[scyld-users] how to let bbq migrate batch jobs to compute nodes?

Weirong Zhu wrzhu at etinternational.com
Tue Mar 14 10:24:38 PST 2006


Thanks for the information.

This seems a feasible method. Thanks very much.

Have you ever tried bbq?  Since with a batch system, it will be easier 
to control jobs.  For example, you can watch the status of all the jobs 
in the queue, and easily delete a job from a queue.  Now if we do the 
batch ourselves,  we need to use "ps" to find the corresponding pid, and 
kill the job.

So I still want to know how make use of "bbq".

-- Weirong

Bishop, Ryan S SAJ Contractor wrote:

>We use a batch script that sets a variable, something like "NODE='beomap
>--nolocal'"  and then runs a bpsh NODE [command].  That will issue the job to
>the next free node.  YMMV - make sure to check out the beomap and bpsh man
>pages.
>
>--Schuyler
>
>-----Original Message-----
>From: scyld-users-bounces at beowulf.org
>[mailto:scyld-users-bounces at beowulf.org] On Behalf Of Weirong Zhu
>Sent: Tuesday, March 14, 2006 1:23 AM
>To: scyld-users at beowulf.org
>Subject: [scyld-users] how to let bbq migrate batch jobs to compute nodes?
>
>We have just got our new Penguin Computing cluster.
>Since one of our main purpose it to submit a lot of batch jobs to the
>cluster, I tried to learn how to use bbq provided by scyld.
>
>As a simple test,
>
>(1) I wrote a C program, which has a while(1) loop.  Then I compile it to
>generate the binary a.out.
>(2) Write a simple job file with only one command "./a.out".  And name this
>file as run.
>(3) submit the job by "batch now -f run"
>(4) do step (3) a lot of times.
>
>Then by using command "bbq" I saw a lot of jobs were listed.  And I assume
>those jobs would be migrated to computing nodes.
>
>However,  when I use "beostat -C", find all the computing nodes are actually
>idle, and all those instances are running on master node.
>
>Did I do something wrong to submit my simple batch jobs?
>How should I do?
>
>Moreover,  I tried to use "atrm" to delete my jobs from the queue. After
>that, when I use "bbq" command, there is nothing in the queue. However, when
>I did a "top" or "ps -fu myname".  Those jobs are still running on the master
>node. 
>Did I do something wrong to delete a batch job from the queue?
>How should I do?
>
>I am really confused with the bbq batch system and it seems that there is no
>PBS avaliable on this cluster.
>
>Any help and suggestions are welcome!
>
>Regards,
>Weirong
>
>
>
>_______________________________________________
>Scyld-users mailing list, Scyld-users at beowulf.org To change your subscription
>(digest mode or unsubscribe) visit
>http://www.beowulf.org/mailman/listinfo/scyld-users
>
>  
>




More information about the Scyld-users mailing list