Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

[scyld-users] how to let bbq migrate batch jobs to compute nodes?

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

Bishop, Ryan S SAJ Contractor Ryan.S.Bishop at saj02.usace.army.mil
Tue Mar 14 10:31:11 PST 2006


We use the methodology I described below with batch, which feeds bbq.  Works
like a charm. 

-----Original Message-----
From: scyld-users-bounces at beowulf.org
[mailto:scyld-users-bounces at beowulf.org] On Behalf Of Weirong Zhu
Sent: Tuesday, March 14, 2006 1:25 PM
To: scyld-users at beowulf.org
Subject: Re: [scyld-users] how to let bbq migrate batch jobs to compute
nodes?

Thanks for the information.

This seems a feasible method. Thanks very much.

Have you ever tried bbq?  Since with a batch system, it will be easier to
control jobs.  For example, you can watch the status of all the jobs in the
queue, and easily delete a job from a queue.  Now if we do the batch
ourselves,  we need to use "ps" to find the corresponding pid, and kill the
job.

So I still want to know how make use of "bbq".

-- Weirong

Bishop, Ryan S SAJ Contractor wrote:

>We use a batch script that sets a variable, something like 
>"NODE='beomap --nolocal'"  and then runs a bpsh NODE [command].  That 
>will issue the job to the next free node.  YMMV - make sure to check 
>out the beomap and bpsh man pages.
>
>--Schuyler
>
>-----Original Message-----
>From: scyld-users-bounces at beowulf.org
>[mailto:scyld-users-bounces at beowulf.org] On Behalf Of Weirong Zhu
>Sent: Tuesday, March 14, 2006 1:23 AM
>To: scyld-users at beowulf.org
>Subject: [scyld-users] how to let bbq migrate batch jobs to compute nodes?
>
>We have just got our new Penguin Computing cluster.
>Since one of our main purpose it to submit a lot of batch jobs to the 
>cluster, I tried to learn how to use bbq provided by scyld.
>
>As a simple test,
>
>(1) I wrote a C program, which has a while(1) loop.  Then I compile it 
>to generate the binary a.out.
>(2) Write a simple job file with only one command "./a.out".  And name 
>this file as run.
>(3) submit the job by "batch now -f run"
>(4) do step (3) a lot of times.
>
>Then by using command "bbq" I saw a lot of jobs were listed.  And I 
>assume those jobs would be migrated to computing nodes.
>
>However,  when I use "beostat -C", find all the computing nodes are 
>actually idle, and all those instances are running on master node.
>
>Did I do something wrong to submit my simple batch jobs?
>How should I do?
>
>Moreover,  I tried to use "atrm" to delete my jobs from the queue. 
>After that, when I use "bbq" command, there is nothing in the queue. 
>However, when I did a "top" or "ps -fu myname".  Those jobs are still 
>running on the master node.
>Did I do something wrong to delete a batch job from the queue?
>How should I do?
>
>I am really confused with the bbq batch system and it seems that there 
>is no PBS avaliable on this cluster.
>
>Any help and suggestions are welcome!
>
>Regards,
>Weirong
>
>
>
>_______________________________________________
>Scyld-users mailing list, Scyld-users at beowulf.org To change your 
>subscription (digest mode or unsubscribe) visit 
>http://www.beowulf.org/mailman/listinfo/scyld-users
>
>  
>

_______________________________________________
Scyld-users mailing list, Scyld-users at beowulf.org To change your subscription
(digest mode or unsubscribe) visit
http://www.beowulf.org/mailman/listinfo/scyld-users




More information about the Scyld-users mailing list