Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

[Beowulf] Customize Comparison between Torque(PBS)/SGE and LSF

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

Rayson Ho raysonlogin at gmail.com
Fri Aug 5 20:25:54 PDT 2005


On 8/4/05, Sebastian <ChiefShadow at gmx.de> wrote:
> SGE:
> + GUI
> + Support (Mailing List)
> + all demands are meet
> + LAM-Support with Reutis Howto
> + Basics for Checkpointing/Migration are available
> + OpenSource

Gridengine supports checkpoint/migrate, but you may need to configure
a ckpt interface. It may involve some scripting to do to invoke
external commands to do checkpointing (if any provided by LS-DYNA?).
http://gridengine.sunsource.net/nonav/source/browse/~checkout~/gridengine/doc/htmlman/htmlman1/sge_ckpt.html


You can get commerical support from:
http://www.sun.com/software/gridware/partners/index.xml


Webinterfaces are also available... in fact there are 4 or 5 written:
- the simplest one is the GE momitor: http://www.veus.hr/php/gemonitor/
- or this one with XML->html: http://scripps.bos.bioteam.net/xml-qstat/
- or http://gridengine.sunsource.net/gep/GEP_Intro.html
- and there's one from the Rocks cluster guys


> Torque:
> + Integration Torque and LAM/MPI with the tm-Modul
> 
> - requirements only available with MAUI Scheduler
> - no GUI when using MAUI (is that right??)
> - xpbs&xpbsmon isn't good looking

Maui is supposed to be the most advanced scheduler, with backfilling
and advanced reservation...

But with the size of the cluster, I am not sure if it's needed or
not... and in fact, Maui+Torque is slower than SGE in terms of
throughtput:

http://arxiv.org/abs/cs.DC/0506006

>From page 7 of the paper:

                                SGE        TORQUE TORQUE+MAUI OAR   OAR(2)
Available Processors    34              34              34            
 34        34
Jobmix work (CPU-sec) 443340    443340  443340          443340 443340
Elapsed Time               14164      14818    15115            15264    14037
Efficiency                     0,9206     0,8800    0,8627         
0,8543  0.9289

This is kind of expected, since [SGE and Torque] "all the jobs
requiring few processors are scheduled first while all the big
parallel jobs are delayed until the end of the schedule."

Rayson

> 
> This is actually my small list with advantages/disadvantes, are there some
> wrong or are there some missing?
> 
> I read of some problems with PBS when you submit more than hundret jobs the
> same time, because i think we don't have more than 20 jobs in a queue this
> shouldn't be problematical.
> 
> Now i listen what you will tell me and i'm amused of many answers.
> 
> Thx and Kind Regards,
> 
> Sebastian
> 
> --
> GMX DSL = Maximale Leistung zum minimalen Preis!
> 2000 MB nur 2,99, Flatrate ab 4,99 Euro/Monat: http://www.gmx.net/de/go/dsl
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
>




More information about the Beowulf mailing list