[Beowulf] Fwd: [torqueusers] TORQUE 2.0.0p1 Release

Chris Samuel csamuel at vpac.org
Wed Nov 9 17:09:19 PST 2005


This patch also fixes a minor regression in Torque 2.0.0p0 which broke 
redirecting stdout when using Pete Wyckoff's excellent mpiexec from OSC.

----------  Forwarded Message  ----------

Subject: [torqueusers] TORQUE 2.0.0p1 Release
Date: Thu, 10 Nov 2005 03:25 am
From: "jonathan ryskamp" <jryskamp at clusterresources.com>
To: torqueusers at supercluster.org
Cc: mauiusers at supercluster.org

Greetings,

  The next patch release of TORQUE, TORQUE 2.0.0 patch 1, is now
available.  While this comes right on the heals of a previous release,
this latest distribution contains many significant improvements
including the following:

  qstat modifications for massive job queue support (>50,000 jobs)
  enhanced momctl control and diagnostics
  multi-server support allowing mom's to communicate with multiple
    server daemons simultaneously
  faster job submission
  fixes for resource availability, data staging, and job management
  support for transient tmpdirs
  improved usability and documentation

 Also, be sure to try the EXPERIMENTAL features and provide feedback.
See
pbs_server_attributes(7B) for "down_on_error", "job_nanny", and
"mom_job_sync".  These are well tested, production-ready features that
simply require more conceptual vetting.  They are indicative of future
directions of TORQUE development.  In essence, these features do the
following:

  - mark compute nodes down when various system failures are detected
  - address job deletion when compute nodes are non-responsive
  - synchronize mom and server job state to remove stale jobs

For more detailed information, see the CHANGELOG at

 http://clusterresources.com/torquedocs/changelog.shtml

  Work has already begun on the next release.  Currently, the
following
enhancements are under development:

 - improved high availability support
 - job array support
 - queue based scalability enhancements
 - qstat based job completion reporting
 - simplified installation for distributed systems
 - data staging diagnostics
 - queue hostlists for direct queue to node mapping
 - import of user umask for TM* module (FNAL)
 - the 'long-awaited' TORQUE documentation WIKI

  TORQUE is moving forward at an amazing pace in terms of both
development
and adoption.  Again, thanks go out to all the contributing sites.
Please continue to offer us your feedback.  Let us know how TORQUE can
be made more scalable, more stable, more capable, and more user
friendly.

Regards,
Jonathan



_______________________________________________
torqueusers mailing list
torqueusers at supercluster.org
http://www.supercluster.org/mailman/listinfo/torqueusers

-------------------------------------------------------

-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20051110/21b47f2b/attachment.sig>


More information about the Beowulf mailing list