[Beowulf] Fwd: [torqueusers] TORQUE 2.0.0p1 Release
Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.
Chris Samuel csamuel at vpac.orgWed Nov 9 17:09:19 PST 2005
- Previous message: [Beowulf] Turion 64 floating point perfomance
- Next message: [Beowulf] Question on hgh performance, low cost Fileserver
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
This patch also fixes a minor regression in Torque 2.0.0p0 which broke redirecting stdout when using Pete Wyckoff's excellent mpiexec from OSC. ---------- Forwarded Message ---------- Subject: [torqueusers] TORQUE 2.0.0p1 Release Date: Thu, 10 Nov 2005 03:25 am From: "jonathan ryskamp" <jryskamp at clusterresources.com> To: torqueusers at supercluster.org Cc: mauiusers at supercluster.org Greetings, The next patch release of TORQUE, TORQUE 2.0.0 patch 1, is now available. While this comes right on the heals of a previous release, this latest distribution contains many significant improvements including the following: qstat modifications for massive job queue support (>50,000 jobs) enhanced momctl control and diagnostics multi-server support allowing mom's to communicate with multiple server daemons simultaneously faster job submission fixes for resource availability, data staging, and job management support for transient tmpdirs improved usability and documentation Also, be sure to try the EXPERIMENTAL features and provide feedback. See pbs_server_attributes(7B) for "down_on_error", "job_nanny", and "mom_job_sync". These are well tested, production-ready features that simply require more conceptual vetting. They are indicative of future directions of TORQUE development. In essence, these features do the following: - mark compute nodes down when various system failures are detected - address job deletion when compute nodes are non-responsive - synchronize mom and server job state to remove stale jobs For more detailed information, see the CHANGELOG at http://clusterresources.com/torquedocs/changelog.shtml Work has already begun on the next release. Currently, the following enhancements are under development: - improved high availability support - job array support - queue based scalability enhancements - qstat based job completion reporting - simplified installation for distributed systems - data staging diagnostics - queue hostlists for direct queue to node mapping - import of user umask for TM* module (FNAL) - the 'long-awaited' TORQUE documentation WIKI TORQUE is moving forward at an amazing pace in terms of both development and adoption. Again, thanks go out to all the contributing sites. Please continue to offer us your feedback. Let us know how TORQUE can be made more scalable, more stable, more capable, and more user friendly. Regards, Jonathan _______________________________________________ torqueusers mailing list torqueusers at supercluster.org http://www.supercluster.org/mailman/listinfo/torqueusers ------------------------------------------------------- -- Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager Victorian Partnership for Advanced Computing http://www.vpac.org/ Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 189 bytes Desc: not available Url : http://www.scyld.com/pipermail/beowulf/attachments/20051110/21b47f2b/attachment.bin
- Previous message: [Beowulf] Turion 64 floating point perfomance
- Next message: [Beowulf] Question on hgh performance, low cost Fileserver
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Beowulf mailing list
