[Beowulf] tracejob command shows error

rigved sharma rigved.sharma123 at gmail.com
Wed Jun 16 18:47:18 PDT 2010


hi
we are having cluster of 16 nodes and torque and maui installed on it. we
have just migrated from torque 2.3.6  to torque 2.4.8.but tracejob command
is not working.

/usr/spool/PBS/server_priv/accounting/20100617: No matching job records
located
/usr/spool/PBS/server_logs/20100617: No matching job records located
/usr/spool/PBS/mom_logs/20100617: No such file or directory
/usr/spool/PBS/sched_logs/20100617: No such file or directory
*** glibc detected *** tracejob: malloc(): memory corruption:
0x0000000003a74140 ***
======= Backtrace: =========
/lib64/libc.so.6[0x3845871cd1]
/lib64/libc.so.6(__libc_malloc+0x7d)[0x3845872e8d]
/lib64/libc.so.6(popen+0x23)[0x3845862a63]
tracejob[0x401218]
tracejob[0x401bcf]
/lib64/libc.so.6(__libc_start_main+0xf4)[0x384581d8b4]
tracejob[0x400e09]
======= Memory map: ========
00400000-00403000 r-xp 00000000 68:05 6094878
/usr/spool/PBS/bin/tracejob
00603000-00604000 rw-p 00003000 68:05 6094878
/usr/spool/PBS/bin/tracejob
03a74000-03a95000 rw-p 03a74000 00:00 0
3844800000-384481a000 r-xp 00000000 68:02 896307
/lib64/ld-2.5.so
3844a1a000-3844a1b000 r--p 0001a000 68:02 896307
/lib64/ld-2.5.so
3844a1b000-3844a1c000 rw-p 0001b000 68:02 896307
/lib64/ld-2.5.so
3845800000-384594a000 r-xp 00000000 68:02 896308
/lib64/libc-2.5.so
384594a000-3845b49000 ---p 0014a000 68:02 896308
/lib64/libc-2.5.so
3845b49000-3845b4d000 r--p 00149000 68:02 896308
/lib64/libc-2.5.so
3845b4d000-3845b4e000 rw-p 0014d000 68:02 896308
/lib64/libc-2.5.so
3845b4e000-3845b53000 rw-p 3845b4e000 00:00 0
3849c00000-3849c0d000 r-xp 00000000 68:02 896314
/lib64/libgcc_s-4.1.2-20080102.so.1
3849c0d000-3849e0d000 ---p 0000d000 68:02 896314
/lib64/libgcc_s-4.1.2-20080102.so.1
3849e0d000-3849e0e000 rw-p 0000d000 68:02 896314
/lib64/libgcc_s-4.1.2-20080102.so.1
2abb01fde000-2abb01fe0000 rw-p 2abb01fde000 00:00 0
2abb01fe0000-2abb02009000 r-xp 00000000 68:05 6072097
/usr/spool/PBS/lib/libtorque.so.2.0.0
2abb02009000-2abb02209000 ---p 00029000 68:05 6072097
/usr/spool/PBS/lib/libtorque.so.2.0.0
2abb02209000-2abb0220b000 rw-p 00029000 68:05 6072097
/usr/spool/PBS/lib/libtorque.so.2.0.0
2abb0220b000-2abb022ee000 rw-p 2abb0220b000 00:00 0
2abb0231a000-2abb0231b000 rw-p 2abb0231a000 00:00 0
2abb04000000-2abb04021000 rw-p 2abb04000000 00:00 0
2abb04021000-2abb08000000 ---p 2abb04021000 00:00 0
7fffa8ab6000-7fffa8acc000 rw-p 7fffa8ab6000 00:00 0
[stack]
ffffffffff600000-ffffffffffe00000 ---p 00000000 00:00 0
[vdso]
Aborted.

kindly  suggest.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.scyld.com/pipermail/beowulf/attachments/20100617/f1f26678/attachment.html


More information about the Beowulf mailing list