[Beowulf] Re: static pvm library won't link anymore

David Mathog mathog at caltech.edu
Mon Jul 9 11:04:41 PDT 2007


Turns out that the difficulties getting pvm to run were not just
pvm problems.  There is a bizarre problem where every program, even
"Hello World" built on Mandriva 2007.1 (gcc 4.1.2,2.6.17-14mdv) 
will not run on Mandriva 2007.0 (gcc 4.1.1, 2.6.19.3).  Going
in the other direction works, as does Mandriva 2006.0 to either,
and of course all programs run wherever they were built
in the first place.  Here's hello refusing to run on 2007.0 under gdb:

(master)
# cat hello.c
#include <stdio.h>
int main(void){
  (void) fprintf(stdout,"HELLO\n");
}
# gcc -g -o hello hello.c
# hello
HELLO
# cp hello /usr/common/bin

(slave)
# gdb /usr/common/tmp/hello
GNU gdb 6.3-8mdv2007.0 (Mandriva Linux release 2007.0)
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain
conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "i586-mandriva-linux-gnu"...Using host
libthread_db library "/lib/i686/libthread_db.so.1".

(gdb) run
Starting program: /usr/common/tmp/hello 
Failed to read a valid object file image from memory.

Program received signal SIGFPE, Arithmetic exception.
0xb7f8b96f in do_lookup_x (undef_name=0xb7e598d3 "_res", hash=420035,
ref=0xb7e52834, result=0xbffef1f0, scope=0xb7f9c838, i=0, 
    version=0xb7f78328, flags=0, skip=0x0, type_class=Variable
"type_class" is not available.
) at do-lookup.h:72
72      do-lookup.h: No such file or directory.
        in do-lookup.h
(gdb) bt
#0  0xb7f8b96f in do_lookup_x (undef_name=0xb7e598d3 "_res",
hash=420035, ref=0xb7e52834, result=0xbffef1f0, scope=0xb7f9c838, 
    i=0, version=0xb7f78328, flags=0, skip=0x0, type_class=Variable
"type_class" is not available.
) at do-lookup.h:72
#1  0xb7f8bc87 in _dl_lookup_symbol_x (undef_name=0xb7e598d3 "_res",
undef_map=0xb7f78000, ref=0xbffef310, 
    symbol_scope=0xb7f781a8, version=0xb7f78328, type_class=0, flags=0,
skip_map=0x0) at dl-lookup.c:233
#2  0xb7f8d263 in _dl_relocate_object (l=Variable "l" is not available.
) at ../sysdeps/i386/dl-machine.h:354
#3  0xb7f8631f in dl_main (phdr=0x8048034, phnum=224,
user_entry=0xbffef700) at rtld.c:2235
#4  0xb7f9540e in _dl_sysdep_start (start_argptr=0xbffef760,
dl_main=0xb7f85050 <dl_main>) at ../elf/dl-sysdep.c:239
#5  0xb7f84709 in _dl_start (arg=0xbffef760) at rtld.c:333
#6  0xb7f83847 in _start () at rtld.c:788
(gdb) quit
The program is running.  Exit anyway? (y or n) y

When the master node was upgraded a short time ago from Mandriva 2006.0
to 2007.1 it introduced this problem, but it didn't manifest until the
first time a program was built on the master and distributed to the
slaves, and that was, unfortunately, pvm, which has enough build issues,
that it took a while to determine that this was yet _another_ issue
beyond those.

Regards,

David Mathog
mathog at caltech.edu
Manager, Sequence Analysis Facility, Biology Division, Caltech



More information about the Beowulf mailing list