Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

[Beowulf] g03 and centos 3.6

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

Mike Davis jmdavis1 at vcu.edu
Tue Jun 13 15:04:22 PDT 2006


Joe et al.

I'm not sure how many of you will be interested, but I think that the 
g03/Centos 3.6 issue is resolved.

In talking to David Hibbs in Australia (who had experienced a similar 
issue), I found out that the issue was likely a compiler bug. David had 
solved the problem by compiling with a pgi5 version.

Joe L. was very helpful today as well. I decided to try the compile with 
-O2 and -Mnosse. Sadly that resulted in a threads error. But it was 
definitely worth a try.

Final success came with an install of the pgi6.1 compilers and making 
sure that ulimit -s was set to unlimited.

So, Centos3.6 will run g03D02 on the opterons when compiled with pgi6.1. 
No special settings were necessary in my case.


Mike Davis





Joe Landman wrote:
> Hi Mike:
> 
>   You hit a segfault.  What options did you use with GO3?  The last
> build we worked on with a customer that we had to restrict it to -O2
> -Mnosse or something like this.  Try to compile/link with -g to see if
> you can include symbols, and force a core dump so you can work backwards
> from there.
> 
>   Also, what does your "ulimit -a" report (or limit for the tcsh folks).
> 
> Joe
> 
> Mike Davis wrote:
> 
>>Hey Everyone,
>>
>>I asked back in January about G03 running on platforms other than SuSe.
>>Thanks to everyone who responded. Now I'm trying to compile G03 D02 with
>>pgi 6 on Centos 3.5.
>>
>>The good news is that it compiles. It generates the needed 80 .exe
>>files. G03 generates output.
>>
>>The bad news is that every run fails at the same point.
>>
>>
>>IExCor=   0 DFT=F Ex=HF Corr=None ExCW=0 ScaHFX=  1.000000
>> ScaDFX=  1.000000  1.000000  1.000000  1.000000
>> IRadAn=      0 IRanWt=     -1 IRanGd=            0 ICorTp=0
>> NAtoms=   40 NActive=   40 NUniq=   40 SFac= 7.50D-01 NAtFMM=   80
>>NAOKFM=F Big=F
>> Leave Link  301 at Thu Jun  8 15:06:59 2006, MaxMem=   20000000 cpu:
>>     0.0
>> (Enter /usr/global/g03/l302.exe)
>> NPDir=0 NMtPBC=     1 NCelOv=     1 NCel=       1 NClECP=     1 NCelD=
>>     1
>>         NCelK=      1 NCelE2=     1 NClLst=     1 CellRange=     0.0.
>> One-electron integrals computed using PRISM.
>>
>>
>>
>>After that nothing... Below is an strace.
>>
>>
>>strace:
>>
>>[pid 10461] write(1, " One-electron integrals computed"..., 46) = 46
>>[pid 10461] --- SIGSEGV (Segmentation fault) @ 0 (0) ---
>>Process 10460 resumed
>>Process 10461 detached
>><... wait4 resumed> [{WIFSIGNALED(s) && WTERMSIG(s) == SIGSEGV}], 0,
>>NULL) = 10461
>>rt_sigaction(SIGINT, {SIG_DFL}, NULL, 8) = 0
>>rt_sigaction(SIGQUIT, {SIG_DFL}, NULL, 8) = 0
>>rt_sigprocmask(SIG_SETMASK, [RTMIN], NULL, 8) = 0
>>--- SIGCHLD (Child exited) @ 0 (0) ---
>>stat("/home/jmdavis/Gau-10460.inp", {st_mode=S_IFREG|0664, st_size=3980,
>>...}) = 0
>>unlink("/home/jmdavis/Gau-10460.inp")   = 0
>>munmap(0x2a9566d000, 4096)              = 0
>>exit_group(1)                           = ?
>>
>>David Hibbs in Australia was having a similar problem but I can't find
>>any answers to his query.
>>
>>
>>Mike Davis
>>_______________________________________________
>>Beowulf mailing list, Beowulf at beowulf.org
>>To change your subscription (digest mode or unsubscribe) visit
>>http://www.beowulf.org/mailman/listinfo/beowulf
> 
> 




More information about the Beowulf mailing list