[Beowulf] bizarre scaling behavior on a Nehalem
Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.
Mikhail Kuzminsky kus at free.netFri Aug 14 16:24:25 PDT 2009
- Previous message: [Beowulf] newbie beorun question
- Next message: [Beowulf] METIS Partitioning within program
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
In message from Bill Broadley <bill at cse.ucdavis.edu> (Fri, 14 Aug 2009 16:13:21 -0700): >Mikhail Kuzminsky wrote: >>> Your results look excellent, so I wouldn't be surprised if they are >>> running at 1333. >> >> I have 12-18 GB/s on 4 threads of stream/ifort w/DDR3-1066 on dual >>E5520 >> server. But it works under "numa-bad" kernel w/o control of >> numa-efficient allocation. > >Sounds pretty bad. > >Why 4 threads? You need 8 cores to keep all 6 memory busses busy. For comparison w/your tests: you have only 4 cores. On 8 threads I have 20-26 GB/s. > >Which compiler? ifort pointed above means intel fortran 11.0.38. Mikhail > open64 does substantially better than gcc. > >-- >üÔÏ ÓÏÏÂÝÅÎÉÅ ÂÙÌÏ ÐÒÏ×ÅÒÅÎÏ ÎÁ ÎÁÌÉÞÉÅ × ÎÅÍ ×ÉÒÕÓÏ× >É ÉÎÏÇÏ ÏÐÁÓÎÏÇÏ ÓÏÄÅÒÖÉÍÏÇÏ ÐÏÓÒÅÄÓÔ×ÏÍ >MailScanner, É ÍÙ ÎÁÄÅÅÍÓÑ >ÞÔÏ ÏÎÏ ÎÅ ÓÏÄÅÒÖÉÔ ×ÒÅÄÏÎÏÓÎÏÇÏ ËÏÄÁ. >
- Previous message: [Beowulf] newbie beorun question
- Next message: [Beowulf] METIS Partitioning within program
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Beowulf mailing list
