Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

Cluster MOBO suggestions

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

Mike Mettke mettke at lucent.com
Mon Jan 8 08:30:33 PST 2001


All,

did anybody run the same test on an Serverworks HE chipset ?
They claim 4.1GByte/s bandwidth which sounds, well, fantastic.
If that's anywhere near true we might get them.

regards,
Mike


Steven Castillo wrote:

> On Sat, 6 Jan 2001, Leo Magallon wrote:
>
> > Thanks,  I think i've almost decided on getting the Super 370DLE.   It  seems
> > like THE motherboard choice. By choosing this mobo I can save on buying the
> > NICs because it comes with it.
> >
> >
> > Leo.
> >
> > Joey Raheb wrote:
> >
>
> Hello Leo:
>
> You might want to consider the Super 370DER (built-in rage ati graphics)
> or 370DE6 instead.  These boards use the Serverworks HE-SL
> chipset rather than the Serverworks LE chipset.  The HE-SL chipset uses
> 2-way interleaved memory for improved memory bandwidth.  Please see the
> post from Steffan Persvold below listing stream benchmarks for these two
> different chipsets.  I happened to save the posting because we were
> considering different dual CPU mobo's.
>
>  --
> Steve Castillo
>
> Professor
> Klipsch School of Electrical and Computer Engineering
> New Mexico State University
> scastill at nmsu.edu
> (505)646-3117
>
> From sp at scali.no Mon Jan  8 08:08:42 2001
> Date: Mon, 11 Dec 2000 19:10:41 +0100
> From: Steffen Persvold <sp at scali.no>
> To: Ray Muno <muno at aem.umn.edu>
> Cc: beowulf at beowulf.org
> Subject: Re: Serverworks chip sets
>
> Here are a more meaningful comparision. The comparison is between a Tyan
> S2510 based on the LE chipset and a Supermicro 370DER based on the HE-SL
> chipset.
>
> Both machines are equipped with 2x PIII-800EB(133) CPUs and 512MByte
> PC-133 SDRAM
>
> Using one CPU:
>
> 370DER:
>    ----------------------------------------------
>     Double precision appears to have 16 digits of accuracy
>     Assuming 8 bytes per DOUBLE PRECISION word
>    ----------------------------------------------
>     Array size =    2000000
>     Offset     =          0
>     The total memory requirement is   45 MB
>     You are running each test  20 times
>     --
>     The *best* time for each test is used
>     *EXCLUDING* the first and last iterations
>     ----------------------------------------------------
>     Your clock granularity appears to be less than one microsecond
>     Your clock granularity/precision appears to be      1 microseconds
>     ----------------------------------------------------
>    Function     Rate (MB/s)  Avg time   Min time  Max time
>    Copy:        374.2689      0.0855      0.0855      0.0856
>    Scale:       401.9594      0.0796      0.0796      0.0797
>    Add:         475.5536      0.1010      0.1009      0.1010
>    Triad:       475.1911      0.1011      0.1010      0.1012
>     ----------------------------------------------------
>     Solution Validates!
>     ----------------------------------------------------
>
> S2510:
>    ----------------------------------------------
>     Double precision appears to have 16 digits of accuracy
>     Assuming 8 bytes per DOUBLE PRECISION word
>    ----------------------------------------------
>     Array size =    2000000
>     Offset     =          0
>     The total memory requirement is   45 MB
>     You are running each test  20 times
>     --
>     The *best* time for each test is used
>     *EXCLUDING* the first and last iterations
>     ----------------------------------------------------
>     Your clock granularity appears to be less than one microsecond
>     Your clock granularity/precision appears to be      1 microseconds
>     ----------------------------------------------------
>    Function     Rate (MB/s)  Avg time   Min time  Max time
>    Copy:        376.9848      0.0849      0.0849      0.0851
>    Scale:       375.0191      0.0854      0.0853      0.0854
>    Add:         483.7248      0.0993      0.0992      0.0994
>    Triad:       484.5352      0.0991      0.0991      0.0991
>     ----------------------------------------------------
>     Solution Validates!
>     ----------------------------------------------------
>
> Using both CPUs (OMP makes it a bit easier and adds the numbers for me):
> 370DER:
>    ----------------------------------------------
>     Double precision appears to have 16 digits of accuracy
>     Assuming 8 bytes per DOUBLE PRECISION word
>    ----------------------------------------------
>     Array size =    2000000
>     Offset     =          0
>     The total memory requirement is   45 MB
>     You are running each test  20 times
>     --
>     The *best* time for each test is used
>     *EXCLUDING* the first and last iterations
>     ----------------------------------------------------
>     Your clock granularity appears to be less than one microsecond
>     Your clock granularity/precision appears to be      1 microseconds
>     ----------------------------------------------------
>    Function     Rate (MB/s)  Avg time   Min time  Max time
>    Copy:        468.8234      0.0686      0.0683      0.0693
>    Scale:       479.3794      0.0669      0.0668      0.0671
>    Add:         541.0826      0.0888      0.0887      0.0888
>    Triad:       542.6429      0.0885      0.0885      0.0886
>     ----------------------------------------------------
>     Solution Validates!
>     ----------------------------------------------------
>
> S2510:
>    ----------------------------------------------
>     Double precision appears to have 16 digits of accuracy
>     Assuming 8 bytes per DOUBLE PRECISION word
>    ----------------------------------------------
>     Array size =    2000000
>     Offset     =          0
>     The total memory requirement is   45 MB
>     You are running each test  20 times
>     --
>     The *best* time for each test is used
>     *EXCLUDING* the first and last iterations
>     ----------------------------------------------------
>     Your clock granularity appears to be less than one microsecond
>     Your clock granularity/precision appears to be      1 microseconds
>     ----------------------------------------------------
>    Function     Rate (MB/s)  Avg time   Min time  Max time
>    Copy:        353.0918      0.0916      0.0906      0.0926
>    Scale:       355.3543      0.0918      0.0901      0.0931
>    Add:         397.2821      0.1213      0.1208      0.1222
>    Triad:       397.7756      0.1218      0.1207      0.1225
>     ----------------------------------------------------
>     Solution Validates!
>     ----------------------------------------------------
>
> As you can see the two-way interleaving helps a bit on a Dual-CPU run
> (~35% more bandwidth)
>
> Best regards,
> --
>   Steffen Persvold               Systems Engineer
>   Email : mailto:sp at scali.no     Scali AS (http://www.scali.com)
>   Tlf   : (+47) 22 62 89 50      Olaf Helsets vei 6
>   Fax   : (+47) 22 62 89 51      N-0621 Oslo, Norway
>
> _______________________________________________
> Beowulf mailing list
> Beowulf at beowulf.org
> http://www.beowulf.org/mailman/listinfo/beowulf
>
> _______________________________________________
> Beowulf mailing list
> Beowulf at beowulf.org
> http://www.beowulf.org/mailman/listinfo/beowulf





More information about the Beowulf mailing list