[Beowulf] likwid vs stream (after HPCG discussion)

Mikhail Kuzminsky kus at free.net
Mon Mar 21 09:46:31 UTC 2022


In message from Scott Atchley <e.scott.atchley at gmail.com> (Sun, 20 Mar
2022 14:52:10 -0400):
> On Sat, Mar 19, 2022 at 6:29 AM Mikhail Kuzminsky <kus at free.net> 
>wrote:
> 
>> If so, it turns out that for the HPC user, stream gives a more
>> important estimate - the application is translated by the compiler
>> (they do not write in assembler - except for modules from 
>>mathematical
>> libraries), and stream will give a real estimate of what will be
>> received in the application.
>>
> 
> When vendors advertise STREAM results, they compile the application 
>with
> non-temporal loads and stores. This means that all memory accesses 
>bypass
> the processor's caches. If your application of interest does a random 
>walk
> through memory and there is neither temporal or spatial locality, 
>then
> using non-temporal loads and stores makes sense and STREAM 
>irrelevant.

STREAM is not initially oriented to random access to memory. In this
case, memory latencies are important, and it makes more sense to get a
bandwidth estimate in the mega-sweep
(https://github.com/UK-MAC/mega-stream).


More information about the Beowulf mailing list