[Beowulf] Can one Infiniband net support MPI and a parallel filesystem?

Kilian CAVALOTTI kilian.cavalotti.work at gmail.com
Wed Aug 13 16:35:16 PDT 2008


Hi Chris,

On Tuesday 12 August 2008 08:29:31 pm Chris Samuel wrote:
> We do use things like cpusets to try and limit the impact
> that jobs can have on other jobs on the same nodes, 

I'm actually curious about how you implemented that. Do you have NUMA 
hardware? Do you use a resources manager, and is the cpusets creation 
process integrated with it? How do you manage concurrent jobs running 
on the same machine: do you pin them on specific CPUs and keep track of 
what CPU is busy and which is not, or do you have a way to just limit 
the number of CPUs they're using?

As you can guess, I'd be interested in some technical details. :)

Cheers,
-- 
Kilian



More information about the Beowulf mailing list