[Beowulf] scratch File system for small cluster

Joe Landman landman at scalableinformatics.com
Thu Sep 25 11:08:23 PDT 2008


Greg Lindahl wrote:
> On Thu, Sep 25, 2008 at 10:19:26AM -0400, Joe Landman wrote:
> 
>>   BLAST uses mmap'ed IO.  This has some interesting ... interactions ... 
>> with parallel file systems.
> 
> The PathScale compilers use mmap on their temporary files. This led to
> some interesting bugs being reported... fortunately, we were able to
> pinpoint the parallel filesystems as being the guilty parties without
> too much work.

   It looks like people use mmap files to explicitly avoid seeks, 
replacing semantics of file IO with memory access semantics.  We have a 
customer who uses mmap for some large files (multiple GB).  Sadly, mmap 
on linux uses the paging mechanism which is pretty much stuck at 4kB 
pages for most distributions.  I think the SiCortex folks and a few 
others are working with 64 kB page kernels.

   I am sure there are good reasons for using mmap.  I just don't know 
what they are, and in what contexts.  I would rather have 
direct/explicit control over the IO if possible.


-- 
Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
        http://jackrabbit.scalableinformatics.com
phone: +1 734 786 8423 x121
fax  : +1 866 888 3112
cell : +1 734 612 4615



More information about the Beowulf mailing list