[Beowulf] RE: [Bioclusters]	FPGAin bioinformatics	clusters	(again?)
    bella at carolina.rr.com 
    bella at carolina.rr.com
       
    Mon Jan 16 17:36:56 PST 2006
    
    
  
Mike Davis wrote:
> But BLAST is only a small part and argueably the easiest part of 
> genomics work. The advantages of parallelization and/or smp come into 
> play when attempting to assemble the genome. Phred/Phrap can do the 
> work but starts to slow even large machines when your talking 50k+ of 
> sequences (which it wants to be in one folder). A quiz for  the Unix 
> geeks out there, what happens when a folder has 50,000 files in it. 
> Can you say SLOOOOOOOOOWWWW?
>
> Mike Davis
>
Sorry... I just couldn't let this one go by.  And no offense meant to 
anyone but...
Many times I have found users and application folks making inordinately 
and (in my opinion) unacceptably large numbers of files in 
sub-directories on one of "my" UNIX or Linux boxes. 
I simply gently take them aside and have a little "prayer meetin'" with 
them.  There is always a way to fix this kind of problem by consulting 
with the applications folks, and helping them see a better way.  That's 
why God made "mkdir (2)".
In my opinion, if this "Phred/Phrap" thingy (about which I KNOW NOTHING 
- all disclaimers apply) _absolutely_  requires one to place 50,000 (or 
more) files in a single sub-directory... and therefore is slow... the 
application is simply broken.  Contact the developers, or get the 
source... and we'll go fix it.
My 1 & 1/2 cents worth.
Arthur Bell
Senior UNIX/Linux System Administrator
    
    
More information about the Beowulf
mailing list