Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

[Beowulf] GPU-HMMer for interested people

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

Olli-Pekka Lehto oplehto at csc.fi
Wed Dec 17 07:41:00 PST 2008


Joe Landman wrote:
> Hi folks
> 
>   GPU-HMMer (part of the MPI-HMMer effort) has just been
> announced/released at http://www.mpihmmer.org
> 
>   MPI-HMMer has itself been improved with parallel-IO and better
> scalability features.  JP has measured some large number (about 180x)
> over single cores on a cluster for the MPI run.
> 
>   Enjoy!
> 
> Joe
> 

Hi Joe,

Looks quite promising. Here are results from a simple real-world test case:

GPU: Dual GTX280, each with 1GB RAM
CPU: Single Intel Core2 quad Q9550 2.83GHz

hmmsearch 4 threads sorted:        		274.49s
hmmsearch 4 threads unsorted:     		254.23s
cuda_hmmsearch unsorted            		407.85s
cuda_hmmsearch sorted:              		62.69s
cuda_hmmsearch sorted 2 simultaneous runs:	78.23s 80.79s

Remarks:

-Running hmmsort to sort the sequence database is critical to obtain 
reasonable performance from cuda_hmmsearch. However, the regular 
hmmsearch is slightly slower with the sorted database.

-Running two simultaneous runs assigned to different GPUs on a dual-GPU 
quad-core system yields some performance penalty, but is still quite 
feasible.

-I used the parameters THREADSIZE=320 BLOCKSIZE=64. I'm not completely 
sure if these are the optimum values for GTX280. Any better suggestions?

Regards,
Olli-Pekka
-- 
Olli-Pekka Lehto, Systems Specialist, Special computing, CSC
PO Box 405 02101 Espoo, Finland; tel +358 9 4572215, fax +358 9 4572302
CSC is the Finnish IT Center for Science, www.csc.fi,
e-mail: Olli-Pekka.Lehto at csc.fi



More information about the Beowulf mailing list