[Beowulf] Computation on the head node
Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.
Joe Landman landman at scalableinformatics.comSun May 18 11:12:31 PDT 2008
- Previous message: [Beowulf] Computation on the head node
- Next message: [Beowulf] Computation on the head node
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Perry E. Metzger wrote: > Joe Landman <landman at scalableinformatics.com> writes: >> [...] >> We approach it from a different view. Start out with a very fast RAID >> and attach good networking to it. Here is a RAID6 across 12 disks. >> [...] > > I have to emphasize, yet again, that the correct approach is not to > assume that you need a fast file system (or that you can make do with > a slow one) until you've actually benchmarked the application that > your cluster is intended to serve. One can have no idea without Perry, you are "preaching to the choir" in general. These are points we have made for quite some time on this group and elsewhere. The only app that matters is your app. Not spec-*, not NAS parallel B/C/... , ... This said, we (collectively) also have some experience dealing with I/O issues on clusters. This is, as it turns out, a hard problem. It is a problem exasperated by fast networks that can sink and source large data sets. Moreover, the problem is non-trivial in a number of cases, as you scale up, as you discover new and exciting points of serialization that have not been exposed before. This is not just true of IO but also of CPUs, of memory systems, etc. 8 processor cores sharing a single memory system can have an impact on codes. Specific algorithms map well into these systems, some do not. > checking. A very fast RAID array may be in order -- or it may be > completely unnecessary. One can't know without understanding one's > application intimately, and that requires testing. Of course. But there are quite a few people/groups on this list with decades of HPC experience that might have an inkling if a USB or similar connected drive "is a good idea" for an app, even prior to running it. Benchmarking is important, but it is important that the benchmark represent real runs. Experience can provide a rough guide in the case of no benchmark data availability. With clusters, you run into the very real problem of IO resource contention, quite quickly. Putting lower end IO devices in there rarely makes sense. Sure, you can benchmark it, and you should if possible. But it is also not a bad idea to listen to people whom have been working on this stuff for a while, they might have a clue about these things. > Picking hardware without knowing the app in detail is like deciding > that the right vehicle for you is a formula 1 race car before > discovering your application is hauling 50 passengers across town > through heavy traffic, or picking a bus before discovering your > application is breaking a land speed record. ... and we see this happen all the time. Not picking particular models as much as picking particular brands, or systems ill-suited for the tasks at hand, or ill-configured for the tasks, ... -- Joseph Landman, Ph.D Founder and CEO Scalable Informatics LLC, email: landman at scalableinformatics.com web : http://www.scalableinformatics.com http://jackrabbit.scalableinformatics.com phone: +1 734 786 8423 fax : +1 866 888 3112 cell : +1 734 612 4615
- Previous message: [Beowulf] Computation on the head node
- Next message: [Beowulf] Computation on the head node
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Beowulf mailing list
