HD cloning
Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.
Donald Becker becker at scyld.comTue Dec 5 12:19:05 PST 2000
- Previous message: HD cloning
- Next message: HD cloning
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
On Wed, 6 Dec 2000, Bruce Janson wrote: > From: Daniel Ridge <newt at scyld.com> > On Wed, 6 Dec 2000, Bruce Janson wrote: > > Like you, installing makes me grumpy too, so I try not to do it > > more than once. Ideally all of our compute servers would share > > the same (network) file system. There are ways of doing this > > now (typically via NFS) but they tend to be hand-crafted and > > unpopular. > > In particular, I notice that the recent Scyld distribution > > assumes that files (libraries if I remember rightly) will be > > installed and available on the local computer. > > Why do people want to install locally? (Scyld people in particular > > are encouraged to reply.) > > While it is true that our (Scyld's) distribution places some files > on target nodes, the total volume is pretty tiny (a couple of tens of > megabytes for now, less in the future). These files, essentially > all shared libraries, are placed on the nodes just as a cache and > are not 'available' from most useful perspectives. They are 'available' > for a remote application to 'dlopen()' or certain other dynamic link > operations. > Yes, but storing any files locally suggests that you don't trust the > kernel's network file system caching. Is that why? If so, in what > way does such caching fail? ... > Sounds like you don't use a network file system at all, > which in itself is an interesting decision. > Care to give some reasons? A common misperception when people first see the Scyld Beowulf system is that it is based on a NFS root scheme. Using a NFS root has several problems: NFS is very slow NFS is unreliable NFS file caching has consistency and semantic problems. Instead our model is based on a ramdisk root and cache, along with using 'bproc' to migrate processes from a master node. All of the NFS problems are magnified and multiplied when working on a Beowulf cluster. Unlike a workstation network, where users are idle on average and working on different jobs, a cluster is all about hot spots. The NFS server quickly becomes a major serialization point. (The same observation is true of a NIS/Yellow-Pages server: when starting a cluster job, every processor will try to scan the password list at the same time.) While a ramdisk root initially sounds like a waste of memory, the semantics of a ramdisk fits very well with what we are doing. The Linux ramdisk code is unified with the buffer cache, rather than a separate page cache. The files cached in the root ramdisk are mostly contain hot pages on a running system: the "/", /etc, /lib and /dev directories, and the common libraries. The unified cache means that rather than costing performance by wasting 40MB of ramdisk memory, we have only have a few MB of dead pages. In some cases we have a performance improvement over a local disk root by effectively wiring down start-up library pages that tend to FIFO-thrash. > In addition to shared libraries, we also place a number of entries > for '/dev' on the nodes. > Well, now that you mention /dev, why don't you use devfs to automatically > populate your nodes' /dev directories? The devfs addition is controversial, at best. It would make our system smaller and less complex, and the drawback of retaining non-standard device ownership and permissions is much reduced on a slave node. But we don't want to introduce what some perceive as a gratuitous change on top of our required changes. Donald Becker becker at scyld.com Scyld Computing Corporation http://www.scyld.com 410 Severn Ave. Suite 210 Second Generation Beowulf Clusters Annapolis MD 21403 410-990-9993
- Previous message: HD cloning
- Next message: HD cloning
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Beowulf mailing list
