clustermatic installation/loading of boot kernel

lothar at triumf.ca lothar at triumf.ca
Thu Dec 12 10:10:49 PST 2002


Erik A. Hendriks wrote:

>On Thu, Dec 12, 2002 at 09:22:43AM -0800, lothar at triumf.ca wrote:
>  
>
>>Hi,
>>I am trying to set up a clustermatic/bproc based system. I installed 
>>your latest version
>>of the CDrom. I made bootimages for a floppy and latter loading. 
>>/etc/rc.d/init.d/beowulf
>>starts without problems.  I have put one of the diskless-floppy only 
>>slaves on a videocard
>>and monitor.  ethernet card on slave and master or both 3com905. When I 
>>boot up, the
>>floppy installation seems to run flawless till it makes contact to the 
>>master. When loading
>>/var/beowulf/boot.img it starts to spill out messages of the the nature
>>missed block nnnn (eg.. 1340)
>>for a while, later it reverses into some
>>rcv  /var/beowulf/boot.img
>>followed occassionally by missed block  messagess.
>>
>>For being busy with other things I left it for two days.
>>Amazingly enough this morning it had successfully booted.
>>
>>What is going on?
>>    
>>
>
>Most likely your network switch is dropping a lot of the multicast
>traffic.  In my experience pretty much all managed switches can't
>handle even a few megabits per second of multicast traffic.
>
>The solution to this is to switch it to using broadcast instead of
>multicast.  Put the following in /etc/beowulf/config:
>
>mcastbcast ethX         # switch image service on ethX to broadcast
>
>You can also throttle the boot image transmits like this:
>
>mcastthrottle ethX NN   # throttle multicast/broadcast on ethX to NN megabits/sec.
>
>- Erik
>
>  
>
I put these commands into /etc/beowuld/config with throtteling to 1 
megabit/s.
I restarted /etc/rc.d/init.d/beowulf.

Unfortunately the same messages appear.

Lothar




More information about the Beowulf mailing list