[scyld-users] Beoboot failure

Donald Becker becker at scyld.com
Mon Jan 30 13:34:08 PST 2006


On Fri, 27 Jan 2006, Gardner Pomper wrote:

> I am trying to get my test Scyld cluster up. I have Scyld v29cz running on
> my head node, and I am trying to get one slave node booting from a CD. The
> slave finds the master, and is assigned an IP address, but it fails with the
> following error:

What version are you using?
Are you booting from the distribution CD, or an "iso image" created from 
Beosetup?
Are you running on a x86_64 or IA32?
 
> Boot information received for interface eth0 from RARP:
>   Assigned IP address: 192.168.1.10 / 255.255.255.0
>   Server: 192.168.1.50  boot file '/var/beowulf/boot.img'
>   BProc master 192.168.1.50 port 2223,
>   VMA server 192.168.1.50 port 1556, protocol tcp.
> boot: installing module "kmonte"
> connect: no route to host
> Boot image download failure: numerical result ouf of range

Hmmm, this appears to be a routing problem.

We have seen a similar problem once before in the quick-remaster case, 
where the old default route prevents a new route from being added, even 
though the interface was shut down.  (This is a kernel routing bug that 
can be worked around by explicitly deleting the old default route.)

> I hope I got all this right, because it automatically clears the screen and
> reboots after 5 seconds and I haven't found a way to stop it.

The parameter 'fatal_delay' should set that timeout, and the default is 
normally 120 seconds.  The only five second delay is in Two Kernel Monte, 
and that shouldn't be called.  (FYI: The TKM delay may changed with the 
'bootmode' flags.  The upper 8 bits specify the delay in seconds before 
switching kernels.  This is pretty much useful only for 
debugging and developers, and isn't likely the delay here.)



-- 
Donald Becker				becker at scyld.com
Scyld Software	 			Scyld Beowulf cluster systems
914 Bay Ridge Road, Suite 220		www.scyld.com
Annapolis MD 21403			410-990-9993




More information about the Scyld-users mailing list