Transmit timed out error

Orion Poplawski OPoplawski@cqg.com
Tue Apr 4 17:32:00 2000


> From: owner-linux-eepro100@beowulf.org
>
> On Mon, Apr 03, 2000 at 11:58:23AM -0600, Orion Poplawski wrote:
> > After some period of running (5-10 minutes), the eth1 interface
> locks up.
> > Some time after that (another couple of minutes) the machine locks up
> > completely.
>
> The complete lock is a strange thing.
> It may happen that you have another problem somewhere.
>
I'll believe that.  I've testing this out on a couple other hardware
platforms using similar Intel NICs and have not seen these problems.  I'm
also seeing the machine lock up when I try to initialize a 3rd network
interface.  Not sure what to look for though.

> Please take the 1.20.2.5 driver from
> ftp://ftp.sw.com.sg/pub/Linux/people/saw/kernel/v2.2/
> and run it with debug=3 option and with debugging kernel message
> logging (in
> your syslogd/klogd system).
>

Installed.  Did not see any "transmit timed out" message this time.

Without my program running, I see the following in /var/log/debug:

Apr  4 12:48:29 btstipc kernel: eth1: Sending a multicast list set command
from a timer routine.
Apr  4 12:48:31 btstipc kernel: eth0: Sending a multicast list set command
from a timer routine.
Apr  4 12:48:31 btstipc kernel: eth1: Sending a multicast list set command
from a timer routine.
Apr  4 12:48:37 btstipc last message repeated 3 times
Apr  4 12:48:39 btstipc kernel: eth0: Sending a multicast list set command
from a timer routine.
Apr  4 12:48:39 btstipc kernel: eth1: Sending a multicast list set command
from a timer routine.

After I've started my program, I see:

Apr  4 12:49:13 btstipc kernel: eth0: Sending a multicast list set command
from a timer routine.
Apr  4 12:49:49 btstipc last message repeated 4 times
Apr  4 12:50:59 btstipc last message repeated 11 times
Apr  4 12:51:59 btstipc last message repeated 14 times
Apr  4 12:53:01 btstipc last message repeated 10 times
Apr  4 12:54:01 btstipc last message repeated 13 times
Apr  4 12:55:01 btstipc last message repeated 8 times
....


Until the eth1 interface locks up, at which point I see:

Apr  4 13:19:01 btstipc last message repeated 12 times
Apr  4 13:19:03 btstipc kernel: eth1: Sending a multicast list set command
from a timer routine.
Apr  4 13:19:05 btstipc kernel: eth1: Sending a multicast list set command
from a timer routine.
Apr  4 13:19:07 btstipc kernel: eth0: Sending a multicast list set command
from a timer routine.
Apr  4 13:19:07 btstipc kernel: eth1: Sending a multicast list set command
from a timer routine.
Apr  4 13:19:09 btstipc kernel: eth1: Sending a multicast list set command
from a timer routine.

Which makes sense I suppose, since eth1 is no longer seeing any received
packets it starts to trigger the receiver lockup fix.

Again, startup message is:

Apr  4 12:45:42 btstipc kernel: eepro100.c: Debug level is 3.
Apr  4 12:45:42 btstipc kernel: eepro100.c:v1.09j-t 9/29/99 Donald Becker
http://cesdis.gsfc.nasa.gov/linux/drivers/eepro100.html
Apr  4 12:45:42 btstipc kernel: eepro100.c: $Revision: 1.20.2.5 $ 2000/03/28
Modified by Andrey V. Savochkin <saw@saw.sw.com.sg> and others
Apr  4 12:45:42 btstipc kernel: Found Intel i82557 PCI Speedo at I/O
0xc883d000, IRQ 11.
Apr  4 12:45:42 btstipc kernel:   PCI latency timer (CFLT) is 0x40.
Apr  4 12:45:42 btstipc kernel: eth0: Intel PCI EtherExpress Pro100 at
0xc883d000, 00:50:B7:11:07:FF, IRQ 11.
Apr  4 12:45:42 btstipc kernel:   Board assembly 701738-002, Physical
connectors present: RJ45
Apr  4 12:45:42 btstipc kernel:   Primary interface chip i82555 PHY #1.
Apr  4 12:45:42 btstipc kernel:   General self-test: passed.
Apr  4 12:45:42 btstipc kernel:   Serial sub-system self-test: passed.
Apr  4 12:45:42 btstipc kernel:   Internal registers self-test: passed.
Apr  4 12:45:42 btstipc kernel:   ROM checksum self-test: passed
(0x24c9f043).
Apr  4 12:45:42 btstipc kernel:   Receiver lock-up workaround activated.
Apr  4 12:45:42 btstipc kernel: Found Intel i82557 PCI Speedo at I/O
0xc883f000, IRQ 10.
Apr  4 12:45:42 btstipc kernel:   PCI latency timer (CFLT) is 0x40.
Apr  4 12:45:42 btstipc kernel: eth1: Intel PCI EtherExpress Pro100 at
0xc883f000, 00:A0:C9:89:92:2B, IRQ 10.
Apr  4 12:45:42 btstipc kernel:   Board assembly 667280-003, Physical
connectors present: RJ45
Apr  4 12:45:42 btstipc kernel:   Primary interface chip i82555 PHY #1.
Apr  4 12:45:42 btstipc kernel:   General self-test: passed.
Apr  4 12:45:42 btstipc kernel:   Serial sub-system self-test: passed.
Apr  4 12:45:42 btstipc kernel:   Internal registers self-test: passed.
Apr  4 12:45:42 btstipc kernel:   ROM checksum self-test: passed
(0x49caa8d6).
Apr  4 12:45:42 btstipc kernel:   Receiver lock-up workaround activated.

- Orion

-------------------------------------------------------------------
To unsubscribe send a message body containing "unsubscribe"
to linux-eepro100-request@beowulf.org