Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

[Beowulf] InfiniBand VL15 error

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

Prentice Bisbal prentice at ias.edu
Tue Dec 2 13:33:20 PST 2008


Greg Lindahl wrote:
> On Tue, Dec 02, 2008 at 10:24:15AM -0500, Prentice Bisbal wrote:
> 
>> #warn: counter VL15Dropped = 476        (threshold 100) lid 1 port 1
>> Error check on lid 1 (aurora HCA-1) port 1:  FAILED
> 
> IB is blissfully fading from my brain, but I think this refers to
> control packets being dropped due to resource limits on the recipient.
> That takes talent if you're using a Mellanox HCA, as pretty much all
> of the VL15 packets are interpreted by the processor in the HCA.
> 
> -- greg
> 
> 

Just my luck. I'm using Cisco HCAs, which are really Mellanox HCAs:

# lspci | grep Infini
0b:00.0 InfiniBand: Mellanox Technologies MT25208 InfiniHost III Ex
(Tavor compatibility mode) (rev 20)

Fortunately, Gilad from Mellanox has offered me some assistance off-list.

-- 
Prentice



More information about the Beowulf mailing list