[Pkg-ofed-devel] ofa-kernel: ib_query_gid() failed
gmpc at sanger.ac.uk
Mon Oct 12 16:43:00 UTC 2009
> However, opensm fails to start which can be traced down
> to ibstat -p hanging. dmesg produces the following output upon
> /etc/init.d/opensm start:
> Oct 12 12:52:48 node1 kernel: [ 78.170077] ib0: ib_query_gid() failed
> Oct 12 12:52:58 node1 kernel: [ 89.272789] ib0: ib_query_port failed
> We dont get any other obvious dmesg errors.
> ipoib doesn't work since opensm can't be started. We didn't even try
> running openmpi in this mode since it seems pointless if ib_query_gid() fails.
> Summary: Mainline kernels from Debian (2.6.26-2-amd64 and
> 2.6.31-trunk-amd64) do work fine. opensm starts and ipoib works,
> however, XRC is missing from the kernel, which basically means
> most native infiniband applications (not using ipoib) fail.
ibstat only get used by the opensm init script to try and guess what ports to
start openSM on.
Does opensm run if you explicitly tell it to start on the port?
start-stop-daemon --start --quiet --make-pidfile \
--pidfile /var/run/opensm-$PORT --background --exec /usr/sbin/opensm \
-- -g 0x0003ba0001007209 -f /var/log/opensm.0x0003ba0001007209.log
Does ibstat hang if you do not give it the -p option? That sounds like a bug
that should be reported to the openfabrics people. (I can't reproduce it on my
setup, so it sounds like this is a hardware dependant bug.)
Dr. Guy Coates, Informatics System Group
The Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1HH, UK
Tel: +44 (0)1223 834244 x 6925
Fax: +44 (0)1223 496802
The Wellcome Trust Sanger Institute is operated by Genome Research
Limited, a charity registered in England with number 1021457 and a
company registered in England with number 2742969, whose registered
office is 215 Euston Road, London, NW1 2BE.
More information about the Pkg-ofed-devel