[Pkg-ofed-devel] lenny - openmpi problems

Guy Coates gmpc at sanger.ac.uk
Tue Sep 15 15:41:44 UTC 2009


Yann JOBIC wrote:
> Guy Coates wrote:
>>
>>> I installed the package, and now i cannot ibping , and ibstat isn't 
>>> working :
>>>
>>> Lilou:~# ibstat
>>> ibpanic: [6594] main: stat of IB device 'mlx4_0' failed: (Device or 
>>> resource busy)
>>
>> You will need to reboot once the kernel module package has been 
>> installed. Assuming that you have done that, is there anything odd in 
>> /var/log/messages /
>> dmesg?
>>
>> Cheers,
>>
>> Guy
>>
> Maybe i loaded the wrongs modules ?
> 
> Lidia:~# lsmod | grep mlx
> mlx4_ib                61632  0
> ib_mad                 39336  4 ib_umad,ib_cm,ib_sa,mlx4_ib
> ib_core                70656  10 
> ib_ipoib,ib_umad,rdma_ucm,rdma_cm,ib_cm,iw_cm,ib_sa,ib_uverbs,mlx4_ib,ib_mad 
> 

Those module sizes look correct; they match what I have on my machine (you can 
double check them with modinfo if you are still unsure).

Are there any unusual messages in the kernel log when the infiniband modules are 
loaded? My machine show just these messages:


[   2.291810] mlx4_core: Mellanox ConnectX core driver v1.0 (April 4, 2008)
[    2.291810] mlx4_core: Initializing 0000:0c:00.0
[    3.861825] mlx4_core 0000:0c:00.0: Requested number of MACs is too much for 
port 1, reducing to 1.
[    5.035171] ACPI: PCI Interrupt 0000:00:1d.0[A] -> GSI 18 (level, low) -> IRQ 18
[    5.035171] PCI: Setting latency timer of device 0000:00:1d.0 to 64
[   34.998522] mlx4_ib: Mellanox ConnectX InfiniBand driver v1.0 (April 4, 2008)


Did you install the new kernel modules on both of your test hosts?

As an outside chance, have you made sure that your infiniband card firmware is 
all up to date?

Guy

-- 
Dr. Guy Coates,  Informatics System Group
The Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1HH, UK
Tel: +44 (0)1223 834244 x 6925
Fax: +44 (0)1223 496802


-- 
 The Wellcome Trust Sanger Institute is operated by Genome Research 
 Limited, a charity registered in England with number 1021457 and a 
 company registered in England with number 2742969, whose registered 
 office is 215 Euston Road, London, NW1 2BE. 



More information about the Pkg-ofed-devel mailing list