Hi:
Excause me, I am not very good at English. We have a luster cluster with IB network .
1. mgs config like below , I think can access with tcp and o2ib
$ more /etc/modprobe.d/lustre.conf
options lnet networks=o2ib0(ib0),tcp0(ib0)
$ tunefs.lustre /dev/mapper/mdt1
checking for existing Lustre data: found CONFIGS/mountdata
Reading CONFIGS/mountdata
Read previous values:
Target: majorbio-MDT0000
Index: 0
Lustre FS: majorbio
Mount type: ldiskfs
Flags: 0x5
(MDT MGS )
Persistent mount opts: user_xattr,errors=remount-ro
Parameters: mdt.identity_upcall=NONE mgsnode=192.168.10.110@o2ib,192.168.10.110@tcp
Permanent disk data:
Target: majorbio-MDT0000
Index: 0
Lustre FS: majorbio
Mount type: ldiskfs
Flags: 0x5
(MDT MGS )
Persistent mount opts: user_xattr,errors=remount-ro
Parameters: mdt.identity_upcall=NONE mgsnode=192.168.10.110@o2ib,192.168.10.110@tcp
exiting before disk write.
when a client with IB network can run the command " mount -t lustre
192.168.10.110@tcp:/majorbio /mnt/lustre " successful .
then, I config the IB network with vmware ESXI as this article
http://www.bussink.ch/?p=1306 ,vmware ESXI can display the IB network as eth
equipment.
On the virtual machine (centos 6.6 ) , I install module lustre, lnet ,
$ lctl ping 192.168.10.110@o2ib
failed to ping 192.168.10.110@o2ib: Input/output error
$ lctl ping 192.168.10.110@tcp
12345-0@lo
192.168.10.110@o2ib
192.168.10.110@tcp
the problem is , when I mount the lustre file system
$ mount.lustre 192.168.10.110@tcp:/majorbio /mnt/lustre/
mount.lustre: mount 192.168.10.110@tcp:/majorbio at /mnt/lustre failed: No such file or
directory
Is the MGS specification correct?
Is the filesystem name correct?
If upgrading, is the copied client log valid? (see upgrade docs)
$tail -f /var/log/messages
Jul 22 15:01:27 localhost kernel: LustreError: 1517:0:(ldlm_lib.c:433:client_obd_setup())
can't add initial connection
Jul 22 15:01:27 localhost kernel: LustreError: 1517:0:(obd_config.c:572:class_setup())
setup majorbio-MDT0000-mdc-ffff88007a751000 failed (-2)
Jul 22 15:01:27 localhost kernel: LustreError:
1517:0:(obd_config.c:1591:class_config_llog_handler()) MGC192.168.10.110@tcp: cfg command
failed: rc = -2
Jul 22 15:01:27 localhost kernel: Lustre: cmd=cf003 0:majorbio-MDT0000-mdc
1:majorbio-MDT0000_UUID 2:192.168.10.110@o2ib
Jul 22 15:01:27 localhost kernel: LustreError: 15c-8: MGC192.168.10.110@tcp: The
configuration from log 'majorbio-client' failed (-2). This may be the result of
communication errors between this node and the MGS, a bad configuration, or other errors.
See the syslog for more information.
Jul 22 15:01:27 localhost kernel: LustreError: 1508:0:(llite_lib.c:1042:ll_fill_super())
Unable to process log: -2
Jul 22 15:01:27 localhost kernel: LustreError: 1508:0:(obd_config.c:619:class_cleanup())
Device 3 not setup
Jul 22 15:01:27 localhost kernel: Lustre: Unmounted majorbio-client
Jul 22 15:01:27 localhost kernel: LustreError:
1508:0:(obd_mount.c:1320:lustre_fill_super()) Unable to mount (-2)
Thanks for any help!
zhaohua.cui(a)majorbio.com