Main Menu

Search

INFINIBAND: How To Use "ibping" Command To Troubleshooting Infiniband (IB) Network Issues?

"ibping" command will be helpful to troubleshooting Infiniband network issues between two or more hosts. "ibping" command is a client-server command, one one host ibping is executed as a server (using -S option) where it listens for pings from clients. On other host ibping is executed client pinging the server node using LID. Output is similar to the Internet Protocol ping command. The ping repeats every second, until the Ctrl-C key combination exits on the server.

Below are steps to do ibping tests between two hosts.

1) Gather the Port GUID's and Base LIDs of IB ports on both the hosts using below command.

ibstat | egrep "Port|Base"

Below is example output of above command.
    Port 1:
                Base lid: 18
                Port GUID: 0xXXXXXX5ff9e65a9d
        Port 2:
                Base lid: 19
                Port GUID: 0xXXXXXX5ff9e65a9e

2) On on one host run ibping as Server using -S option as follows:
ibping -S -P <IB Port> -d

In above command change IB Port with the IB port number on which you want to do ping test. -d is the debug messages flag.

For e.g. if you want to do ping test on Port 1, your server host side command looks as follows:

ibping -S -P 1 -d

3) Now on other host run the ibping command as follows to connect to Base LID of host in above step 2 where ibping is running in server mode. Replace the IB port to IB port number on client host which you want to use to ping ibping server host.

ibping -P <IB Port> <Base LID of ibping server>

For e.g. if Base LID of ibping server host is 19 and you want to do ibping test using Port 1 of cient host, your command looks as follows:

ibping -P 1 19

Below are example outputs from ibping server and ibping client hosts

On ibping client side we see resonses from ibpingserverhost as follows if there are no issues.

#ibping -P 1 19
Pong from ibpingserverhost.testing.com (Lid 19): time 0.358 ms
Pong from ibpingserverhost.testing.com (Lid 19): time 0.479 ms
Pong from ibpingserverhost.testing.com (Lid 19): time 0.390 ms
Pong from ibpingserverhost.testing.com (Lid 19): time 0.500 ms
Pong from ibpingserverhost.testing.com (Lid 19): time 0.377 ms

On Server side we see ibping responses debug messages as follows if there are no issues.

# ibping -S -P 2 -d
ibdebug: [14359] ibping_serv: starting to serve...
ibdebug: [14359] ibping_serv: Pong: ibpingserverhost.testing.com
ibwarn: [14359] mad_respond_via: dest Lid 20
ibwarn: [14359] mad_respond_via: qp 0x1 class 0x32 method 129 attr 0x0 mod 0x0 datasz 0 off 0 qkey 80010000
ibdebug: [14359] ibping_serv: Pong: ibpingserverhost.testing.com
ibwarn: [14359] mad_respond_via: dest Lid 20
ibwarn: [14359] mad_respond_via: qp 0x1 class 0x32 method 129 attr 0x0 mod 0x0 datasz 0 off 0 qkey 80010000
ibdebug: [14359] ibping_serv: Pong: ibpingserverhost.testing.com
ibwarn: [14359] mad_respond_via: dest Lid 20
ibwarn: [14359] mad_respond_via: qp 0x1 class 0x32 method 129 attr 0x0 mod 0x0 datasz 0 off 0 qkey 80010000

NOTE: Please note that if you started ibping server connection on Port 1 and if you are tring to connect to ibping server on Port 2 Base LID, the test will fail.

Products to which Article Applies

All hosts using Infiniband.

Additional Reference

https://docs.oracle.com/cd/E19632-01/835-0783-03/bbggcjji.html

tarun boyella


No comments:

Post a Comment