lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <201108261709.10119.jlieb@panasas.com>
Date:	Fri, 26 Aug 2011 17:09:09 -0700
From:	Jim Lieb <jlieb@...asas.com>
To:	<netdev@...r.kernel.org>
Subject: Mellanox mlx4_en update

I am having trouble getting the mlx4_en driver to work on 3.0.1 kernels.  On 
the advice of Yevgeny, I have pulled patches from net-next and applied them to 
a 3.0.1 stable tree which compiles and runs just fine except for the driver.  
What I know so far:

* The driver in the mlnx_en-1.5.6.tgz tarball from the Mellanox download site 
works just fine with Fedora 12/RHEL6 and earlier kernels.

* Mainline and stable + patches don't on 3.0.1.  See commands and logs below.

* A diff between the two (~15k lines) shows significant differences (mlx4_en is a 
newer version, 1.5.6)

* udev seems to want to install mlx4_core alone but removing and re-installing 
via modprobe of mlx4_en does the loading properly but the driver doesn't init 
properly as shown in the logs.

See the end of the log for a git log short of the patches on top of 3.0.1.

post-boot, the driver had not fully loaded so I removed the only module that 
did load.  I did not show the boot time dmesg but it is identical to the first 
part in the modprobe install log below.

The base install is Fedora 12 with updates on an x86_64 Westmere class server.  
Everything minus 10Ge worked for extensive internal testing.

Am I missing patches?  Might the firmware be out of date?  Note that it works 
w/ F12|RHEL[4-6].

If I can get working patches/firmware/??? I can repackage kernel RPMs so our 
lab folks can deploy this.

Thanks

Jim
Please CC me on the reply.

[root@...twin-22a-boot ~]# modprobe -r mlx4_core
[root@...twin-22a-boot ~]# cat /proc/modules |grep mlx
[root@...twin-22a-boot ~]# tail /var/log/messages
Aug 25 16:13:18 ca-twin-22a-boot sudo:    jlieb : TTY=pts/0 ; 
PWD=/net/nfs.panwest.panasas.com/home/jlieb ; USER=root ; COMMAND=/bin/bash
Aug 25 16:13:56 ca-twin-22a-boot kernel: [  219.388210] mlx4_core 
0000:08:00.0: PCI INT A disabled
Aug 25 16:13:56 ca-twin-22a-boot kernel: [  219.402558] mlx4_core 
0000:0a:00.0: PCI INT A disabled


[root@...twin-22a-boot ~]# modprobe mlx4_en
Aug 25 16:15:17 ca-twin-22a-boot kernel: [  300.481764] mlx4_core: Mellanox 
ConnectX core driver v1.0 (July 14, 2011)
Aug 25 16:15:17 ca-twin-22a-boot kernel: [  300.481769] mlx4_core: 
Initializing 0000:0a:00.0
Aug 25 16:15:17 ca-twin-22a-boot kernel: [  300.481789] mlx4_core 
0000:0a:00.0: PCI INT A -> GSI 26 (level, low) -> IRQ 26
Aug 25 16:15:19 ca-twin-22a-boot kernel: [  302.511615] mlx4_core 
0000:0a:00.0: Sense command failed for port: 1
Aug 25 16:15:19 ca-twin-22a-boot kernel: [  302.511856] mlx4_core: 
Initializing 0000:08:00.0
Aug 25 16:15:19 ca-twin-22a-boot kernel: [  302.511874] mlx4_core 
0000:08:00.0: PCI INT A -> GSI 30 (level, low) -> IRQ 30
Aug 25 16:15:21 ca-twin-22a-boot kernel: [  305.045626] mlx4_core 
0000:08:00.0: Sense command failed for port: 1
Aug 25 16:15:21 ca-twin-22a-boot kernel: [  305.045829] mlx4_core 
0000:08:00.0: Sense command failed for port: 2
Aug 25 16:15:21 ca-twin-22a-boot kernel: [  305.104652] mlx4_en: Mellanox 
ConnectX HCA Ethernet driver v1.5.4.1 (March 2011)
Aug 25 16:15:21 ca-twin-22a-boot kernel: [  305.105401] mlx4_en 0000:0a:00.0: 
UDP RSS is not supported on this device.
Aug 25 16:15:21 ca-twin-22a-boot kernel: [  305.105784] mlx4_en 0000:08:00.0: 
UDP RSS is not supported on this device.
Aug 25 16:15:21 ca-twin-22a-boot kernel: [  305.105838] mlx4_en 0000:08:00.0: 
Activating port:1
Aug 25 16:15:21 ca-twin-22a-boot kernel: [  305.107174] mlx4_en: 0000:08:00.0: 
Port 1: Using 8 TX rings
Aug 25 16:15:21 ca-twin-22a-boot kernel: [  305.107177] mlx4_en: 0000:08:00.0: 
Port 1: Using 16 RX rings
Aug 25 16:15:21 ca-twin-22a-boot kernel: [  305.107319] mlx4_en: 0000:08:00.0: 
Port 1: Initializing port
Aug 25 16:15:21 ca-twin-22a-boot kernel: [  305.129594] udev: renamed network 
interface eth2 to eth3
Aug 25 16:15:22 ca-twin-22a-boot kernel: [  305.594246] mlx4_en 0000:08:00.0: 
Activating port:2
Aug 25 16:15:22 ca-twin-22a-boot kernel: [  305.953213] mlx4_en: eth3: Failed 
to allocate RSS indirection QP
Aug 25 16:15:22 ca-twin-22a-boot kernel: [  305.954608] mlx4_en: eth3: Failed 
configuring rss steering
Aug 25 16:15:22 ca-twin-22a-boot kernel: [  305.988594] mlx4_en: eth3: Failed 
starting port:1
Aug 25 16:15:22 ca-twin-22a-boot kernel: [  305.989008] mlx4_en: 0000:08:00.0: 
Port 2: Using 8 TX rings
Aug 25 16:15:22 ca-twin-22a-boot kernel: [  305.989012] mlx4_en: 0000:08:00.0: 
Port 2: Using 16 RX rings
Aug 25 16:15:22 ca-twin-22a-boot kernel: [  305.989107] mlx4_en: 0000:08:00.0: 
Port 2: Initializing port
Aug 25 16:15:22 ca-twin-22a-boot kernel: [  306.022225] udev: renamed network 
interface eth2 to eth4


[root@...twin-22a-boot ~]# ifconfig -a
eth0      Link encap:Ethernet  HWaddr E0:CB:4E:64:EA:89  
          inet addr:10.6.2.143  Bcast:10.6.2.255  Mask:255.255.255.0
          UP BROADCAST MULTICAST  MTU:1500  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:0 (0.0 b)  TX bytes:0 (0.0 b)
          Interrupt:16 Memory:fbce0000-fbd00000 

eth1      Link encap:Ethernet  HWaddr E0:CB:4E:64:EA:88  
          inet addr:10.6.1.143  Bcast:10.6.1.255  Mask:255.255.255.0
          inet6 addr: fe80::e2cb:4eff:fe64:ea88/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:2066 errors:0 dropped:0 overruns:0 frame:0
          TX packets:1558 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:560875 (547.7 KiB)  TX bytes:160892 (157.1 KiB)
          Interrupt:17 Memory:fbbe0000-fbc00000 

eth3      Link encap:Ethernet  HWaddr 00:02:C9:08:14:B6  
          BROADCAST MULTICAST  MTU:1500  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:0 (0.0 b)  TX bytes:0 (0.0 b)

eth4      Link encap:Ethernet  HWaddr 00:02:C9:08:14:B7  
          BROADCAST MULTICAST  MTU:1500  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:0 (0.0 b)  TX bytes:0 (0.0 b)

lo        Link encap:Local Loopback  
          inet addr:127.0.0.1  Mask:255.0.0.0
          inet6 addr: ::1/128 Scope:Host
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
          RX packets:82 errors:0 dropped:0 overruns:0 frame:0
          TX packets:82 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0 
          RX bytes:5500 (5.3 KiB)  TX bytes:5500 (5.3 KiB)

Scripts etc. in the fedora install:

[root@...twin-22a-boot network-scripts]# less ifcfg-eth3
# Please read /usr/share/doc/initscripts-*/sysconfig.txt
# for the documentation of these parameters.
GATEWAY=10.6.2.254
DNS1=172.17.132.60
DEVICE=eth3
BOOTPROTO=static
NETMASK=255.255.255.0
DNS2=172.17.132.29
TYPE=Ethernet
HWADDR=00:02:c9:08:14:b6
IPADDR=10.6.2.143
[root@...twin-22a-boot network-scripts]# host ca-twin-22a
ca-twin-22a.calab.panasas.com has address 10.6.2.143

restart the network

[root@...twin-22a-boot network-scripts]# service network restart
Shutting down interface eth0:  [  OK  ]
Shutting down interface eth1:  [  OK  ]
Shutting down loopback interface:  [  OK  ]
Bringing up loopback interface:  [  OK  ]
Bringing up interface eth0:  [  OK  ]
Bringing up interface eth1:  [  OK  ]
Bringing up interface eth3:  RTNETLINK answers: Bad file descriptor
Failed to bring up eth3.
[FAILED]

What the log shows during the restart:

Aug 25 16:17:26 ca-twin-22a-boot ntpd[2586]: Deleting interface #2 eth1, 
fe80::e2cb:4eff:fe64:ea88#123, interface stats: received=0, sent=0, dropped=0, 
active_time=273 secs
Aug 25 16:17:26 ca-twin-22a-boot ntpd[2586]: Deleting interface #5 eth0, 
10.6.2.143#123, interface stats: received=0, sent=0, dropped=0, 
active_time=273 secs
Aug 25 16:17:26 ca-twin-22a-boot ntpd[2586]: Deleting interface #6 eth1, 
10.6.1.143#123, interface stats: received=14, sent=14, dropped=0, 
active_time=273 secs
Aug 25 16:17:30 ca-twin-22a-boot avahi-daemon[2237]: Joining mDNS multicast 
group on interface eth0.IPv4 with address 10.6.2.143.
Aug 25 16:17:30 ca-twin-22a-boot avahi-daemon[2237]: New relevant interface 
eth0.IPv4 for mDNS.
Aug 25 16:17:30 ca-twin-22a-boot avahi-daemon[2237]: Registering new address 
record for 10.6.2.143 on eth0.IPv4.
Aug 25 16:17:30 ca-twin-22a-boot kernel: [  433.835452] ADDRCONF(NETDEV_UP): 
eth1: link is not ready
Aug 25 16:17:31 ca-twin-22a-boot ntpd[2586]: Listening on interface #7 eth0, 
10.6.2.143#123 Enabled
Aug 25 16:17:33 ca-twin-22a-boot kernel: [  436.917985] e1000e: eth1 NIC Link 
is Up 1000 Mbps Full Duplex, Flow Control: None
Aug 25 16:17:33 ca-twin-22a-boot kernel: [  436.919503] 
ADDRCONF(NETDEV_CHANGE): eth1: link becomes ready
Aug 25 16:17:35 ca-twin-22a-boot avahi-daemon[2237]: Registering new address 
record for fe80::e2cb:4eff:fe64:ea88 on eth1.*.
Aug 25 16:17:35 ca-twin-22a-boot avahi-daemon[2237]: Joining mDNS multicast 
group on interface eth1.IPv4 with address 10.6.1.143.
Aug 25 16:17:35 ca-twin-22a-boot avahi-daemon[2237]: New relevant interface 
eth1.IPv4 for mDNS.
Aug 25 16:17:35 ca-twin-22a-boot avahi-daemon[2237]: Registering new address 
record for 10.6.1.143 on eth1.IPv4.
Aug 25 16:17:35 ca-twin-22a-boot kernel: [  438.989438] mlx4_core 
0000:08:00.0: Failed to bring QP to state: 1 with error: -9
Aug 25 16:17:35 ca-twin-22a-boot kernel: [  438.990609] mlx4_en: eth3: Failed 
configuring rss steering
Aug 25 16:17:35 ca-twin-22a-boot kernel: [  439.026160] mlx4_en: eth3: Failed 
starting port:1
Aug 25 16:17:37 ca-twin-22a-boot ntpd[2586]: Listening on interface #8 eth1, 
fe80::e2cb:4eff:fe64:ea88#123 Enabled
Aug 25 16:17:37 ca-twin-22a-boot ntpd[2586]: Listening on interface #9 eth1, 
10.6.1.143#123 Enabled


git short log...

commit 31a5037e908bf73ac8c185ebd58727ed4af50785
Author: Yevgeny Petrilin <yevgenyp@...lanox.co.il>

    mlx4: decreasing ref count when removing mac

commit 2c0ff8f3327ec1fcf4d184ea49281559955ffa59
Author: Yevgeny Petrilin <yevgenyp@...lanox.co.il>

    mlx4: Fixing Ethernet unicast packet steering

commit d540b5c19da7e3e3fb17dcf508fe4674d915bb76
Author: Dotan Barak <dotanb@....mellanox.co.il>

    mlx4_core: Bump the driver version to 1.0

commit f2a0b4681c54bf7915e3ce65f1496eef9792fe00
Author: Jiri Pirko <jpirko@...hat.com>

    mlx4: do vlan cleanup

commit 9a3d46b2b017fface2355d150fb5cbf798066dd0
Author: Or Gerlitz <ogerlitz@...lanox.com>

    mlx4_core: Add network flow counters

commit ac7af6feb51a5d5243e306b7ab1ef89436aa1756
Author: Or Gerlitz <ogerlitz@...lanox.com>

    mlx4_core: Read extended capabilities into the flags field

commit 878184cd92ee22314f489392698972f8167ccd5d
Author: Or Gerlitz <ogerlitz@...lanox.com>

    mlx4_core: Extend capability flags to 64 bits

commit 348a8184aaad001bf85d7be4ae5d9bd4a176aae3
Author: Jon Mason <jdmason@...zu.us>

    mlx4: remove unnecessary read of PCI_CAP_ID_EXP

commit c27c320827959efd1caf2f89c7991bffe5051d13
Author: Sergei Shtylyov <sshtylyov@...mvista.com>

    mlx4: use pci_dev->revision

commit e2074fa29c96b799dc9db0880c7bb6a850b77220
Author: Joe Perches <joe@...ches.com>

    drivers/net: Remove casts of void *

commit 94ed5b4788a7cdbe68bc7cb8516972cbebdc8274
Author: Greg Kroah-Hartman <gregkh@...e.de>

    Linux 3.0.1


-- 
Jim Lieb
Linux Systems Engineer
Panasas Inc.
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ