lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <91545596-f932-8834-f613-feda3edc9b84@tomt.net>
Date:   Sat, 27 Oct 2018 23:41:54 +0200
From:   Andre Tomt <andre@...t.net>
To:     Eric Dumazet <eric.dumazet@...il.com>,
        Eric Dumazet <edumazet@...gle.com>
Cc:     Stephen Hemminger <stephen@...workplumber.org>,
        netdev <netdev@...r.kernel.org>, rossi.f@...ind.it,
        Dimitris Michailidis <dmichail@...gle.com>
Subject: Re: Fw: [Bug 201423] New: eth0: hw csum failure

On 26.10.2018 13:45, Andre Tomt wrote:
> On 25.10.2018 19:38, Eric Dumazet wrote:
>>
>>
>> On 10/24/2018 12:41 PM, Andre Tomt wrote:
>>>
>>> It eventually showed up again with mlx4, on 4.18.16 + fix and also on 
>>> 4.19. I still do not have a useful packet capture.
>>>
>>> It is running a torrent client serving up various linux distributions.
>>>
>>
>> Have you also applied this fix ?
>>
>> https://git.kernel.org/pub/scm/linux/kernel/git/davem/net.git/commit/?id=db4f1be3ca9b0ef7330763d07bf4ace83ad6f913 
>>
>>
> 
> No. I've applied it now to 4.19 and will report back if anything shows up.

Just hit it on the simpler server; no VRF, no tunnels, no nat/conntrack. 
Only a basic stateless nftables ruleset and a vlan netdev (unlikely to 
be the one triggering this I guess; it has only v4 traffic).

On 4.19 + above commit:
> [158269.360271] p0xe0: hw csum failure
> [158269.360286] CPU: 3 PID: 0 Comm: swapper/3 Tainted: P           O      4.19.0-1 #1
> [158269.360287] Hardware name: Supermicro Super Server/X10SRL-F, BIOS 2.0b 05/02/2017
> [158269.360288] Call Trace:
> [158269.360290]  <IRQ>
> [158269.360295]  dump_stack+0x5c/0x7b
> [158269.360299]  __skb_gro_checksum_complete+0x9a/0xa0
> [158269.360301]  udp6_gro_receive+0x211/0x290
> [158269.360303]  ipv6_gro_receive+0x1b1/0x3a0
> [158269.360306]  ? ip_sublist_rcv_finish+0x70/0x70
> [158269.360307]  dev_gro_receive+0x3a0/0x620
> [158269.360309]  ? __build_skb+0x25/0xe0
> [158269.360310]  napi_gro_frags+0xa8/0x220
> [158269.360314]  mlx4_en_process_rx_cq+0xa01/0xb40 [mlx4_en]
> [158269.360322]  ? mlx4_cq_completion+0x23/0x70 [mlx4_core]
> [158269.360325]  ? mlx4_eq_int+0x373/0xc80 [mlx4_core]
> [158269.360327]  mlx4_en_poll_rx_cq+0x55/0xf0 [mlx4_en]
> [158269.360329]  net_rx_action+0xe0/0x2e0
> [158269.360330]  __do_softirq+0xd8/0x2ff
> [158269.360333]  irq_exit+0xbd/0xd0
> [158269.360334]  do_IRQ+0x85/0xd0
> [158269.360336]  common_interrupt+0xf/0xf
> [158269.360337]  </IRQ>
> [158269.360339] RIP: 0010:cpuidle_enter_state+0xb3/0x310
> [158269.360340] Code: 31 ff e8 e0 e0 bb ff 45 84 f6 74 17 9c 58 0f 1f 44 00 00 f6 c4 02 0f 85 3f 02 00 00 31 ff e8 64 cc c0 ff fb 66 0f 1f 44 00 00 <4c> 29 fb 48 ba cf f7 53 e3 a5 9b c4 20 48 89 d8 48 c1 fb 3f 48 f7
> [158269.360341] RSP: 0018:ffffaf28c634bea8 EFLAGS: 00000246 ORIG_RAX: ffffffffffffffd9
> [158269.360342] RAX: ffff9a9f7fae0fc0 RBX: 00008ff1f4ff622a RCX: 000000000000001f
> [158269.360343] RDX: 00008ff1f4ff622a RSI: 0000000022983893 RDI: 0000000000000000
> [158269.360343] RBP: 0000000000000001 R08: 0000000000000002 R09: 0000000000020840
> [158269.360344] R10: ffffaf28c634be88 R11: 0000000000000036 R12: ffff9a9f7fae9aa8
> [158269.360344] R13: ffffffffaa0ac638 R14: 0000000000000000 R15: 00008ff1f4f09d43
> [158269.360347]  ? cpuidle_enter_state+0x90/0x310
> [158269.360349]  do_idle+0x1d0/0x240
> [158269.360351]  cpu_startup_entry+0x5f/0x70
> [158269.360352]  start_secondary+0x185/0x1a0
> [158269.360354]  secondary_startup_64+0xa4/0xb0

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ