[<prev] [next>] [day] [month] [year] [list]
Message-ID: <1479309161.3755.41.camel@decadent.org.uk>
Date: Wed, 16 Nov 2016 15:12:41 +0000
From: Ben Hutchings <ben@...adent.org.uk>
To: netdev <netdev@...r.kernel.org>
Cc: Panagiotis Malakoudis <malakudi@...il.com>,
Debian kernel maintainers <debian-kernel@...ts.debian.org>
Subject: Re: linux kernel 4.6 and 4.7 from backports have bug in congestion
module refcount
[Moving to netdev and debian-kernel lists]
On Wed, 2016-11-16 at 16:12 +0200, Panagiotis Malakoudis wrote:
> I am using Debian Jessie with backports and with the last kernels,
> 4.6.0-0.bpo.1 and 4.7.0-bpo.1 I experience the following WARNING after a
> few hours of heavy load tcp traffic.
>
> Nov 15 01:40:00 archive kernel: WARNING: CPU: 8 PID: 0 at /build/linux-lVEVrl/linux-4.7.8/kernel/module.c:1107 module_put+0x8d/0xa0
> Nov 15 01:40:00 archive kernel: Modules linked in: seqiv xfrm6_mode_tunnel xfrm4_mode_tunnel twofish_generic twofish_avx_x86_64 twofish_x86_64_3way twofish_x86_64 twofish_common serpent_avx2 serpent_avx_x86_64 serpent_sse2_x86_64 serpent_generic blowfish_generic blowfish_x86_64 blowfish_common cast5_avx_x86_64 cast5_generic cast_common ctr ecb des_generic cbc algif_skcipher camellia_generic camellia_aesni_avx2 camellia_aesni_avx_x86_64 camellia_x86_64 xts xcbc sha512_ssse3 sha512_generic md4 algif_hash af_alg xfrm_user xfrm4_tunnel ipcomp xfrm_ipcomp esp4 ah4 af_key xfrm_algo nfsd auth_rpcgss nfs_acl nfs lockd grace fscache sunrpc ipip tunnel4 ip_tunnel xt_TCPMSS xt_tcpmss nf_conntrack_ipv6 nf_defrag_ipv6 ip6table_filter ip6_tables xt_recent xt_tcpudp xt_policy nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack
> Nov 15 01:40:00 archive kernel: iptable_filter ip_tables x_tables xfs libcrc32c crc32c_generic intel_rapl sb_edac edac_core x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel iTCO_wdt iTCO_vendor_support jitterentropy_rng hmac drbg ansi_cprng aesni_intel ast aes_x86_64 lrw ttm gf128mul glue_helper ablk_helper cryptd drm_kms_helper pcspkr drm lpc_ich mfd_core i2c_i801 sg mei_me joydev evdev ipmi_si mei ioatdma shpchp wmi button ipmi_msghandler tpm_tis acpi_pad tpm acpi_power_meter tcp_htcp autofs4 ext4 crc16 jbd2 mbcache dm_mod md_mod hid_generic usbhid hid sd_mod crc32c_intel xhci_pci ahci xhci_hcd ehci_pci libahci ehci_hcd libata ixgbe mpt3sas igb usbcore raid_class scsi_transport_sas i2c_algo_bit usb_common dca scsi_mod ptp pps_core mdio fjes
> Nov 15 01:40:00 archive kernel: CPU: 8 PID: 0 Comm: swapper/8 Tainted: G W 4.7.0-0.bpo.1-amd64 #1 Debian 4.7.8-1~bpo8+1
> Nov 15 01:40:00 archive kernel: Hardware name: Supermicro Super Server/X10SRH-CF, BIOS 1.0c 09/14/2015
> Nov 15 01:40:00 archive kernel: 0000000000000286 c4c1e09c90bbaf17 ffffffffa3d1c805 0000000000000000
> Nov 15 01:40:00 archive kernel: 0000000000000000 ffffffffa3a7c9c4 ffff881024255000 ffffffffc032a0c0
> Nov 15 01:40:00 archive kernel: ffff881024255130 0000000000000000 ffff881024255000 ffff881024255000
> Nov 15 01:40:00 archive kernel: Call Trace:
> Nov 15 01:40:00 archive kernel: <IRQ> [<ffffffffa3d1c805>] ? dump_stack+0x5c/0x77
> Nov 15 01:40:00 archive kernel: [<ffffffffa3a7c9c4>] ? __warn+0xc4/0xe0
> Nov 15 01:40:00 archive kernel: [<ffffffffa3afe6dd>] ? module_put+0x8d/0xa0
> Nov 15 01:40:00 archive kernel: [<ffffffffa3f41ed0>] ? tcp_v4_destroy_sock+0x20/0x290
> Nov 15 01:40:00 archive kernel: [<ffffffffa3f292e7>] ? inet_csk_destroy_sock+0x47/0x160
> Nov 15 01:40:00 archive kernel: [<ffffffffa3f38c37>] ? tcp_rcv_state_process+0xcc7/0xcd0
> Nov 15 01:40:00 archive kernel: [<ffffffffa3fd14e0>] ? tcp_v4_inbound_md5_hash+0x67/0x177
> Nov 15 01:40:00 archive kernel: [<ffffffffa3f41d20>] ? tcp_v4_do_rcv+0x70/0x200
> Nov 15 01:40:00 archive kernel: [<ffffffffa3f435e2>] ? tcp_v4_rcv+0x8a2/0xa90
> Nov 15 01:40:00 archive kernel: [<ffffffffc06fac01>] ? ipv4_confirm+0x61/0xf0 [nf_conntrack_ipv4]
> Nov 15 01:40:00 archive kernel: [<ffffffffa3f1d86b>] ? ip_local_deliver_finish+0x8b/0x1c0
> Nov 15 01:40:00 archive kernel: [<ffffffffa3f1db3b>] ? ip_local_deliver+0x6b/0xf0
> Nov 15 01:40:00 archive kernel: [<ffffffffa3f1d7e0>] ? ip_rcv_finish+0x3e0/0x3e0
> Nov 15 01:40:00 archive kernel: [<ffffffffa3f1de41>] ? ip_rcv+0x281/0x3b0
> Nov 15 01:40:00 archive kernel: [<ffffffffa3f1d400>] ? inet_del_offload+0x40/0x40
> Nov 15 01:40:00 archive kernel: [<ffffffffa3edf11e>] ? __netif_receive_skb_core+0x2be/0xa50
> Nov 15 01:40:00 archive kernel: [<ffffffffa3f58865>] ? inet_gro_receive+0x265/0x2a0
> Nov 15 01:40:00 archive kernel: [<ffffffffa3edf93f>] ? netif_receive_skb_internal+0x2f/0xa0
> Nov 15 01:40:00 archive kernel: [<ffffffffa3ee088b>] ? napi_gro_receive+0xbb/0x110
> Nov 15 01:40:00 archive kernel: [<ffffffffc0332522>] ? ixgbe_clean_rx_irq+0x542/0xaf0 [ixgbe]
> Nov 15 01:40:00 archive kernel: [<ffffffffc033381c>] ? ixgbe_poll+0x44c/0x7a0 [ixgbe]
> Nov 15 01:40:00 archive kernel: [<ffffffffa3ee00e8>] ? net_rx_action+0x238/0x370
> Nov 15 01:40:00 archive kernel: [<ffffffffa3fe20b6>] ? __do_softirq+0x106/0x294
> Nov 15 01:40:00 archive kernel: [<ffffffffa3a82306>] ? irq_exit+0x86/0x90
> Nov 15 01:40:00 archive kernel: [<ffffffffa3fe1dff>] ? do_IRQ+0x4f/0xd0
> Nov 15 01:40:00 archive kernel: [<ffffffffa3fdff42>] ? common_interrupt+0x82/0x82
> Nov 15 01:40:00 archive kernel: <EOI> [<ffffffffa3ea38e2>] ? cpuidle_enter_state+0x112/0x260
> Nov 15 01:40:00 archive kernel: [<ffffffffa3abe31e>] ? cpu_startup_entry+0x2be/0x360
> Nov 15 01:40:00 archive kernel: [<ffffffffa3a4e1c1>] ? start_secondary+0x151/0x190
> Nov 15 01:40:00 archive kernel: ---[ end trace cdbbaec80305a0cc ]---
>
> Tracing the code I found that the issue comes from the usage of a tcp
> congestion module.
> I use sysctl -w net.ipv4.tcp_congestion_control=htcp, which is compiled as
> module. Default congestion algo, cubic, is compiled inside kernel, not
> module.
>
> When the warning is triggered, /sys/module/tcp_htcp/refcnt is -1, which is
> something that should not happen. It is triggered from inside
> tcp_v4_destroy_sock, when calling tcp_cleanup_congestion_control, which
> then calls module_put.
>
> This doesn't happen in 4.5.0-0.bpo.2 (2016-05-13) so I guess there is a bug
> introduced from 4.5 to 4.6 which still lives in 4.7 and decrements
> reference count of htcp congestion module when it shouldn't. The issue is
> only triggered under heavy load (the above mentioned server is a 10g SFP+
> server with more than 4 Gbps traffic at certain times of day).
I didn't see any bug fixes relating to congestion control refcounts
after 4.7, so I'm guessing this bug is still present upstream.
Ben.
> I will test with a different congestion module (tcp_highspeed) to see if
> the issue is only htcp related or it is general issue of using congestion
> tcp modules.
>
> Panagiotis Malakoudis
--
Ben Hutchings
Time is nature's way of making sure that everything doesn't happen at
once.
Download attachment "signature.asc" of type "application/pgp-signature" (802 bytes)
Powered by blists - more mailing lists