[<prev] [next>] [day] [month] [year] [list]
Date: Wed, 04 Apr 2012 16:47:04 +0200
From: Jean-Philippe Menil <jean-philippe.menil@...v-nantes.fr>
To: netdev@...r.kernel.org
Subject: INFO: rcu_sched detected stall on CPU 0 (with whost module)
Hi,
several times a day, i can observe the following trace:
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600004]
INFO: rcu_sched detected stall on CPU 0 (t=6000 jiffies)
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
Pid: 2487, comm: vhost-2475 Not tainted 3.2.1-dsiun-120113 #55
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
Call Trace:
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
<IRQ> [<ffffffff810ac986>] ? __rcu_pending+0x1e6/0x400
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff810ace3b>] ? rcu_check_callbacks+0x6b/0x1d0
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff810586df>] ? update_process_times+0x3f/0x80
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff8107998b>] ? tick_sched_timer+0x5b/0xb0
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff8106d099>] ? __run_hrtimer+0x69/0x1e0
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff81079930>] ? tick_nohz_handler+0xe0/0xe0
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff81009e85>] ? read_tsc+0x5/0x20
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff8106d9d5>] ? hrtimer_interrupt+0xe5/0x200
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff8101f563>] ? smp_apic_timer_interrupt+0x63/0xa0
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff813ad79e>] ? apic_timer_interrupt+0x6e/0x80
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff812f1d01>] ? __kfree_skb+0x11/0x90
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffffa00d4437>] ? bnx2_poll_work+0x247/0x1370 [bnx2]
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffffa00d4437>] ? bnx2_poll_work+0x247/0x1370 [bnx2]
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff813a6e7c>] ? put_cpu_partial+0x90/0x90
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff8111b453>] ? kmem_cache_free+0x103/0x110
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffffa00d4437>] ? bnx2_poll_work+0x247/0x1370 [bnx2]
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffffa00d5691>] ? bnx2_poll+0x61/0x264 [bnx2]
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff812febd9>] ? net_rx_action+0x119/0x260
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff8104fa1d>] ? __do_softirq+0x9d/0x1f0
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff813aef2c>] ? call_softirq+0x1c/0x30
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff813aef2c>] ? call_softirq+0x1c/0x30
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
<EOI> [<ffffffff81004725>] ? do_softirq+0x65/0xa0
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff812ff05e>] ? netif_rx_ni+0x1e/0x30
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffffa04fb5af>] ? tun_get_user+0x30f/0x4c0 [tun]
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffffa04fb780>] ? tun_sendmsg+0x20/0x30 [tun]
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffffa051426e>] ? handle_tx+0x27e/0x4f0 [vhost_net]
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffffa0511781>] ? vhost_worker+0xc1/0x150 [vhost_net]
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffffa05116c0>] ? memory_access_ok.isra.11+0xd0/0xd0 [vhost_net]
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff8106905e>] ? kthread+0x7e/0x90
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff813aee34>] ? kernel_thread_helper+0x4/0x10
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff81068fe0>] ? kthread_worker_fn+0x190/0x190
Apr 3 19:51:41 ayrshire.u06.univ-nantes.prive kernel: [1659525.600015]
[<ffffffff813aee30>] ? gs_change+0x13/0x13
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950000]
INFO: rcu_sched detected stall on CPU 0 (t=6000 jiffies)
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
Pid: 2485, comm: vhost-2475 Not tainted 3.2.1-dsiun-120113 #55
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
Call Trace:
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
<IRQ> [<ffffffff810ac986>] ? __rcu_pending+0x1e6/0x400
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff810ace3b>] ? rcu_check_callbacks+0x6b/0x1d0
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff810586df>] ? update_process_times+0x3f/0x80
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff8107998b>] ? tick_sched_timer+0x5b/0xb0
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff8106d099>] ? __run_hrtimer+0x69/0x1e0
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff81079930>] ? tick_nohz_handler+0xe0/0xe0
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff81009e85>] ? read_tsc+0x5/0x20
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff8106d9d5>] ? hrtimer_interrupt+0xe5/0x200
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff8101f563>] ? smp_apic_timer_interrupt+0x63/0xa0
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff8111b1c7>] ? unfreeze_partials+0xc7/0x250
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff813ad79e>] ? apic_timer_interrupt+0x6e/0x80
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff812f1fc5>] ? skb_release_head_state+0xa5/0x110
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff813a6e33>] ? put_cpu_partial+0x47/0x90
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff812f1fc5>] ? skb_release_head_state+0xa5/0x110
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff812f1cf9>] ? __kfree_skb+0x9/0x90
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffffa00d4437>] ? bnx2_poll_work+0x247/0x1370 [bnx2]
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffffa00d5691>] ? bnx2_poll+0x61/0x264 [bnx2]
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff8134e3d0>] ? tcp_init_xmit_timers+0x20/0x20
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff812febd9>] ? net_rx_action+0x119/0x260
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff8104fa1d>] ? __do_softirq+0x9d/0x1f0
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff813aef2c>] ? call_softirq+0x1c/0x30
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
<EOI> [<ffffffff81004725>] ? do_softirq+0x65/0xa0
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff812ff05e>] ? netif_rx_ni+0x1e/0x30
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffffa04fb5af>] ? tun_get_user+0x30f/0x4c0 [tun]
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffffa04fb780>] ? tun_sendmsg+0x20/0x30 [tun]
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffffa051426e>] ? handle_tx+0x27e/0x4f0 [vhost_net]
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffffa0511781>] ? vhost_worker+0xc1/0x150 [vhost_net]
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffffa05116c0>] ? memory_access_ok.isra.11+0xd0/0xd0 [vhost_net]
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff8106905e>] ? kthread+0x7e/0x90
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff813aee34>] ? kernel_thread_helper+0x4/0x10
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff81068fe0>] ? kthread_worker_fn+0x190/0x190
Apr 3 19:59:23 ayrshire.u06.univ-nantes.prive kernel: [1659986.950014]
[<ffffffff813aee30>] ? gs_change+0x13/0x13
This host run several kvm (with multiple tun/tap devices attached on
bridge) with the vhost module with experimental_zcopytx to 1
Both eth0 and eth2 are BCM5708S, with followinf parameter:
root@...shire:~# ethtool -k eth0
Offload parameters for eth0:
rx-checksumming: on
tx-checksumming: on
scatter-gather: on
tcp-segmentation-offload: on
udp-fragmentation-offload: off
generic-segmentation-offload: on
generic-receive-offload: on
large-receive-offload: off
ntuple-filters: off
receive-hashing: on
root@...shire:~# ethtool -i eth0
driver: bnx2
version: 2.1.11
firmware-version: 5.2.7 bc 5.0.5
bus-info: 0000:04:00.0
Is it a know issue?
Regards
--
Jean-Philippe Menil - Pôle réseau Service IRTS
DSI Université de Nantes
jean-philippe.menil@...v-nantes.fr
Tel : 02.53.48.49.27 - Fax : 02.53.48.49.09
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists