lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <1273128429.2304.5.camel@edumazet-laptop>
Date:	Thu, 06 May 2010 08:47:09 +0200
From:	Eric Dumazet <eric.dumazet@...il.com>
To:	Martín Ferrari <martin.ferrari@...il.com>
Cc:	netdev <netdev@...r.kernel.org>,
	Mathieu Lacage <mathieu.lacage@...hia.inria.fr>,
	David Miller <davem@...emloft.net>
Subject: Re: kernel panic when using netns+bridges+tc(netem)

Le jeudi 06 mai 2010 à 08:40 +0200, Eric Dumazet a écrit :
> Le jeudi 06 mai 2010 à 03:01 +0200, Martín Ferrari a écrit :
> > Hi there,
> > 
> > While working on my project that uses netns, I found another bug. This
> > one causes a "Kernel panic - not syncing: Fatal exception in
> > interrupt", and I can reproduce it in 2.6.33 and 2.6.34-rc5, but not
> > in 2.6.32. It dies during a call to __free_skb.
> > I tested this on my x86_64 laptop (2 cores) and on qemu. In qemu it
> > was not triggered until I asked it to emulate 2 cpus instead of one,
> > so it is probably a SMP-only issue.
> > 
> > Scenario:
> > 
> > I set up a number of network namespaces, each with two veths to netns
> > 1. In the main namespace I take those veths and bridge them in pairs,
> > to configure a linear topology; also I configure the netem qdisc to
> > simulate link delay.
> > 
> > Once the network is set up, I run a client/server program to send UDP
> > packets from one end of the topology to the other. After a few seconds
> > of sending packets (not really deterministic) it panics.
> > 
> > Note that I didn't experience this problem when using only 2
> > namespaces (so, no routing)
> > 
> > below the dumps. These all come from the qemu, as I couldn't use
> > netconsole in the network at work, but I checked and the backtraces
> > were essentially the same
> > 
> > First, two panics with 2.6.34, each one with a slightly different backtrace
> > 
> > [   65.272131] ------------[ cut here ]------------
> > [   65.272135] kernel BUG at mm/slub.c:2846!
> > [   65.272135] invalid opcode: 0000 [#1] SMP
> > [   65.272135] last sysfs file: /sys/devices/virtual/net/lo/operstate
> > [   65.272135] CPU 1
> > [   65.272135] Modules linked in: sch_netem veth bridge stp netconsole
> > configfs loop parport_pc parport evdev tpm_tis tpm snd_pcm tpm_bios
> > snd_timer snd soundcore snd_page_alloc pcspkr psmouse serio_raw
> > i2c_piix4 button i2c_core processor ext3 jbd mbcache ide_cd_mod cdrom
> > ide_gd_mod ata_generic ata_piix libata 8139too scsi_mod floppy piix
> > 8139cp mii ide_core thermal thermal_sys [last unloaded: configfs]
> > [   65.272135]
> > [   65.272135] Pid: 1518, comm: udp-perf Not tainted 2.6.34-rc5 #1 /
> > [   65.272135] RIP: 0010:[<ffffffff810e0d6b>]  [<ffffffff810e0d6b>]
> > kfree+0x55/0xc6
> > [   65.272135] RSP: 0018:ffff880001a23d90  EFLAGS: 00010246
> > [   65.272135] RAX: 0100000000000000 RBX: ffff88007d6bc600 RCX: 0000000000012850
> > [   65.272135] RDX: ffff88007d6bc600 RSI: 000000000000000e RDI: ffffea0001b6f610
> > [   65.272135] RBP: ffff88007d6ae200 R08: ffff88007d6bc600 R09: ffffffffa0280690
> > [   65.272135] R10: 0000000000000002 R11: ffff88007d6bc500 R12: ffffffff8123a77f
> > [   65.272135] R13: 000000000000002b R14: ffff88007d39b600 R15: ffff88007d6bc600
> > [   65.272135] FS:  00007f637c9dd6f0(0000) GS:ffff880001a20000(0000)
> > knlGS:0000000000000000
> > [   65.272135] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
> > [   65.272135] CR2: 00000000009deaa0 CR3: 000000007d82d000 CR4: 00000000000006e0
> > [   65.272135] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> > [   65.272135] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
> > [   65.272135] Process udp-perf (pid: 1518, threadinfo
> > ffff88007d92a000, task ffff88007e7ff810)
> > [   65.272135] Stack:
> > [   65.272135]  ffff88007d6bc600 ffff88007d6bc600 0000000000000246
> > ffffffff8123a77f
> > [   65.272135] <0> ffff880001a32860 ffffffff81241e01 ffffe8ffff623280
> > ffffe8ffff83ffc0
> > [   65.272135] <0> ffff88007d6bc600 ffffffffa028057d 000000027d4c06c8
> > ffff88007d4c0600
> > [   65.272135] Call Trace:
> > [   65.272135]  <IRQ>
> > [   65.272135]  [<ffffffff8123a77f>] ? __kfree_skb+0x11/0x7d
> > [   65.272135]  [<ffffffff81241e01>] ? netif_rx+0xe2/0xee
> > [   65.272135]  [<ffffffffa028057d>] ? veth_xmit+0x6e/0xad [veth]
> > [   65.272135]  [<ffffffff8124301f>] ? dev_hard_start_xmit+0x221/0x301
> > [   65.272135]  [<ffffffff81256d9a>] ? sch_direct_xmit+0x5b/0x15d
> > [   65.272135]  [<ffffffff81256f55>] ? __qdisc_run+0xb9/0xd8
> > [   65.272135]  [<ffffffff81240511>] ? net_tx_action+0xd6/0x149
> > [   65.272135]  [<ffffffff8104ba02>] ? __do_softirq+0xdd/0x19f
> > [   65.272135]  [<ffffffff8101e515>] ? lapic_next_event+0x18/0x1d
> > [   65.272135]  [<ffffffff810099dc>] ? call_softirq+0x1c/0x30
> > [   65.272135]  [<ffffffff8100b863>] ? do_softirq+0x3f/0x79
> > [   65.272135]  [<ffffffff8104b88e>] ? irq_exit+0x36/0x76
> > [   65.272135]  [<ffffffff8101f000>] ? smp_apic_timer_interrupt+0x86/0x94
> > [   65.272135]  [<ffffffff81009493>] ? apic_timer_interrupt+0x13/0x20
> > [   65.272135]  <EOI>
> > [   65.272135]  [<ffffffff8114c1ea>] ? cap_sk_getsecid+0x0/0x1
> > [   65.272135]  [<ffffffff812a0b0f>] ? __xfrm_lookup+0x2/0xb04
> > [   65.272135]  [<ffffffff81264b8d>] ? ip_route_output_flow+0x77/0x1cc
> > [   65.272135]  [<ffffffff812887c6>] ? udp_sendmsg+0x32d/0x5f3
> > [   65.272135]  [<ffffffff81009400>] ? irq_entries_start+0x3c0/0x400
> > [   65.272135]  [<ffffffff8128e112>] ? inet_sendmsg+0x53/0x58
> > [   65.272135]  [<ffffffff8123388d>] ? sock_sendmsg+0x83/0x9b
> > [   65.272135]  [<ffffffff8103a042>] ? pick_next_task_fair+0xca/0xd6
> > [   65.272135]  [<ffffffff812ef74a>] ? schedule+0x52b/0x593
> > [   65.272135]  [<ffffffff8100948e>] ? apic_timer_interrupt+0xe/0x20
> > [   65.272135]  [<ffffffff8100948e>] ? apic_timer_interrupt+0xe/0x20
> > [   65.272135]  [<ffffffff81185d4b>] ? _copy_from_user+0x1b/0x30
> > [   65.272135]  [<ffffffff8100765f>] ? __switch_to+0x1b5/0x2a6
> > [   65.272135]  [<ffffffff81232227>] ? copy_from_user+0x13/0x25
> > [   65.272135]  [<ffffffff812353cb>] ? sys_sendto+0xd7/0x117
> > [   65.272135]  [<ffffffff8103f7b1>] ? finish_task_switch+0x34/0xa1
> > [   65.272135]  [<ffffffff812ef74a>] ? schedule+0x52b/0x593
> > [   65.272135]  [<ffffffff8100948e>] ? apic_timer_interrupt+0xe/0x20
> > [   65.272135]  [<ffffffff81008ac2>] ? system_call_fastpath+0x16/0x1b
> > [   65.272135] Code: 83 c3 08 48 83 3b 00 eb ec 48 83 fd 10 0f 86 84
> > 00 00 00 48 89 ef e8 f4 e2 ff ff 48 89 c7 48 8b 00 84 c0 78 13 66 a9
> > 00 c0 75 04 <0f> 0b eb fe 5b 5d 41 5c e9 fb 57 fd ff 48 8b 4c 24 18 4c
> > 8b 47
> > [   65.272135] RIP  [<ffffffff810e0d6b>] kfree+0x55/0xc6
> > [   65.272135]  RSP <ffff880001a23d90>
> > [   65.385803] ---[ end trace 42d2fb5b94980ab5 ]---
> > [   65.386337] Kernel panic - not syncing: Fatal exception in interrupt
> > [   65.386943] Pid: 1518, comm: udp-perf Tainted: G      D    2.6.34-rc5 #1
> > [   65.387557] Call Trace:
> > [   65.388011]  <IRQ>  [<ffffffff812eef9b>] ? panic+0x77/0xf7
> > [   65.388729]  [<ffffffff81046b88>] ? kmsg_dump+0xa6/0x13e
> > [   65.389292]  [<ffffffff8100c8c2>] ? oops_end+0xa7/0xb4
> > [   65.389871]  [<ffffffff8123a77f>] ? __kfree_skb+0x11/0x7d
> > [   65.390441]  [<ffffffff8100a695>] ? do_invalid_op+0x8b/0x95
> > [   65.391032]  [<ffffffff810e0d6b>] ? kfree+0x55/0xc6
> > [   65.391587]  [<ffffffffa026f301>] ?
> > br_nf_pre_routing_finish+0x0/0x25e [bridge]
> > [   65.392512]  [<ffffffffa026f301>] ?
> > br_nf_pre_routing_finish+0x0/0x25e [bridge]
> > [   65.393399]  [<ffffffff8100975b>] ? invalid_op+0x1b/0x20
> > [   65.393978]  [<ffffffff8123a77f>] ? __kfree_skb+0x11/0x7d
> > [   65.394547]  [<ffffffff810e0d6b>] ? kfree+0x55/0xc6
> > [   65.395091]  [<ffffffff8123a77f>] ? __kfree_skb+0x11/0x7d
> > [   65.395668]  [<ffffffff81241e01>] ? netif_rx+0xe2/0xee
> > [   65.396438]  [<ffffffffa028057d>] ? veth_xmit+0x6e/0xad [veth]
> > [   65.397026]  [<ffffffff8124301f>] ? dev_hard_start_xmit+0x221/0x301
> > [   65.397640]  [<ffffffff81256d9a>] ? sch_direct_xmit+0x5b/0x15d
> > [   65.398222]  [<ffffffff81256f55>] ? __qdisc_run+0xb9/0xd8
> > [   65.398788]  [<ffffffff81240511>] ? net_tx_action+0xd6/0x149
> > [   65.399365]  [<ffffffff8104ba02>] ? __do_softirq+0xdd/0x19f
> > [   65.399935]  [<ffffffff8101e515>] ? lapic_next_event+0x18/0x1d
> > [   65.400560]  [<ffffffff810099dc>] ? call_softirq+0x1c/0x30
> > [   65.401125]  [<ffffffff8100b863>] ? do_softirq+0x3f/0x79
> > [   65.401708]  [<ffffffff8104b88e>] ? irq_exit+0x36/0x76
> > [   65.402267]  [<ffffffff8101f000>] ? smp_apic_timer_interrupt+0x86/0x94
> > [   65.402869]  [<ffffffff81009493>] ? apic_timer_interrupt+0x13/0x20
> > [   65.403463]  <EOI>  [<ffffffff8114c1ea>] ? cap_sk_getsecid+0x0/0x1
> > [   65.404210]  [<ffffffff812a0b0f>] ? __xfrm_lookup+0x2/0xb04
> > [   65.404779]  [<ffffffff81264b8d>] ? ip_route_output_flow+0x77/0x1cc
> > [   65.405375]  [<ffffffff812887c6>] ? udp_sendmsg+0x32d/0x5f3
> > [   65.405945]  [<ffffffff81009400>] ? irq_entries_start+0x3c0/0x400
> > [   65.406534]  [<ffffffff8128e112>] ? inet_sendmsg+0x53/0x58
> > [   65.407102]  [<ffffffff8123388d>] ? sock_sendmsg+0x83/0x9b
> > [   65.407674]  [<ffffffff8103a042>] ? pick_next_task_fair+0xca/0xd6
> > [   65.408296]  [<ffffffff812ef74a>] ? schedule+0x52b/0x593
> > [   65.408854]  [<ffffffff8100948e>] ? apic_timer_interrupt+0xe/0x20
> > [   65.409468]  [<ffffffff8100948e>] ? apic_timer_interrupt+0xe/0x20
> > [   65.410084]  [<ffffffff81185d4b>] ? _copy_from_user+0x1b/0x30
> > [   65.410667]  [<ffffffff8100765f>] ? __switch_to+0x1b5/0x2a6
> > [   65.510031]  [<ffffffff81232227>] ? copy_from_user+0x13/0x25
> > [   65.510818]  [<ffffffff812353cb>] ? sys_sendto+0xd7/0x117
> > [   65.511608]  [<ffffffff8103f7b1>] ? finish_task_switch+0x34/0xa1
> > [   65.512465]  [<ffffffff812ef74a>] ? schedule+0x52b/0x593
> > [   65.513229]  [<ffffffff8100948e>] ? apic_timer_interrupt+0xe/0x20
> > [   65.514040]  [<ffffffff81008ac2>] ? system_call_fastpath+0x16/0x1b
> > 
> > [ 1438.042568] ------------[ cut here ]------------
> > [ 1438.043170] kernel BUG at mm/slub.c:2846!
> > [ 1438.043711] invalid opcode: 0000 [#1] SMP
> > [ 1438.044531] last sysfs file: /sys/devices/virtual/net/lo/operstate
> > [ 1438.045148] CPU 1
> > [ 1438.045350] Modules linked in: sch_netem veth bridge stp netconsole
> > configfs loop parport_pc tpm_tis tpm snd_pcm psmouse parport tpm_bios
> > snd_timer snd soundcore snd_page_alloc evdev pcspkr serio_raw
> > i2c_piix4 i2c_core button processor ext3 jbd mbcache ide_cd_mod cdrom
> > ide_gd_mod ata_generic ata_piix 8139too libata scsi_mod floppy 8139cp
> > mii thermal thermal_sys piix ide_core [last unloaded: scsi_wait_scan]
> > [ 1438.046215]
> > [ 1438.046215] Pid: 1476, comm: udp-perf Not tainted 2.6.34-rc5 #1 /
> > [ 1438.046215] RIP: 0010:[<ffffffff810e0d6b>]  [<ffffffff810e0d6b>]
> > kfree+0x55/0xc6
> > [ 1438.046215] RSP: 0018:ffff880001a23d90  EFLAGS: 00010246
> > [ 1438.046215] RAX: 0100000000000000 RBX: ffff88007d882200 RCX: 0000000000012850
> > [ 1438.046215] RDX: ffff88007d882200 RSI: 000000000000000e RDI: ffffea0001b972a0
> > [ 1438.046215] RBP: ffff88007e20c000 R08: ffff88007d882200 R09: ffffffffa026c690
> > [ 1438.046215] R10: 0000000000000002 R11: ffff88007d882100 R12: ffffffff8123a77f
> > [ 1438.046215] R13: 0000000000000032 R14: ffff8800378e0700 R15: ffff88007d882200
> > [ 1438.046215] FS:  00007f1c1d07f6f0(0000) GS:ffff880001a20000(0000)
> > knlGS:0000000000000000
> > [ 1438.046215] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
> > [ 1438.046215] CR2: 00007f6ea9e07310 CR3: 0000000037834000 CR4: 00000000000006e0
> > [ 1438.046215] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> > [ 1438.046215] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
> > [ 1438.046215] Process udp-perf (pid: 1476, threadinfo
> > ffff88007db0e000, task ffff8800379a2350)
> > [ 1438.046215] Stack:
> > [ 1438.046215]  ffff88007d882200 ffff88007d882200 0000000000000246
> > ffffffff8123a77f
> > [ 1438.046215] <0> ffff880001a32860 ffffffff81241e01 ffffe8ffffa3f430
> > ffffe8ffffa3d180
> > [ 1438.046215] <0> ffff88007d882200 ffffffffa026c57d 0000000201a30500
> > ffff88007d171200
> > [ 1438.046215] Call Trace:
> > [ 1438.046215]  <IRQ>
> > [ 1438.046215]  [<ffffffff8123a77f>] ? __kfree_skb+0x11/0x7d
> > [ 1438.046215]  [<ffffffff81241e01>] ? netif_rx+0xe2/0xee
> > [ 1438.046215]  [<ffffffffa026c57d>] ? veth_xmit+0x6e/0xad [veth]
> > [ 1438.046215]  [<ffffffff8124301f>] ? dev_hard_start_xmit+0x221/0x301
> > [ 1438.046215]  [<ffffffff81256d9a>] ? sch_direct_xmit+0x5b/0x15d
> > [ 1438.046215]  [<ffffffff81256f55>] ? __qdisc_run+0xb9/0xd8
> > [ 1438.046215]  [<ffffffff81240511>] ? net_tx_action+0xd6/0x149
> > [ 1438.046215]  [<ffffffff8104ba02>] ? __do_softirq+0xdd/0x19f
> > [ 1438.046215]  [<ffffffff8101e515>] ? lapic_next_event+0x18/0x1d
> > [ 1438.046215]  [<ffffffff810099dc>] ? call_softirq+0x1c/0x30
> > [ 1438.046215]  [<ffffffff8100b863>] ? do_softirq+0x3f/0x79
> > [ 1438.046215]  [<ffffffff8104b88e>] ? irq_exit+0x36/0x76
> > [ 1438.046215]  [<ffffffff8101f000>] ? smp_apic_timer_interrupt+0x86/0x94
> > [ 1438.046215]  [<ffffffff81009493>] ? apic_timer_interrupt+0x13/0x20
> > [ 1438.046215]  <EOI>
> > [ 1438.046215]  [<ffffffff81288499>] ? udp_sendmsg+0x0/0x5f3
> > [ 1438.046215]  [<ffffffff8128850f>] ? udp_sendmsg+0x76/0x5f3
> > [ 1438.046215]  [<ffffffff812889a7>] ? udp_sendmsg+0x50e/0x5f3
> > [ 1438.046215]  [<ffffffff8123388d>] ? sock_sendmsg+0x83/0x9b
> > [ 1438.046215]  [<ffffffff8103a042>] ? pick_next_task_fair+0xca/0xd6
> > [ 1438.046215]  [<ffffffff812ef74a>] ? schedule+0x52b/0x593
> > [ 1438.046215]  [<ffffffff8100948e>] ? apic_timer_interrupt+0xe/0x20
> > [ 1438.046215]  [<ffffffff8100948e>] ? apic_timer_interrupt+0xe/0x20
> > [ 1438.046215]  [<ffffffff8100948e>] ? apic_timer_interrupt+0xe/0x20
> > [ 1438.046215]  [<ffffffff810e93ed>] ? fget_light+0x0/0xa1
> > [ 1438.046215]  [<ffffffff81185e40>] ? copy_user_generic_string+0x30/0x40
> > [ 1438.046215]  [<ffffffff81232227>] ? copy_from_user+0x13/0x25
> > [ 1438.046215]  [<ffffffff812353cb>] ? sys_sendto+0xd7/0x117
> > [ 1438.046215]  [<ffffffff8103f7b1>] ? finish_task_switch+0x34/0xa1
> > [ 1438.046215]  [<ffffffff812ef74a>] ? schedule+0x52b/0x593
> > [ 1438.046215]  [<ffffffff8100948e>] ? apic_timer_interrupt+0xe/0x20
> > [ 1438.046215]  [<ffffffff81008ac2>] ? system_call_fastpath+0x16/0x1b
> > [ 1438.046215] Code: 83 c3 08 48 83 3b 00 eb ec 48 83 fd 10 0f 86 84
> > 00 00 00 48 89 ef e8 f4 e2 ff ff 48 89 c7 48 8b 00 84 c0 78 13 66 a9
> > 00 c0 75 04 <0f> 0b eb fe 5b 5d 41 5c e9 fb 57 fd ff 48 8b 4c 24 18 4c
> > 8b 47
> > [ 1438.046215] RIP  [<ffffffff810e0d6b>] kfree+0x55/0xc6
> > [ 1438.046215]  RSP <ffff880001a23d90>
> > [ 1438.102706] ---[ end trace ab36062dcf233d6a ]---
> > [ 1438.103251] Kernel panic - not syncing: Fatal exception in interrupt
> > [ 1438.103912] Pid: 1476, comm: udp-perf Tainted: G      D    2.6.34-rc5 #1
> > [ 1438.104563] Call Trace:
> > [ 1438.105017]  <IRQ>  [<ffffffff812eef9b>] ? panic+0x77/0xf7
> > [ 1438.105718]  [<ffffffff81046b88>] ? kmsg_dump+0xa6/0x13e
> > [ 1438.106293]  [<ffffffff8100c8c2>] ? oops_end+0xa7/0xb4
> > [ 1438.106866]  [<ffffffff8123a77f>] ? __kfree_skb+0x11/0x7d
> > [ 1438.107451]  [<ffffffff8100a695>] ? do_invalid_op+0x8b/0x95
> > [ 1438.108036]  [<ffffffff810e0d6b>] ? kfree+0x55/0xc6
> > [ 1438.108799]  [<ffffffffa025b301>] ?
> > br_nf_pre_routing_finish+0x0/0x25e [bridge]
> > [ 1438.109699]  [<ffffffffa025b301>] ?
> > br_nf_pre_routing_finish+0x0/0x25e [bridge]
> > [ 1438.110604]  [<ffffffff8100975b>] ? invalid_op+0x1b/0x20
> > [ 1438.111172]  [<ffffffff8123a77f>] ? __kfree_skb+0x11/0x7d
> > [ 1438.111753]  [<ffffffff810e0d6b>] ? kfree+0x55/0xc6
> > [ 1438.112359]  [<ffffffff8123a77f>] ? __kfree_skb+0x11/0x7d
> > [ 1438.112952]  [<ffffffff81241e01>] ? netif_rx+0xe2/0xee
> > [ 1438.113527]  [<ffffffffa026c57d>] ? veth_xmit+0x6e/0xad [veth]
> > [ 1438.114123]  [<ffffffff8124301f>] ? dev_hard_start_xmit+0x221/0x301
> > [ 1438.114734]  [<ffffffff81256d9a>] ? sch_direct_xmit+0x5b/0x15d
> > [ 1438.115334]  [<ffffffff81256f55>] ? __qdisc_run+0xb9/0xd8
> > [ 1438.115912]  [<ffffffff81240511>] ? net_tx_action+0xd6/0x149
> > [ 1438.116544]  [<ffffffff8104ba02>] ? __do_softirq+0xdd/0x19f
> > [ 1438.117128]  [<ffffffff8101e515>] ? lapic_next_event+0x18/0x1d
> > [ 1438.117732]  [<ffffffff810099dc>] ? call_softirq+0x1c/0x30
> > [ 1438.118420]  [<ffffffff8100b863>] ? do_softirq+0x3f/0x79
> > [ 1438.119066]  [<ffffffff8104b88e>] ? irq_exit+0x36/0x76
> > [ 1438.147879]  [<ffffffff8101f000>] ? smp_apic_timer_interrupt+0x86/0x94
> > [ 1438.148615]  [<ffffffff81009493>] ? apic_timer_interrupt+0x13/0x20
> > [ 1438.149294]  <EOI>  [<ffffffff81288499>] ? udp_sendmsg+0x0/0x5f3
> > [ 1438.150101]  [<ffffffff8128850f>] ? udp_sendmsg+0x76/0x5f3
> > [ 1438.150681]  [<ffffffff812889a7>] ? udp_sendmsg+0x50e/0x5f3
> > [ 1438.151276]  [<ffffffff8123388d>] ? sock_sendmsg+0x83/0x9b
> > [ 1438.151853]  [<ffffffff8103a042>] ? pick_next_task_fair+0xca/0xd6
> > [ 1438.152508]  [<ffffffff812ef74a>] ? schedule+0x52b/0x593
> > [ 1438.153078]  [<ffffffff8100948e>] ? apic_timer_interrupt+0xe/0x20
> > [ 1438.153687]  [<ffffffff8100948e>] ? apic_timer_interrupt+0xe/0x20
> > [ 1438.154295]  [<ffffffff8100948e>] ? apic_timer_interrupt+0xe/0x20
> > [ 1438.154902]  [<ffffffff810e93ed>] ? fget_light+0x0/0xa1
> > [ 1438.155477]  [<ffffffff81185e40>] ? copy_user_generic_string+0x30/0x40
> > [ 1438.156097]  [<ffffffff81232227>] ? copy_from_user+0x13/0x25
> > [ 1438.156720]  [<ffffffff812353cb>] ? sys_sendto+0xd7/0x117
> > [ 1438.157376]  [<ffffffff8103f7b1>] ? finish_task_switch+0x34/0xa1
> > [ 1438.157970]  [<ffffffff812ef74a>] ? schedule+0x52b/0x593
> > [ 1438.158557]  [<ffffffff8100948e>] ? apic_timer_interrupt+0xe/0x20
> > [ 1438.159153]  [<ffffffff81008ac2>] ? system_call_fastpath+0x16/0x1b
> > 
> > ------------
> > 
> > Finally, a panic in 2.6.33. Note that the line in wioch BUG is
> > triggered is different
> > 
> > [  102.442815] ------------[ cut here ]------------
> > [  102.443433] kernel BUG at
> > /build/mattems-linux-2.6_2.6.33-1~experimental.4-amd64-ieqSsa/linux-2.6-2.6.33-1~experimental.4/debian/build/source_amd64_none/mm/slub.c:2969!
> > [  102.444874] invalid opcode: 0000 [#1] SMP
> > [  102.444958] last sysfs file: /sys/devices/virtual/net/lo/operstate
> > [  102.444958] CPU 0
> > [  102.444958] Pid: 4, comm: ksoftirqd/0 Not tainted 2.6.33-2-amd64 #1 /
> > [  102.444958] RIP: 0010:[<ffffffff810e1e2c>]  [<ffffffff810e1e2c>]
> > kfree+0x55/0xcb
> > [  102.444958] RSP: 0018:ffff880001a03df8  EFLAGS: 00010246
> > [  102.444958] RAX: 0100000000000000 RBX: ffff88007e439000 RCX: 0000000000012d70
> > [  102.444958] RDX: 000000000000006a RSI: ffffea0001b76a70 RDI: ffffea0000c1a328
> > [  102.444958] RBP: ffff880037533c00 R08: ffff88007e21e500 R09: ffffffff8162bbe0
> > [  102.444958] R10: 000000037e439e00 R11: ffff88007e439e00 R12: ffffffff81239cf2
> > [  102.444958] R13: 000000000000006a R14: ffff88007efb3000 R15: ffff88007e21e500
> > [  102.444958] FS:  0000000000000000(0000) GS:ffff880001a00000(0000)
> > knlGS:0000000000000000
> > [  102.444958] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
> > [  102.444958] CR2: 00007fc8e25cd0f0 CR3: 000000007d088000 CR4: 00000000000006f0
> > [  102.444958] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> > [  102.444958] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
> > [  102.444958] Process ksoftirqd/0 (pid: 4, threadinfo
> > ffff88007fb78000, task ffff88007fb61500)
> > [  102.444958] Stack:
> > [  102.444958]  ffff88007e439000 ffff88007e439000 0000000000000246
> > ffffffff81239cf2
> > [  102.444958] <0> ffff880001a12d80 ffffffff8124125d ffffe8ffff818190
> > ffffe8ffff815ee0
> > [  102.444958] <0> ffff88007e439000 ffffffffa025254b ffff880001a10010
> > ffff88007ef95200
> > [  102.444958] Call Trace:
> > [  102.444958]  <IRQ>
> > [  102.444958]  [<ffffffff81239cf2>] ? __kfree_skb+0x11/0x7d
> > [  102.444958]  [<ffffffff8124125d>] ? netif_rx+0xe2/0xee
> > [  102.444958]  [<ffffffffa025254b>] ? veth_xmit+0x6e/0xad [veth]
> > [  102.444958]  [<ffffffff8124243a>] ? dev_hard_start_xmit+0x221/0x2dc
> > [  102.444958]  [<ffffffff81255514>] ? sch_direct_xmit+0x5b/0x15d
> > [  102.444958]  [<ffffffff812556cf>] ? __qdisc_run+0xb9/0xda
> > [  102.444958]  [<ffffffff8123fa99>] ? net_tx_action+0xd6/0x149
> > [  102.444958]  [<ffffffff8104c918>] ? __do_softirq+0xdd/0x1a1
> > [  102.444958]  [<ffffffff810099dc>] ? call_softirq+0x1c/0x30
> > [  102.444958]  <EOI>
> > [  102.444958]  [<ffffffff8100b85f>] ? do_softirq+0x3f/0x79
> > [  102.444958]  [<ffffffff8104c504>] ? run_ksoftirqd+0x6a/0x118
> > [  102.444958]  [<ffffffff8104c49a>] ? run_ksoftirqd+0x0/0x118
> > [  102.444958]  [<ffffffff8105ee99>] ? kthread+0x79/0x81
> > [  102.444958]  [<ffffffff810098e4>] ? kernel_thread_helper+0x4/0x10
> > [  102.444958]  [<ffffffff8105ee20>] ? kthread+0x0/0x81
> > [  102.444958]  [<ffffffff810098e0>] ? kernel_thread_helper+0x0/0x10
> > [  102.444958] Code: 83 c3 08 48 83 3b 00 eb ec 48 83 fd 10 0f 86 89
> > 00 00 00 48 89 ef e8 f0 e6 ff ff 48 89 c7 48 8b 00 84 c0 78 13 66 a9
> > 00 c0 75 04 <0f> 0b eb fe 5b 5d 41 5c e9 45 5a fd ff 48 8b 4c 24 18 4c
> > 8b 4f
> > [  102.444958] RIP  [<ffffffff810e1e2c>] kfree+0x55/0xcb
> > [  102.444958]  RSP <ffff880001a03df8>
> > [  102.484000] ---[ end trace b1157390d40df1cb ]---
> > [  102.485018] Kernel panic - not syncing: Fatal exception in interrupt
> > [  102.485647] Pid: 4, comm: ksoftirqd/0 Tainted: G      D    2.6.33-2-amd64 #1
> > [  102.486630] Call Trace:
> > [  102.487112]  <IRQ>  [<ffffffff812ec605>] ? panic+0x86/0x14b
> > [  102.487870]  [<ffffffff8104c799>] ? irq_exit+0x48/0x76
> > [  102.488474]  [<ffffffff812ee893>] ? ret_from_intr+0x0/0x11
> > [  102.489068]  [<ffffffff810478c0>] ? kmsg_dump+0xa6/0x13e
> > [  102.489661]  [<ffffffff8100c89a>] ? oops_end+0xa7/0xb4
> > [  102.490245]  [<ffffffff81239cf2>] ? __kfree_skb+0x11/0x7d
> > [  102.490836]  [<ffffffff8100a690>] ? do_invalid_op+0x8b/0x95
> > [  102.491436]  [<ffffffff810e1e2c>] ? kfree+0x55/0xcb
> > [  102.492020]  [<ffffffffa0244aee>] ?
> > br_nf_pre_routing_finish+0x284/0x2a6 [bridge]
> > [  102.492942]  [<ffffffffa024486a>] ?
> > br_nf_pre_routing_finish+0x0/0x2a6 [bridge]
> > [  102.493857]  [<ffffffff8125f204>] ? nf_hook_slow+0x62/0xc3
> > [  102.523082]  [<ffffffffa024486a>] ?
> > br_nf_pre_routing_finish+0x0/0x2a6 [bridge]
> > [  102.524008]  [<ffffffff8100975b>] ? invalid_op+0x1b/0x20
> > [  102.524631]  [<ffffffff81239cf2>] ? __kfree_skb+0x11/0x7d
> > [  102.525225]  [<ffffffff810e1e2c>] ? kfree+0x55/0xcb
> > [  102.525795]  [<ffffffff810e1e1c>] ? kfree+0x45/0xcb
> > [  102.526402]  [<ffffffff81239cf2>] ? __kfree_skb+0x11/0x7d
> > [  102.526992]  [<ffffffff8124125d>] ? netif_rx+0xe2/0xee
> > [  102.527576]  [<ffffffffa025254b>] ? veth_xmit+0x6e/0xad [veth]
> > [  102.528199]  [<ffffffff8124243a>] ? dev_hard_start_xmit+0x221/0x2dc
> > [  102.528821]  [<ffffffff81255514>] ? sch_direct_xmit+0x5b/0x15d
> > [  102.529429]  [<ffffffff812556cf>] ? __qdisc_run+0xb9/0xda
> > [  102.530018]  [<ffffffff8123fa99>] ? net_tx_action+0xd6/0x149
> > [  102.530618]  [<ffffffff8104c918>] ? __do_softirq+0xdd/0x1a1
> > [  102.531214]  [<ffffffff810099dc>] ? call_softirq+0x1c/0x30
> > [  102.531804]  <EOI>  [<ffffffff8100b85f>] ? do_softirq+0x3f/0x79
> > [  102.532572]  [<ffffffff8104c504>] ? run_ksoftirqd+0x6a/0x118
> > [  102.533169]  [<ffffffff8104c49a>] ? run_ksoftirqd+0x0/0x118
> > [  102.533764]  [<ffffffff8105ee99>] ? kthread+0x79/0x81
> > [  102.534340]  [<ffffffff810098e4>] ? kernel_thread_helper+0x4/0x10
> > [  102.534954]  [<ffffffff8105ee20>] ? kthread+0x0/0x81
> > [  102.535526]  [<ffffffff810098e0>] ? kernel_thread_helper+0x0/0x10
> > 
> 
> Could you please try following patch ?
> 
> Thanks
> 
> [PATCH] veth: Dont kfree_skb() after dev_forward_skb()
> 
> In case of congestion, dev_forward_skb() already free the skb
> 
> Reported-by: Martín Ferrari <martin.ferrari@...il.com>
> Signed-off-by: Eric Dumazet <eric.dumazet@...il.com>
> ---
> diff --git a/drivers/net/veth.c b/drivers/net/veth.c
> index f9f0730..5ec542d 100644
> --- a/drivers/net/veth.c
> +++ b/drivers/net/veth.c
> @@ -187,7 +187,6 @@ tx_drop:
>  	return NETDEV_TX_OK;
>  
>  rx_drop:
> -	kfree_skb(skb);
>  	rcv_stats->rx_dropped++;
>  	return NETDEV_TX_OK;
>  }
> 

Hmm, scratch that one, I'll resubmit a proper fix in few minutes

(We must change dev_forward_skb() too)



--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ