[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <20140311.131615.1099547048239011196.davem@davemloft.net>
Date: Tue, 11 Mar 2014 13:16:15 -0400 (EDT)
From: David Miller <davem@...emloft.net>
To: nhorman@...driver.com
Cc: netdev@...r.kernel.org, sbhatewara@...are.com,
pv-drivers@...are.com, stable@...r.kernel.org
Subject: Re: [PATCH v2] vmxnet3: fix netpoll race condition
From: Neil Horman <nhorman@...driver.com>
Date: Mon, 10 Mar 2014 06:55:55 -0400
> vmxnet3's netpoll driver is incorrectly coded. It directly calls
> vmxnet3_do_poll, which is the driver internal napi poll routine. As the netpoll
> controller method doesn't block real napi polls in any way, there is a potential
> for race conditions in which the netpoll controller method and the napi poll
> method run concurrently. The result is data corruption causing panics such as this
> one recently observed:
> PID: 1371 TASK: ffff88023762caa0 CPU: 1 COMMAND: "rs:main Q:Reg"
> #0 [ffff88023abd5780] machine_kexec at ffffffff81038f3b
> #1 [ffff88023abd57e0] crash_kexec at ffffffff810c5d92
> #2 [ffff88023abd58b0] oops_end at ffffffff8152b570
> #3 [ffff88023abd58e0] die at ffffffff81010e0b
> #4 [ffff88023abd5910] do_trap at ffffffff8152add4
> #5 [ffff88023abd5970] do_invalid_op at ffffffff8100cf95
> #6 [ffff88023abd5a10] invalid_op at ffffffff8100bf9b
> [exception RIP: vmxnet3_rq_rx_complete+1968]
> RIP: ffffffffa00f1e80 RSP: ffff88023abd5ac8 RFLAGS: 00010086
> RAX: 0000000000000000 RBX: ffff88023b5dcee0 RCX: 00000000000000c0
> RDX: 0000000000000000 RSI: 00000000000005f2 RDI: ffff88023b5dcee0
> RBP: ffff88023abd5b48 R8: 0000000000000000 R9: ffff88023a3b6048
> R10: 0000000000000000 R11: 0000000000000002 R12: ffff8802398d4cd8
> R13: ffff88023af35140 R14: ffff88023b60c890 R15: 0000000000000000
> ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
> #7 [ffff88023abd5b50] vmxnet3_do_poll at ffffffffa00f204a [vmxnet3]
> #8 [ffff88023abd5b80] vmxnet3_netpoll at ffffffffa00f209c [vmxnet3]
> #9 [ffff88023abd5ba0] netpoll_poll_dev at ffffffff81472bb7
>
> The fix is to do as other drivers do, and have the poll controller call the top
> half interrupt handler, which schedules a napi poll properly to recieve frames
>
> Tested by myself, successfully.
>
> Signed-off-by: Neil Horman <nhorman@...driver.com>
VMware folks, please review.
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists