[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CA+55aFw64xTR7LXjo5OfUvRwDb9UF8EcfjguAbF0C3VsUXCecg@mail.gmail.com>
Date: Mon, 14 Oct 2013 10:31:14 -0700
From: Linus Torvalds <torvalds@...ux-foundation.org>
To: Dave Jones <davej@...hat.com>,
Linux Kernel <linux-kernel@...r.kernel.org>,
Al Viro <viro@...iv.linux.org.uk>,
Davide Libenzi <davidel@...ilserver.org>,
Eric Wong <normalperson@...t.net>,
Oleg Nesterov <oleg@...hat.com>
Subject: Re: epoll oops.
On Mon, Oct 14, 2013 at 8:46 AM, Dave Jones <davej@...hat.com> wrote:
> Machine is wedged and I can't get to it until tomorrow, but this is what was on serial console.
> kernel running was from some time last Friday, I can get exact info tomorrow, though
> I don't think there's anything epoll related recently that could explain this.
It looks like it is the access to "lock->key" that takes a page fault.
The pointer looks good (%r13=ffff8801654cec98), so I'm pretty sure
this is due to DEBUG_PAGEALLOC and a free'd page.
So it looks like ep_unregister_pollwait() calls remove_wait_queue() on
a wait-queue head that has already been free'd.
I have this dim memory of us having fought this before. But maybe I'm
just remembering some of the old signalfd-vs-epoll races.
Oleg, does this trigger any memory for you? Commit 971316f0503a
("epoll: ep_unregister_pollwait() can use the freed pwq->whead") just
makes me go "Hmm, this is *exactly* that that commit is talking
about.."
Linus
---
> Oops: 0000 [#1] PREEMPT SMP DEBUG_PAGEALLOC
> CPU: 3 PID: 449 Comm: trinity-main Not tainted 3.12.0-rc4+ #98
> task: ffff88023e239560 ti: ffff880083082000 task.ti: ffff880083082000
> RIP: 0010:[<ffffffff810c9f98>] [<ffffffff810c9f98>] __lock_acquire+0x58/0x1be0
> Call Trace:
> [<ffffffff810cc2d3>] lock_acquire+0x93/0x200
> [<ffffffff81732c3b>] _raw_spin_lock_irqsave+0x4b/0x90
> [<ffffffff810848d9>] remove_wait_queue+0x19/0x40
> [<ffffffff812120eb>] ep_unregister_pollwait.isra.14+0x5b/0x1e0
> [<ffffffff81212786>] ep_remove+0x26/0x140
> [<ffffffff81213391>] eventpoll_release_file+0x71/0xa0
> [<ffffffff811c4faa>] __fput+0x2aa/0x2d0
> [<ffffffff811c501e>] ____fput+0xe/0x10
> [<ffffffff8107d67c>] task_work_run+0xac/0xe0
> [<ffffffff81056bd7>] do_exit+0x2c7/0xcc0
> [<ffffffff810589cc>] do_group_exit+0x4c/0xc0
> [<ffffffff81058a54>] SyS_exit_group+0x14/0x20
> [<ffffffff8173bf64>] tracesys+0xdd/0xe2
> Code: 85 c0 8b 05 4b d6 bc 00 45 0f 45 e0 85 c0 0f 84 07 01 00 00 8b 05 31 af 00 01 49 89 fd 41 89 f7 41 89 d3 85 c0 0f 84 08 01 00 00 <49> 8b 45 00 ba 01 00 00 00 48 3d 60 6a 13 82 44 0f 44 e2 41 83
> RIP [<ffffffff810c9f98>] __lock_acquire+0x58/0x1be0
> RSP <ffff880083083c18>
> CR2: ffff8801654cec98
> ---[ end trace 044e98c2d3aab216 ]---
>
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists