[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20120910131603.GA29742@hmsreliant.think-freely.org>
Date: Mon, 10 Sep 2012 09:16:03 -0400
From: Neil Horman <nhorman@...driver.com>
To: "Srivatsa S. Bhat" <srivatsa.bhat@...ux.vnet.ibm.com>
Cc: Gao feng <gaofeng@...fujitsu.com>, eric.dumazet@...il.com,
davem@...emloft.net, linux-kernel@...r.kernel.org,
netdev@...r.kernel.org, mark.d.rustad@...el.com,
john.r.fastabend@...el.com, lizefan@...wei.com
Subject: Re: [PATCH] net, cgroup: Fix boot failure due to iteration of
uninitialized list
On Mon, Sep 10, 2012 at 02:59:18PM +0530, Srivatsa S. Bhat wrote:
> On 07/23/2012 05:10 PM, Neil Horman wrote:
> > On Mon, Jul 23, 2012 at 09:15:05AM +0800, Gao feng wrote:
> >> 于 2012年07月20日 00:27, Srivatsa S. Bhat 写道:
> >>> After commit ef209f15 (net: cgroup: fix access the unallocated memory in
> >>> netprio cgroup), boot fails with the following NULL pointer dereference:
> >>>
> [...]
> >>> Call Trace:
> >>> [<ffffffff81b1cb78>] cgroup_init_subsys+0x83/0x169
> >>> [<ffffffff81b1ce13>] cgroup_init+0x36/0x119
> >>> [<ffffffff81affef7>] start_kernel+0x3ba/0x3ef
> >>> [<ffffffff81aff95b>] ? kernel_init+0x27b/0x27b
> >>> [<ffffffff81aff356>] x86_64_start_reservations+0x131/0x136
> >>> [<ffffffff81aff45e>] x86_64_start_kernel+0x103/0x112
> >>> RIP [<ffffffff8145e8d6>] cgrp_create+0xf6/0x190
> >>> RSP <ffffffff81a01ea8>
> >>> CR2: 0000000000000698
> >>> ---[ end trace a7919e7f17c0a725 ]---
> >>> Kernel panic - not syncing: Attempted to kill the idle task!
> >>>
> >>> The code corresponds to:
> >>>
> >>> update_netdev_tables():
> >>> for_each_netdev(&init_net, dev) {
> >>> map = rtnl_dereference(dev->priomap); <---- HERE
> >>>
> >>>
> >>> The list head is initialized in netdev_init(), which is called much
> >>> later than cgrp_create(). So the problem is that we are calling
> >>> update_netdev_tables() way too early (in cgrp_create()), which will
> >>> end up traversing the not-yet-circular linked list. So at some point,
> >>> the dev pointer will become NULL and hence dev->priomap becomes an
> >>> invalid access.
> >>>
> >>> To fix this, just remove the update_netdev_tables() function entirely,
> >>> since it appears that write_update_netdev_table() will handle things
> >>> just fine.
> >>
> >> The reason I add update_netdev_tables in cgrp_create is to avoid additional
> >> bound checkings when we accessing the dev->priomap.priomap.
> >>
> >> Eric,can we revert this commit 91c68ce2b26319248a32d7baa1226f819d283758 now?
> >> I think it's safe enough to access priomap without bound check.
> >>
> >
> > I think its probably safe, yes, but lets leave it there for just a bit. Its not
> > hurting anything, and I'd like to look into getting Srivatsa' patch in first.
>
> Hi Neil,
>
> Did you get around to look into this again?
>
I haven't looked at it specifically no, I apologize. That said I think the
other changes that went in back in that time frame have had time to soak, and
looking at the way we current update the priomap table, I think its safe for us
to remove the update_netdev_table call and definition. If you repost your
patch, I'll ack it.
Thanks!
Neil
> Regards,
> Srivatsa S. Bhat
>
>
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists