[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20220423160723.GA20330@wunner.de>
Date: Sat, 23 Apr 2022 18:07:23 +0200
From: Lukas Wunner <lukas@...ner.de>
To: Paolo Abeni <pabeni@...hat.com>
Cc: Oliver Neukum <oneukum@...e.com>,
"David S. Miller" <davem@...emloft.net>,
Jakub Kicinski <kuba@...nel.org>, Jann Horn <jannh@...gle.com>,
Oleksij Rempel <o.rempel@...gutronix.de>,
Eric Dumazet <edumazet@...gle.com>, netdev@...r.kernel.org,
linux-usb@...r.kernel.org, Andrew Lunn <andrew@...n.ch>,
Jacky Chou <jackychou@...x.com.tw>, Willy Tarreau <w@....eu>,
Lino Sanfilippo <LinoSanfilippo@....de>,
Philipp Rosenberger <p.rosenberger@...bus.com>,
Heiner Kallweit <hkallweit1@...il.com>,
Greg Kroah-Hartman <gregkh@...uxfoundation.org>
Subject: Re: [PATCH] net: linkwatch: ignore events for unregistered netdevs
On Thu, Apr 21, 2022 at 10:02:43AM +0200, Paolo Abeni wrote:
> On Sun, 2022-04-17 at 09:04 +0200, Lukas Wunner wrote:
> > --- a/net/core/link_watch.c
> > +++ b/net/core/link_watch.c
> > @@ -107,7 +107,8 @@ static void linkwatch_add_event(struct net_device *dev)
> > unsigned long flags;
> >
> > spin_lock_irqsave(&lweventlist_lock, flags);
> > - if (list_empty(&dev->link_watch_list)) {
> > + if (list_empty(&dev->link_watch_list) &&
> > + dev->reg_state < NETREG_UNREGISTERED) {
> > list_add_tail(&dev->link_watch_list, &lweventlist);
> > dev_hold_track(dev, &dev->linkwatch_dev_tracker, GFP_ATOMIC);
> >
>
> What about testing dev->reg_state in linkwatch_fire_event() before
> setting the __LINK_STATE_LINKWATCH_PENDING bit, so that we don't leave
> the device in an unexpected state?
That would be racy because linkwatch_fire_event() may see a reg_state of
REGISTERED or UNREGISTERING and then add the device to link_watch_list,
even though reg_state may be changed to UNREGISTERED in-between.
That race is avoided by performing the reg_state check under
lweventlist_lock:
Scenario 1:
CPU 1: CPU 2:
linkwatch_add_event(dev);
dev->reg_state = NETREG_UNREGISTERED;
linkwatch_forget_dev(dev);
In this scenario, CPU 2 sees the old value of dev->reg_state and
adds the device to link_watch_list, but CPU 1 will subsequently
delete it from the list.
Scenario 2:
CPU 1: CPU 2:
dev->reg_state = NETREG_UNREGISTERED;
linkwatch_forget_dev(dev);
linkwatch_add_event(dev);
In this scenario, CPU 2 refrains from adding the device to
link_watch_list. It is guaranteed to see the new reg_state
due to the memory barriers implied by lweventlist_lock,
which is taken both by linkwatch_forget_dev() and
linkwatch_add_event().
Note that an unregistered netdev has been stopped, so the portion
of linkwatch_do_dev() which is constrained to the netdev being IFF_UP
is skipped. The only portion that's executed is rfc2863_policy(),
which updates the operstate.
I believe that operstate changes are irrelevant and unnecessary after
the netdev has been unregistered.
Same for the fact that __LINK_STATE_LINKWATCH_PENDING may be set even
though the netdev is not on link_watch_list. That should be irrelevant
for an unregistered netdev.
> Other than that, it looks good to me, but potentially quite risky.
To mitigate risk I suggest letting the patch bake in linux-next
for a couple of weeks.
However I would then have to respin it because the declaration of
linkwatch_run_queue() was moved from include/linux/netdevice.h to
net/core/dev.h by Jakub's net-next commit 6264f58ca0e5 ("net:
extract a few internals from netdevice.h").
Let me know if you want me to respin the patch based on net-next.
> Looking at the original report it looks like the issue could be
> resolved with a more usb-specific change: e.g. it looks like
> usbnet_defer_kevent() is not acquiring a dev reference as it should.
>
> Have you considered that path?
First of all, the diffstat of the patch shows this is an opportunity
to reduce LoC as well as simplify and speed up device teardown.
Second, the approach you're proposing won't work if a driver calls
netif_carrier_on/off() after unregister_netdev().
It seems prudent to prevent such a misbehavior in *any* driver,
not just usbnet. usbnet may not be the only one doing it wrong.
Jann pointed out that there are more syzbot reports related
to a UAF in linkwatch:
https://lore.kernel.org/netdev/?q=__linkwatch_run_queue+syzbot
Third, I think an API which schedules work, invisibly to the driver,
is dangerous and misguided. If it is illegal to call
netif_carrier_on/off() for an unregistered but not yet freed netdev,
catch that in core networking code and don't expect drivers to respect
a rule which isn't even documented.
Thanks,
Lukas
Powered by blists - more mailing lists