[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <6692b7d912e3e_8f74d2945a@dwillia2-xfh.jf.intel.com.notmuch>
Date: Sat, 13 Jul 2024 10:22:33 -0700
From: Dan Williams <dan.j.williams@...el.com>
To: Tetsuo Handa <penguin-kernel@...ove.sakura.ne.jp>, Dan Williams
<dan.j.williams@...el.com>, <gregkh@...uxfoundation.org>
CC: <syzbot+4762dd74e32532cda5ff@...kaller.appspotmail.com>,
<stable@...r.kernel.org>, Ashish Sangwan <a.sangwan@...sung.com>, Namjae Jeon
<namjae.jeon@...sung.com>, Dirk Behme <dirk.behme@...bosch.com>, "Rafael J.
Wysocki" <rafael@...nel.org>, <linux-kernel@...r.kernel.org>,
<linux-cxl@...r.kernel.org>
Subject: Re: [PATCH] driver core: Fix uevent_show() vs driver detach race
Tetsuo Handa wrote:
> On 2024/07/13 8:49, Dan Williams wrote:
> >>> + /* Synchronize with dev_uevent() */
> >>> + synchronize_rcu();
> >>> +
> >>
> >> this synchronize_rcu(), in order to make sure that
> >> READ_ONCE(dev->driver) in dev_uevent() observes NULL?
> >
> > No, this synchronize_rcu() is to make sure that if dev_uevent() wins the
> > race and observes that dev->driver is not NULL that it is still safe to
> > dereference that result because the 'struct device_driver' object is
> > still live.
>
> I can't catch what the pair of rcu_read_lock()/rcu_read_unlock() in dev_uevent()
> and synchronize_rcu() in module_remove_driver() is for.
It is to extend the lifetime of @driver if dev_uevent() observes
non-NULL @dev->driver.
> I think that the below race is possible.
> Please explain how "/* Synchronize with module_remove_driver() */" works.
It is for this race:
Thread1: Thread2:
dev_uevent(...) delete_module()
driver = dev->driver; mod->exit()
if (driver) driver_unregister()
driver_detach() // <-- @dev->driver marked NULL
module_remove_driver()
free_module() // <-- @driver object destroyed
add_uevent_var(env, "DRIVER=%s", driver->name); // <-- use after free of @driver
If driver_detach() happens before Thread1 reads dev->driver then there
is no use after free risk.
The previous attempt to fix this held the device_lock() over
dev_uevent() which prevents driver_detach() from even starting, but that
causes lockdep issues and is even more heavy-handed than the
synchronize_rcu() delay. RCU makes sure that @driver stays alive between
reading @dev->driver and reading @driver->name.
Powered by blists - more mailing lists