[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CAHS8izP=AuPbV6N=c05J2kJLJ16-AmRzu983khXaR91Pti=cNw@mail.gmail.com>
Date: Fri, 23 May 2025 09:45:28 -0700
From: Mina Almasry <almasrymina@...gle.com>
To: Yunsheng Lin <linyunsheng@...wei.com>
Cc: Dong Chenchen <dongchenchen2@...wei.com>, hawk@...nel.org, ilias.apalodimas@...aro.org,
davem@...emloft.net, edumazet@...gle.com, kuba@...nel.org, pabeni@...hat.com,
horms@...nel.org, netdev@...r.kernel.org, linux-kernel@...r.kernel.org,
zhangchangzhong@...wei.com,
syzbot+204a4382fcb3311f3858@...kaller.appspotmail.com
Subject: Re: [PATCH net] page_pool: Fix use-after-free in page_pool_recycle_in_ring
On Fri, May 23, 2025 at 1:31 AM Yunsheng Lin <linyunsheng@...wei.com> wrote:
>
> On 2025/5/23 14:45, Dong Chenchen wrote:
>
> >
> > static bool page_pool_recycle_in_ring(struct page_pool *pool, netmem_ref netmem)
> > {
> > + bool in_softirq;
> > int ret;
> int -> bool?
>
> > /* BH protection not needed if current is softirq */
> > - if (in_softirq())
> > - ret = ptr_ring_produce(&pool->ring, (__force void *)netmem);
> > - else
> > - ret = ptr_ring_produce_bh(&pool->ring, (__force void *)netmem);
> > -
> > - if (!ret) {
> > + in_softirq = page_pool_producer_lock(pool);
> > + ret = !__ptr_ring_produce(&pool->ring, (__force void *)netmem);
> > + if (ret)
> > recycle_stat_inc(pool, ring);
> > - return true;
> > - }
> > + page_pool_producer_unlock(pool, in_softirq);
> >
> > - return false;
> > + return ret;
> > }
> >
> > /* Only allow direct recycling in special circumstances, into the
> > @@ -1091,10 +1088,14 @@ static void page_pool_scrub(struct page_pool *pool)
> >
> > static int page_pool_release(struct page_pool *pool)
> > {
> > + bool in_softirq;
> > int inflight;
> >
> > page_pool_scrub(pool);
> > inflight = page_pool_inflight(pool, true);
> > + /* Acquire producer lock to make sure producers have exited. */
> > + in_softirq = page_pool_producer_lock(pool);
> > + page_pool_producer_unlock(pool, in_softirq);
>
> Is a compiler barrier needed to ensure compiler doesn't optimize away
> the above code?
>
I don't want to derail this conversation too much, and I suggested a
similar fix to this initially, but now I'm not sure I understand why
it works.
Why is the existing barrier not working and acquiring/releasing the
producer lock fixes this issue instead? The existing barrier is the
producer thread incrementing pool->pages_state_release_cnt, and
page_pool_release() is supposed to block the freeing of the page_pool
until it sees the
`atomic_inc_return_relaxed(&pool->pages_state_release_cnt);` from the
producer thread. Any idea why this barrier is not working? AFAIU it
should do the exact same thing as acquiring/dropping the producer
lock.
--
Thanks,
Mina
Powered by blists - more mailing lists