[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20200917123316.GA113655@nvidia.com>
Date: Thu, 17 Sep 2020 09:33:16 -0300
From: Jason Gunthorpe <jgg@...dia.com>
To: Leon Romanovsky <leon@...nel.org>
CC: Doug Ledford <dledford@...hat.com>,
Leon Romanovsky <leonro@...dia.com>,
Eli Cohen <eli@....mellanox.co.il>,
<linux-kernel@...r.kernel.org>, <linux-rdma@...r.kernel.org>,
Roland Dreier <rolandd@...co.com>
Subject: Re: [PATCH rdma-next 0/8] Cleanup and fix the CMA state machine
On Wed, Sep 02, 2020 at 11:11:14AM +0300, Leon Romanovsky wrote:
> From: Leon Romanovsky <leonro@...dia.com>
>
> >From Jason:
>
> The RDMA CMA continues to attract syzkaller bugs due to its somewhat loose
> operation of its FSM. Audit and scrub the whole thing to follow modern
> expectations.
>
> Overall the design elements are broadly:
>
> - The ULP entry points MUST NOT run in parallel with each other. The ULP
> is solely responsible for preventing this.
>
> - If the ULP returns !0 from it's event callback it MUST guarentee that no
> other ULP threads are touching the cm_id or calling into any RDMA CM
> entry point.
>
> - ULP entry points can sometimes run conurrently with handler callbacks,
> although it is tricky because there are many entry points that exist
> in the flow before the handler is registered.
>
> - Some ULP entry points are called from the ULP event handler callback,
> under the handler_mutex. (however ucma never does this)
>
> - state uses a weird double locking scheme, in most cases one should hold
> the handler_mutex. (It is somewhat unclear what exactly the spinlock is
> for)
>
> - Reading the state without holding the spinlock should use READ_ONCE,
> even if the handler_mutex is held.
>
> - There are certain states which are 'stable' under the handler_mutex,
> exit from that state requires also holding the handler_mutex. This
> explains why testing the test under only the handler_mutex makes sense.
>
> Thanks
>
> Jason Gunthorpe (8):
> RDMA/cma: Fix locking for the RDMA_CM_CONNECT state
> RDMA/cma: Make the locking for automatic state transition more clear
> RDMA/cma: Fix locking for the RDMA_CM_LISTEN state
> RDMA/cma: Remove cma_comp()
> RDMA/cma: Combine cma_ndev_work with cma_work
> RDMA/cma: Remove dead code for kernel rdmacm multicast
> RDMA/cma: Consolidate the destruction of a cma_multicast in one place
> RDMA/cma: Fix use after free race in roce multicast join
Applied to for-next
Jason
Powered by blists - more mailing lists