[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20241125201036.GK160612@unreal>
Date: Mon, 25 Nov 2024 22:10:36 +0200
From: Leon Romanovsky <leon@...nel.org>
To: Parav Pandit <parav@...dia.com>
Cc: "NBU-Contact-longli (EXTERNAL)" <longli@...rosoft.com>,
Konstantin Taranov <kotaranov@...rosoft.com>,
Konstantin Taranov <kotaranov@...ux.microsoft.com>,
Wei Hu <weh@...rosoft.com>,
"sharmaajay@...rosoft.com" <sharmaajay@...rosoft.com>,
"jgg@...pe.ca" <jgg@...pe.ca>,
"linux-rdma@...r.kernel.org" <linux-rdma@...r.kernel.org>,
linux-netdev <netdev@...r.kernel.org>,
"open list:Hyper-V/Azure CORE AND DRIVERS" <linux-hyperv@...r.kernel.org>
Subject: Re: [PATCH rdma-next 1/1] RDMA/mana_ib: Set correct device into ib
On Mon, Nov 25, 2024 at 03:56:01PM +0000, Parav Pandit wrote:
>
>
> > From: Long Li <longli@...rosoft.com>
> > Sent: Thursday, November 21, 2024 5:34 AM
> >
> > > >
> > > > Actually, another alternative solution for mana_ib is always set the
> > > > slave device, but in the GID mgmt code we need the following patch.
> > > > The problem is that it may require testing/confirmation from other
> > > > ib providers
> > > as in the worst case some GIDs will not be listed.
> > >
> > > is_eth_active_slave_of_bonding_rcu() is for bonding.
> >
> > Sorry, need to bring this issue up again.
> >
> > This patch has broken user-space programs (e.g DPDK) that requires to
> > export a kernel device to user-mode.
> >
> > With this patch, the RDMA driver grabbed a reference from the master
> > device, it's impossible to move the master device to user-mode.
> >
> > I think the root cause is that the individual driver should not decide on which
> > (master or slave) address should be used for GID. roce_gid_mgmt.c should
> > handle this situation.
> >
> > I think Konstantin's suggestion makes sense, how about we do this (don't
> > need to define netdev_is_slave(dev)):
> >
> > --- a/drivers/infiniband/core/roce_gid_mgmt.c
> > +++ b/drivers/infiniband/core/roce_gid_mgmt.c
> > @@ -161,7 +161,7 @@ is_eth_port_of_netdev_filter(struct ib_device
> > *ib_dev, u32 port,
> > res = ((rdma_is_upper_dev_rcu(rdma_ndev, cookie) &&
> > (is_eth_active_slave_of_bonding_rcu(rdma_ndev, real_dev) &
> > REQUIRED_BOND_STATES)) ||
> > - real_dev == rdma_ndev);
> > + (real_dev == rdma_ndev &&
> > + !netif_is_bond_slave(rdma_ndev)));
> >
> > rcu_read_unlock();
> > return res;
> >
> >
> > is_eth_port_of_netdev_filter() should not return true if this netdev is a
> > bonded slave. In this case, only use the address of its bonded master.
> >
> Right. This change makes sense to me.
> I don't have a setup presently to verify it to ensure I didn't miss a corner case.
> Leon,
> Can you or others please test the regression once with the formal patch?
Sure, once Long will send the patch, I'll make sure that it is tested.
Thanks
>
Powered by blists - more mailing lists