[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20200728210116.56potw45eyptmlc7@bsd-mbp.dhcp.thefacebook.com>
Date: Tue, 28 Jul 2020 14:01:16 -0700
From: Jonathan Lemon <jonathan.lemon@...il.com>
To: Jason Gunthorpe <jgg@...dia.com>
Cc: Christoph Hellwig <hch@....de>, netdev@...r.kernel.org,
kernel-team@...com, robin.murphy@....com,
akpm@...ux-foundation.org, davem@...emloft.net, kuba@...nel.org,
willemb@...gle.com, edumazet@...gle.com,
steffen.klassert@...unet.com, saeedm@...lanox.com,
maximmi@...lanox.com, bjorn.topel@...el.com,
magnus.karlsson@...el.com, borisp@...lanox.com, david@...hat.com
Subject: Re: [RFC PATCH v2 21/21] netgpu/nvidia: add Nvidia plugin for netgpu
On Tue, Jul 28, 2020 at 03:19:04PM -0300, Jason Gunthorpe wrote:
> On Mon, Jul 27, 2020 at 06:48:12PM -0700, Jonathan Lemon wrote:
>
> > While the current GPU utilized is nvidia, there's nothing in the rest of
> > the patches specific to Nvidia - an Intel or AMD GPU interface could be
> > equally workable.
>
> I think that is very misleading.
>
> It looks like this patch, and all the ugly MM stuff, is done the way
> it is *specifically* to match the clunky nv_p2p interface that only
> the NVIDIA driver exposes.
For /this/ patch [21], this is quite true. I'm forced to use the nv_p2p
API if I want to use the hardware that I have. What's being overlooked
is that the host mem driver does not do this, nor would another GPU
if it used p2p_dma. I'm just providing get_page, put_page, get_dma.
> Any approach done in tree, where we can actually modify the GPU
> driver, would do sane things like have the GPU driver itself create
> the MEMORY_DEVICE_PCI_P2PDMA pages, use the P2P DMA API framework, use
> dmabuf for the cross-driver attachment, etc, etc.
So why doesn't Nvidia implement the above in the driver?
Actually a serious question, not trolling here.
> If you are serious about advancing this then the initial patches in a
> long road must be focused on building up the core kernel
> infrastructure for P2P DMA to a point where netdev could consume
> it. There has been a lot of different ideas thrown about on how to do
> this over the years.
Yes, I'm serious about doing this work, and may not have seen or
remember all the various ideas I've seen over time. The netstack
operates on pages - are you advocating replacing them with sglists?
> > I think this is a better patch than all the various implementations of
> > the protocol stack in the form of RDMA, driver code and device firmware.
>
> Oh? You mean "better" in the sense the header split offload in the NIC
> is better liked than a full protocol running in the NIC?
Yes. The NIC firmware should become simpler, not more complicated.
--
Jonathan
Powered by blists - more mailing lists