[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <8fefa815a3924fb3b893371c988781ad@AcuMS.aculab.com>
Date: Wed, 22 Jul 2020 08:39:06 +0000
From: David Laight <David.Laight@...LAB.COM>
To: 'Christoph Hellwig' <hch@....de>, Andy Lutomirski <luto@...nel.org>
CC: Jens Axboe <axboe@...nel.dk>,
linux-arch <linux-arch@...r.kernel.org>,
Linux API <linux-api@...r.kernel.org>,
LKML <linux-kernel@...r.kernel.org>,
"io-uring@...r.kernel.org" <io-uring@...r.kernel.org>
Subject: RE: io_uring vs in_compat_syscall()
From: Christoph Hellwig
> Sent: 22 July 2020 07:31
...
> > I agree that MSG_CMSG_COMPAT is nasty, but I think the concept is
> > sound -- rather than tracking whether we're compat by using a
> > different function or a per-thread variable, actually explicitly
> > tracking the mode seems sensible.
>
> I very strongly disagree. Two recent projects I did was to remove
> the compat_exec mess, and the compat get/setsockopt mess, and each
> time it removed hundreds of lines of code duplicating native
> functionality, often in slightly broken ways. We need a generic
> out of band way to transfer the information down and just check in
> in a few strategic places, and in_compat_syscall() does the right
> thing for that.
Hmmm... set_fs(KERNEL_DS) is a per-thread variable that indicates
that the current system call is being done by the kernel.
So you are pulling two different bits of code in opposite directions.
It has to be safer to track the flag through with the request.
Then once any conversion has been done the flag can be corrected.
Imagine something like a bpf hook on a compat syscall.
Having read the user buffer into kernel space it may decide
to reformat it to the native layout to process it.
After which the code has a native format buffer even though
in_compat_syscall() returns true.
To the native/compat flag is actually a property of the buffer
much the same as whether it is user/kernel.
The other property of the buffer is whether embedded addresses
are user or kernel.
If the buffer has been read from userspace then they are user.
If the request originated in the kernel they are kernel.
This difference may matter in the future.
David
-
Registered Address Lakeside, Bramley Road, Mount Farm, Milton Keynes, MK1 1PT, UK
Registration No: 1397386 (Wales)
Powered by blists - more mailing lists