[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <X/2fP9LNWXvp7up9@kroah.com>
Date: Tue, 12 Jan 2021 14:08:15 +0100
From: Greg KH <gregkh@...uxfoundation.org>
To: Adrian Catangiu <acatan@...zon.com>
Cc: linux-doc@...r.kernel.org, linux-kernel@...r.kernel.org,
qemu-devel@...gnu.org, kvm@...r.kernel.org,
linux-s390@...r.kernel.org, graf@...zon.com, arnd@...db.de,
ebiederm@...ssion.com, rppt@...nel.org, 0x7f454c46@...il.com,
borntraeger@...ibm.com, Jason@...c4.com, jannh@...gle.com,
w@....eu, colmmacc@...zon.com, luto@...nel.org, tytso@....edu,
ebiggers@...nel.org, dwmw@...zon.co.uk, bonzini@....org,
sblbir@...zon.com, raduweis@...zon.com, corbet@....net,
mst@...hat.com, mhocko@...nel.org, rafael@...nel.org, pavel@....cz,
mpe@...erman.id.au, areber@...hat.com, ovzxemul@...il.com,
avagin@...il.com, ptikhomirov@...tuozzo.com, gil@...l.com,
asmehra@...hat.com, dgunigun@...hat.com, vijaysun@...ibm.com,
oridgar@...il.com, ghammer@...hat.com
Subject: Re: [PATCH v4 1/2] drivers/misc: sysgenid: add system generation id
driver
On Tue, Jan 12, 2021 at 02:15:59PM +0200, Adrian Catangiu wrote:
> +``read()``:
> + Read is meant to provide the *new* system generation counter when a
> + generation change takes place. The read operation blocks until the
> + associated counter is no longer up to date, at which point the new
> + counter is provided/returned.
> + Nonblocking ``read()`` uses ``EAGAIN`` to signal that there is no
> + *new* counter value available. The generation counter is considered
> + *new* for each open file descriptor that hasn't confirmed the new
> + value following a generation change. Therefore, once a generation
> + change takes place, all ``read()`` calls will immediately return the
> + new generation counter and will continue to do so until the
> + new value is confirmed back to the driver through ``write()``.
> + Partial reads are not allowed - read buffer needs to be at least
> + ``sizeof(unsigned)`` in size.
"sizeof(unsigned)"? How about being specific and making this a real "X
bits big" value please.
"unsigned" does not work well across user/kernel boundries. Ok, that's
on understatement, the correct thing is "does not work at all".
Please be specific in your apis.
This is listed elsewhere also.
> +``write()``:
> + Write is used to confirm the up-to-date Sys Gen counter back to the
> + driver.
> + Following a VM generation change, all existing watchers are marked
> + as *outdated*. Each file descriptor will maintain the *outdated*
> + status until a ``write()`` confirms the up-to-date counter back to
> + the driver.
> + Partial writes are not allowed - write buffer should be exactly
> + ``sizeof(unsigned)`` in size.
> +
> +``poll()``:
> + Poll is implemented to allow polling for generation counter updates.
> + Such updates result in ``EPOLLIN`` polling status until the new
> + up-to-date counter is confirmed back to the driver through a
> + ``write()``.
> +
> +``ioctl()``:
> + The driver also adds support for tracking count of open file
> + descriptors that haven't acknowledged a generation counter update,
> + as well as a mechanism for userspace to *force* a generation update:
> +
> + - SYSGENID_GET_OUTDATED_WATCHERS: immediately returns the number of
> + *outdated* watchers - number of file descriptors that were open
> + during a system generation change, and which have not yet confirmed
> + the new generation counter.
But this number can instantly change after it is read, what good is it?
It should never be relied on, so why is this needed at all?
What can userspace do with this information?
thanks,
greg k-h
Powered by blists - more mailing lists