[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <dh6gl6xzufrxk23dwei3dcyljjlspjx3gwdhi6o3umtltpypkw@ef4n6llndg5p>
Date: Tue, 18 Nov 2025 10:51:11 +0100
From: Stefano Garzarella <sgarzare@...hat.com>
To: Michal Luczaj <mhal@...x.co>
Cc: "David S. Miller" <davem@...emloft.net>,
Eric Dumazet <edumazet@...gle.com>, Jakub Kicinski <kuba@...nel.org>,
Paolo Abeni <pabeni@...hat.com>, Simon Horman <horms@...nel.org>,
George Zhang <georgezhang@...are.com>, Andy King <acking@...are.com>, Dmitry Torokhov <dtor@...are.com>,
virtualization@...ts.linux.dev, netdev@...r.kernel.org, linux-kernel@...r.kernel.org
Subject: Re: [PATCH net] vsock: Ignore signal/timeout on connect() if already
established
On Mon, Nov 17, 2025 at 09:57:25PM +0100, Michal Luczaj wrote:
>During connect(), acting on a signal/timeout by disconnecting an already
>established socket leads to several issues:
>
>1. connect() invoking vsock_transport_cancel_pkt() ->
> virtio_transport_purge_skbs() may race with sendmsg() invoking
> virtio_transport_get_credit(). This results in a permanently elevated
> `vvs->bytes_unsent`. Which, in turn, confuses the SOCK_LINGER handling.
>
>2. connect() resetting a connected socket's state may race with socket
> being placed in a sockmap. A disconnected socket remaining in a sockmap
> breaks sockmap's assumptions. And gives rise to WARNs.
>
>3. connect() transitioning SS_CONNECTED -> SS_UNCONNECTED allows for a
> transport change/drop after TCP_ESTABLISHED. Which poses a problem for
> any simultaneous sendmsg() or connect() and may result in a
> use-after-free/null-ptr-deref.
>
>Do not disconnect socket on signal/timeout. Keep the logic for unconnected
>sockets: they don't linger, can't be placed in a sockmap, are rejected by
>sendmsg().
>
>[1]: https://lore.kernel.org/netdev/e07fd95c-9a38-4eea-9638-133e38c2ec9b@rbox.co/
>[2]: https://lore.kernel.org/netdev/20250317-vsock-trans-signal-race-v4-0-fc8837f3f1d4@rbox.co/
>[3]: https://lore.kernel.org/netdev/60f1b7db-3099-4f6a-875e-af9f6ef194f6@rbox.co/
>
>Fixes: d021c344051a ("VSOCK: Introduce VM Sockets")
>Signed-off-by: Michal Luczaj <mhal@...x.co>
>---
>Note that this patch does not tackle related problems described in
>https://lore.kernel.org/netdev/70371863-fa71-48e0-a1e5-fee83e7ca37c@rbox.co/
Ooops, it seems I forgot to reply. Thanks for bringing this to my
attention agan. Next time feel free to ping me :-)
I'll reply in that thread.
>---
> net/vmw_vsock/af_vsock.c | 48 ++++++++++++++++++++++++++++++++++++++++--------
> 1 file changed, 40 insertions(+), 8 deletions(-)
>
>diff --git a/net/vmw_vsock/af_vsock.c b/net/vmw_vsock/af_vsock.c
>index 76763247a377..a52e7dbe7878 100644
>--- a/net/vmw_vsock/af_vsock.c
>+++ b/net/vmw_vsock/af_vsock.c
>@@ -1528,6 +1528,23 @@ static void vsock_connect_timeout(struct work_struct *work)
> sock_put(sk);
> }
>
>+static void vsock_reset_interrupted(struct sock *sk)
>+{
>+ struct vsock_sock *vsk = vsock_sk(sk);
>+
>+ /* Try to cancel VIRTIO_VSOCK_OP_REQUEST skb sent out by
>+ * transport->connect().
>+ */
>+ vsock_transport_cancel_pkt(vsk);
>+
>+ /* Listener might have already responded with VIRTIO_VSOCK_OP_RESPONSE.
>+ * Its handling expects our sk_state == TCP_SYN_SENT, which hereby we
>+ * break. In such case VIRTIO_VSOCK_OP_RST will follow.
>+ */
>+ sk->sk_state = TCP_CLOSE;
>+ sk->sk_socket->state = SS_UNCONNECTED;
>+}
>+
> static int vsock_connect(struct socket *sock, struct sockaddr *addr,
> int addr_len, int flags)
> {
>@@ -1661,18 +1678,33 @@ static int vsock_connect(struct socket *sock, struct sockaddr *addr,
> timeout = schedule_timeout(timeout);
> lock_sock(sk);
>
>+ /* Connection established. Whatever happens to socket once we
>+ * release it, that's not connect()'s concern. No need to go
>+ * into signal and timeout handling. Call it a day.
>+ *
>+ * Note that allowing to "reset" an already established socket
>+ * here is racy and insecure.
>+ */
>+ if (sk->sk_state == TCP_ESTABLISHED)
>+ break;
>+
>+ /* If connection was _not_ established and a signal/timeout came
>+ * to be, we want the socket's state reset. User space may want
>+ * to retry.
>+ *
>+ * sk_state != TCP_ESTABLISHED implies that socket is not on
>+ * vsock_connected_table. We keep the binding and the transport
>+ * assigned.
>+ */
> if (signal_pending(current)) {
> err = sock_intr_errno(timeout);
>- sk->sk_state = sk->sk_state == TCP_ESTABLISHED ? TCP_CLOSING : TCP_CLOSE;
>- sock->state = SS_UNCONNECTED;
>- vsock_transport_cancel_pkt(vsk);
>- vsock_remove_connected(vsk);
>+ vsock_reset_interrupted(sk);
> goto out_wait;
>- } else if ((sk->sk_state != TCP_ESTABLISHED) && (timeout == 0)) {
>+ }
>+
>+ if (timeout == 0) {
> err = -ETIMEDOUT;
>- sk->sk_state = TCP_CLOSE;
>- sock->state = SS_UNCONNECTED;
>- vsock_transport_cancel_pkt(vsk);
>+ vsock_reset_interrupted(sk);
> goto out_wait;
I'm fine with the change, but now both code blocks are the same, so
can we unify them?
I mean something like this:
if (signal_pending(current) || timeout == 0 {
err = timeout == 0 ? -ETIMEDOUT : sock_intr_errno(timeout);
...
}
Maybe at that point we can also remove the vsock_reset_interrupted()
function and put the code right there.
BTW I don't have a strong opinion, what do you prefer?
Thanks,
Stefano
Powered by blists - more mailing lists