lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date:   Fri, 31 Jan 2020 16:48:40 +0000
From:   David Laight <David.Laight@...LAB.COM>
To:     'Eric Dumazet' <eric.dumazet@...il.com>,
        netdev <netdev@...r.kernel.org>
Subject: RE: Freeing 'temporary' IPv4 route table entries.

From: Eric Dumazet
> Sent: 31 January 2020 15:54
> 
> On 1/31/20 2:26 AM, David Laight wrote:
> > If I call sendmsg() on a raw socket (or probably
> > an unconnected UDP one) rt_dst_alloc() is called
> > in the bowels of ip_route_output_flow() to hold
> > the remote address.
> >
> > Much later __dev_queue_xmit() calls dst_release()
> > to delete the 'dst' referenced from the skb.
> >
> > Prior to f8864972 it did just that.
> > Afterwards the actual delete is 'laundered' through the
> > rcu callbacks.
> > This is probably ok for dst that are actually attached
> > to sockets or tunnels (which aren't freed very often).
> > But it leads to horrid long rcu callback sequences
> > when a lot of messages are sent.
> > (A sample of 1 gave nearly 100 deletes in one go.)
> > There is also the additional cost of deferring the free
> > (and the extra retpoline etc).
> >
> > ISTM that the dst_alloc() done during a send should
> > set a flag so that the 'dst' can be immediately
> > freed since it is known that no one can be picking up
> > a reference as it is being freed.
> >
> > Thoughts?
> >
> 
> I thought these routes were cached in per-cpu caches.
> 
> At least for UDP I do not see rcu callbacks being queueed.

For rawip (actually sending UDP) all the sends go:

282710.407759 |   0) pid-29158  |               |          inet_sendmsg() {
282710.407759 |   0) pid-29158  |   0.093 us    |            inet_send_prepare();
282710.407759 |   0) pid-29158  |               |            raw_sendmsg() {
282710.407759 |   0) pid-29158  |   0.094 us    |              security_sk_classify_flow();
282710.407760 |   0) pid-29158  |               |              ip_route_output_flow() {
282710.407760 |   0) pid-29158  |               |                ip_route_output_key_hash() {
282710.407760 |   0) pid-29158  |               |                  ip_route_output_key_hash_rcu() {
282710.407760 |   0) pid-29158  |   0.160 us    |                    fib_table_lookup();
282710.407760 |   0) pid-29158  |               |                    fib_select_path() {
282710.407760 |   0) pid-29158  |   0.102 us    |                      fib_result_prefsrc();
282710.407760 |   0) pid-29158  |   0.302 us    |                    } /* fib_select_path */
282710.407761 |   0) pid-29158  |   0.103 us    |                    find_exception();
282710.407761 |   0) pid-29158  |               |                    rt_dst_alloc() {
282710.407761 |   0) pid-29158  |               |                      dst_alloc() {
282710.407761 |   0) pid-29158  |               |                        kmem_cache_alloc() {
282710.407761 |   0) pid-29158  |   0.102 us    |                          should_failslab();
282710.407761 |   0) pid-29158  |   0.362 us    |                        } /* kmem_cache_alloc */
282710.407761 |   0) pid-29158  |   0.098 us    |                        dst_init();
282710.407762 |   0) pid-29158  |   0.747 us    |                      } /* dst_alloc */
282710.407762 |   0) pid-29158  |   0.936 us    |                    } /* rt_dst_alloc */

Then dst_release() gets called from __dev_queue_xmit().
In this test all the destinations are the same, but in real life
there will be 100s of different addresses.

Are the addresses inserted in the cache at the start of the send and
then deleted once the message is sent?

I'm not sure there are any 'connected' sockets to that IP address.
(This is SIP + RTP testing.)

	David

-
Registered Address Lakeside, Bramley Road, Mount Farm, Milton Keynes, MK1 1PT, UK
Registration No: 1397386 (Wales)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ