lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <55006C14.6030209@redhat.com>
Date:	Wed, 11 Mar 2015 09:23:48 -0700
From:	Alexander Duyck <alexander.h.duyck@...hat.com>
To:	Dave Taht <dave.taht@...il.com>,
	Alexander Duyck <alexander.duyck@...il.com>
CC:	"netdev@...r.kernel.org" <netdev@...r.kernel.org>,
	Stephen Hemminger <stephen@...workplumber.org>,
	Jiří Pírko <jiri@...nulli.us>,
	Scott Feldman <sfeldma@...il.com>,
	"davem@...emloft.net" <davem@...emloft.net>,
	Sabrina Dubroca <sd@...asysnail.net>
Subject: Re: [net-next PATCH] ipv4: FIB Local/MAIN table collapse

On 03/11/2015 09:03 AM, Dave Taht wrote:
> On Wed, Mar 11, 2015 at 7:58 AM, Alexander Duyck
> <alexander.duyck@...il.com> wrote:
>> On 03/06/2015 01:47 PM, Alexander Duyck wrote:
>>> This patch is meant to collapse local and main into one by converting
>>> tb_data from an array to a pointer.  Doing this allows us to point the
>>> local table into the main while maintaining the same variables in the
>>> table.
>>>
>>> As such the tb_data was converted from an array to a pointer, and a new
>>> array called data is added in order to still provide an object for tb_data
>>> to point to.
>>>
>>> In order to track the origin of the fib aliases a tb_id value was added in
>>> a hole that existed on 64b systems.  Using this we can also reverse the
>>> merge in the event that custom FIB rules are enabled.
>>>
>>> With this patch I am seeing an improvement of 20ns to 30ns for routing
>>> lookups as long as custom rules are not enabled, with custom rules enabled
>>> we fall back to split tables and the original behavior.
>>>
>>> Signed-off-by: Alexander Duyck <alexander.h.duyck@...hat.com>
>>> ---
>>>
>>> Changes since the RFC:
>>>    Added tb_id value so I could split main and local for custom rules
>>>    Added functionality to split tables if custom rules were enabled
>>>    Added table replacement and unmerge functions
>>>
>>> I have done some testing on this to verify performance gains and that I can
>>> split the tables correctly when I enable custom rules, but this patch is
>>> what I would consider to be high risk since I am certain there are things I
>>> have not considered.
>>>
>>> If this gets pulled into someone's switchdev tree instead of into net-next I
>>> would be perfectly fine with that as I am sure this can use some additional
>>> testing.
>> Has anyone out there had a chance to review this patch?  I had suggested
>> holding off on applying it to net-next for additional testing, but I
>> haven't found anything, and the only related issue is the one issue
>> Sabrina reported for the RTNL locking which was already in net-next anyway.
>
> My environment consists largely of 32 bit routing platforms (openwrt)
> running the current homenet routing protocol (babel), using, in
> particular, the babeld source specific routing extensions ( see
> http://arxiv.org/pdf/1403.0445v3.pdf  ) which do use multiple routing
> tables, and other uncommon things like p2p routes and odd netmasks
> like /27 and /30.

Can you define "use multiple routing tables"?  If you are adding other 
routing tables than local/main and adding custom rules to determine the 
order then this latest patch should automatically be switched off.  This 
patch only applies to the case where custom rules are not enabled, 
otherwise it splits the tables and you are right back to two tables one 
for local and one for main.

This patch is based on work Dave Miller originally sent me to merge the 
tables for local/main in order to improve performance in the standard 
end user case.  As far as Quagga and such I am not that familiar with 
how they configure the tables so I cannot say if there is a performance 
gain available or not.

> But: I simply can't keep up with you and this entire patchset changes
> so dramatically how routing works that you make me nervous. I would
> like to see quagga (and babeld) improved to use atomic updates in
> particular (They do a delete + add for no good reason), and for
> protocols like ospf, bgp, etc, to be actively tested on real traffic
> loads on live data against this entire patchset.

Yeah, looking back on it I probably shouldn't have sat on my patches for 
an entire release cycle before pushing them but as it was I was trying 
to stretch things out over two releases.  As far as Quagga and such that 
is several levels above this work as I have no idea about the internals 
of the user space routing daemons.

> Your 64 bit x86 benchmarks are very exciting and I do look forward to
> one day soon attempting to evaluate and benchmark these changes on
> teeny 32 bit platforms both in the general case and in this
> environment, and am mostly just hoping that others that are doing
> higher end real world routing with big tables (such as BGP) are paying
> more attention than I can.

Merging the two tables is one of the requests that came out of netconf 
this year.  I hadn't anticipated the amount of improvement that I have 
seen, though it makes sense since we are essentially reducing the memory 
utilization significantly as we reduce the backtrace and main table 
look-up to something like one tnode since in the case of most /24 
subnets you only have 3 or 4 addresses that are tracked in the trie and 
the route from main will usually just end up overlapping with the .0 
broadcast address.

> What test procedures are you using at present (and what is everyone else using)?

What I have been doing is mostly performance testing for different 
portions of the trie, performance of shortest path to find a match, 
performance of longest path to find a match, load/unload/reload 
interface, insert/delete route or address, dumping route, fib_trie, and 
fib_triestat, and with this most recent patch adding custom rules to 
force the tables to split.

- Alex
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ