[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <5791BA22.7050309@cumulusnetworks.com>
Date: Thu, 21 Jul 2016 23:16:02 -0700
From: Roopa Prabhu <roopa@...ulusnetworks.com>
To: "Eric W. Biederman" <ebiederm@...ssion.com>
CC: Magnus Bergroth <bergroth@...du.net>, netdev@...r.kernel.org,
Robert Shearman <rshearma@...cade.com>
Subject: Re: iproute2 mpls max labels
On 7/21/16, 1:00 PM, Eric W. Biederman wrote:
> Roopa Prabhu <roopa@...ulusnetworks.com> writes:
>
>> On 7/16/16, 11:24 AM, Magnus Bergroth wrote:
>>> Wanted to use more than the default maximum of 8 mpls labels. Max labels
>>> seems to be hardcode to 8 in two places.
>>>
>>> --- iproute2-4.6.0/lib/utils.c 2016-05-18 20:56:02.000000000 +0200
>>> +++ iproute2-4.6.0-bergroth/lib/utils.c 2016-07-16 20:12:10.714958071
>>> +0200
>>> @@ -476,7 +476,7 @@
>>> addr->bytelen = 4;
>>> addr->bitlen = 20;
>>> /* How many bytes do I need? */
>>> - for (i = 0; i < 8; i++) {
>>> + for (i = 0; i < MPLS_MAX_LABELS; i++) {
>>> if (ntohl(addr->data[i]) & MPLS_LS_S_MASK) {
>>> addr->bytelen = (i + 1)*4;
>>> break;
>>>
>>>
>>> --- iproute2-4.6.0/include/utils.h 2016-05-18 20:56:02.000000000 +0200
>>> +++ iproute2-4.6.0-bergroth/include/utils.h 2016-07-15
>>> 11:55:57.297681742 +0200
>>> @@ -54,6 +54,9 @@
>>> #define NEXT_ARG_FWD() do { argv++; argc--; } while(0)
>>> #define PREV_ARG() do { argv--; argc++; } while(0)
>>>
>>> +/* Maximum number of labels the mpls helpers support */
>>> +#define MPLS_MAX_LABELS 8
>>> +
>>> typedef struct
>>> {
>>> __u16 flags;
>>> @@ -61,7 +64,7 @@
>>> __s16 bitlen;
>>> /* These next two fields match rtvia */
>>> __u16 family;
>>> - __u32 data[8];
>>> + __u32 data[MPLS_MAX_LABELS];
>>> } inet_prefix;
>>>
> This structure is not MPLS specific so that is not appropriate to use
> MPLS_MAX_LABELS when definiting the structure. Likewise changing this
> structure and limiting the changes to mpls parts of the code is not
> appropriate.
>
>>> #define PREFIXLEN_SPECIFIED 1
>>> @@ -88,9 +91,6 @@
>>> # define AF_MPLS 28
>>> #endif
>>>
>>> -/* Maximum number of labels the mpls helpers support */
>>> -#define MPLS_MAX_LABELS 8
>>> -
>>> __u32 get_addr32(const char *name);
>>> int get_addr_1(inet_prefix *dst, const char *arg, int family);
>>> int get_prefix_1(inet_prefix *dst, char *arg, int family);
>>>
>>>
>> I did not realize it is hardcoded to 8 in iproute2. Because kernel has a hard coded limit of
>> 2.
>> I think we need to fix it in a few places:
>> a) we should move the kernel #define to a uapi header file which iproute2 can use
>> b) there has been a general ask to bump the kernel MAX_LABELS from 2 and I don't see
>> a problem with it yet. so, we could bump it to 8.
>>
>> Were you planning to post patches for one or both of the above ?.
>>
>> I can post them too. Let me know.
> a) I just looked and the kernel netlink protocol does not have a limit.
> The kernel does have a limit but the netlink protocol does not so
> there is no point in exporting a limit in a uapi header, it will
> just be out of date and wrong.
sure, if you have concerns about making it part of uapi, we can
separately maintain the same limit in iproute2 and kernel.
>
> b) I can see in principle bumping up the kernels MAX_LABELS past two
> although I haven't heard those requests, or understand the use cases.
> I don't recall seeing any ducumentation on cases where it is
> desirable to push a lot of labels at once. (Do hardware
> implementations support pushing a lot of labels at once?)
I don't know of any use cases either. But i have received multiple requests
on bumping the current limit of two
>
> Bumping past 8 seems quite a lot. That starts feeling like people
> trying to break other peoples mpls stacks. That is asking for more
> packet space for labels than ipv6 uses for addresses and ipv6 is way
> oversized. The commonly agreed wisdom is the world only needs 40 to
> 48 bits to route on to reach the entire world.
>
> I can completely understand a few specialty labels going beyond what
> is needed for general purpose routing but pushing more that 8 at
> once seems huge. Especially since you can recirculate packets if
> you really need to and push more labels that way.
I don't think there is an ask for going more than 8. anything greater than
current 2 is good.
>
> Add to that for a software implementation we have these pesky things
> called cache lines. I can see in the kernel pushing struct
> mpls_route towards the size of a full cacheline. Today we are at 52
> bytes not counting the via adress. With the via address we are at 56
> (ipv4), 58 (ethernet), and 60 (ipv6) bytes. Which means in we have
> to make the kernel data structures smarter or we risk messing up the
> performance of the common case.
>
> Also we do need some kind of limit in the kernel to protect against
> insane inputs.
>
> So while I can imagine there are reasonable cases for bumping up the
> maximum number of labels in the kernel I think we need to be smart if
> we ware going to do that. Which probably means we will want a
> __mpls_nh_label helper function.
>
sure, yes, the current static label array works well for the common case
of 2 labels. does it make sense for it to be configurable
with the default being 2 and max something like 8 ?
Thanks,
Roopa
Powered by blists - more mailing lists