[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <49060561.4060607@redhat.com>
Date: Mon, 27 Oct 2008 14:16:01 -0400
From: Chris Snook <csnook@...hat.com>
To: Matt Mackall <mpm@...enic.com>
CC: Linux Kernel Mailing List <linux-kernel@...r.kernel.org>,
akpm <akpm@...uxfoundation.org>
Subject: Re: Bloatwatch 2.6.28-rc1: last_sysfs_file
Matt Mackall wrote:
> On Mon, 2008-10-27 at 12:50 -0400, Chris Snook wrote:
>> Matt Mackall wrote:
>>> On Sat, 2008-10-25 at 06:38 -0400, Chris Snook wrote:
>>>> Matt Mackall wrote:
>>>>> On Fri, 2008-10-24 at 15:52 -0500, Matt Mackall wrote:
>>>>>> 2.6.28-rc1 adds 4k for last_sysfs_file debug tracking. That's one hell
>>>>>> of a long sysfs path.
>>>>>>
>>>>>> http://www.selenic.com/bloatwatch/?cmd=compare;v1=2.6.27;v2=2.6.28-rc1;part=/built-in/fs/sysfs
>>>>> ..especially given that printk is limited to 1k at a time.
>>>>>
>>>>>
>>>>> sysfs: shrink last_sysfs_file to a reasonable size
>>>>>
>>>>> sysfs was reserving 4k to store filenames for debug despite printk being
>>>>> limited to 1k. Shrink this to something more reasonable.
>>>>>
>>>>> Signed-off-by: Matt Mackall <mpm@...enic.com>
>>>>>
>>>>> diff -r ac8c82ff3be7 fs/sysfs/file.c
>>>>> --- a/fs/sysfs/file.c Fri Oct 24 13:13:04 2008 -0500
>>>>> +++ b/fs/sysfs/file.c Fri Oct 24 16:11:53 2008 -0500
>>>>> @@ -25,7 +25,7 @@
>>>>> #include "sysfs.h"
>>>>>
>>>>> /* used in crash dumps to help with debugging */
>>>>> -static char last_sysfs_file[PATH_MAX];
>>>>> +static char last_sysfs_file[200]; /* allow for disgustingly long paths */
>>>>> void sysfs_printk_last_file(void)
>>>>> {
>>>>> printk(KERN_EMERG "last sysfs file: %s\n", last_sysfs_file);
>>>>>
>>>>>
>>>> Please don't use magic numbers. Use a symbolic constant, and modify printk.c to
>>>> use the same.
>>> I'm explicitly not using a magic number because the relevant magic
>>> number is absurdly large. It is in fact 4 times larger than printk can
>>> print. And its 40 times larger than any path we are liable to encounter.
>>> Inventing a new use-once #define meaning "good enough for debugging
>>> 99.99% of sysfs bugs" one line up is not an improvement here.
>>>
>>> And why on earth would I modify anything in printk?
>> Because the 1024 limit is a created by an array declaration in printk.c, which
>> uses the number 1024. Instead, put something like this in a header file:
>>
>> #define PRINTK_MAX 1024
>
> That 1024 is supposed to effectively be infinity. Printk is intended to
> be a largely line-oriented facility. If you're printing something large
> enough that you even have to ask yourself 'hmm, how big a buffer can I
> print?', you're printing something -way- too large.
>
> So no, I think printk is fine as it stands.
>
>> Arbitrarily clamping critical debugging features to limits pulled out of one's
>> ass is generally frowned upon.
>
> First, I don't think it's accurate to apply the adjective 'critical' to
> a facility that's existed in mainline for mere days, while sysfs itself
> has existed many years.
It's critical for debugging *new* things like PCI domain support, that people
using very non-embedded systems care a whole lot about.
> Second, I didn't pull it out of my ass, actually. I did a histogram of
> sysfs path lengths on my system, found it cut off at ~100, and then
> added generous padding.
Does your system have multiple NUMA nodes, cascaded PCI bridges, and multipathed
fibre channel storage? The point of this debugging feature is to help debug the
cases that are *not* trivial. Your workstation probably isn't very interesting
in this regard.
>> If you can justify some limit lower than the
>> length of the printk buffer, great, but "I want to save a few hundred bytes of
>> RAM, total" is insufficient, unless you want to restrict it to the
>> CONFIG_EMBEDDED world.
>
> I think you've got this exactly backwards. This is just a debugging
> hint. It is not a standards correctness or kernel robustness issue.
> Reporting the last 20 bytes of the filename would be a sufficient hint
> for most purposes. On the vast majority of machines, this feature will
> be effectively dormant, and this buffer will be wasted space so we have
> to weigh the usefulness of storing unusually long filenames against the
> value of that space for -all other purposes-. If it's going to be on
> unconditionally, it should be as small as possible.
Showing the last 20 characters doesn't help in a highly repetitive file
hierarchy where how you got there may be more important than where you ended up.
I agree with you that it won't be terribly interesting on most systems.
Perhaps we should find a way to resolve that. Would you be satisfied if the
full buffer were enabled by a debug config option?
As for the savings of 824 bytes, you're going to have a very hard time
convincing the people who want this feature un-castrated to debug their
multi-chassis Itanium machines that saving 824 bytes is worth losing valuable
debugging data.
> Just for fun, let's imagine for a moment that someone -did- have a 1k
> sysfs path. And, further, we happen to have a fatal oops at some point
> after it. And, further, we were lucky enough for it to get directed to a
> text console. But unfortunately, our debugging hint has helpfully taken
> up half the screen, and we've actually scrolled the much more important
> *cause of the oops* off the screen. Hurray! In fact, we'll probably
> scroll most of the giant sysfs path off too when we print a backtrace.
On the systems where we care about extremely long sysfs paths, screen space
isn't an issue, because the thing is sitting in a data center and you're
debugging with netconsole or a serial console.
> Given that the only purpose of this feature is to be printked in an
> already tightly constrained oops report where it's quite likely to be
> completely extraneous, spending more than a couple lines on it is a good
> way to fail.
Now, this is valid. "200 prevents the printk from consuming more than 3 lines
on an 80 column terminal." is a much more useful comment than "200 seems to be
big enough on my box." So, is 3 lines within budget? Would 4 lines be too much?
-- Chris
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists