[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <200709192143.14635.david-b@pacbell.net>
Date: Wed, 19 Sep 2007 21:43:14 -0700
From: David Brownell <david-b@...bell.net>
To: Andrew Morton <akpm@...ux-foundation.org>
Cc: Chuck Ebbert <cebbert@...hat.com>, tilman@...p.cc,
linux-kernel@...r.kernel.org, gregkh@...e.de, a.zummo@...ertech.it
Subject: Re: 2.6.23-rc6-mm1
On Wednesday 19 September 2007, Andrew Morton wrote:
> On Wed, 19 Sep 2007 16:44:48 -0700
> David Brownell <david-b@...bell.net> wrote:
>
> > > > <4>[ 21.211942] Duplicate file names "rtc" detected
> > >
> > > Nah, that's an rtc-specific problem.
> >
> > RTC-related ... but it's a procfs bug, since it's procfs which doesn't
> > even bother to check for duplicate names before it registers files.
>
> So you keep on claiming, but I don't think I've yet seen a description of
> the *reason* why two copies of this file are being created,
Yes you did .... most recently in http://lkml.org/lkml/2007/9/19/18 is:
> > The missing step seems to be that proc_register() doesn't bother
> > to check whether there's already an entry for that file. Which
> > is what the appended *UNTESTED* patch does (it compiles though).
Although maybe you meant me to parse that differently ... as in,
not "why is procfs doing this broken thing?" but rather "how is it
that procfs fault (non)handling code ended up getting used?".
That's always seemed self-evident: the RTC framework was creating
it for /dev/rtc0 (presumably, rtc-cmos), while at the same time the
legacy drivers/char/rtc.c was creating it for /dev/rtc.
Workaround by configuring just one, and the problem goes away.
(Although it *ought* to be OK to configure both, with all the normal
resource exclusions kicking in to ensure only one will run.)
> and a
> description of why that is an OK thing for the kernel to be doing.
It's not a wrong thing, at any rate. This is a fairly basic task
in any filesystem: mutual exclusion. Code is allowed to rely on
filesystems acting correctly...
> > > From: Chuck Ebbert <cebbert@...hat.com>
> > >
> > > AFAICT the rtc problem is caused by misconfiguration: both the new
> > > and old rtc driver have been built and they are both trying to load.
> >
> > That _shouldn't_ be a problem at all; only one of them should be
> > able to bind to that hardware.
> >
> > The only problem I see in these messages is that procfs bug.
> >
>
> It's not obvious that this is only a procfs bug. If some part of the
> kernel tries to add a procfs file which is already there, that's often a
> bug in the caller.
Not really; procfs is supposed to not create it if it's already there!!
Reasonable callers will cope with "it didn't get created", and that's
all they should really need to do.
> Yes, procfs should have been checking for this. But it is too late now for
> us to just fail out of the procfs registration code. Because this can
> cause previously buggy-but-works-ok code to now fail completely.
What do you mean by too late "now" ... just-before-2.6.23?
I'm a bit puzzled why this issue cropped up suddenly, when the
"two RTC drivers" configs have been behaving fine (presumably
failing properly, but at least not generating problem reports)
for some time. One of the procfs changes must have caused this
trouble.
And what would an example be of buggy-but-works code, which would
then be broken if the procfs stopped being buggy?
> So I think the best we can do now is to retain the runtime warning and to
> continue to "succeed" and to identify all the problematic codesites and to
> either fix them or to convince ourselves that they really are working as
> intended.
At a micro level, both the relevant call sites already have code
to tolerate the "couldn't create that file" error, which looked
correct at a quick reading.
- Dave
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists