[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20150417195238.GH16743@htj.duckdns.org>
Date: Fri, 17 Apr 2015 15:52:38 -0400
From: Tejun Heo <tj@...nel.org>
To: David Miller <davem@...emloft.net>
Cc: penguin-kernel@...ove.SAKURA.ne.jp, akpm@...ux-foundation.org,
linux-kernel@...r.kernel.org, netdev@...r.kernel.org
Subject: Re: [PATCHSET] printk, netconsole: implement reliable netconsole
Hello,
On Fri, Apr 17, 2015 at 02:55:37PM -0400, David Miller wrote:
> > * The bulk of patches are to pipe extended log messages to console
> > drivers and let netconsole relay them to the receiver (and quite a
> > bit of refactoring in the process), which, regardless of the
> > reliability logic, is beneficial as we're currently losing
> > structured logging (dictionary) and other metadata over consoles and
> > regardless of where the reliability logic is implemented, it's a lot
> > easier to have messages IDs.
>
> I do not argue against cleanups and good restructuring of the existing
> code. But you have decided to mix that up with something that is not
> exactly non-controversial.
Is the controlversial part referring to sending extended messages or
the reliability part or both?
> You'd do well to seperate the cleanups from the fundamental changes,
> so they can be handled separately.
Hmmm... yeah, probably would have been a better idea. FWIW, the
patches are stacked roughly in the order of escalating
controversiness. Will split the series up.
> > * The only thing necessary for reliable transmission are timer and
> > netpoll. There sure are cases where they go down too but there's a
> > pretty big gap between those two going down and userland getting
> > hosed, but where to put the retransmission and reliability logic
> > definitely is debatable.
>
> I fundamentally disagree, exactly on this point.
>
> If you take an OOPS in a software interrupt handler (basically, all of
> the networking receive paths and part of the transmit paths, for
> example) you're not going to be taking timer interrupts.
Sure, if irq handling is hosed, this won't work but I think there are
enough other failure modes like oopsing while holding a mutex or
falling into infinite loop while holding task_list lock (IIRC we had
something simliar a while ago due to iterator bug). Whether being
more robust in those cases is worthwhile is definitely debatable. I
thought the added complexity was small enough but the judgement can
easily fall on the other side.
> And that's the value of netconsole, the chance (albeit not %100) of
> getting messages in those scenerios.
None of the changes harm that in any way. Anyways, I'll split up the
extended message and the rest.
Thanks.
--
tejun
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists