[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <49C78DD1.30406@fujitsu-siemens.com>
Date: Mon, 23 Mar 2009 14:25:37 +0100
From: Martin Wilck <martin.wilck@...itsu-siemens.com>
To: Corey Minyard <minyard@....org>
CC: Greg KH <greg@...ah.com>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
"openipmi-developer@...ts.sourceforge.net"
<openipmi-developer@...ts.sourceforge.net>
Subject: Re: [PATCH] limit CPU time spent in kipmid
Corey Minyard wrote:
>>> It's actually already sort of self-tuning. kipmid sleeps unless there is
>>> IPMI activity. It only spins if it is expecting something from the
>>> controller.
The self-tuning is fine (as long as there is no CPU load, which may slow
down stuff a lot, see my other posting). But on systems with many
sensors it will lead to considerable CPU time shown in "top" and other
tools for kipmid. And this confuses users. Users think that this is the
hardware vendor's fault - that's why I am sending this patch (if you so
wish, it is indeed the vendor's fault to use the outdated KCS interface
- but that's a different discussion, please let's keep it separate).
>>> I've been thinking about this a little more. Assuming that the self-tuning
>>> is working (and it appears to be working fine on my systems), that means
>>> that something is causing the IPMI driver to constantly talk to the
>>> management controller. I can think of three things:
>>>
>>> 1. The user is constantly sending messages to management controller.
This is what I did in my benchmark, of course. But also in real systems,
there are now many sensors (think dozens of DIMMs with several sensors
on each DIMM) and many events, causing constant IPMI traffic.
>>> 2. There is something wrong with the hardware, like the ATTN bit is
>>> stuck high, causing the driver to constantly poll the management
>>> controller.
>>> 3. The driver either has a bug or needs some more work to account for
>>> something the hardware needs it to do to clear the ATTN bit.
I think both 2.) and 3.) is not the case here.
>>> If it's #1 above, then I don't know if there is anything we can do about
>>> it. The patch Martin sent will simply slow things down.
True, but only a little bit. Please look at the numbers in my other posting.
Best regards
Martin
--
Martin Wilck
PRIMERGY System Software Engineer
FSC IP ESP DEV 6
Fujitsu Siemens Computers GmbH
Heinz-Nixdorf-Ring 1
33106 Paderborn
Germany
Tel: ++49 5251 525 2796
Fax: ++49 5251 525 2820
Email: mailto:martin.wilck@...itsu-siemens.com
Internet: http://www.fujitsu-siemens.com
Company Details: http://www.fujitsu-siemens.com/imprint.html
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists