[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <46E7EE89.9060006@katalix.com>
Date: Wed, 12 Sep 2007 14:50:01 +0100
From: James Chapman <jchapman@...alix.com>
To: hadi@...erus.ca, Bill Fink <billfink@...dspring.com>
CC: netdev@...r.kernel.org, davem@...emloft.net, jeff@...zik.org,
mandeep.baines@...il.com, ossthema@...ibm.com,
Stephen Hemminger <shemminger@...l.org>
Subject: Re: RFC: possible NAPI improvements to reduce interrupt rates for
low traffic rates
jamal wrote:
> On Wed, 2007-12-09 at 03:04 -0400, Bill Fink wrote:
>> On Fri, 07 Sep 2007, jamal wrote:
>
>>> I am going to be the devil's advocate[1]:
>> So let me be the angel's advocate. :-)
>
> I think this would make you God's advocate ;->
> (http://en.wikipedia.org/wiki/God%27s_advocate)
>
>> I view his results much more favorably.
>
> The challenge is, under _low traffic_: bad bad CPU use.
> Thats what is at stake, correct?
By low traffic, I assume you mean a rate at which the NAPI driver
doesn't stay in polled mode. The problem is that that rate is getting
higher all the time, as interface and CPU speeds increase. This results
in too many interrupts and NAPI thrashing in/out of polled mode very
quickly.
> Lets bury the stats for a sec ...
Yes please. We need an analysis of what happens to cpu usage, latency,
pps etc when various factors are changed, e.g. input pps, NAPI busy-idle
delay etc. The main purpose of my RFC wasn't to push a patch into the
kernel right now, it was to highlight the issue and to find out if
others were already working on it. The feedback has been good so far. I
just need to find some time to do some testing. :)
> People are bitching about NAPI abusing CPU, is the
> answer to abuse more CPU than NAPI?;->
Jamal, do you have more details? Are people saying NAPI gets too much of
the CPU pie because they profiled it? Are they complaining that system
behavior degrades too much under certain network traffic conditions?
Mouse cursor movement jittery? Real-time apps such as music/video
players starved of CPU? Is it possible they blame NAPI because they see
tangible effects on their system, not because measured CPU usage is
high? I say this because my music/video player and mouse cursor behave
_much_ better with my NAPI changes during general use, despite the
increase in measured cpu load. Even ftp can make my system's mouse
cursor jitter...
> The answer could be "I am not solving that problem anymore" - at least
> thats what James is saying;->
I'm investigating whether the symptoms I describe above can be reduced
or eliminated without resorting to hardware interrupt mitigation.
Specifically, I want to do more testing on the idle polling scheme which
seems to improve system behavior in my tests. This will involve more
than doing a flood ping or two. :)
>> Sometimes there
>> are tradeoffs to be made to be decided by the user based on what's most
>> important to that user and his specific workload. And the suggested
>> ethtool option (defaulting to current behavior) would enable the user
>> to make that decision.
>
> And the challenge is:
> What workload is willing to invest that much cpu for low traffic?
> Can you name one? One that may come close is database benchmarks for
> latency - but those folks wouldnt touch this with a mile-long pole if
> you told them their cpu use is going to get worse than what NAPI (that
> big bad CPU hog under low traffic) is giving them.
I agree with both of you. But we need more test results first to know
whether it will be useful to offer NAPI idle polling as an _option_.
--
James Chapman
Katalix Systems Ltd
http://www.katalix.com
Catalysts for your Embedded Linux software development
-
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists