[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20080110093126.GA360@elte.hu>
Date: Thu, 10 Jan 2008 10:31:26 +0100
From: Ingo Molnar <mingo@...e.hu>
To: Andi Kleen <ak@...e.de>
Cc: linux-kernel@...r.kernel.org, Thomas Gleixner <tglx@...utronix.de>,
"H. Peter Anvin" <hpa@...or.com>,
Venki Pallipadi <venkatesh.pallipadi@...el.com>,
suresh.b.siddha@...el.com, Arjan van de Ven <arjan@...radead.org>,
Dave Jones <davej@...hat.com>
Subject: Re: CPA patchset
Andi,
finally managed to get the time to review your CPA patchset, and i
fundamentally agree with most of the detail changes done in it. But here
are a few structural high-level observations:
- firstly, there's no rationale given. So we'll change ioremap()/etc.
from doing a cflush-range instruction instead of a WBINVD. But why?
WBINVD isnt particular fast (takes a few msecs), but why is that a
problem? Drivers dont do high-frequency ioremap-ing. It's typically
only done at driver/device startup and that's it. Whether module load
time takes 1254 msecs instead of 1250 msecs is no big deal.
- secondly, obviously doing a 'flush some' instead of a 'flush all'
operation is risky. There's not many ways we can get the 'flush all'
WBINVD instruction wrong (as long as we do it on all cpus, etc.). But
with this specific range flush we've got all the risks of accidentally
not flushing _enough_. Especially if some boundary of a mapped area is
imprecisely. Accidentally leaving aliases around in the cache is
asking for trouble as it's very hard to debug and the operations here
(ioremap) are typically done only once per system bootup. So we'll add
_fundamental_ fragility to an already historically fragile and
bug-ridden piece of code. That does not sound too smart to me and you
do not analyze these concerns in your submission.
- the bugfixes and cleanups to pgattr.c i like very much - but shouldnt
they come first in the series, instead of being mixed into it?
so i'd suggest for you to reshape this ontop of the PAT patchset done by
Venki and Suresh, with the cflush stuff kept at the very end of the
series, and even that should be boot and .config configurable, with
default off - at least initially. The cflush performance advantages seem
dubious at best, and they bring in very real regression dangers.
Ingo
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists