lists.openwall.net | lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC | |
Open Source and information security mailing list archives
| ||
|
Date: Tue, 3 Jun 2008 17:18:42 +1000 From: Nick Piggin <nickpiggin@...oo.com.au> To: Paul Mackerras <paulus@...ba.org> Cc: benh@...nel.crashing.org, Russell King <rmk+lkml@....linux.org.uk>, Linus Torvalds <torvalds@...ux-foundation.org>, David Miller <davem@...emloft.net>, linux-arch@...r.kernel.org, scottwood@...escale.com, linuxppc-dev@...abs.org, alan@...rguk.ukuu.org.uk, linux-kernel@...r.kernel.org, tpiepho@...escale.com Subject: Re: MMIO and gcc re-ordering issue On Tuesday 03 June 2008 16:53, Paul Mackerras wrote: > Nick Piggin writes: > > So your readl can pass an earlier cacheable store or earlier writel? > > No. It's quite gross at the moment, it has a sync before the access > (i.e. a full mb()) and a twi; isync sequence after the access that > stalls execution until the data comes back. OK. > > > We don't provide meaningless ones like writel + cacheable store for > > > example. (PCI posting would defeat it anyway). > > > > What do you mean by meaningless? Ordering of writel followed by a > > cacheable store is meaningful eg. for retaining io operations within > > locks. OK, you explicitly have some extra code for spin_unlock, but > > not for bit locks, mutexes, etc. It would make sense to have the > > default operations _very_ strongly ordered IMO, and then move drivers > > to be more relaxed when they are verified. > > It's meaningless in the sense that nothing guarantees that the writel > has actually hit the device, even if we put a full mb() barrier in > between the writel and the cacheable write. That would guarantee that > the writel had got to the PCI host bridge, and couldn't be reordered > with other accesses to the device, but not that the device had > actually seen it. OK, but I think fits OK with our SMP ordering model for cacheable stores: no amount of barriers on CPU0 will guarantee that CPU1 has seen the store, you actually have to observe a causual effect of the store before you can say that. > I don't mind adding code to the mutex unlock to do the same as > spin_unlock, but I really don't want to have *two* sync instructions > for every MMIO write. One is bad enough. So you can't provide iostore/store ordering without another sync after the writel store? I guess the problem with providing exceptions is that it makes it hard for people who absolutely don't know or care about ordering. I don't like having to think about it "hmm, we can allow this type of reordering... oh, unless some silly device does X...". If we come up with a sane set of weakly ordered accessors (including io_*mb()), it will make it easier to go through the important drivers and improve them. We don't have to enforce the the new semantics strictly until then if they'll slow you down too much in the meantime. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@...r.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists