[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <1174522152.2713.73.camel@localhost.localdomain>
Date: Thu, 22 Mar 2007 11:09:12 +1100
From: Rusty Russell <rusty@...tcorp.com.au>
To: "Eric W. Biederman" <ebiederm@...ssion.com>
Cc: Andrew Morton <akpm@...ux-foundation.org>,
Ingo Molnar <mingo@...e.hu>, Andi Kleen <andi@...stfloor.org>,
lkml - Kernel Mailing List <linux-kernel@...r.kernel.org>
Subject: Re: [PATCH] Allow per-cpu variables to be page-aligned
On Wed, 2007-03-21 at 10:49 -0600, Eric W. Biederman wrote:
> Rusty Russell <rusty@...tcorp.com.au> writes:
>
> > On Wed, 2007-03-21 at 03:21 -0600, Eric W. Biederman wrote:
> >> Do we really want to allow modules to be able to allocate page sized
> >> per cpu memory.
> >
> > Hi Eric!
> >
> > They always could, of course, they just wouldn't get correct alignment.
> > I think the principle of least surprise says that if we support this, it
> > will also work in modules...
>
> The module load would fail.
Hi again Eric,
Unfortunately not. It probably should, though: people ignore printks.
I was probably thinking that large alignment constraints were only for
performance when I wrote this code, but a page-aligned requirement for
hypervisors changes that.
> > Looking at the module per-cpu code again, the rounding up of the memory
> > used by the kernel seems unnecessary though. I'll try ripping that
> > out...
>
> I want to say that when dealing with cpu stuff aligning to a cache
> line makes sense as it prevents multiple variables from sharing
> the same cache line. However we rarely access per cpu variables from
> other cpus (the point) so the extra alignment doesn't seem to have
> a justification in this context.
Um, yes, always good to remember. I wrote the per-cpu infrastructure,
and I haven't forgotten 8)
> Although I'm not quite certain what this will do to the per cpu
> memory allocator...
It should Just Work. My only hesitation is that I obviously thought
different when I wrote this code, so am I smarter now, or then?
> After increasing NR_IRQS on x86_64 to (NR_CPUS*32) the per cpu irq
> stats got much bigger especially as NR_CPUS went up. The only
> reasonable way I could see to fix this at the time was to just make
> PER_CPU_ENOUGH_ROOM do the right thing and change size dynamically
> with the size of the per cpu section. I added PERCPU_MODULE_RESERVE
> to allocate the amount that we did not have compile information on.
> 8K was roughly what we had left over for modules before I made the
> change so I just preserved that.
This makes a lot of sense. A fixed constant seemed sensible at the
time, but now we know that the majority of per-cpu vars are in code
which can never be a module. Reasons are obvious, and seem unlikely to
change.
> > It means the x86 cpu_pda initialization would have to be done in
> > smp_prepare_boot_cpu tho...
>
> Well that is earlier than trap_init so it shouldn't be a problem...
But it doesn't get called on UP. Don't know if that matters, but it
wasn't immediately obvious.
Thanks,
Rusty.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists