[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <471E7B1D.4050107@zytor.com>
Date: Tue, 23 Oct 2007 15:52:13 -0700
From: "H. Peter Anvin" <hpa@...or.com>
To: Jeremy Fitzhardinge <jeremy@...p.org>
CC: "Huang, Ying" <ying.huang@...el.com>, Andi Kleen <ak@...e.de>,
"Eric W. Biederman" <ebiederm@...ssion.com>,
akpm@...ux-foundation.org,
Linus Torvalds <torvalds@...ux-foundation.org>,
linux-kernel@...r.kernel.org
Subject: Re: [PATCH -v7 1/3] x86 boot: setup data
Jeremy Fitzhardinge wrote:
>
> As a general comment, I can't say I'm thrilled about sticking the copied
> setup data at the end of the initial pagetables. This is already a
> fairly complex area, and changing it touches a surprisingly large number
> of places. I wonder if there isn't a better way to deal with this
> (possibly by fixing up the generation of the initial pagetables in the
> process).
>
> Or more simply, define a max size for this data, and copy it into an
> initdata area (which, being initdata, can be quite large without wasting
> lots of memory).
>
The initdata solution is fairly sensible. It's easy, and this data
really isn't expected to grow very fast at all. Unfortunately we don't
*have* an initdata bss section at the moment, so a large buffer would
bloat the kernel binary -- at least the uncompressed one. One school of
thought says that this should be fixed :) [*]
However, appending to bss (like the pagetables already are) *should* be
a reasonable option; what really should be done there is instead of
replacing init_pg_tables_end with setup_data_end we should except
additional dynamic data items here, and have an initial_data_end
variable for the extreme end of any initial data no matter the source.
It still does touch a lot of places, but at least that way they will be
fixed once and for all. It's somewhat questionable if it isn't easier
for this particular job to just fix the initial bss issue.
Furthermore, on looking through the code again, I see a bunch of
"init_pg_tables_end + setup_data_len" which really is ugly.
>> +
>> +/* extensible setup data list node */
>> +struct setup_data {
>> + u64 next;
>> + u32 type;
>> + u32 len;
>> + u8 data[0];
>> +};
>>
>
> What are the alignment rules for this structure? Is it always 64-bit
> aligned? What about the relationship of len and data?
>
It's x86, so alignment is soft - it presumably *should* be 64-bit
aligned, but nothing break if the boot loader doesn't.
-hpa
[*] A very clean way to do that is to put said section right at the
beginning of the bss, and move __init_end after it:
.bss : AT(ADDR(.bss) - LOAD_OFFSET) {
__bss_start = .; /* BSS */
*(.init.bss)
. = ALIGN(4096);
__init_end = .;
*(.bss.page_aligned)
*(.bss)
. = ALIGN(4);
__bss_stop = .;
_end = . ;
/* This is where the kernel creates the early boot page tables */
. = ALIGN(4096);
pg0 = . ;
}
On x86-64, this entails moving .data_nosave; it probably can be moved to
the same relative position as on i386 (although I have not verified that
that is true.)
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists