[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <mafs0wm4yluej.fsf@kernel.org>
Date: Mon, 13 Oct 2025 16:59:32 +0200
From: Pratyush Yadav <pratyush@...nel.org>
To: Breno Leitao <leitao@...ian.org>
Cc: Changyuan Lyu <changyuanl@...gle.com>, rppt@...nel.org,
akpm@...ux-foundation.org, linux-kernel@...r.kernel.org,
anthony.yznaga@...cle.com, arnd@...db.de, ashish.kalra@....com,
benh@...nel.crashing.org, bp@...en8.de, catalin.marinas@....com,
corbet@....net, dave.hansen@...ux.intel.com,
devicetree@...r.kernel.org, dwmw2@...radead.org, ebiederm@...ssion.com,
graf@...zon.com, hpa@...or.com, jgowans@...zon.com,
kexec@...ts.infradead.org, krzk@...nel.org,
linux-arm-kernel@...ts.infradead.org, linux-doc@...r.kernel.org,
linux-mm@...ck.org, luto@...nel.org, mark.rutland@....com,
mingo@...hat.com, pasha.tatashin@...een.com, pbonzini@...hat.com,
peterz@...radead.org, robh@...nel.org, rostedt@...dmis.org,
saravanak@...gle.com, skinsburskii@...ux.microsoft.com,
tglx@...utronix.de, thomas.lendacky@....com, will@...nel.org,
x86@...nel.org
Subject: Re: [PATCH v8 01/17] memblock: add MEMBLOCK_RSRV_KERN flag
On Fri, Oct 10 2025, Breno Leitao wrote:
> Hello Chanyuan, Mike,
>
> On Fri, May 09, 2025 at 12:46:19AM -0700, Changyuan Lyu wrote:
>> --- a/mm/memblock.c
>> +++ b/mm/memblock.c
>> @@ -492,7 +492,7 @@ static int __init_memblock memblock_double_array(struct memblock_type *type,
>> * needn't do it
>> */
>> if (!use_slab)
>> - BUG_ON(memblock_reserve(addr, new_alloc_size));
>> + BUG_ON(memblock_reserve_kern(addr, new_alloc_size));
>>
>> /* Update slab flag */
>> *in_slab = use_slab;
>> @@ -642,7 +642,7 @@ static int __init_memblock memblock_add_range(struct memblock_type *type,
>> #ifdef CONFIG_NUMA
>> WARN_ON(nid != memblock_get_region_node(rgn));
>> #endif
>> - WARN_ON(flags != rgn->flags);
>> + WARN_ON(flags != MEMBLOCK_NONE && flags != rgn->flags);
>
> I am hitting some sporadic warnings at early boot on a production kernel
> (6.16). Unfortunately this issue is not easily reproduce for me to test on
> upstream.
>
> 09:14:44 BIOS-provided physical RAM map:
> 09:14:44 BIOS-e820: [mem 0x0000000000000000-0x000000000009ffff] usable
> 09:14:44 BIOS-e820: [mem 0x00000000000a0000-0x00000000000fffff] reserved
> 09:14:44 BIOS-e820: [mem 0x0000000000100000-0x0000000064cb7fff] usable
> 09:14:44 BIOS-e820: [mem 0x0000000064cb8000-0x0000000064dc3fff] reserved
> 09:14:44 BIOS-e820: [mem 0x0000000064dc4000-0x0000000065b13fff] usable
> 09:14:44 BIOS-e820: [mem 0x0000000065b14000-0x0000000065b61fff] reserved
> 09:14:44 BIOS-e820: [mem 0x0000000065b62000-0x0000000065ed0fff] usable
> 09:14:44 BIOS-e820: [mem 0x0000000065ed1000-0x0000000065f2bfff] reserved
> 09:14:44 BIOS-e820: [mem 0x0000000065f2c000-0x0000000066621fff] usable
> 09:14:44 BIOS-e820: [mem 0x0000000066622000-0x0000000066630fff] reserved
> 09:14:44 BIOS-e820: [mem 0x0000000066631000-0x0000000068107fff] usable
> 09:14:44 BIOS-e820: [mem 0x0000000068108000-0x000000006819dfff] ACPI data
> 09:14:44 BIOS-e820: [mem 0x000000006819e000-0x000000006a48cfff] usable
> 09:14:44 BIOS-e820: [mem 0x000000006a48d000-0x000000006c58cfff] reserved
> 09:14:44 BIOS-e820: [mem 0x000000006c58d000-0x000000006c5dcfff] ACPI data
> 09:14:44 BIOS-e820: [mem 0x000000006c5dd000-0x000000006cfdcfff] ACPI NVS
> 09:14:44 BIOS-e820: [mem 0x000000006cfdd000-0x000000006e9fcfff] reserved
> 09:14:44 BIOS-e820: [mem 0x000000006e9fd000-0x000000006fffffff] usable
> 09:14:44 BIOS-e820: [mem 0x0000000070000000-0x000000008fffffff] reserved
> 09:14:44 BIOS-e820: [mem 0x00000000fd000000-0x00000000fe7fffff] reserved
> 09:14:44 BIOS-e820: [mem 0x00000000fed20000-0x00000000fed44fff] reserved
> 09:14:44 BIOS-e820: [mem 0x00000000ff000000-0x00000000ffffffff] reserved
> 09:14:44 BIOS-e820: [mem 0x0000000100000000-0x000000107fff655f] usable
> 09:14:44 BIOS-e820: [mem 0x000000107fff6560-0x000000107fff656f] type 128
> 09:14:44 BIOS-e820: [mem 0x000000107fff6570-0x000000107fffffff] usable
> 09:14:44 random: crng init done
> 09:14:44 ------------[ cut here ]------------
> 09:14:44 WARNING: CPU: 0 PID: 0 at mm/memblock.c:668 memblock_add_range (mm/memblock.c:668)
> 09:14:44 Modules linked in:
> 09:14:44 Tainted: [S]=CPU_OUT_OF_SPEC
> 09:14:44 RIP: 0010:memblock_add_range (mm/memblock.c:668)
> 09:14:44 Code: 28 80 3c 01 00 0f 84 04 fd ff ff 4c 89 ef e8 6c 77 09 00 e9 f7 fc ff ff 0f 0b 83 7c 24 1c 00 0f 85 9c fd ff ff e9 c5 fd ff ff <0f> 0b e9 be fd ff ff 44 89 f1 80 e1 07 80 c1 03 38 c1 0f 8c 6b fd
> All code
> ========
> 0: 28 80 3c 01 00 0f sub %al,0xf00013c(%rax)
> 6: 84 04 fd ff ff 4c 89 test %al,-0x76b30001(,%rdi,8)
> d: ef out %eax,(%dx)
> e: e8 6c 77 09 00 call 0x9777f
> 13: e9 f7 fc ff ff jmp 0xfffffffffffffd0f
> 18: 0f 0b ud2
> 1a: 83 7c 24 1c 00 cmpl $0x0,0x1c(%rsp)
> 1f: 0f 85 9c fd ff ff jne 0xfffffffffffffdc1
> 25: e9 c5 fd ff ff jmp 0xfffffffffffffdef
> 2a:* 0f 0b ud2 <-- trapping instruction
> 2c: e9 be fd ff ff jmp 0xfffffffffffffdef
> 31: 44 89 f1 mov %r14d,%ecx
> 34: 80 e1 07 and $0x7,%cl
> 37: 80 c1 03 add $0x3,%cl
> 3a: 38 c1 cmp %al,%cl
> 3c: 0f .byte 0xf
> 3d: 8c 6b fd mov %gs,-0x3(%rbx)
>
> Code starting with the faulting instruction
> ===========================================
> 0: 0f 0b ud2
> 2: e9 be fd ff ff jmp 0xfffffffffffffdc5
> 7: 44 89 f1 mov %r14d,%ecx
> a: 80 e1 07 and $0x7,%cl
> d: 80 c1 03 add $0x3,%cl
> 10: 38 c1 cmp %al,%cl
> 12: 0f .byte 0xf
> 13: 8c 6b fd mov %gs,-0x3(%rbx)
> 09:14:44 RSP: 0000:ffffffff85e07d48 EFLAGS: 00010083 ORIG_RAX: 0000000000000000
> 09:14:44 RAX: 0000000000000020 RBX: 0000000000001c00 RCX: dffffc0000000000
> 09:14:44 RDX: 000000000009f000 RSI: 000000000009d000 RDI: ffffffff8685ebf8
> 09:14:44 RBP: 0000000000000002 R08: 0000000000000020 R09: 0000000000000000
> 09:14:44 R10: ffffffffff200570 R11: fffffbffffe400b2 R12: 000000000009d000
> 09:14:44 R13: 0000000000100000 R14: ffffffff8edf5ce4 R15: ffffffff8edf5ce0
> 09:14:44 FS: 0000000000000000(0000) GS:0000000000000000(0000) knlGS:0000000000000000
> 09:14:44 CR2: ffff888059e2dff8 CR3: 000000005bc1d000 CR4: 00000000000000b0
> 09:14:44 Call Trace:
> 09:14:44 <TASK>
> 09:14:44 ? __memblock_reserve (mm/memblock.c:936)
> 09:14:44 ? add_early_ima_buffer (arch/x86/kernel/setup.c:413)
> 09:14:44 ? parse_setup_data (arch/x86/kernel/setup.c:500)
> 09:14:44 ? setup_arch (arch/x86/kernel/setup.c:245 arch/x86/kernel/setup.c:958)
> 09:14:44 ? start_kernel (init/main.c:922)
> 09:14:44 ? x86_64_start_reservations (arch/x86/kernel/ebda.c:57)
> 09:14:44 ? x86_64_start_kernel (arch/x86/kernel/head64.c:231)
> 09:14:44 ? common_startup_64 (arch/x86/kernel/head_64.S:419)
> 09:14:44 </TASK>
> ....
> Memory: 49640988K/66772816K available (54946K kernel code, 19058K rwdata, 22636K rodata, 2940K init, 120968K bss, 10650188K reserved, 6291456K cma-reserved)
>
> So, there is a memory override, and I am curious about it. Do you think it
Yeah, it seems IMA is reserving a region that overlaps a region reserved
by something else that doesn't use memblock_reserve_kern().
> would be useful to expand this warning to dump more information about the
> issue? (only compiled tested)
>
> if (flags != MEMBLOCK_NONE && flags != rgn->flags) {
> pr_warn("memblock: Flag mismatch at region [%pa-%pa]\n",
> &rgn->base, &rend);
> pr_warn(" Existing region flags: %#x\n", rgn->flags);
> pr_warn(" New range flags: %#x\n", flags);
> pr_warn(" New range: [%pa-%pa]\n", &base, &end);
> WARN_ON_ONCE(1);
> }
I suppose this would be useful. I think enabling memblock debug prints
would also be helpful (using the "memblock=debug" commandline parameter)
if it doesn't impact your production environment too much.
--
Regards,
Pratyush Yadav
Powered by blists - more mailing lists