[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <8c987a26-d499-4ceb-905e-750bb24fcb08@quicinc.com>
Date: Thu, 31 Oct 2024 14:05:58 -0700
From: Oreoluwa Babatunde <quic_obabatun@...cinc.com>
To: Geert Uytterhoeven <geert@...ux-m68k.org>
CC: <robh@...nel.org>, <aisheng.dong@....com>, <andy@...ck.fi.intel.com>,
<catalin.marinas@....com>, <devicetree@...r.kernel.org>, <hch@....de>,
<iommu@...ts.linux.dev>, <kernel@...cinc.com>, <klarasmodin@...il.com>,
<linux-kernel@...r.kernel.org>, <m.szyprowski@...sung.com>,
<quic_ninanaik@...cinc.com>, <robin.murphy@....com>,
<saravanak@...gle.com>, <will@...nel.org>
Subject: Re: [PATCH v10 1/2] of: reserved_mem: Restruture how the reserved
memory regions are processed
On 10/29/2024 5:41 AM, Geert Uytterhoeven wrote:
> Hi Oreoluwa,
>
> On Wed, Oct 9, 2024 at 12:08 AM Oreoluwa Babatunde
> <quic_obabatun@...cinc.com> wrote:
>> Reserved memory regions defined in the devicetree can be broken up into
>> two groups:
>> i) Statically-placed reserved memory regions
>> i.e. regions defined with a static start address and size using the
>> "reg" property.
>> ii) Dynamically-placed reserved memory regions.
>> i.e. regions defined by specifying an address range where they can be
>> placed in memory using the "alloc_ranges" and "size" properties.
>>
>> These regions are processed and set aside at boot time.
>> This is done in two stages as seen below:
>>
>> Stage 1:
>> At this stage, fdt_scan_reserved_mem() scans through the child nodes of
>> the reserved_memory node using the flattened devicetree and does the
>> following:
>>
>> 1) If the node represents a statically-placed reserved memory region,
>> i.e. if it is defined using the "reg" property:
>> - Call memblock_reserve() or memblock_mark_nomap() as needed.
>> - Add the information for that region into the reserved_mem array
>> using fdt_reserved_mem_save_node().
>> i.e. fdt_reserved_mem_save_node(node, name, base, size).
>>
>> 2) If the node represents a dynamically-placed reserved memory region,
>> i.e. if it is defined using "alloc-ranges" and "size" properties:
>> - Add the information for that region to the reserved_mem array with
>> the starting address and size set to 0.
>> i.e. fdt_reserved_mem_save_node(node, name, 0, 0).
>> Note: This region is saved to the array with a starting address of 0
>> because a starting address is not yet allocated for it.
>>
>> Stage 2:
>> After iterating through all the reserved memory nodes and storing their
>> relevant information in the reserved_mem array,fdt_init_reserved_mem() is
>> called and does the following:
>>
>> 1) For statically-placed reserved memory regions:
>> - Call the region specific init function using
>> __reserved_mem_init_node().
>> 2) For dynamically-placed reserved memory regions:
>> - Call __reserved_mem_alloc_size() which is used to allocate memory
>> for each of these regions, and mark them as nomap if they have the
>> nomap property specified in the DT.
>> - Call the region specific init function.
>>
>> The current size of the resvered_mem array is 64 as is defined by
>> MAX_RESERVED_REGIONS. This means that there is a limitation of 64 for
>> how many reserved memory regions can be specified on a system.
>> As systems continue to grow more and more complex, the number of
>> reserved memory regions needed are also growing and are starting to hit
>> this 64 count limit, hence the need to make the reserved_mem array
>> dynamically sized (i.e. dynamically allocating memory for the
>> reserved_mem array using membock_alloc_*).
>>
>> On architectures such as arm64, memory allocated using memblock is
>> writable only after the page tables have been setup. This means that if
>> the reserved_mem array is going to be dynamically allocated, it needs to
>> happen after the page tables have been setup, not before.
>>
>> Since the reserved memory regions are currently being processed and
>> added to the array before the page tables are setup, there is a need to
>> change the order in which some of the processing is done to allow for
>> the reserved_mem array to be dynamically sized.
>>
>> It is possible to process the statically-placed reserved memory regions
>> without needing to store them in the reserved_mem array until after the
>> page tables have been setup because all the information stored in the
>> array is readily available in the devicetree and can be referenced at
>> any time.
>> Dynamically-placed reserved memory regions on the other hand get
>> assigned a start address only at runtime, and hence need a place to be
>> stored once they are allocated since there is no other referrence to the
>> start address for these regions.
>>
>> Hence this patch changes the processing order of the reserved memory
>> regions in the following ways:
>>
>> Step 1:
>> fdt_scan_reserved_mem() scans through the child nodes of
>> the reserved_memory node using the flattened devicetree and does the
>> following:
>>
>> 1) If the node represents a statically-placed reserved memory region,
>> i.e. if it is defined using the "reg" property:
>> - Call memblock_reserve() or memblock_mark_nomap() as needed.
>>
>> 2) If the node represents a dynamically-placed reserved memory region,
>> i.e. if it is defined using "alloc-ranges" and "size" properties:
>> - Call __reserved_mem_alloc_size() which will:
>> i) Allocate memory for the reserved region and call
>> memblock_mark_nomap() as needed.
>> ii) Call the region specific initialization function using
>> fdt_init_reserved_mem_node().
>> iii) Save the region information in the reserved_mem array using
>> fdt_reserved_mem_save_node().
>>
>> Step 2:
>> 1) This stage of the reserved memory processing is now only used to add
>> the statically-placed reserved memory regions into the reserved_mem
>> array using fdt_scan_reserved_mem_reg_nodes(), as well as call their
>> region specific initialization functions.
>>
>> 2) This step has also been moved to be after the page tables are
>> setup. Moving this will allow us to replace the reserved_mem
>> array with a dynamically sized array before storing the rest of
>> these regions.
>>
>> Signed-off-by: Oreoluwa Babatunde <quic_obabatun@...cinc.com>
> Thanks for your patch, which is now commit 8a6e02d0c00e7b62
> ("of: reserved_mem: Restructure how the reserved memory regions
> are processed") in dt-rh/for-next.
>
> I have bisected a boot issue on RZ/Five to this commit.
> With "earlycon keep_bootcon" (else there is no output):
>
> Oops - store (or AMO) access fault [#1]
> CPU: 0 UID: 0 PID: 1 Comm: swapper Not tainted
> 6.12.0-rc1-00015-g8a6e02d0c00e #201
> Hardware name: Renesas SMARC EVK based on r9a07g043f01 (DT)
> epc : __memset+0x60/0x100
> ra : __dma_alloc_from_coherent+0x150/0x17a
> epc : ffffffff8062d2bc ra : ffffffff80053a94 sp : ffffffc60000ba20
> gp : ffffffff812e9938 tp : ffffffd601920000 t0 : ffffffc6000d0000
> t1 : 0000000000000000 t2 : ffffffffe9600000 s0 : ffffffc60000baa0
> s1 : ffffffc6000d0000 a0 : ffffffc6000d0000 a1 : 0000000000000000
> a2 : 0000000000001000 a3 : ffffffc6000d1000 a4 : 0000000000000000
> a5 : 0000000000000000 a6 : ffffffd601adacc0 a7 : ffffffd601a841a8
> s2 : ffffffd6018573c0 s3 : 0000000000001000 s4 : ffffffd6019541e0
> s5 : 0000000200000022 s6 : ffffffd6018f8410 s7 : ffffffd6018573e8
> s8 : 0000000000000001 s9 : 0000000000000001 s10: 0000000000000010
> s11: 0000000000000000 t3 : 0000000000000000 t4 : ffffffffdefe62d1
> t5 : 000000001cd6a3a9 t6 : ffffffd601b2aad6
> status: 0000000200000120 badaddr: ffffffc6000d0000 cause: 0000000000000007
> [<ffffffff8062d2bc>] __memset+0x60/0x100
> [<ffffffff80053e1a>] dma_alloc_from_global_coherent+0x1c/0x28
> [<ffffffff80053056>] dma_direct_alloc+0x98/0x112
> [<ffffffff8005238c>] dma_alloc_attrs+0x78/0x86
> [<ffffffff8035fdb4>] rz_dmac_probe+0x3f6/0x50a
> [<ffffffff803a0694>] platform_probe+0x4c/0x8a
> [<ffffffff8039ea16>] really_probe+0xe4/0x1c8
> [<ffffffff8039ebc4>] __driver_probe_device+0xca/0xce
> [<ffffffff8039ec48>] driver_probe_device+0x34/0x92
> [<ffffffff8039ede8>] __driver_attach+0xb4/0xbe
> [<ffffffff8039ce58>] bus_for_each_dev+0x60/0xa0
> [<ffffffff8039e26a>] driver_attach+0x1a/0x22
> [<ffffffff8039dc20>] bus_add_driver+0xa4/0x184
> [<ffffffff8039f65c>] driver_register+0x8a/0xb4
> [<ffffffff803a051c>] __platform_driver_register+0x1c/0x24
> [<ffffffff808202f6>] rz_dmac_driver_init+0x1a/0x22
> [<ffffffff80800ef6>] do_one_initcall+0x64/0x134
> [<ffffffff8080122e>] kernel_init_freeable+0x200/0x202
> [<ffffffff80638126>] kernel_init+0x1e/0x10a
> [<ffffffff8063d58e>] ret_from_fork+0xe/0x18
> Code: 1007 82b3 40e2 0797 0000 8793 00e7 8305 97ba 8782 (b023) 00b2
> ---[ end trace 0000000000000000 ]---
> Kernel panic - not syncing: Fatal exception in interrupt
> ---[ end Kernel panic - not syncing: Fatal exception in interrupt ]---
>
> Nothing really stands out in the kernel log, except for a delayed
> initialization of the reserved mem nodes (they are the same
> before/after):
>
> printk: debug: ignoring loglevel setting.
> -OF: reserved mem: 0x0000000000030000..0x000000000003ffff (64 KiB)
> nomap non-reusable mmode_resv0@...00
> -OF: reserved mem: 0x0000000000040000..0x000000000004ffff (64 KiB)
> nomap non-reusable mmode_resv1@...00
> -OF: reserved mem: 0x0000000044000000..0x000000004403ffff (256 KiB)
> nomap non-reusable mmode_resv3@...00000
> -OF: reserved mem: 0x0000000044040000..0x000000004405ffff (128 KiB)
> nomap non-reusable mmode_resv2@...40000
> +earlycon: scif0 at MMIO 0x000000001004b800 (options '115200n8')
> +printk: legacy bootconsole [scif0] enabled
> +printk: debug: skip boot console de-registration.
> Reserved memory: created DMA memory pool at 0x0000000058000000, size 128 MiB
> OF: reserved mem: initialized node pma_resv0@...00000, compatible id
> shared-dma-pool
> OF: reserved mem: 0x0000000058000000..0x000000005fffffff (131072 KiB)
> nomap non-reusable pma_resv0@...00000
> +OF: reserved mem: 0x0000000000030000..0x000000000003ffff (64 KiB)
> nomap non-reusable mmode_resv0@...00
> +OF: reserved mem: 0x0000000000040000..0x000000000004ffff (64 KiB)
> nomap non-reusable mmode_resv1@...00
> +OF: reserved mem: 0x0000000044040000..0x000000004405ffff (128 KiB)
> nomap non-reusable mmode_resv2@...40000
> +OF: reserved mem: 0x0000000044000000..0x000000004403ffff (256 KiB)
> nomap non-reusable mmode_resv3@...00000
> Zone ranges:
> DMA32 [mem 0x0000000048000000-0x000000007fffffff]
> Normal empty
>
> Reverting commits 00c9a452a235c61f ("of: reserved_mem: Add code to
> dynamically allocate reserved_mem array") and 8a6e02d0c00e7b62 fixes
> the issue.
>
> root@...rc-rzfive:/sys/firmware/devicetree/base/reserved-memory# ls -l
> total 0
> -r--r--r-- 1 root root 4 Oct 29 12:37 #address-cells
> -r--r--r-- 1 root root 4 Oct 29 12:37 #size-cells
> drwxr-xr-x 2 root root 0 Oct 29 12:37 mmode_resv0@...00
> drwxr-xr-x 2 root root 0 Oct 29 12:37 mmode_resv1@...00
> drwxr-xr-x 2 root root 0 Oct 29 12:37 mmode_resv2@...40000
> drwxr-xr-x 2 root root 0 Oct 29 12:37 mmode_resv3@...00000
> -r--r--r-- 1 root root 16 Oct 29 12:37 name
> drwxr-xr-x 2 root root 0 Oct 29 12:37 pma_resv0@...00000
> -r--r--r-- 1 root root 0 Oct 29 12:37 ranges
Hi Geert,
Thanks for reaching out and sorry you're seeing this issue.
Please can you provide reproduction steps? I tried booting up
risc-v arch with qemu but did not run into the issue you are
seeing.
Regards,
Oreoluwa
Powered by blists - more mailing lists