lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <C38C154E-3D45-4EB4-AC41-99E7CD534CC6@gmail.com>
Date: Wed, 18 Jun 2025 16:32:31 +0530
From: Abhigyan ghosh <zscript.team.zs@...il.com>
To: Marco Bonelli <marco@...eim.net>
CC: linux-kernel@...r.kernel.org
Subject: Re: RISC-V 32-bit debug builds reaching breaking point: too many symbols

Hi Marco,

Thanks for this incredibly detailed analysis — those stats are eye-opening.

I had a quick question as I'm exploring ELF internals and kernel compilation workflows, especially in the RISC-V context:

Would using split debug info earlier in the build (e.g. extracting `.debug_*` before `vmlinux.o` grows too large) help mitigate the symbol table overflow — or is the linker still forced to carry them through regardless due to reloc dependencies?

Also, does this behavior differ between GNU ld and LLD in your tests?

Appreciate the time you've taken to compile all this — the plots and breakdown by section/reloc type really help visualize the scale.

Best regards,  
Abhigyan Ghosh  
@zscript.team.zs (zsml.zscript.org)

On 18 June 2025 4:25:08 pm IST, Marco Bonelli <marco@...eim.net> wrote:
>RISC-V debug builds generate *millions* of symbols in vmlinux.o, and this number
>is now getting beyond the limit of the maximum symbol table index that can be
>represented by 32-bit Rela relocations. The ELF32_R_SYM portion of Elf32_Rela
>r_info is only 24 bits, therefore symtab indexes larger than 16777215 overflow
>and cause bogus Rela entries pointing to wrong symbols, ultimately breaking the
>build.
>
>I recently noticed this [1] when "MODPOST vmlinux.symvers" failed with thousands
>of errors and warnings on a particular build of mine for v6.15 RISC-V 32-bit.
>
>The majority (99%) of the symbols are for local temporary labels (.Lxxx) that
>would normally be stripped by default at link time, but seems they cannot be
>stripped as they are referenced by Rela relocations. The number of such symbols
>has always been huge from the very first RISC-V Linux version (v4.15) and has
>been steadily growing since (I plotted a bar chart for v6.5-v6.15 here [3]).
>
>For reference, on v4.15 a simple defconfig + debug build with GCC 11.1 produces
>a vmlinux.o with around 7 million such symbols. On v6.15 with GCC 14.2 it gets
>to around 15 million. We are close to the 16.8M limit, and already exceeding it
>on some configurations (the discussion in [1] is an example).
>
>What can be done to reduce them down to an acceptable number, or even better get
>rid of them entirely? These local temporary symbols referenced by relocations
>seem to be ubiquitous, so I suppose this is some RISC-V ELF ABI design choice.
>Could those Rela relocs just avoid referencing any symbol? Or could Rel/Relr be
>used instead?
>
>Otherwise, if those really need to be kept as is, perhaps splitting debug info
>out of vmlinux.o before linking into final vmlinux (when those are finally
>stripped) could also be a viable solution, though the debug info would have to
>be split into multiple files or the same problem would arise.
>
>Thoughts?
>
>Here are some stats from a custom script I hacked together (can provide source
>if needed).
>
>	# Clean v6.15 tree
>	export PATH=/path/to/gcc-14.2.0-nolibc/riscv32-linux/bin:$PATH
>	export ARCH=riscv CROSS_COMPILE=riscv32-linux-
>
>	make defconfig
>	make 32-bit.config
>	./scripts/config -e DEBUG_INFO_DWARF_TOOLCHAIN_DEFAULT
>	make olddefconfig
>	make -j vmlinux
>
>Stats:
>
>	Total symbols: 15138556
>	Temporary local symbols: 14972044
>	Referenced by relocations: 14971527
>	By section:
>		.rela.debug_info                  8783573
>		.rela.debug_line                  3649412
>		.rela.debug_ranges                1110915
>		.rela.debug_loc                   972349
>		.rela.text                        398789
>		.rela.debug_frame                 283422
>		.rela.rodata                      17182
>		.rela.init.text                   15732
>		.rela__bug_table                  11163
>		.rela__jump_table                 10830
>		.rela.debug_aranges               10253
>		.rela.alternative                 4510
>		.rela.data                        3504
>		.rela.text.unlikely               2650
>		.rela__ex_table                   2044
>		.rela.sched.text                  1405
>		.rela.init.rodata                 541
>		.rela.exit.text                   538
>		.rela.ref.text                    344
>		.rela.init.data                   321
>		.rela.noinstr.text                234
>		.rela.spinlock.text               194
>		.rela.data..ro_after_init         98
>		.rela.cpuidle.text                36
>		.rela.srodata                     30
>		.rela__modver                     27
>		.rela.head.text                   21
>		.rela.sdata                       15
>		.rela.irqentry.text               12
>		.relaruntime_shift_d_hash_shift   10
>		.relaruntime_ptr_dentry_hashtable 10
>		.rela.lsm_info.init               3
>		.rela.data..percpu                1
>	By relocation kind:
>		R_RISCV_32           9665623
>		R_RISCV_SUB16        3652043
>		R_RISCV_ADD16        3647245
>		R_RISCV_ADD32        1174187
>		R_RISCV_SUB32        345881
>		R_RISCV_BRANCH       168074
>		R_RISCV_RVC_BRANCH   104031
>		R_RISCV_SET6         87790
>		R_RISCV_SUB6         87790
>		R_RISCV_PCREL_LO12_I 81819
>		R_RISCV_RVC_JUMP     76539
>		R_RISCV_SET8         29260
>		R_RISCV_SUB8         29260
>		R_RISCV_PCREL_HI20   23416
>		R_RISCV_SET16        4798
>		R_RISCV_JAL          4676
>		R_RISCV_PCREL_LO12_S 1410
>		R_RISCV_CALL_PLT     1
>
>[1]: https://lore.kernel.org/lkml/960240908.630790.1748641210849@privateemail.com/
>[2]: https://x.com/mebeim/status/1934950596693635410
>
>--
>Marco Bonelli
>

aghosh

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ