linux-kernel - Re: [perf] fuzzer triggers "BUG: kernel NULL pointer dereference"

lists.openwall.net		lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC
Open Source and information security mailing list archives
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <88331bf6-5bfc-476b-a7cb-48596d979fe8@linux.intel.com>
Date: Wed, 30 Jul 2025 09:54:02 +0800
From: "Mi, Dapeng" <dapeng1.mi@...ux.intel.com>
To: Vince Weaver <vincent.weaver@...ne.edu>
Cc: linux-kernel@...r.kernel.org, linux-perf-users@...r.kernel.org,
 "Liang, Kan" <kan.liang@...ux.intel.com>,
 Peter Zijlstra <peterz@...radead.org>, Ingo Molnar <mingo@...hat.com>,
 Arnaldo Carvalho de Melo <acme@...nel.org>,
 Namhyung Kim <namhyung@...nel.org>, Mark Rutland <mark.rutland@....com>,
 Alexander Shishkin <alexander.shishkin@...ux.intel.com>,
 Jiri Olsa <jolsa@...nel.org>, Ian Rogers <irogers@...gle.com>,
 Adrian Hunter <adrian.hunter@...el.com>
Subject: Re: [perf] fuzzer triggers "BUG: kernel NULL pointer dereference"

Thanks the information. I would look at the issue.

On 7/30/2025 2:06 AM, Vince Weaver wrote:
> On Tue, 29 Jul 2025, Vince Weaver wrote:
>
>> This is on a RaptorLake system.
>>
>> I can reproduce this issue with the perf_fuzzer but it is possibly timing 
>> sensitive and so if I enable fuzzer trace logging to try to make a 
>> reproducible test case it won't trigger anymore.
>>
>> The system locks up extremely hard and so I can't really get the panic 
>> message besides taking a picture of the screen.
>>
>> I can try enabling KASAN to see if that helps get better debug messages.
> I managed to get KASAN to trigger the issue without crashing the system, 
> not sure if this helps at all:
>
> [  115.636383] ==================================================================
> [  115.636432] BUG: KASAN: slab-use-after-free in rb_erase+0xd25/0x1370
> [  115.636467] Read of size 8 at addr ffff8881151dc0b8 by task swapper/5/0
>
> [  115.636508] CPU: 5 UID: 0 PID: 0 Comm: swapper/5 Not tainted 6.16.0+ #13 PREEMPT(voluntary) 
> [  115.636517] Hardware name: Dell Inc. Precision 3660/0VJ7G2, BIOS 2.17.0 08/09/2024
> [  115.636520] Call Trace:
> [  115.636524]  <TASK>
> [  115.636528]  dump_stack_lvl+0x64/0x80
> [  115.636536]  print_report+0xce/0x650
> [  115.636547]  ? rb_erase+0xd25/0x1370
> [  115.636552]  kasan_report+0xce/0x100
> [  115.636559]  ? rb_erase+0xd25/0x1370
> [  115.636565]  rb_erase+0xd25/0x1370
> [  115.636570]  ? __tmigr_cpu_activate+0x13d/0x310
> [  115.636578]  timerqueue_del+0x68/0x120
> [  115.636585]  __remove_hrtimer+0x84/0x200
> [  115.636592]  hrtimer_try_to_cancel+0x19c/0x350
> [  115.636597]  hrtimer_cancel+0x15/0x30
> [  115.636602]  tick_nohz_restart_sched_tick+0x5b/0x210
> [  115.636609]  tick_nohz_idle_exit+0xfc/0x180
> [  115.636615]  do_idle+0x258/0x410
> [  115.636623]  ? __pfx_do_idle+0x10/0x10
> [  115.636631]  ? do_idle+0x2c1/0x410
> [  115.636638]  cpu_startup_entry+0x54/0x60
> [  115.636645]  start_secondary+0x20f/0x290
> [  115.636654]  ? __pfx_start_secondary+0x10/0x10
> [  115.636661]  common_startup_64+0x13e/0x141
> [  115.636671]  </TASK>
>
> [  115.637037] Allocated by task 1091:
> [  115.637054]  kasan_save_stack+0x33/0x60
> [  115.637062]  kasan_save_track+0x14/0x30
> [  115.637069]  __kasan_slab_alloc+0x89/0x90
> [  115.637076]  kmem_cache_alloc_node_noprof+0x136/0x450
> [  115.637086]  perf_event_alloc+0x100/0x41e0
> [  115.637093]  __do_sys_perf_event_open+0x39b/0x1c30
> [  115.637098]  do_syscall_64+0x82/0x2f0
> [  115.637105]  entry_SYSCALL_64_after_hwframe+0x76/0x7e
>
> [  115.637121] Freed by task 0:
> [  115.637136]  kasan_save_stack+0x33/0x60
> [  115.637143]  kasan_save_track+0x14/0x30
> [  115.637149]  kasan_save_free_info+0x3b/0x60
> [  115.637155]  __kasan_slab_free+0x51/0x70
> [  115.637162]  kmem_cache_free+0x2f2/0x540
> [  115.637167]  rcu_do_batch+0x39a/0xe00
> [  115.637174]  rcu_core+0x3f0/0xb10
> [  115.637180]  handle_softirqs+0x1bf/0x5d0
> [  115.637188]  __irq_exit_rcu+0x14e/0x1a0
> [  115.637193]  sysvec_apic_timer_interrupt+0x72/0x90
> [  115.637201]  asm_sysvec_apic_timer_interrupt+0x1a/0x20
>
> [  115.637216] Last potentially related work creation:
> [  115.637235]  kasan_save_stack+0x33/0x60
> [  115.637242]  kasan_record_aux_stack+0xa7/0xc0
> [  115.637248]  __call_rcu_common.constprop.0+0xc8/0x1160
> [  115.637255]  perf_event_release_kernel+0x2f6/0x400
> [  115.637262]  perf_release+0x31/0x50
> [  115.637270]  __fput+0x35b/0xac0
> [  115.637276]  task_work_run+0x11a/0x1f0
> [  115.637282]  do_exit+0x6c6/0x2400
> [  115.637289]  do_group_exit+0xac/0x230
> [  115.637297]  get_signal+0x1cbc/0x1e40
> [  115.637304]  arch_do_signal_or_restart+0x8d/0x5e0
> [  115.637312]  irqentry_exit_to_user_mode+0x12c/0x1f0
> [  115.637319]  asm_sysvec_reschedule_ipi+0x1a/0x20
>
> [  115.637334] The buggy address belongs to the object at ffff8881151dbf40
>                 which belongs to the cache perf_event of size 1344
> [  115.637376] The buggy address is located 376 bytes inside of
>                 freed 1344-byte region [ffff8881151dbf40, ffff8881151dc480)
>
> [  115.637428] The buggy address belongs to the physical page:
> [  115.637450] page: refcount:0 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x1151d8
> [  115.637457] head: order:3 mapcount:0 entire_mapcount:0 nr_pages_mapped:0 pincount:0
> [  115.637461] flags: 0x17ffffc0000040(head|node=0|zone=2|lastcpupid=0x1fffff)
> [  115.637470] page_type: f5(slab)
> [  115.637477] raw: 0017ffffc0000040 ffff88810004b040 dead000000000122 0000000000000000
> [  115.637483] raw: 0000000000000000 0000000000160016 00000000f5000000 0000000000000000
> [  115.637488] head: 0017ffffc0000040 ffff88810004b040 dead000000000122 0000000000000000
> [  115.637492] head: 0000000000000000 0000000000160016 00000000f5000000 0000000000000000
> [  115.637497] head: 0017ffffc0000003 ffffea0004547601 00000000ffffffff 00000000ffffffff
> [  115.637501] head: ffffffffffffffff 0000000000000000 00000000ffffffff 0000000000000008
> [  115.637504] page dumped because: kasan: bad access detected
>
> [  115.637517] Memory state around the buggy address:
> [  115.637537]  ffff8881151dbf80: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
> [  115.637564]  ffff8881151dc000: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
> [  115.637590] >ffff8881151dc080: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
> [  115.637615]                                         ^
> [  115.637636]  ffff8881151dc100: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
> [  115.637661]  ffff8881151dc180: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
> [  115.637687] ==================================================================
> [  115.637712] Disabling lock debugging due to kernel taint