lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite for Android: free password hash cracker in your pocket
[<prev] [next>] [day] [month] [year] [list]
Date:   Fri, 11 Mar 2022 15:53:20 +0800
From:   kernel test robot <oliver.sang@...el.com>
To:     Song Liu <song@...nel.org>
Cc:     0day robot <lkp@...el.com>, LKML <linux-kernel@...r.kernel.org>,
        lkp@...ts.01.org, bpf@...r.kernel.org, netdev@...r.kernel.org,
        ast@...nel.org, daniel@...earbox.net, andrii@...nel.org,
        kernel-team@...com, edumazet@...gle.com, Song Liu <song@...nel.org>
Subject: [bpf]  4d06f388e1: WARNING:at_kernel/bpf/core.c:#bpf_prog_pack_free



Greeting,

FYI, we noticed the following commit (built with gcc-9):

commit: 4d06f388e14c69d938cbc1e4081029c14d8bc654 ("[PATCH v3 bpf-next] bpf: select proper size for bpf_prog_pack")
url: https://github.com/0day-ci/linux/commits/Song-Liu/bpf-select-proper-size-for-bpf_prog_pack/20220310-023737
base: https://git.kernel.org/cgit/linux/kernel/git/bpf/bpf-next.git master
patch link: https://lore.kernel.org/bpf/20220309183523.3308210-1-song@kernel.org

in testcase: stress-ng
version: stress-ng-x86_64-0.11-06_20220309
with following parameters:

	nr_threads: 100%
	testtime: 60s
	class: memory
	test: stream
	cpufreq_governor: performance
	ucode: 0xd000331



on test machine: 128 threads 2 sockets Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz with 128G memory

caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):



If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang@...el.com>


[ 18.435242][ T767] WARNING: CPU: 2 PID: 767 at kernel/bpf/core.c:946 bpf_prog_pack_free (kernel/bpf/core.c:946 (discriminator 3)) 
[   18.444373][  T767] Modules linked in: ip_tables
[   18.444375][  T767] CPU: 2 PID: 767 Comm: kworker/2:1 Not tainted 5.17.0-rc6-01953-g4d06f388e14c #1
[   18.444378][  T767] Workqueue: events bpf_prog_free_deferred
[ 18.444380][ T767] RIP: 0010:bpf_prog_pack_free (kernel/bpf/core.c:946 (discriminator 3)) 
[ 18.444382][ T767] Code: ed 61 f6 ff 48 89 ef e8 e5 21 11 00 eb 9f 80 3d 7a dd b5 01 00 75 96 48 c7 c7 22 e1 56 82 c6 05 6a dd b5 01 01 e8 86 31 a4 00 <0f> 0b e9 7c ff ff ff 0f 0b 0f 0b b9 ff ff ff ff e9 59 ff ff ff 66
All code
========
   0:	ed                   	in     (%dx),%eax
   1:	61                   	(bad)  
   2:	f6 ff                	idiv   %bh
   4:	48 89 ef             	mov    %rbp,%rdi
   7:	e8 e5 21 11 00       	callq  0x1121f1
   c:	eb 9f                	jmp    0xffffffffffffffad
   e:	80 3d 7a dd b5 01 00 	cmpb   $0x0,0x1b5dd7a(%rip)        # 0x1b5dd8f
  15:	75 96                	jne    0xffffffffffffffad
  17:	48 c7 c7 22 e1 56 82 	mov    $0xffffffff8256e122,%rdi
  1e:	c6 05 6a dd b5 01 01 	movb   $0x1,0x1b5dd6a(%rip)        # 0x1b5dd8f
  25:	e8 86 31 a4 00       	callq  0xa431b0
  2a:*	0f 0b                	ud2    		<-- trapping instruction
  2c:	e9 7c ff ff ff       	jmpq   0xffffffffffffffad
  31:	0f 0b                	ud2    
  33:	0f 0b                	ud2    
  35:	b9 ff ff ff ff       	mov    $0xffffffff,%ecx
  3a:	e9 59 ff ff ff       	jmpq   0xffffffffffffff98
  3f:	66                   	data16

Code starting with the faulting instruction
===========================================
   0:	0f 0b                	ud2    
   2:	e9 7c ff ff ff       	jmpq   0xffffffffffffff83
   7:	0f 0b                	ud2    
   9:	0f 0b                	ud2    
   b:	b9 ff ff ff ff       	mov    $0xffffffff,%ecx
  10:	e9 59 ff ff ff       	jmpq   0xffffffffffffff6e
  15:	66                   	data16
[   18.444384][  T767] RSP: 0000:ffa00000086a7e30 EFLAGS: 00010286
[   18.444386][  T767] RAX: 0000000000000000 RBX: ffffffffc0400000 RCX: c0000000ffff7fff
[   18.444387][  T767] RDX: ffa00000086a7c58 RSI: 00000000ffff7fff RDI: 0000000000000000
[   18.444388][  T767] RBP: ffffffff82c4c5f0 R08: 0000000000000000 R09: ffa00000086a7c50
[   18.444389][  T767] R10: 0000000000000001 R11: 0000000000000001 R12: ffffffffc0600640
[   18.444389][  T767] R13: ff1100103f2af400 R14: 0000000000000000 R15: ff1100103f2af405
[   18.444390][  T767] FS:  0000000000000000(0000) GS:ff1100103f280000(0000) knlGS:0000000000000000
[   18.444391][  T767] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   18.444392][  T767] CR2: 000055c0f768b248 CR3: 000000010005e002 CR4: 0000000000771ee0
[   18.444393][  T767] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   18.444393][  T767] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[   18.444394][  T767] PKRU: 55555554
[   18.444395][  T767] Call Trace:
[   18.444397][  T767]  <TASK>
[ 18.444398][ T767] bpf_jit_binary_pack_free (kernel/bpf/core.c:1151) 
[ 18.444400][ T767] bpf_jit_free+0x6c/0x80 
[ 18.444401][ T767] process_one_work (kernel/workqueue.c:2307) 
[ 18.444405][ T767] worker_thread (include/linux/list.h:292 kernel/workqueue.c:2455) 
[ 18.444407][ T767] ? process_one_work (kernel/workqueue.c:2397) 
[ 18.444409][ T767] kthread (kernel/kthread.c:377) 
[ 18.444410][ T767] ? kthread_complete_and_exit (kernel/kthread.c:332) 
[ 18.444412][ T767] ret_from_fork (arch/x86/entry/entry_64.S:295) 
[   18.444417][  T767]  </TASK>
[   18.444417][  T767] ---[ end trace 0000000000000000 ]---
[   18.789890][ T1318] IPMI message handler: version 39.2
9mHuge Pages Fil[   18.835704][ T1345] ioatdma: Intel(R) QuickData Technology Driver 5.00
[   18.841494][ T1324] ipmi_si IPI0001:00: ipmi_platform: probing via ACPI
[[   18.855954][ T1324] ipmi_si IPI0001:00: ipmi_platform: [io  0x0ca2-0x0ca3] regsize 1 spacing 1 irq 0
and Kernel File[   18.875833][ T1323] ACPI: bus type drm_connector registered
[   18.883069][ T1324] ipmi_si: Adding ACPI-specified kcs state machine
[   18.890060][ T1324] ipmi_si: Trying ACPI-specified kcs state machine at i/o address 0xca2, slave address 0x0, irq 0
[   18.892421][ T1317] power_meter ACPI000D:00: Found ACPI power meter.
[   18.906871][ T1317] power_meter ACPI000D:00: Ignoring unsafe software power cap!
Starting[   18.914268][ T1317] power_meter ACPI000D:00: hwmon_device_register() is deprecated. Please convert the driver to use hwmon_device_register_with_info().
[[   18.936700][ T1174] ahci 0000:00:17.0: version 3.0
[   18.952686][ T1174] ahci 0000:00:17.0: flags: 64bit ncq sntf led clo only pio slum part ems deso sadm sds apst
[   18.953557][ T1369] RAPL PMU: API unit is 2^-32 Joules, 2 fixed counters, 655360 ms ovfl timer
[   18.961014][ T1324] ipmi_si IPI0001:00: IPMI message handler: Found new BMC (man_id: 0x000157, prod_id: 0x0098, dev_id: 0x22)
[   18.982661][ T1369] RAPL PMU: hw unit of domain package 2^-14 Joules
[   18.989691][ T1174] scsi host0: ahci
;39mLoad Kernel [   19.000736][ T1174] scsi host1: ahci
[   19.005180][ T1174] scsi host2: ahci
Starting [   19.010631][ T1174] scsi host3: ahci
[   19.019744][ T1324] ipmi_si IPI0001:00: IPMI kcs interface initialized
[0m...
[   19.031746][ T1174] scsi host6: ahci
[   19.041630][ T1174] ata1: SATA max UDMA/133 abar m524288@...3100000 port 0x93100100 irq 405
[   19.049974][ T1174] ata2: SATA max UDMA/133 abar m524288@...3100000 port 0x93100180 irq 405
[   19.058319][ T1174] ata3: SATA max UDMA/133 abar m524288@...3100000 port 0x93100200 irq 405
[   19.066665][ T1174] ata4: SATA max UDMA/133 abar m524288@...3100000 port 0x93100280 irq 405
[   19.075012][ T1174] ata5: SATA max UDMA/133 abar m524288@...3100000 port 0x93100300 irq 405
[   19.083358][ T1174] ata6: SATA max UDMA/133 abar m524288@...3100000 port 0x93100380 irq 405
[   19.091702][ T1174] ata7: SATA max UDMA/133 abar m524288@...3100000 port 0x93100400 irq 405
[   19.100048][ T1174] ata8: SATA max UDMA/133 abar m524288@...3100000 port 0x93100480 irq 405
39mKernel Configuration File Sys[   19.109981][ T1174] mei_me 0000:00:16.0: Device doesn't have valid ME Interface
[   19.261650][   T17] intel_rapl_common: Found RAPL domain dram
[   19.267390][   T17] intel_rapl_common: DRAM domain energy unit 15300pj
[   19.273915][   T17] intel_rapl_common: RAPL package-0 domain package locked by BIOS
[0m.
[   19.284649][  T173] intel_rapl_common: Found RAPL domain package
[   19.290669][  T173] intel_rapl_common: Found RAPL domain dram
[   19.290671][  T173] intel_rapl_common: DRAM domain energy unit 15300pj
[   19.290673][  T173] intel_rapl_common: RAPL package-1 domain package locked by BIOS
[   19.424065][ T1493] ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[   19.430950][ T1507] ata8: SATA link down (SStatus 0 SControl 300)
[   19.437060][ T1503] ata6: SATA link down (SStatus 0 SControl 300)
[   19.443169][ T1505] ata7: SATA link down (SStatus 0 SControl 300)
[   19.449282][ T1495] ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[   19.456170][ T1497] ata3: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[   19.463064][ T1499] ata4: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[   19.469951][ T1493] ata1.00: ATA-10: INTEL SSDSC2KB240G8, XCV10120, max UDMA/133
[   19.477346][ T1493] ata1.00: 468862128 sectors, multi 1: LBA48 NCQ (depth 32)
[   19.484506][ T1499] ata4.00: ATA-10: WDC WD20SPZX-08UA7, 02.01A02, max UDMA/133
[   19.491816][ T1497] ata3.00: ATA-9: INTEL SSDSC2BB800G4, D2010370, max UDMA/133
[   19.499128][ T1497] ata3.00: 1562824368 sectors, multi 1: LBA48 NCQ (depth 32)
[   19.506351][ T1495] ata2.00: ATA-9: INTEL SSDSC2BA400G4, G2010150, max UDMA/133
[   19.513659][ T1495] ata2.00: 781422768 sectors, multi 1: LBA48 NCQ (depth 32)
[   19.520805][ T1493] ata1.00: configured for UDMA/133
[0m.
[   19.526310][  T721] scsi 0:0:0:0: Direct-Access     ATA      INTEL SSDSC2KB24 0120 PQ: 0 ANSI: 5
[   19.526457][ T1495] ata2.00: configured for UDMA/133
[   19.540115][ T1497] ata3.00: configured for UDMA/133
[   19.540121][ T1499] ata4.00: 3907029168 sectors, multi 16: LBA48 NCQ (depth 32), AA
[   19.540124][ T1499] ata4.00: Features: NCQ-prio
[   19.540446][  T656] scsi 1:0:0:0: Direct-Access     ATA      INTEL SSDSC2BA40 0150 PQ: 0 ANSI: 5
1;39mNetwork Tim[   19.583118][  T682] scsi 3:0:0:0: Direct-Access     ATA      WDC WD20SPZX-08U 1A02 PQ: 0 ANSI: 5
[   19.607486][ T1263] scsi 0:0:0:0: Attached scsi generic sg0 type 0
[   19.613842][ T1263] scsi 1:0:0:0: Attached scsi generic sg1 type 0
[   19.659635][  T740] ata2.00: Enabling discard_zeroes_data
[   19.665037][  T677] ata3.00: Enabling discard_zeroes_data
[   19.670457][  T684] sd 3:0:0:0: [sdd] 3907029168 512-byte logical blocks: (2.00 TB/1.82 TiB)
[   19.670457][  T740] sd 1:0:0:0: [sdb] 781422768 512-byte logical blocks: (400 GB/373 GiB)
[   19.670457][  T677] sd 2:0:0:0: [sdc] 1562824368 512-byte logical blocks: (800 GB/745 GiB)
[   19.670460][  T740] sd 1:0:0:0: [sdb] 4096-byte physical blocks
[   19.670460][  T677] sd 2:0:0:0: [sdc] 4096-byte physical blocks
[   19.670464][  T677] sd 2:0:0:0: [sdc] Write Protect is off
[   19.670465][  T677] sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00
[   19.670472][  T677] sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[   19.671106][  T677] ata3.00: Enabling discard_zeroes_data
[   19.671113][  T679] sd 0:0:0:0: [sda] 468862128 512-byte logical blocks: (240 GB/224 GiB)
[   19.671115][  T679] sd 0:0:0:0: [sda] 4096-byte physical blocks
[   19.671119][  T679] sd 0:0:0:0: [sda] Write Protect is off
[   19.671120][  T679] sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
[   19.671125][  T679] sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[   19.671893][  T679] ata1.00: Enabling discard_zeroes_data
[   19.674306][  T679]  sda: sda1 sda2
[   19.674488][  T679] ata1.00: Enabling discard_zeroes_data
[   19.674517][  T679] sd 0:0:0:0: [sda] Attached SCSI disk
[   19.677790][  T677]  sdc: sdc1 sdc2 sdc3
[   19.677979][  T677] ata3.00: Enabling discard_zeroes_data
[   19.678012][  T677] sd 2:0:0:0: [sdc] Attached SCSI disk
[   19.678886][  T684] sd 3:0:0:0: [sdd] 4096-byte physical blocks
[   19.678893][  T684] sd 3:0:0:0: [sdd] Write Protect is off
[   19.687063][  T740] sd 1:0:0:0: [sdb] Write Protect is off
[   19.695318][  T684] sd 3:0:0:0: [sdd] Mode Sense: 00 3a 00 00
[   19.701237][  T740] sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        sudo bin/lkp install job.yaml           # job file is attached in this email
        bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
        sudo bin/lkp run generated-yaml-file

        # if come across any failure that blocks the test,
        # please remove ~/.lkp and /lkp dir to run from a clean state.



---
0-DAY CI Kernel Test Service
https://lists.01.org/hyperkitty/list/lkp@lists.01.org

Thanks,
Oliver Sang


View attachment "config-5.17.0-rc6-01953-g4d06f388e14c" of type "text/plain" (162334 bytes)

View attachment "job-script" of type "text/plain" (8139 bytes)

Download attachment "dmesg.xz" of type "application/x-xz" (37812 bytes)

View attachment "stress-ng" of type "text/plain" (14636 bytes)

View attachment "job.yaml" of type "text/plain" (5496 bytes)

View attachment "reproduce" of type "text/plain" (340 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ