[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <202511061432.4e534796-lkp@intel.com>
Date: Thu, 6 Nov 2025 14:53:09 +0800
From: kernel test robot <oliver.sang@...el.com>
To: Mathieu Desnoyers <mathieu.desnoyers@...icios.com>
CC: <oe-lkp@...ts.linux.dev>, <lkp@...el.com>, Andrew Morton
<akpm@...ux-foundation.org>, "Paul E. McKenney" <paulmck@...nel.org>, "Steven
Rostedt" <rostedt@...dmis.org>, Masami Hiramatsu <mhiramat@...nel.org>,
"Dennis Zhou" <dennis@...nel.org>, Tejun Heo <tj@...nel.org>, Christoph
Lameter <cl@...ux.com>, Martin Liu <liumartin@...gle.com>, David Rientjes
<rientjes@...gle.com>, Shakeel Butt <shakeel.butt@...ux.dev>, SeongJae Park
<sj@...nel.org>, Michal Hocko <mhocko@...e.com>, Johannes Weiner
<hannes@...xchg.org>, Sweet Tea Dorminy <sweettea@...gle.com>, "Lorenzo
Stoakes" <lorenzo.stoakes@...cle.com>, "Liam R . Howlett"
<liam.howlett@...cle.com>, Mike Rapoport <rppt@...nel.org>, "Suren
Baghdasaryan" <surenb@...gle.com>, Vlastimil Babka <vbabka@...e.cz>,
"Christian Brauner" <brauner@...nel.org>, Wei Yang
<richard.weiyang@...il.com>, "David Hildenbrand" <david@...hat.com>, Miaohe
Lin <linmiaohe@...wei.com>, Al Viro <viro@...iv.linux.org.uk>, Yu Zhao
<yuzhao@...gle.com>, Roman Gushchin <roman.gushchin@...ux.dev>, Mateusz Guzik
<mjguzik@...il.com>, Matthew Wilcox <willy@...radead.org>, Baolin Wang
<baolin.wang@...ux.alibaba.com>, "Aboorva Devarajan"
<aboorvad@...ux.ibm.com>, <linux-mm@...ck.org>,
<linux-kernel@...r.kernel.org>, <linux-trace-kernel@...r.kernel.org>,
"Mathieu Desnoyers" <mathieu.desnoyers@...icios.com>,
<christian.koenig@....com>, <oliver.sang@...el.com>
Subject: Re: [RFC PATCH v7 2/2] mm: Fix OOM killer inaccuracy on large
many-core systems
Hello,
kernel test robot noticed "BUG:Bad_rss-counter_state_mm:#type:MM_ANONPAGES_val:#Comm:kworker##Pid" on:
commit: 25ae03e80acad812e536694c1a07a3f57784ae23 ("[RFC PATCH v7 2/2] mm: Fix OOM killer inaccuracy on large many-core systems")
url: https://github.com/intel-lab-lkp/linux/commits/Mathieu-Desnoyers/lib-Introduce-hierarchical-per-cpu-counters/20251031-224455
base: https://git.kernel.org/cgit/linux/kernel/git/akpm/mm.git mm-everything
patch link: https://lore.kernel.org/all/20251031144232.15284-3-mathieu.desnoyers@efficios.com/
patch subject: [RFC PATCH v7 2/2] mm: Fix OOM killer inaccuracy on large many-core systems
in testcase: boot
config: x86_64-randconfig-002-20251103
compiler: clang-20
test machine: qemu-system-x86_64 -enable-kvm -cpu SandyBridge -smp 2 -m 16G
(please refer to attached dmesg/kmsg for entire log/backtrace)
in fact, we observed various BUG:Bad_rss-counter_state_mm issues for this commit
but clean on parent, as below
+------------------------------------------------------------------------+------------+------------+
| | 05880dc4af | 25ae03e80a |
+------------------------------------------------------------------------+------------+------------+
| BUG:Bad_rss-counter_state_mm:#type:MM_FILEPAGES_val:#Comm:kworker##Pid | 0 | 10 |
| BUG:Bad_rss-counter_state_mm:#type:MM_ANONPAGES_val:#Comm:kworker##Pid | 0 | 17 |
| BUG:Bad_rss-counter_state_mm:#type:MM_ANONPAGES_val:#Comm:swapper_Pid | 0 | 2 |
| BUG:Bad_rss-counter_state_mm:#type:MM_ANONPAGES_val:#Comm:modprobe_Pid | 0 | 3 |
| BUG:Bad_rss-counter_state_mm:#type:MM_FILEPAGES_val:#Comm:modprobe_Pid | 0 | 1 |
+------------------------------------------------------------------------+------------+------------+
If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@...el.com>
| Closes: https://lore.kernel.org/oe-lkp/202511061432.4e534796-lkp@intel.com
[ 14.858862][ T67] BUG: Bad rss-counter state mm:ffff8881000655c0 type:MM_ANONPAGES val:0 Comm:kworker/u9:0 Pid:67
[ 14.894890][ T69] BUG: Bad rss-counter state mm:ffff888100061cc0 type:MM_FILEPAGES val:0 Comm:kworker/u9:0 Pid:69
[ 14.896108][ T69] BUG: Bad rss-counter state mm:ffff888100061cc0 type:MM_ANONPAGES val:0 Comm:kworker/u9:0 Pid:69
[ 14.918858][ T71] module: module-autoload: duplicate request for module crypto-aes
[ 14.919479][ T71] module: module-autoload: duplicate request for module crypto-aes-all
[ 14.920801][ T1] krb5: Running aes128-cts-hmac-sha256-128 enc plain<block
[ 14.921844][ T1] krb5: Running aes128-cts-hmac-sha256-128 enc plain==block
[ 14.922852][ T1] krb5: Running aes128-cts-hmac-sha256-128 enc plain>block
[ 14.923843][ T1] krb5: Running aes256-cts-hmac-sha384-192 enc no plain
[ 14.939591][ T1] krb5: Running aes256-cts-hmac-sha384-192 enc plain<block
[ 14.940614][ T1] krb5: Running aes256-cts-hmac-sha384-192 enc plain==block
[ 14.941586][ T1] krb5: Running aes256-cts-hmac-sha384-192 enc plain>block
[ 14.942547][ T1] krb5: Running camellia128-cts-cmac enc no plain
[ 15.018568][ T85] BUG: Bad rss-counter state mm:ffff888160f81340 type:MM_ANONPAGES val:0 Comm:kworker/u9:0 Pid:85
[ 15.054490][ T89] module: module-autoload: duplicate request for module crypto-camellia
[ 15.055466][ T89] module: module-autoload: duplicate request for module crypto-camellia-all
[ 15.056999][ T1] krb5: Running camellia128-cts-cmac enc 1 plain
[ 15.057912][ T1] krb5: Running camellia128-cts-cmac enc 9 plain
[ 15.058781][ T1] krb5: Running camellia128-cts-cmac enc 13 plain
[ 15.059603][ T1] krb5: Running camellia128-cts-cmac enc 30 plain
[ 15.061279][ T1] krb5: Running camellia256-cts-cmac enc no plain
[ 15.062207][ T1] krb5: Running camellia256-cts-cmac enc 1 plain
[ 15.063150][ T1] krb5: Running camellia256-cts-cmac enc 9 plain
[ 15.072917][ T1] krb5: Running camellia256-cts-cmac enc 13 plain
[ 15.073896][ T1] krb5: Running camellia256-cts-cmac enc 30 plain
[ 15.074834][ T1] krb5: Running aes128-cts-hmac-sha256-128 mic
[ 15.075625][ T1] krb5: Running aes256-cts-hmac-sha384-192 mic
[ 15.076396][ T1] krb5: Running camellia128-cts-cmac mic abc
[ 15.077225][ T1] krb5: Running camellia128-cts-cmac mic ABC
[ 15.078052][ T1] krb5: Running camellia256-cts-cmac mic 123
[ 15.078853][ T1] krb5: Running camellia256-cts-cmac mic !@#
[ 15.079621][ T1] krb5: Selftests succeeded
[ 15.080683][ T1] Block layer SCSI generic (bsg) driver version 0.4 loaded (major 248)
[ 15.081610][ T1] io scheduler kyber registered
[ 15.082527][ T1] test_mul_u64_u64_div_u64: Starting mul_u64_u64_div_u64() test
[ 15.083365][ T1] test_mul_u64_u64_div_u64: ERROR: 0x000000000000000b * 0x0000000000000007 +/ 0x0000000000000003
[ 15.086382][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: 000000000000001a
[ 15.087178][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: 0000000000000019
[ 15.088064][ T1] test_mul_u64_u64_div_u64: ERROR: 0x00000000ffffffff * 0x00000000ffffffff +/ 0x0000000000000002
[ 15.089105][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: 7fffffff00000001
[ 15.089924][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: 7fffffff00000000
[ 15.090696][ T1] test_mul_u64_u64_div_u64: ERROR: 0x00000001ffffffff * 0x00000000ffffffff +/ 0x0000000000000002
[ 15.091734][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: fffffffe80000001
[ 15.092502][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: fffffffe80000000
[ 15.093281][ T1] test_mul_u64_u64_div_u64: ERROR: 0x00000001ffffffff * 0x00000001ffffffff +/ 0x0000000000000004
[ 15.094337][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: ffffffff00000001
[ 15.095172][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: ffffffff00000000
[ 15.095953][ T1] test_mul_u64_u64_div_u64: ERROR: 0xffff000000000000 * 0xffff000000000000 +/ 0xffff000000000001
[ 15.097175][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: ffff000000000000
[ 15.098020][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: fffeffffffffffff
[ 15.098837][ T1] test_mul_u64_u64_div_u64: ERROR: 0x3333333333333333 * 0x3333333333333333 +/ 0x5555555555555555
[ 15.099924][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: 1eb851eb851eb852
[ 15.100721][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: 1eb851eb851eb851
[ 15.101542][ T1] test_mul_u64_u64_div_u64: ERROR: 0x7fffffffffffffff * 0x0000000000000002 +/ 0x0000000000000003
[ 15.102565][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: 5555555555555555
[ 15.103368][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: 5555555555555554
[ 15.107134][ T1] test_mul_u64_u64_div_u64: ERROR: 0xffffffffffffffff * 0x0000000000000002 +/ 0x8000000000000000
[ 15.108196][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: 0000000000000004
[ 15.109049][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: 0000000000000003
[ 15.109887][ T1] test_mul_u64_u64_div_u64: ERROR: 0xffffffffffffffff * 0x0000000000000002 +/ 0xc000000000000000
[ 15.111017][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: 0000000000000003
[ 15.111907][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: 0000000000000002
[ 15.112666][ T1] test_mul_u64_u64_div_u64: ERROR: 0xffffffffffffffff * 0x4000000000000004 +/ 0x8000000000000000
[ 15.113703][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: 8000000000000008
[ 15.114527][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: 8000000000000007
[ 15.115424][ T1] test_mul_u64_u64_div_u64: ERROR: 0xffffffffffffffff * 0x4000000000000001 +/ 0x8000000000000000
[ 15.116279][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: 8000000000000002
[ 15.116882][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: 8000000000000001
[ 15.117490][ T1] test_mul_u64_u64_div_u64: ERROR: 0xfffffffffffffffe * 0x8000000000000001 +/ 0xffffffffffffffff
[ 15.118363][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: 8000000000000001
[ 15.119240][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: 8000000000000000
[ 15.119914][ T1] test_mul_u64_u64_div_u64: ERROR: 0xffffffffffffffff * 0x8000000000000001 +/ 0xfffffffffffffffe
[ 15.120785][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: 8000000000000002
[ 15.121627][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: 8000000000000001
[ 15.122503][ T1] test_mul_u64_u64_div_u64: ERROR: 0xffffffffffffffff * 0x8000000000000001 +/ 0xfffffffffffffffd
[ 15.123624][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: 8000000000000003
[ 15.124521][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: 8000000000000002
[ 15.125399][ T1] test_mul_u64_u64_div_u64: ERROR: 0x7fffffffffffffff * 0xffffffffffffffff +/ 0xc000000000000000
[ 15.126592][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: aaaaaaaaaaaaaaa9
[ 15.127438][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: aaaaaaaaaaaaaaa8
[ 15.128411][ T1] test_mul_u64_u64_div_u64: ERROR: 0xffffffffffffffff * 0x7fffffffffffffff +/ 0xa000000000000000
[ 15.129565][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: cccccccccccccccb
[ 15.130454][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: ccccccccccccccca
[ 15.131239][ T1] test_mul_u64_u64_div_u64: ERROR: 0xffffffffffffffff * 0x7fffffffffffffff +/ 0x9000000000000000
[ 15.132213][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: e38e38e38e38e38c
[ 15.132793][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: e38e38e38e38e38b
[ 15.133374][ T1] test_mul_u64_u64_div_u64: ERROR: 0x7fffffffffffffff * 0x7fffffffffffffff +/ 0x5000000000000000
[ 15.134101][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: ccccccccccccccca
[ 15.134674][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: ccccccccccccccc9
[ 15.135235][ T1] test_mul_u64_u64_div_u64: ERROR: 0xe6102d256d7ea3ae * 0x70a77d0be4c31201 +/ 0xd63ec35ab3220357
[ 15.135984][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: 78f8bf8cc86c6e19
[ 15.136587][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: 78f8bf8cc86c6e18
[ 15.137140][ T1] test_mul_u64_u64_div_u64: ERROR: 0xf53bae05cb86c6e1 * 0x3847b32d2f8d32e0 +/ 0xcfd4f55a647f403c
[ 15.137964][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: 42687f79d8998d36
[ 15.138541][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: 42687f79d8998d35
[ 15.139135][ T1] test_mul_u64_u64_div_u64: ERROR: 0x9951c5498f941092 * 0x1f8c8bfdf287a251 +/ 0xa3c8dc5f81ea3fe2
[ 15.139884][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: 1d887cb259000920
[ 15.140444][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: 1d887cb25900091f
[ 15.141025][ T1] test_mul_u64_u64_div_u64: ERROR: 0x374fee9daa1bb2bb * 0x0d0bfbff7b8ae3ef +/ 0xc169337bd42d5179
[ 15.141759][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: 03bb2dbaffcbb962
[ 15.142324][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: 03bb2dbaffcbb961
[ 15.142890][ T1] test_mul_u64_u64_div_u64: ERROR: 0xeac0d03ac10eeaf0 * 0x89be05dfa162ed9b +/ 0x92bb1679a41f0e4b
[ 15.143618][ T1] test_mul_u64_u64_div_u64: ERROR: expected result: dc5f5cc9e270d217
[ 15.144200][ T1] test_mul_u64_u64_div_u64: ERROR: obtained result: dc5f5cc9e270d216
[ 15.144767][ T1] test_mul_u64_u64_div_u64: Completed mul_u64_u64_div_u64() test, 56 tests, 23 errors, 61402015 ns
[ 15.147067][ T1] gpio_virtuser: Failed to create the debugfs tree: -2
[ 15.148313][ T1] gpio_winbond: chip ID at 2e is ffff
[ 15.148884][ T1] gpio_winbond: not an our chip
[ 15.149345][ T1] gpio_winbond: chip ID at 4e is ffff
[ 15.149721][ T1] gpio_winbond: not an our chip
[ 15.151343][ T1] IPMI message handler: version 39.2
[ 15.151885][ T1] ipmi device interface
[ 15.152644][ T1] ipmi_si: IPMI System Interface driver
[ 15.153494][ T1] ipmi_si: Unable to find any System Interface(s)
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20251106/202511061432.4e534796-lkp@intel.com
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
Powered by blists - more mailing lists