[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CABXGCsNYzqmuYmWj4jd3BYYgzXoUFwfUC0CyZe5fjWU9Ta9aiw@mail.gmail.com>
Date: Wed, 11 Jan 2023 02:52:25 +0500
From: Mikhail Gavrilov <mikhail.v.gavrilov@...il.com>
To: Felix Fietkau <nbd@....name>
Cc: Linux regressions mailing list <regressions@...ts.linux.dev>,
lorenzo@...nel.org, sujuan.chen@...iatek.com,
Linux List Kernel Mailing <linux-wireless@...r.kernel.org>,
Linux List Kernel Mailing <linux-kernel@...r.kernel.org>,
"David S. Miller" <davem@...emloft.net>,
Eric Dumazet <edumazet@...gle.com>,
Jakub Kicinski <kuba@...nel.org>,
Paolo Abeni <pabeni@...hat.com>, spasswolf@....de
Subject: Re: [6.2][regression] after commit cd372b8c99c5a5cf6a464acebb7e4a79af7ec8ae
stopping working wifi mt7921e
On Tue, Jan 10, 2023 at 1:00 PM Felix Fietkau <nbd@....name> wrote:
>
>
> Johannes told me on IRC that he will review my patch soon. He simply has too many things to do at the moment.
>
Hi Felix,
I sometimes get this kernel oops.
[ 15.658988] mt7921e 0000:05:00.0: enabling device (0000 -> 0002)
[ 15.686595] BUG: unable to handle page fault for address: ffffb2758525a6a9
[ 15.687243] #PF: supervisor read access in kernel mode
[ 15.687806] #PF: error_code(0x0000) - not-present page
[ 15.687806] PGD 100000067 P4D 100000067 PUD 10020f067 PMD 11f02a067 PTE 0
[ 15.688647] Oops: 0000 [#1] PREEMPT SMP NOPTI
[ 15.688647] CPU: 10 PID: 728 Comm: systemd-udevd Tainted: G
W L ------- ---
6.2.0-0.rc3.20230110git5a41237ad1d4.25.fc38.x86_64 #1
[ 15.689537] Hardware name: ASUSTeK COMPUTER INC. ROG Strix
G513QY_G513QY/G513QY, BIOS G513QY.320 09/07/2022
[ 15.689537] RIP: 0010:mt7921_check_offload_capability+0xcb/0x100
[mt7921_common]
[ 15.689537] Code: 38 0f b7 03 0f b6 53 02 01 d0 48 98 48 8d 5c 05
00 48 39 cb 73 23 80 7b 03 04 48 8d 6b 04 75 e1 e8 fa 06 fe ee 48 85
ed 74 14 <0f> b6 43 05 48 83 c4 08 5b 5d c3 cc cc cc cc e8 e1 06 fe ee
48 83
[ 15.691541] RSP: 0018:ffffb27581b77b38 EFLAGS: 00010282
[ 15.691541] RAX: 0000000000000001 RBX: ffffb2758525a6a4 RCX: 000000008080003c
[ 15.691541] RDX: 000000008080003d RSI: ffffd478c5fe82c0 RDI: 0000000040000000
[ 15.691541] RBP: ffffb2758525a6a8 R08: 0000000000000000 R09: 000000008080003c
[ 15.693538] R10: ffff98463fa0bb60 R11: 0000000000000000 R12: ffff9845d33490d0
[ 15.693538] R13: ffff9845d33490d0 R14: ffff9845dbb9dda8 R15: ffffb27581b77da0
[ 15.693538] FS: 00007f498fed3b40(0000) GS:ffff985498a00000(0000)
knlGS:0000000000000000
[ 15.693538] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 15.693538] CR2: ffffb2758525a6a9 CR3: 000000017ebbc000 CR4: 0000000000750ee0
[ 15.695563] PKRU: 55555554
[ 15.695563] Call Trace:
[ 15.696548] <TASK>
[ 15.696548] mt7921_pci_probe+0xa6/0x340 [mt7921e]
[ 15.697540] ? __pm_runtime_resume+0x54/0x90
[ 15.697540] local_pci_probe+0x41/0x80
[ 15.698542] pci_device_probe+0xb3/0x220
[ 15.698542] really_probe+0xde/0x380
[ 15.698542] ? pm_runtime_barrier+0x50/0x90
[ 15.699559] __driver_probe_device+0x78/0x170
[ 15.699559] driver_probe_device+0x1f/0x90
[ 15.700547] __driver_attach+0xd2/0x1c0
[ 15.700547] ? __pfx___driver_attach+0x10/0x10
[ 15.701539] bus_for_each_dev+0x76/0xa0
[ 15.701539] bus_add_driver+0x1b1/0x200
[ 15.701539] driver_register+0x89/0xe0
[ 15.702537] ? __pfx_init_module+0x10/0x10 [mt7921e]
[ 15.703136] do_one_initcall+0x6e/0x330
[ 15.703562] do_init_module+0x4a/0x200
[ 15.703583] __do_sys_init_module+0x16a/0x1a0
[ 15.704344] ? sched_clock_local+0xe/0x80
[ 15.704565] do_syscall_64+0x5b/0x80
[ 15.704565] ? lock_release+0x14b/0x440
[ 15.704565] ? up_read+0x17/0x20
[ 15.705541] ? lock_is_held_type+0xe8/0x140
[ 15.705541] ? asm_exc_page_fault+0x22/0x30
[ 15.705541] ? lockdep_hardirqs_on+0x7d/0x100
[ 15.705541] entry_SYSCALL_64_after_hwframe+0x72/0xdc
[ 15.705541] RIP: 0033:0x7f499091100e
[ 15.707560] Code: 48 8b 0d fd 4d 0c 00 f7 d8 64 89 01 48 83 c8 ff
c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca b8 af 00 00
00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d ca 4d 0c 00 f7 d8 64 89
01 48
[ 15.707560] RSP: 002b:00007ffd7b33ed98 EFLAGS: 00000246 ORIG_RAX:
00000000000000af
[ 15.707560] RAX: ffffffffffffffda RBX: 00005626f8635cf0 RCX: 00007f499091100e
[ 15.707560] RDX: 00007f4990de2453 RSI: 0000000000030f76 RDI: 00005626f8b50d70
[ 15.707560] RBP: 00007f4990de2453 R08: 00005626f8634050 R09: ffffd6297d846130
[ 15.709539] R10: 0000000000000005 R11: 0000000000000246 R12: 0000000000020000
[ 15.709539] R13: 00005626f8634210 R14: 0000000000000000 R15: 00005626f8a11670
[ 15.709539] </TASK>
[ 15.709539] Modules linked in: snd_intel_dspcfg mt7921e(+)
snd_intel_sdw_acpi mt7921_common amd64_edac(-) btusb binfmt_misc
edac_mce_amd snd_soc_core snd_hda_codec mt76_connac_lib btrtl
snd_compress mt76 ac97_bus btbcm kvm_amd snd_pcm_dmaengine
snd_hda_core btintel snd_pci_ps snd_rpl_pci_acp6x btmtk mac80211
snd_pci_acp6x snd_hwdep kvm snd_seq bluetooth irqbypass libarc4
snd_seq_device snd_pcm vfat rapl snd_pci_acp5x asus_nb_wmi fat
wmi_bmof cfg80211 pcspkr snd_timer snd_rn_pci_acp3x snd_acp_config
k10temp snd_soc_acpi snd i2c_piix4 snd_pci_acp3x soundcore
asus_wireless joydev amd_pmc acpi_cpufreq zram amdgpu drm_ttm_helper
ttm hid_asus nvme asus_wmi iommu_v2 drm_buddy ledtrig_audio
sparse_keymap gpu_sched platform_profile nvme_core crct10dif_pclmul
drm_display_helper crc32_pclmul crc32c_intel polyval_clmulni rfkill
polyval_generic ucsi_acpi hid_multitouch ghash_clmulni_intel
sha512_ssse3 typec_ucsi serio_raw ccp sp5100_tco cec r8169 nvme_common
typec video i2c_hid_acpi i2c_hid wmi
[ 15.709539] ip6_tables ip_tables fuse
[ 15.712545] CR2: ffffb2758525a6a9
[ 15.712545] ---[ end trace 0000000000000000 ]---
[ 15.713540] RIP: 0010:mt7921_check_offload_capability+0xcb/0x100
[mt7921_common]
[ 15.713540] Code: 38 0f b7 03 0f b6 53 02 01 d0 48 98 48 8d 5c 05
00 48 39 cb 73 23 80 7b 03 04 48 8d 6b 04 75 e1 e8 fa 06 fe ee 48 85
ed 74 14 <0f> b6 43 05 48 83 c4 08 5b 5d c3 cc cc cc cc e8 e1 06 fe ee
48 83
[ 15.714539] RSP: 0018:ffffb27581b77b38 EFLAGS: 00010282
[ 15.714539] RAX: 0000000000000001 RBX: ffffb2758525a6a4 RCX: 000000008080003c
[ 15.714539] RDX: 000000008080003d RSI: ffffd478c5fe82c0 RDI: 0000000040000000
[ 15.715558] RBP: ffffb2758525a6a8 R08: 0000000000000000 R09: 000000008080003c
[ 15.715558] R10: ffff98463fa0bb60 R11: 0000000000000000 R12: ffff9845d33490d0
[ 15.716656] R13: ffff9845d33490d0 R14: ffff9845dbb9dda8 R15: ffffb27581b77da0
[ 15.716656] FS: 00007f498fed3b40(0000) GS:ffff985498a00000(0000)
knlGS:0000000000000000
[ 15.717536] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 15.717536] CR2: ffffb2758525a6a9 CR3: 000000017ebbc000 CR4: 0000000000750ee0
[ 15.717536] PKRU: 55555554
[ 15.759488] intel_rapl_common: Found RAPL domain package
[ 15.760623] intel_rapl_common: Found RAPL domain core
Is it somehow related to the tested patch?
Unfortunately, it happens too rarely and randomly that I do not have a
reproduction scenario for its exact repetition.
One thing I can say when this happens WiFi disappears.
I also attached the full kernel log here.
--
Best Regards,
Mike Gavrilov.
View attachment "dmesg.txt" of type "text/plain" (189849 bytes)
Powered by blists - more mailing lists