lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <616b754a-deb2-c80d-cbb4-7f0172b90f3a@pp.inet.fi>
Date:   Sat, 12 Nov 2022 23:03:10 +0200
From:   Ilkka Prusi <ilkka.prusi@...inet.fi>
To:     linux-kernel@...r.kernel.org
Subject: AMD GPU splat and network error

Hi,

There's two problems that have started appearing with 6.0. They might be 
unrelated.


1) There is a repeating warning from AMD GPU in logs (see below).

Visually I don't see this causing other problems but in case this is 
related to the second issue..


2) Network connections start silently failing after the computer has 
been on for a while.

DNS requests start failing, downloads slow to a trickle and so on. 
Restarting the network hardware does not

solve this, only rebooting the affected computer helps.


Is there commonality that might cause both? Or are they two entirely 
separate things? It is a discrete graphics card

while the Ethernet controller is integrated on the motherboard so I 
don't see them being related on that level.


Hardware:

NIC: 03:00.0 Ethernet controller: Intel Corporation I211 Gigabit Network 
Connection (rev 03)
GPU: 09:00.0 VGA compatible controller: Advanced Micro Devices, Inc. 
[AMD/ATI] Vega 10 XL/XT [Radeon RX Vega 56/64] (rev c3)


[  147.235047] ------------[ cut here ]------------
[  147.235051] WARNING: CPU: 6 PID: 2559 at 
drivers/gpu/drm/drm_modeset_lock.c:276 drm_modeset_drop_locks+0x4b/0x60
[  147.235058] Modules linked in: snd_seq_dummy(E) snd_hrtimer(E) 
snd_seq(E) binfmt_misc(E) nls_ascii(E) nls_cp850(E) vfat(E) fat(E) 
amdgpu(E) intel_rapl_msr(E) intel_rapl_common(E) iosf_mbi(E) 
gpu_sched(E) drm_buddy(E) edac_mce_amd(E) snd_usb_audio(E) kvm_amd(E) 
drm_display_helper(E) kvm(E) cec(E) snd_usbmidi_lib(E) 
snd_hda_codec_hdmi(E) snd_hda_codec_realtek(E) drm_ttm_helper(E) 
snd_hda_codec_generic(E) ledtrig_audio(E) ttm(E) snd_hda_intel(E) 
snd_intel_dspcfg(E) snd_rawmidi(E) snd_hda_codec(E) drm_kms_helper(E) 
snd_seq_device(E) irqbypass(E) hid_sony(E) evdev(E) snd_hda_core(E) 
mc(E) syscopyarea(E) crct10dif_pclmul(E) snd_hwdep(E) ff_memless(E) 
sysfillrect(E) input_leds(E) sysimgblt(E) snd_pcm(E) joydev(E) ccp(E) 
crc32_pclmul(E) snd_timer(E) fb_sys_fops(E) snd(E) 
ghash_clmulni_intel(E) aesni_intel(E) rng_core(E) k10temp(E) 
crypto_simd(E) soundcore(E) sg(E) wmi_bmof(E) cryptd(E) 
tiny_power_button(E) button(E) acpi_cpufreq(E) rapl(E) nfsd(E) 
auth_rpcgss(E) nfs_acl(E)
[  147.235088]  lockd(E) grace(E) sunrpc(E) msr(E) fuse(E) configfs(E) 
efi_pstore(E) dmi_sysfs(E) ip_tables(E) x_tables(E) ipv6(E) autofs4(E) 
efivarfs(E) raid10(E) raid456(E) async_raid6_recov(E) async_memcpy(E) 
async_pq(E) async_xor(E) xor(E) async_tx(E) raid6_pq(E) libcrc32c(E) 
raid1(E) raid0(E) multipath(E) linear(E) md_mod(E) hid_generic(E) 
usbhid(E) hid(E) xhci_pci(E) crc32c_intel(E) xhci_hcd(E) sd_mod(E) 
i2c_piix4(E) t10_pi(E) usbcore(E) igb(E) crc64_rocksoft(E) crc64(E) 
usb_common(E) dca(E) wmi(E) thermal(E)
[  147.235111] CPU: 6 PID: 2559 Comm: kwin_wayland Tainted: G            
E      6.0.8-stable #111
[  147.235114] Hardware name: Gigabyte Technology Co., Ltd. X570 AORUS 
ELITE/X570 AORUS ELITE, BIOS F37c 05/12/2022
[  147.235116] RIP: 0010:drm_modeset_drop_locks+0x4b/0x60
[  147.235118] Code: 8b 50 08 48 8d b8 60 ff ff ff 48 89 51 08 48 89 0a 
48 89 00 48 89 40 08 e8 c2 08 29 00 48 8b 43 70 48 39 c5 75 d2 5b 5d c3 
cc <0f> 0b 8b 7f 68 e8 db fa ff ff eb b5 66 0f 1f 84 00 00 00 00 00 53
[  147.235120] RSP: 0018:ffffc90003327c70 EFLAGS: 00010282
[  147.235122] RAX: ffff888197fb3b01 RBX: ffffc90003327ce8 RCX: 
0000000000012406
[  147.235123] RDX: 0000000000012386 RSI: 00000000000000c0 RDI: 
ffffc90003327ce8
[  147.235124] RBP: 0000000000000000 R08: ffff8881097e8d50 R09: 
ffff8881097e8d50
[  147.235125] R10: 0000000000000000 R11: ffff8881097e8d50 R12: 
0000000000000000
[  147.235126] R13: ffff888197fb3140 R14: 0000000000000000 R15: 
0000000000000000
[  147.235127] FS:  00007fe7d088df00(0000) GS:ffff88881e980000(0000) 
knlGS:0000000000000000
[  147.235129] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  147.235130] CR2: 0000564e5d4923f8 CR3: 00000001a2e0c000 CR4: 
0000000000350ee0
[  147.235131] Call Trace:
[  147.235132]  <TASK>
[  147.235134]  drm_mode_atomic_ioctl+0x345/0xae0
[  147.235137]  ? find_held_lock+0x2b/0x80
[  147.235140]  ? drm_atomic_set_property+0xb30/0xb30
[  147.235142]  drm_ioctl_kernel+0x9c/0x140
[  147.235145]  drm_ioctl+0x217/0x410
[  147.235146]  ? drm_atomic_set_property+0xb30/0xb30
[  147.235148]  ? lock_release+0x93/0x1c0
[  147.235150]  ? _raw_spin_unlock_irqrestore+0x1e/0x40
[  147.235153]  amdgpu_drm_ioctl+0x45/0x80 [amdgpu]
[  147.235283]  __x64_sys_ioctl+0x88/0xc0
[  147.235287]  do_syscall_64+0x3a/0x80
[  147.235290]  entry_SYSCALL_64_after_hwframe+0x46/0xb0
[  147.235292] RIP: 0033:0x7fe7d551caeb
[  147.235293] Code: 00 48 89 44 24 18 31 c0 48 8d 44 24 60 c7 04 24 10 
00 00 00 48 89 44 24 08 48 8d 44 24 20 48 89 44 24 10 b8 10 00 00 00 0f 
05 <89> c2 3d 00 f0 ff ff 77 1c 48 8b 44 24 18 64 48 2b 04 25 28 00 00
[  147.235295] RSP: 002b:00007ffc54a2de50 EFLAGS: 00000246 ORIG_RAX: 
0000000000000010
[  147.235297] RAX: ffffffffffffffda RBX: 0000564e5cc68f00 RCX: 
00007fe7d551caeb
[  147.235298] RDX: 00007ffc54a2def0 RSI: 00000000c03864bc RDI: 
0000000000000017
[  147.235299] RBP: 00007ffc54a2def0 R08: 0000000000000000 R09: 
0000000000000000
[  147.235300] R10: 0000000000000000 R11: 0000000000000246 R12: 
00000000c03864bc
[  147.235301] R13: 0000000000000017 R14: 0000564e5ddd6940 R15: 
0000564e5ddd6960
[  147.235303]  </TASK>
[  147.235304] ---[ end trace 0000000000000000 ]---
[  147.235387] drm_modeset_lock attempting to lock a contended lock 
without backoff:
                   modeset_lock+0xef/0x1e0
                   drm_atomic_get_plane_state+0x73/0x170
                   drm_atomic_normalize_zpos+0x188/0x2b0
                   amdgpu_dm_atomic_check+0x3c2/0x1060 [amdgpu]
                   drm_atomic_check_only+0x5c0/0xa30
                   drm_mode_atomic_ioctl+0x6d5/0xae0
                   drm_ioctl_kernel+0x9c/0x140
                   drm_ioctl+0x217/0x410

--

  - Ilkka


Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ