[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CABXGCsPeeHWUYCuAiZVSbn1Pq2mKK1umtcRYZFcG4z9712xdDg@mail.gmail.com>
Date: Fri, 9 Aug 2019 23:55:27 +0500
From: Mikhail Gavrilov <mikhail.v.gavrilov@...il.com>
To: Alex Deucher <alexdeucher@...il.com>
Cc: Michel Dänzer <michel@...nzer.net>,
Hillf Danton <hdanton@...a.com>,
Dave Airlie <airlied@...il.com>,
Linux List Kernel Mailing <linux-kernel@...r.kernel.org>,
amd-gfx list <amd-gfx@...ts.freedesktop.org>,
Linux Memory Management List <linux-mm@...ck.org>,
dri-devel <dri-devel@...ts.freedesktop.org>,
"Deucher, Alexander" <Alexander.Deucher@....com>,
Harry Wentland <harry.wentland@....com>,
"Koenig, Christian" <Christian.Koenig@....com>
Subject: Re: The issue with page allocation 5.3 rc1-rc2 (seems drm culprit here)
On Thu, 8 Aug 2019 at 19:26, Alex Deucher <alexdeucher@...il.com> wrote:
>
>
> Yup, good catch. Updated patch attached.
>
> Alex
Finally initial problem "gnome-shell: page allocation failure:
order:4, mode:0x40cc0(GFP_KERNEL|__GFP_COMP),
nodemask=(null),cpuset=/,mems_allowed=0" did not happens anymore with
latest version of the patch (I tested more than 23 hours)
But I hit a new problem:
[73808.088801] ------------[ cut here ]------------
[73808.088806] DEBUG_LOCKS_WARN_ON(ww_ctx->contending_lock)
[73808.088813] WARNING: CPU: 8 PID: 1348877 at
kernel/locking/mutex.c:757 __ww_mutex_lock.constprop.0+0xb0f/0x10c0
[73808.088815] Modules linked in: crypto_user sha512_ssse3
sha512_generic macvtap macvlan tap rfcomm xt_CHECKSUM xt_MASQUERADE
nf_nat_tftp nf_conntrack_tftp tun bridge stp llc
nf_conntrack_netbios_ns nf_conntrack_broadcast xt_CT ip6t_REJECT
nf_reject_ipv6 ip6t_rpfilter ipt_REJECT nf_reject_ipv4 xt_conntrack
ebtable_nat ip6table_nat ip6table_mangle ip6table_raw
ip6table_security iptable_nat nf_nat iptable_mangle iptable_raw
iptable_security nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c
ip_set nfnetlink ebtable_filter ebtables ip6table_filter ip6_tables
iptable_filter cmac bnep sunrpc vfat fat snd_hda_codec_realtek
snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi edac_mce_amd
kvm_amd snd_hda_intel snd_usb_audio snd_hda_codec rtwpci
snd_usbmidi_lib rtw88 kvm snd_rawmidi uvcvideo snd_hda_core xpad
snd_hwdep videobuf2_vmalloc mac80211 ff_memless irqbypass
videobuf2_memops snd_seq videobuf2_v4l2 snd_seq_device
videobuf2_common crct10dif_pclmul btusb crc32_pclmul snd_pcm btrtl
videodev
[73808.088845] cfg80211 btbcm btintel eeepc_wmi asus_wmi bluetooth
ghash_clmulni_intel joydev sparse_keymap mc wmi_bmof video k10temp
sp5100_tco snd_timer snd ecdh_generic soundcore i2c_piix4 ccp rfkill
libarc4 ecc gpio_amdpt gpio_generic acpi_cpufreq binfmt_misc ip_tables
hid_logitech_hidpp amdgpu amd_iommu_v2 gpu_sched ttm drm_kms_helper
drm igb crc32c_intel nvme dca i2c_algo_bit hid_logitech_dj nvme_core
wmi pinctrl_amd
[73808.088866] CPU: 8 PID: 1348877 Comm: Youngblood_x64v Not tainted
5.3.0-0.rc3.git1.2.fc31.x86_64 #1
[73808.088868] Hardware name: System manufacturer System Product
Name/ROG STRIX X470-I GAMING, BIOS 2406 06/21/2019
[73808.088871] RIP: 0010:__ww_mutex_lock.constprop.0+0xb0f/0x10c0
[73808.088873] Code: 28 00 74 28 e8 42 29 a6 ff 85 c0 74 1f 8b 05 f8
6a e0 00 85 c0 75 15 48 c7 c6 70 35 32 a5 48 c7 c7 f0 67 30 a5 e8 e9
84 5c ff <0f> 0b 4d 89 74 24 28 b8 dd ff ff ff 65 48 8b 14 25 40 8e 01
00 48
[73808.088876] RSP: 0018:ffffbe618c84b760 EFLAGS: 00010286
[73808.088878] RAX: 0000000000000000 RBX: ffff96f007450000 RCX: 0000000000000000
[73808.088880] RDX: 0000000000000002 RSI: 0000000000000001 RDI: 0000000000000246
[73808.088881] RBP: ffffbe618c84b820 R08: 0000000000000000 R09: 0000000000000000
[73808.088883] R10: ffffffffa6d3f740 R11: 00000000a6d3f373 R12: ffffbe618c84bb90
[73808.088884] R13: ffffbe618c84b7c0 R14: ffff96f6967d4258 R15: ffff96f6967d4260
[73808.088886] FS: 00000000078c3700(0000) GS:ffff96f6fb000000(0000)
knlGS:00007fffffe6c000
[73808.088888] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[73808.088890] CR2: 00007fc785db1000 CR3: 00000007a5f56000 CR4: 00000000003406e0
[73808.088892] Call Trace:
[73808.088896] ? _raw_spin_unlock_irq+0x29/0x40
[73808.088904] ? ttm_mem_evict_first+0x1ed/0x4f0 [ttm]
[73808.088908] ? ww_mutex_lock_interruptible+0x43/0xb0
[73808.088910] ww_mutex_lock_interruptible+0x43/0xb0
[73808.088916] ttm_mem_evict_first+0x1ed/0x4f0 [ttm]
[73808.088922] ttm_bo_mem_space+0x229/0x2c0 [ttm]
[73808.088928] ttm_bo_validate+0xe5/0x190 [ttm]
[73808.088933] ? lockdep_hardirqs_on+0xf0/0x180
[73808.088983] amdgpu_cs_bo_validate+0xaa/0x1b0 [amdgpu]
[73808.089033] amdgpu_cs_validate+0x3b/0x260 [amdgpu]
[73808.089081] amdgpu_cs_list_validate+0x110/0x180 [amdgpu]
[73808.089130] amdgpu_cs_ioctl+0x5a9/0x1d10 [amdgpu]
[73808.089188] ? amdgpu_cs_find_mapping+0x120/0x120 [amdgpu]
[73808.089200] drm_ioctl_kernel+0xaa/0xf0 [drm]
[73808.089212] drm_ioctl+0x208/0x390 [drm]
[73808.089258] ? amdgpu_cs_find_mapping+0x120/0x120 [amdgpu]
[73808.089262] ? sched_clock_cpu+0xc/0xc0
[73808.089265] ? lockdep_hardirqs_on+0xf0/0x180
[73808.089308] amdgpu_drm_ioctl+0x49/0x80 [amdgpu]
[73808.089313] do_vfs_ioctl+0x411/0x750
[73808.089318] ksys_ioctl+0x5e/0x90
[73808.089320] __x64_sys_ioctl+0x16/0x20
[73808.089323] do_syscall_64+0x5c/0xb0
[73808.089325] entry_SYSCALL_64_after_hwframe+0x49/0xbe
[73808.089328] RIP: 0033:0x7f6b3167b07b
[73808.089330] Code: 0f 1e fa 48 8b 05 0d 9e 0c 00 64 c7 00 26 00 00
00 48 c7 c0 ff ff ff ff c3 66 0f 1f 44 00 00 f3 0f 1e fa b8 10 00 00
00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d dd 9d 0c 00 f7 d8 64 89
01 48
[73808.089331] RSP: 002b:00000000078bbce8 EFLAGS: 00000246 ORIG_RAX:
0000000000000010
[73808.089333] RAX: ffffffffffffffda RBX: 00000000078bbd60 RCX: 00007f6b3167b07b
[73808.089335] RDX: 00000000078bbd60 RSI: 00000000c0186444 RDI: 0000000000000103
[73808.089336] RBP: 00000000c0186444 R08: 00000000078bbe70 R09: 0000000000000030
[73808.089337] R10: 00000000078bbe70 R11: 0000000000000246 R12: 0000000000000004
[73808.089338] R13: 0000000000000103 R14: 0000000000000000 R15: 0000000000000000
[73808.089343] irq event stamp: 85143093
[73808.089346] hardirqs last enabled at (85143093):
[<ffffffffa4b24d19>] _raw_spin_unlock_irq+0x29/0x40
[73808.089348] hardirqs last disabled at (85143092):
[<ffffffffa4b1d188>] __schedule+0xc8/0x900
[73808.089350] softirqs last enabled at (85140666):
[<ffffffffa4e0035d>] __do_softirq+0x35d/0x45d
[73808.089353] softirqs last disabled at (85140569):
[<ffffffffa40f1e37>] irq_exit+0xf7/0x100
[73808.089355] ---[ end trace 14afc4859d6718ca ]---
So I needed to report it separately (in another thread) or we continue here?
View attachment "dmesg.txt" of type "text/plain" (245305 bytes)
Powered by blists - more mailing lists