lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <20170717083825.7fc26834@xeon-e3>
Date:   Mon, 17 Jul 2017 08:38:25 -0700
From:   Stephen Hemminger <stephen@...workplumber.org>
To:     netdev@...r.kernel.org
Subject: Fw: [Bug 196399] New: WARNING at net/sched/sch_generic.c:316
 dev_watchdog[...] when suspending



Begin forwarded message:

Date: Mon, 17 Jul 2017 08:34:16 +0000
From: bugzilla-daemon@...zilla.kernel.org
To: stephen@...workplumber.org
Subject: [Bug 196399] New: WARNING at net/sched/sch_generic.c:316 dev_watchdog[...] when suspending


https://bugzilla.kernel.org/show_bug.cgi?id=196399

            Bug ID: 196399
           Summary: WARNING at net/sched/sch_generic.c:316
                    dev_watchdog[...] when suspending
           Product: Networking
           Version: 2.5
    Kernel Version: 4.11+
          Hardware: All
                OS: Linux
              Tree: Mainline
            Status: NEW
          Severity: normal
          Priority: P1
         Component: Other
          Assignee: stephen@...workplumber.org
          Reporter: martin.peres@...e.fr
        Regression: No

Hello,

We have found out that since at least 4.11-rc1, some machines in the Intel GFX
CI lab have been generating the following warning when suspending to s4
(suspend to disk):

[  287.212825] ------------[ cut here ]------------
[  287.212829] WARNING: CPU: 0 PID: 3165 at net/sched/sch_generic.c:316
dev_watchdog+0x218/0x220
[  287.212830] Modules linked in: mcs7830 usbnet mii snd_hda_codec_hdmi
snd_hda_codec_realtek snd_hda_codec_generic i915 x86_pkg_temp_thermal
intel_powerclamp coretemp crct10dif_pclmul crc32_pclmul snd_hda_intel
snd_hda_codec snd_hwdep ghash_clmulni_intel snd_hda_core snd_pcm
i2c_designware_platform i2c_designware_core mei_me mei prime_numbers i2c_hid
pinctrl_sunrisepoint pinctrl_intel
[  287.212864] CPU: 0 PID: 3165 Comm: gem_exec_suspen Tainted: G     U         
4.12.0-CI-CI_DRM_2829+ #1
[  287.212865] Hardware name: Dell Inc. XPS 13 9360/093TW6, BIOS 1.3.2
01/18/2017
[  287.212867] task: ffff8801b4084f40 task.stack: ffffc900001d8000
[  287.212869] RIP: 0010:dev_watchdog+0x218/0x220
[  287.212870] RSP: 0018:ffff88027e403e38 EFLAGS: 00010292
[  287.212872] RAX: 000000000000005a RBX: 0000000000000000 RCX:
0000000000000000
[  287.212874] RDX: 0000000000000002 RSI: ffffffff81cbcf89 RDI:
ffffffff81c9c627
[  287.212875] RBP: ffff88027e403e68 R08: 0000000000000000 R09:
0000000000000001
[  287.212876] R10: 0000000028e9c215 R11: 0000000000000000 R12:
ffff88026e08a848
[  287.212877] R13: 0000000000000000 R14: ffff88026e050020 R15:
0000000000000001
[  287.212878] FS:  00007f345056a8c0(0000) GS:ffff88027e400000(0000)
knlGS:0000000000000000
[  287.212880] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  287.212881] CR2: 00000000008d7008 CR3: 00000001b4314000 CR4:
00000000003406f0
[  287.212882] Call Trace:
[  287.212883]  <IRQ>
[  287.212886]  ? qdisc_rcu_free+0x40/0x40
[  287.212888]  ? qdisc_rcu_free+0x40/0x40
[  287.212891]  call_timer_fn+0x8e/0x370
[  287.212894]  ? qdisc_rcu_free+0x40/0x40
[  287.212896]  expire_timers+0x150/0x1f0
[  287.212899]  run_timer_softirq+0x7c/0x160
[  287.212903]  __do_softirq+0x116/0x4a0
[  287.212906]  irq_exit+0xa9/0xc0
[  287.212909]  smp_apic_timer_interrupt+0x38/0x50
[  287.212912]  apic_timer_interrupt+0x90/0xa0
[  287.212914] RIP: 0010:delay_tsc+0x33/0xc0
[  287.212916] RSP: 0018:ffffc900001dbcd8 EFLAGS: 00000286 ORIG_RAX:
ffffffffffffff10
[  287.212918] RAX: 0000000080000000 RBX: 00000005964f23a0 RCX:
0000000000000001
[  287.212919] RDX: 0000000080000001 RSI: ffffffff81c8e23a RDI:
00000000ffffffff
[  287.212920] RBP: ffffc900001dbcf8 R08: 0000000000000000 R09:
0000000000000001
[  287.212921] R10: 0000000000000000 R11: 0000000000000000 R12:
000000059633478e
[  287.212922] R13: 0000000000249f13 R14: 0000000000000000 R15:
ffff880272eac008
[  287.212924]  </IRQ>
[  287.212929]  ? delay_tsc+0x6b/0xc0
[  287.212932]  __delay+0xa/0x10
[  287.212934]  __const_udelay+0x31/0x40
[  287.212936]  hibernation_debug_sleep+0x20/0x30
[  287.212938]  hibernation_snapshot+0x2bc/0x5f0
[  287.212940]  hibernate+0x159/0x2f0
[  287.212943]  state_store+0xe0/0xf0
[  287.212947]  kobj_attr_store+0xf/0x20
[  287.212949]  sysfs_kf_write+0x40/0x50
[  287.212951]  kernfs_fop_write+0x130/0x1b0
[  287.212955]  __vfs_write+0x23/0x120
[  287.212957]  ? rcu_read_lock_sched_held+0x75/0x80
[  287.212959]  ? rcu_sync_lockdep_assert+0x2a/0x50
[  287.212961]  ? __sb_start_write+0xfa/0x1f0
[  287.212964]  vfs_write+0xc5/0x1d0
[  287.212966]  ? trace_hardirqs_on_caller+0xe7/0x1c0
[  287.212969]  SyS_write+0x44/0xb0
[  287.212972]  entry_SYSCALL_64_fastpath+0x1c/0xb1
[  287.212973] RIP: 0033:0x7f344ed4a4a0
[  287.212974] RSP: 002b:00007ffef50dfaa8 EFLAGS: 00000246 ORIG_RAX:
0000000000000001
[  287.212977] RAX: ffffffffffffffda RBX: ffffffff81470683 RCX:
00007f344ed4a4a0
[  287.212978] RDX: 0000000000000004 RSI: 000000000041d211 RDI:
0000000000000006
[  287.212979] RBP: ffffc900001dbf88 R08: 00000000008d6a50 R09:
0000000000000000
[  287.212980] R10: 0000000000000000 R11: 0000000000000246 R12:
000000000041d211
[  287.212981] R13: 0000000000000006 R14: 0000000000000000 R15:
0000000000000000
[  287.212984]  ? __this_cpu_preempt_check+0x13/0x20
[  287.212988] Code: 63 8e 18 04 00 00 eb 93 4c 89 f7 c6 05 77 5c 77 00 01 e8
dc 7f fd ff 89 d9 48 89 c2 4c 89 f6 48 c7 c7 18 f4 cf 81 e8 f1 c4 9d ff <0f> ff
eb c3 0f 1f 40 00 48 c7 47 08 00 00 00 00 55 48 c7 07 00 
[  287.213051] ---[ end trace b6016dcc7544a681 ]---

This is caught while running the intel-gpu-tools test named
'igt@..._exec_suspend@...ic-s4-devices' on the following machines:

 - Intel Kaby Lake-R RVP: Failure rate 123/135 run(s) (91%), last occurence:
https://intel-gfx-ci.01.org/CI/CI_DRM_2828/fi-kbl-r/igt@gem_exec_suspend@basic-s4-devices.html
 - Intel Kaby Lake i7-7560u: Failure rate 196/305 run(s) (64%), last occurence:
https://intel-gfx-ci.01.org/CI/CI_DRM_2827/fi-kbl-7560u/igt@gem_exec_suspend@basic-s4-devices.html 
 - Intel Skylake i7-6600u: Failure rate 23/75 run(s) (30%), last occurence:
https://intel-gfx-ci.01.org/CI/CI_DRM_2824/fi-skl-6600u/igt@gem_exec_suspend@basic-s4-devices.html
 - Intel Sandy Bridge i7-2600: Failure rate 10/293 run(s) (3%), last occurence:
https://intel-gfx-ci.01.org/CI/CI_DRM_2816/fi-snb-2600/igt@gem_exec_suspend@basic-s4-devices.html 

We have plenty of other machines that do not trigger this warning at all.

The bug used to live in fd.o's bugzilla, but it had no business being there:
https://bugs.freedesktop.org/show_bug.cgi?id=100125

Let me know if I can help in some ways.

-- 
You are receiving this mail because:
You are the assignee for the bug.

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ