linux-kernel - Re: kernel BUG at kernel/sched

lists.openwall.net		lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC
Open Source and information security mailing list archives
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <1357638325.5678.47.camel@marge.simpson.net>
Date:	Tue, 08 Jan 2013 10:45:25 +0100
From:	Mike Galbraith <bitbucket@...ine.de>
To:	Shawn Bohrer <sbohrer@...advisors.com>
Cc:	linux-kernel@...r.kernel.org, mingo@...e.hu, peterz@...radead.org
Subject: Re: kernel BUG at kernel/sched_rt.c:493!

On Mon, 2013-01-07 at 17:02 -0600, Shawn Bohrer wrote: 
> On Mon, Jan 07, 2013 at 11:58:18AM -0600, Shawn Bohrer wrote:
> > On Sat, Jan 05, 2013 at 11:46:32AM -0600, Shawn Bohrer wrote:
> > > I've tried reproducing the issue, but so far I've been unsuccessful
> > > but I believe that is because my RT tasks aren't using enough CPU
> > > cause borrowing from the other runqueues.  Normally our RT tasks use
> > > very little CPU so I'm not entirely sure what conditions caused them
> > > to run into throttling on the day that this happened.
> > 
> > I've managed to reproduce this a couple times now on 3.1.9 I'll give
> > this a try later with a more recent kernel.  Here is what I've done to
> > reproduce the issue.
> > 
> > 
> > # Setup in shell 1
> > root@...box39:/cgroup/cpuset# mkdir package0
> > root@...box39:/cgroup/cpuset# echo 0 > package0/cpuset.mems
> > root@...box39:/cgroup/cpuset# echo 0,2,4,6 > package0/cpuset.cpus
> > root@...box39:/cgroup/cpuset# cat cpuset.sched_load_balance
> > 1
> > root@...box39:/cgroup/cpuset# cat package0/cpuset.sched_load_balance
> > 1
> > root@...box39:/cgroup/cpuset# cat sysdefault/cpuset.sched_load_balance
> > 1
> > root@...box39:/cgroup/cpuset# echo 1,3,5,7 > sysdefault/cpuset.cpus
> > root@...box39:/cgroup/cpuset# echo 0 > sysdefault/cpuset.mems
> > root@...box39:/cgroup/cpuset# echo $$ > package0/tasks
> > 
> > # Setup in shell 2
> > root@...box39:~# cd /cgroup/cpuset/
> > root@...box39:/cgroup/cpuset# chrt -f -p 60 $$
> > root@...box39:/cgroup/cpuset# echo $$ > sysdefault/tasks
> > 
> > # In shell 1
> > root@...box39:/cgroup/cpuset# chrt -f 1 /root/burn.sh &
> > root@...box39:/cgroup/cpuset# chrt -f 1 /root/burn.sh &
> > 
> > # In shell 2
> > root@...box39:/cgroup/cpuset# echo 0 > cpuset.sched_load_balance
> > root@...box39:/cgroup/cpuset# echo 1 > cpuset.sched_load_balance
> > root@...box39:/cgroup/cpuset# echo 0 > cpuset.sched_load_balance
> > root@...box39:/cgroup/cpuset# echo 1 > cpuset.sched_load_balance
> > 
> > I haven't found the exact magic combination but I've been going back
> > and forth adding/killing burn.sh processes and toggling
> > cpuset.sched_load_balance and in a couple of minutes I can usually get
> > the machine to trigger the bug.
> 
> I've also managed to reproduce this on 3.8.0-rc2 so it appears the bug
> is still present in the latest kernel.
> 
> Also just re-reading my instructions above /root/burn.sh is just a
> simple:
> 
> while true; do : ; done
> 
> I've also had to make the kworker threads SCHED_FIFO with a higher
> priority than burn.sh or as expected I can lock up the system due to
> some xfs threads getting starved.
> 
> Let me know if anyone needs any more information, or needs me to try
> anything since it appears I can trigger this fairly easily now.

I hit something very similar.

In my case, I had booted with sched_debug, and a 57600 serial console
connected to my local box via a horribly slow internet link, which led
to it taking 40 seconds to get 2 of 80 cpus down for reboot.  There were
a gaggle of CPUs with pilfered rt_runtime, a matching gaggle of bandits,
no runqueues were throttled, nor were there any with rt_time accrued at
crash time, but as CPU4 tried to go down, it couldn't find its property,
so cleverly fired a thermonuclear pop-flare, lest thieves escape unseen,
precious rt_runtime in hand.

     runqueues 
     0xffff88047f811180      rt_runtime = 888192922   cpu0
     0xffff880c7f811180      rt_runtime = RUNTIME_INF cpu1
     0xffff88087f811180      rt_runtime = RUNTIME_INF cpu2
     0xffff880e7f811180      rt_runtime = 699462573   cpu3
     0xffff88047f851180      rt_runtime = 841094505   cpu4
     0xffff880c7f851180      rt_runtime = 926250000   cpu5
     0xffff88087f851180      rt_runtime = 926250000   cpu6
     0xffff880e7f851180      rt_runtime = 926250000   cpu7
     0xffff88047f891180      rt_runtime = 926250000   cpu8
     0xffff880c7f891180      rt_runtime = 926250000   cpu9
     0xffff88087f891180      rt_runtime = 926250000   cpu10
     0xffff880e7f891180      rt_runtime = 926250000   cpu11
     0xffff88047f8d1180      rt_runtime = 926250000   cpu12
     0xffff880c7f8d1180      rt_runtime = 926250000   cpu13
     0xffff88087f8d1180      rt_runtime = 1000000000  cpu14
     0xffff880e7f8d1180      rt_runtime = 1000000000  cpu15
     0xffff88047f911180      rt_runtime = 1000000000  cpu16
     0xffff880c7f911180      rt_runtime = 1000000000  cpu17
     0xffff88087f911180      rt_runtime = 1000000000  cpu18
     0xffff880e7f911180      rt_runtime = 1000000000  cpu19
     0xffff88047f951180      rt_runtime = 1000000000  cpu20
     0xffff880c7f951180      rt_runtime = 1000000000  cpu21
     0xffff88087f951180      rt_runtime = 1000000000  cpu23
     0xffff880e7f951180      rt_runtime = 1000000000  cpu24
     0xffff88047f991180      rt_runtime = 1000000000  cpu25
     0xffff880c7f991180      rt_runtime = 950000000   cpu26
     0xffff88087f991180      rt_runtime = 1000000000  cpu27
     0xffff880e7f991180      rt_runtime = 950000000   cpu28
     ....
     0xffff88087fcd1180      rt_runtime = 950000000   cpu79

crash> bt
PID: 14735  TASK: ffff880e75e86500  CPU: 4   COMMAND: "reboot"
 #0 [ffff880e74b3d890] machine_kexec at ffffffff8102676e
 #1 [ffff880e74b3d8e0] crash_kexec at ffffffff810a3a3a
 #2 [ffff880e74b3d9b0] oops_end at ffffffff81446238
 #3 [ffff880e74b3d9d0] do_invalid_op at ffffffff810035d4
 #4 [ffff880e74b3da70] invalid_op at ffffffff8144d9bb
    [exception RIP: __disable_runtime+494]
    RIP: ffffffff81045dae  RSP: ffff880e74b3db28  RFLAGS: 00010082
    RAX: 000000000000046a  RBX: ffff880e7fcd1310  RCX: ffffffff81d88a90
    RDX: 000000000000046a  RSI: 0000000000000050  RDI: ffff880e7fcd19b0
    RBP: ffff880e74b3db98   R8: 0000000000000010   R9: ffff88047f423408
    R10: 0000000000000050  R11: ffffffff81250190  R12: 0000000000000050
    R13: fffffffffde9f140  R14: ffff880e7fcd1310  R15: 000000000000122f
    ORIG_RAX: ffffffffffffffff  CS: 0010  SS: 0018
 #5 [ffff880e74b3dba0] rq_offline_rt at ffffffff8104b36a
 #6 [ffff880e74b3dbc0] set_rq_offline at ffffffff81043c9e
 #7 [ffff880e74b3dbe0] rq_attach_root at ffffffff8104c130
 #8 [ffff880e74b3dc10] cpu_attach_domain at ffffffff81054f44
 #9 [ffff880e74b3dc80] build_sched_domains at ffffffff81055525
#10 [ffff880e74b3dcd0] partition_sched_domains at ffffffff81055835
#11 [ffff880e74b3dd70] cpuset_cpu_inactive at ffffffff81055a0d
#12 [ffff880e74b3dd80] notifier_call_chain at ffffffff81448907
#13 [ffff880e74b3ddb0] _cpu_down at ffffffff8142b57b
#14 [ffff880e74b3de10] disable_nonboot_cpus at ffffffff8105ba23
#15 [ffff880e74b3de40] kernel_restart at ffffffff81071dde
#16 [ffff880e74b3de50] sys_reboot at ffffffff81071fc3
#17 [ffff880e74b3df80] system_call_fastpath at ffffffff8144ca12
    RIP: 00007fed35778316  RSP: 00007fff23f7e198  RFLAGS: 00010202
    RAX: 00000000000000a9  RBX: ffffffff8144ca12  RCX: 00007fed35771130
    RDX: 0000000001234567  RSI: 0000000028121969  RDI: fffffffffee1dead
    RBP: 0000000000000000   R8: 0000000000020fd0   R9: 000000000060d120
    R10: 00007fff23f7df40  R11: 0000000000000202  R12: 0000000000000000
    R13: 0000000000000000  R14: 0000000000000001  R15: 0000000000000000
    ORIG_RAX: 00000000000000a9  CS: 0033  SS: 002b

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/