[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <1205678588.15075.563.camel@localhost>
Date: Sun, 16 Mar 2008 15:43:08 +0100
From: Laurent GUERBY <laurent@...rby.net>
To: linux-kernel@...r.kernel.org
Subject: Re: BUG: soft lockup detected on Phenom with Debian 2.6.24-4
On Sat, 2008-03-08 at 23:00 +0100, Laurent GUERBY wrote:
> Hi,
>
> I have a system with an "AMD64 Phenom 9500" quad core cpu, 4GB RAM,
> "ASUS M3A32 MVP Deluxe wifi" motherboard with latest vendor BIOS
> (0801).
>
> I tried stock debian etch kernel (Debian 2.6.18.dfsg.1-18etch1),
> machine
> froze with no message, debian etch backport kernel same, and then
> Debian 2.6.24-4 from unstable and I got some messages: machine
> is not frozen but some userland processes are (ps says "Dl" state
> with child in "Zs" state) and "events/3" is taking 100% cpu
> according to top:
>
> 18 root 15 -5 0 0 0 R 100 0.0 74:59.46
> events/3
>
> Got to the same state with ubuntu hardy 2.6.24-8-server kernel. All
> kernels are untainted, no X running anyway.
>
> It takes a few hours of doing some stuff, in my case bootstraping or
> testing GCC at -j 4, and then the problem happens.
On 2.6.24-1-amd64 (Debian 2.6.24-4) I got a slightly different
backtrace in /var/log/messages after a few hours of stressing the
machine with compilations, see below. The given process
was stuck and unkillable.
Any idea on what to do/try?
Thanks in advance,
Laurent
BUG: soft lockup - CPU#3 stuck for 11s! [sh:7652]
CPU 3:
Modules linked in: nfs tun nfsd lockd nfs_acl auth_rpcgss sunrpc exportfs ac battery ipv6 it87 hwmon_vid eeprom loop snd_hda_intel snd_pcm snd_timer snd serio_raw i2c_piix4 soundcore snd_page_alloc ath5k pcspkr psmouse i2c_core mac80211 cfg80211 button sky2 evdev ext3 jbd mbcache dm_mirror dm_snapshot dm_mod sd_mod ata_generic firewire_ohci firewire_core atiixp crc_itu_t ehci_hcd ahci pata_marvell ohci_hcd libata scsi_mod generic ide_core thermal processor fan
Pid: 7652, comm: sh Not tainted 2.6.24-1-amd64 #1
RIP: 0010:[<ffffffff8021bc79>] [<ffffffff8021bc79>] __smp_call_function_mask+0x9f/0xbf
RSP: 0018:ffff81000131bd68 EFLAGS: 00000297
RAX: 00000000000008fc RBX: 0000000000000003 RCX: 0000000000000001
RDX: 00000000000008fc RSI: 00000000000000fc RDI: 0000000000000007
RBP: ffff81000000e000 R08: ffff81011bab5982 R09: ffff81000131bce8
R10: ffff81000131bce8 R11: 0000000000000062 R12: 0000000000000000
R13: 00000000000000d0 R14: ffff81000131bcb8 R15: ffff81000131bcb8
FS: 00002b61abbe4ae0(0000) GS:ffff81011bab58c0(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 00000000005e1808 CR3: 0000000001303000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Call Trace:
[<ffffffff8021bc64>] __smp_call_function_mask+0x8a/0xbf
[<ffffffff80275939>] smp_drain_local_pages+0x0/0x5
[<ffffffff80275939>] smp_drain_local_pages+0x0/0x5
[<ffffffff8021bcf7>] smp_call_function_mask+0x5e/0x70
[<ffffffff80276c3a>] __alloc_pages+0x1ef/0x309
[<ffffffff8020be2e>] system_call+0x7e/0x83
[<ffffffff802762a4>] __get_free_pages+0xe/0x32
[<ffffffff802334bb>] copy_process+0xc1/0x148f
[<ffffffff8022d224>] set_next_entity+0x18/0x3a
[<ffffffff8022d2a3>] pick_next_task_fair+0x2d/0x3f
[<ffffffff8020be2e>] system_call+0x7e/0x83
[<ffffffff80234985>] do_fork+0x70/0x204
[<ffffffff8023f29a>] recalc_sigpending+0xe/0x3c
[<ffffffff8023f366>] sigprocmask+0x9e/0xc0
[<ffffffff8020c147>] ptregscall_common+0x67/0xb0
guerby@...04:~$ cat /proc/7652/stat
7652 (sh) R 7633 3857 3069 34816 3069 4194368 114 0 0 0 0 294472 0 0 20 0 1 0 2369557 8212480 332 18446744073709551615 4194304 4924460 140737476255664 18446744073709551615 47698491444962 262400 65538 0 65538 0 0 0 17 3 0 0 0 0 0
guerby@...04:~$ cat /proc/7652/status
Name: sh
State: R (running)
Tgid: 7652
Pid: 7652
PPid: 7633
TracerPid: 27382
Uid: 1000 1000 1000 1000
Gid: 1000 1000 1000 1000
FDSize: 256
Groups: 1000
VmPeak: 8020 kB
VmSize: 8020 kB
VmLck: 0 kB
VmHWM: 1328 kB
VmRSS: 1328 kB
VmData: 1168 kB
VmStk: 92 kB
VmExe: 716 kB
VmLib: 1560 kB
VmPTE: 24 kB
Threads: 1
SigQ: 12/18446744073709551615
SigPnd: 0000000000040100
ShdPnd: 00000000000f4102
SigBlk: 0000000000010002
SigIgn: 0000000000000000
SigCgt: 0000000000010002
CapInh: 0000000000000000
CapPrm: 0000000000000000
CapEff: 0000000000000000
Cpus_allowed: 0000000f
Mems_allowed: 00000000,00000001
voluntary_ctxt_switches: 0
nonvoluntary_ctxt_switches: 1
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists