[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <200712081920.28748.m.kozlowski@tuxland.pl>
Date: Sat, 8 Dec 2007 19:20:28 +0100
From: Mariusz Kozlowski <m.kozlowski@...land.pl>
To: Andrew Morton <akpm@...ux-foundation.org>,
David Miller <davem@...emloft.net>
Cc: linux-kernel@...r.kernel.org, sparclinux@...r.kernel.org
Subject: Re: 2.6.24-rc4-mm1: some issues on sparc64
Hello,
The box is sun ultra 60 (dual sparc64). This was caught when
system (gentoo) was emerging some package.
[27006.402237] kernel BUG at fs/jbd/transaction.c:1894!
[27006.402268] \|/ ____ \|/
[27006.402274] "@'/ .. \`@"
[27006.402279] /_| \__/ |_\
[27006.402285] \__U_/
[27006.402298] rm(4713): Kernel bad sw trap 5 [#1]
[27006.402538] TSTATE: 0000009911009605 TPC: 000000000053b1cc TNPC: 000000000053b1d0 Y: 00000000 Not tainted
[27006.402579] TPC: <journal_invalidatepage+0x3d4/0x460>
[27006.402593] g0: 0000000000000002 g1: 0000000000000000 g2: 0000000000000001 g3: fffff800a7d90000
[27006.402610] g4: fffff800b54ea460 g5: fffff8007f832000 g6: fffff800a7d90000 g7: 000000000076d868
[27006.402627] o0: 000000000072b660 o1: 0000000000000766 o2: 0000000000000002 o3: 0000000000000001
[27006.402644] o4: 00000000008a2940 o5: 0000000000000000 sp: fffff800a7d92c91 ret_pc: 000000000053b1c4
[27006.402665] RPC: <journal_invalidatepage+0x3cc/0x460>
[27006.402679] l0: fffff800afbf4070 l1: 000000000069511c l2: 0000000000002000 l3: 0000000000000000
[27006.402696] l4: 0000000000000001 l5: fffff800ba4cb730 l6: fffff800bf1cd338 l7: 0000000000000001
[27006.402713] i0: fffff800bf1cd000 i1: 0000000201db2708 i2: 0000000000000000 i3: 0000000000727000
[27006.402730] i4: 0000000000200000 i5: fffff800bf1cd028 i6: fffff800a7d92d51 i7: 0000000000529254
[27006.402763] I7: <ext3_invalidatepage+0x3c/0x60>
[27006.402776] Caller[0000000000529254]: ext3_invalidatepage+0x3c/0x60
[27006.402800] Caller[00000000004b22fc]: do_invalidatepage+0x24/0x60
[27006.402826] Caller[00000000004b29c4]: truncate_complete_page+0x6c/0x80
[27006.402849] Caller[00000000004b2a6c]: truncate_inode_pages_range+0x94/0x440
[27006.402872] Caller[00000000004b2e2c]: truncate_inode_pages+0x14/0x20
[27006.402894] Caller[0000000000529888]: ext3_delete_inode+0x10/0x160
[27006.402918] Caller[00000000004e7ca0]: generic_delete_inode+0x88/0x120
[27006.402949] Caller[00000000004e7e60]: generic_drop_inode+0x128/0x1c0
[27006.402971] Caller[00000000004e75d4]: iput+0x7c/0xa0
[27006.402992] Caller[00000000004dd680]: do_unlinkat+0x108/0x1a0
[27006.403024] Caller[00000000004dd884]: sys_unlinkat+0x2c/0x60
[27006.403047] Caller[00000000004062d4]: linux_sparc_syscall32+0x3c/0x40
[27006.403081] Caller[00000000f7e7d0ec]: 0xf7e7d0f4
[27006.403102] Instruction DUMP: 92102766 7ffbbeaf 90122260 <91d02005> 92102780 7ffbbeab 90122260 91d02005 7ffbbea8
After this happend, one (out of two) cpu got consumed (in kernel space) trying to
complete io. Process stuck in D state, wchan says it was in sync_buffer() which
you can see also in 'SysRq : Show Blocked State' below.
[27422.874858] SysRq : Show Blocked State
[27422.877086] task PC stack pid father
[27422.877143] rm D 00000000004f8f68 0 4966 4860
[27422.877160] Call Trace:
[27422.877167] [0000000000692840] io_schedule+0x28/0x40
[27422.877182] [00000000004f8f68] sync_buffer+0x50/0x60
[27422.877198] [0000000000692a58] __wait_on_bit_lock+0x60/0xa0
[27422.877213] [0000000000692ae4] out_of_line_wait_on_bit_lock+0x4c/0x60
[27422.877228] [00000000004f9328] __lock_buffer+0x30/0x40
[27422.877242] [000000000053b024] journal_invalidatepage+0x22c/0x460
[27422.877268] [0000000000529254] ext3_invalidatepage+0x3c/0x60
[27422.877297] [00000000004b22fc] do_invalidatepage+0x24/0x60
[27422.877316] [00000000004b29c4] truncate_complete_page+0x6c/0x80
[27422.877332] [00000000004b2a6c] truncate_inode_pages_range+0x94/0x440
[27422.877349] [00000000004b2e2c] truncate_inode_pages+0x14/0x20
[27422.877364] [0000000000529888] ext3_delete_inode+0x10/0x160
[27422.877381] [00000000004e7ca0] generic_delete_inode+0x88/0x120
[27422.877405] [00000000004e7e60] generic_drop_inode+0x128/0x1c0
[27422.877421] [00000000004e75d4] iput+0x7c/0xa0
[27422.877435] [00000000004dd680] do_unlinkat+0x108/0x1a0
The downside is that it is unclear to me how to reproduce that - it just happens sometimes.
Also from time to time I get warnings about tcp_fastretrans_alert(), but it seems they do no harm.
[30014.779310] WARNING: at net/ipv4/tcp_input.c:2518 tcp_fastretrans_alert()
[30014.781630] Call Trace:
[30014.783976] [00000000006551c8] tcp_fastretrans_alert+0x70/0xe00
[30014.786312] [0000000000657c60] tcp_ack+0x988/0x10c0
[30014.788702] [000000000065bd80] tcp_rcv_established+0x408/0x840
[30014.791074] [00000000006634dc] tcp_v4_do_rcv+0xe4/0x4a0
[30014.793440] [000000000066632c] tcp_v4_rcv+0xa34/0xb20
[30014.795762] [0000000000643a10] ip_local_deliver+0xd8/0x2c0
[30014.798102] [0000000000643ed4] ip_rcv+0x2dc/0x640
[30014.800431] [000000000062424c] netif_receive_skb+0x334/0x400
[30014.802762] [0000000000627228] process_backlog+0x90/0x140
[30014.805097] [0000000000626d28] net_rx_action+0x190/0x260
[30014.807462] [0000000000475ea8] __do_softirq+0x90/0x140
[30014.809794] [0000000000475fe0] do_softirq+0x88/0xa0
[30014.812134] [000000000047608c] irq_exit+0x94/0xc0
[30014.814453] [000000000042f53c] handler_irq+0xa4/0xc0
[30014.816800] [0000000000426f30] sunos_sys_table+0x560/0x728
[30014.819133] [00000000004286d8] cpu_idle+0x20/0xe0
Linux sparc64 2.6.24-rc4-mm1 #2 SMP PREEMPT Sat Dec 8 10:59:35 CET 2007 sparc64 sun4u TI UltraSparc II (BlackBird) GNU/Linux
Gnu C 4.1.1
Gnu make 3.81
binutils 2.18
util-linux 2.12r
mount 2.12r
module-init-tools 3.2.2
e2fsprogs 1.40.2
Linux C Library 2.5
Dynamic linker (ldd) 2.5
Procps 3.2.7
Net-tools 1.60
Kbd 1.13
Sh-utils 6.9
udev 104
Modules Loaded sr_mod cdrom sg
Regards,
Mariusz
View attachment "sparc64-2.6.24-rc4-mm1.config" of type "text/plain" (25619 bytes)
Powered by blists - more mailing lists