[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20110831194826.GA28483@msgid.wurtel.net>
Date: Wed, 31 Aug 2011 21:48:26 +0200
From: Paul Slootman <paul@...tel.net>
To: Maciej Rutecki <maciej.rutecki@...il.com>
Cc: linux-btrfs@...r.kernel.org, chris.mason@...cle.com,
linux-kernel@...r.kernel.org
Subject: Re: linux 3.0: kernel BUG at fs/btrfs/inode.c:1584!
On Wed 31 Aug 2011, Maciej Rutecki wrote:
> On czwartek, 25 sierpnia 2011 o 11:32:42 Paul Slootman wrote:
>>
>> I got the following message just now:
>>
>> Aug 25 09:53:14 duimalot kernel: kernel BUG at fs/btrfs/inode.c:1584!
>> Aug 25 09:53:14 duimalot kernel: invalid opcode: 0000 [#1] SMP
>> Aug 25 09:53:14 duimalot kernel: CPU 1
>> Aug 25 09:53:14 duimalot kernel: Modules linked in: loop 8021q acpi_cpufreq mperf e1000 sg sr_mod e1000e cdrom processor container button evdev thermal_sys i3000_edac arcmsr edac_core
>> Aug 25 09:53:14 duimalot kernel:
>> Aug 25 09:53:14 duimalot kernel: Pid: 12162, comm: btrfs-fixup-0 Tainted: G W 3.0.0-vs2.3.1-pre8 #2 Supermicro PDSM4+/PDSM4+
>> Aug 25 09:53:14 duimalot kernel: RIP: 0010:[<ffffffff812220e8>] [<ffffffff812220e8>] btrfs_writepage_fixup_worker+0x158/0x160
>> Aug 25 09:53:14 duimalot kernel: RSP: 0018:ffff880117f13df0 EFLAGS: 00010246
>> Aug 25 09:53:14 duimalot kernel: RAX: 0000000000000000 RBX: 000000000a45b000 RCX: 0000000000000000
>> Aug 25 09:53:14 duimalot kernel: RDX: 000000025a32b7f4 RSI: 000000000a45b000 RDI: ffff880016ca90a0
>> Aug 25 09:53:14 duimalot kernel: RBP: ffffea00038a4058 R08: ffffea00002c30a8 R09: ffffffff8124290f
>> Aug 25 09:53:14 duimalot kernel: R10: 0000000000000010 R11: 0000000000000000 R12: 000000000a45bfff
>> Aug 25 09:53:14 duimalot kernel: R13: ffff880016ca9178 R14: ffff880016ca9010 R15: 0000000000000000
>> Aug 25 09:53:14 duimalot kernel: FS: 0000000000000000(0000) GS:ffff88011fd00000(0000) knlGS:0000000000000000
>> Aug 25 09:53:14 duimalot kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
>> Aug 25 09:53:14 duimalot kernel: CR2: 000000001b0a62f8 CR3: 0000000025608000 CR4: 00000000000006e0
>> Aug 25 09:53:14 duimalot kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
>> Aug 25 09:53:14 duimalot kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
>> Aug 25 09:53:14 duimalot kernel: Process btrfs-fixup-0 (pid: 12162, threadinfo ffff880117f12000, task ffff880118920760)
>> Aug 25 09:53:14 duimalot kernel: Stack:
>> Aug 25 09:53:14 duimalot kernel: ffff880056cbb420 0000000000000000 ffff8800748aec00 ffff880056cbb428
>> Aug 25 09:53:14 duimalot kernel: ffff880056cbb450 ffff8801064aa840 ffff8801064aa890 ffff8801064aa858
>> Aug 25 09:53:14 duimalot kernel: ffff8801064aa848 ffffffff8124d419 ffff880117f13e90 ffff880117f13e80
>> Aug 25 09:53:14 duimalot kernel: Call Trace:
>> Aug 25 09:53:14 duimalot kernel: [<ffffffff8124d419>] ? worker_loop+0x169/0x520
>> Aug 25 09:53:14 duimalot kernel: [<ffffffff8124d2b0>] ? btrfs_queue_worker+0x2f0/0x2f0
>> Aug 25 09:53:14 duimalot kernel: [<ffffffff81065956>] ? kthread+0x96/0xb0
>> Aug 25 09:53:14 duimalot kernel: [<ffffffff81495394>] ? kernel_thread_helper+0x4/0x10
>> Aug 25 09:53:14 duimalot kernel: [<ffffffff810658c0>] ? kthread_worker_fn+0x180/0x180
>> Aug 25 09:53:14 duimalot kernel: [<ffffffff81495390>] ? gs_change+0xb/0xb
>> Aug 25 09:53:14 duimalot kernel: Code: 5c 41 5d 41 5e 41 5f c3 0f 1f 00 41 b8 50 00 00 00 48 8d 4c 24 10 4c 89 e2 48 89 de 4c 89 f7 e8 7f 1e 02 00 eb ba 0f 1f 44 00 00 <0f> 0b eb fe 0f 1f 40 00 41 56 49 89 fe 41 55 41 54 55 53 48 83
>> Aug 25 09:53:14 duimalot kernel: RIP [<ffffffff812220e8>] btrfs_writepage_fixup_worker+0x158/0x160
>> Aug 25 09:53:14 duimalot kernel: RSP <ffff880117f13df0>
>> Aug 25 09:53:14 duimalot kernel: ---[ end trace 948b88b1d8dd7dc8 ]---
>>
>> The system has been running previously just fine for more than a year
>> before I upgraded it to linux 3.0 and added a btrfs filesystem.
>> This is a storage system, and the btrfs filesystem is 41TB in size.
>> Currently it's 17% full.
>>
>> If any other info is needed, let me know.
>
> Cany you check that eg 2.6.39 works OK and/or try bisecton?
After the above message, creating snapshots hung in state D, so I
rebooted.
After mounting the btrfs filesystem again, and trying 'ls' on it, the
kernel gave these messages:
btrfs: truncated 1 orphans
btrfs: truncated 1 orphans
btrfs: truncated 2 orphans
btrfs: truncated 3 orphans
btrfs: truncated 4 orphans
btrfs: truncated 5 orphans
btrfs: truncated 6 orphans
btrfs: truncated 7 orphans
btrfs: truncated 8 orphans
btrfs: truncated 8 orphans
btrfs: truncated 8 orphans
btrfs: truncated 9 orphans
btrfs: truncated 10 orphans
btrfs: truncated 10 orphans
btrfs: truncated 11 orphans
btrfs: truncated 11 orphans
btrfs: truncated 11 orphans
btrfs: truncated 12 orphans
btrfs: truncated 13 orphans
btrfs: truncated 14 orphans
btrfs: truncated 14 orphans
btrfs: truncated 14 orphans
btrfs: truncated 5 orphans
btrfs: truncated 6 orphans
btrfs: truncated 7 orphans
btrfs: truncated 8 orphans
btrfs: truncated 8 orphans
btrfs: truncated 9 orphans
btrfs: truncated 10 orphans
btrfs: truncated 10 orphans
btrfs: truncated 10 orphans
btrfs: truncated 10 orphans
btrfs: truncated 10 orphans
btrfs: truncated 11 orphans
btrfs: truncated 12 orphans
btrfs: truncated 12 orphans
btrfs: truncated 13 orphans
btrfs: free space inode generation (0) did not match free space cache generation (507)
btrfs: failed to load free space cache for block group 263364542464
But after that the filesystem is working just fine again.
One thing I failed to mention: a thing that bugs me a bit when making
snapshots, is that an empty directory is created for every snapshot that
already existed on that subvolume already. On another system I found
that I could delete those directories inside the snapshots just fine.
I had tried that on this system, but that gave the following curious
results:
# mount | grep mirror
/dev/sdc on /extra/vservers/debian-mirror/pub type btrfs (rw,compress=lzo,space_cache,subvol=debian-mirror)
# ls /extra/vservers/debian-mirror/pub
20050101 20090206 20090409 20090611 20090812 20091013 20091214 20100214 20100417 20101121
20050401 20090207 20090410 20090612 20090813 20091014 20091215 20100215 20100418 20101123
20050501 20090208 20090411 20090613 20090814 20091015 20091216 20100216 20100419 20101124
20050601 20090209 20090412 20090614 20090815 20091016 20091217 20100217 20100420 20101127
...
# ls -l /extra/vservers/debian-mirror/pub/20050401
ls: cannot access /extra/vservers/debian-mirror/pub/20050401/20050101: No such file or directory
total 3
d????????? ? ? ? ? ? 20050101
-rwxr-xr-x 1 root root 3094 Jul 10 2007 linkit
drwxr-xr-x 1 108 nogroup 98 Jul 26 2007 mirrors
Nothing can be done with those un-stat-able directories. Not that I'm
very worried about this, but it might be related. Any it's wrong in any
case :-)
There's been a lot of IO on that filesystem since the reboot (6 days
now), and everything seems to be working fine.
# cat /sys/block/sdc/stat
152272235 8500861 4097794632 220201180 35993505 1966359 3228040504 500809840 0 118998200 720954250
thanks,
Paul
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists