[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <49AF9932.2040301@dti2.net>
Date: Thu, 05 Mar 2009 10:19:46 +0100
From: "Jorge Boncompte [DTI2]" <jorge@...2.net>
To: npiggin@...e.de
CC: ext-adrian.hunter@...ia.com, LKML <linux-kernel@...r.kernel.org>
Subject: Re: Error testing ext3 on brd ramdisk
Nick Piggin escribió:
> On Mon, Mar 02, 2009 at 06:42:18PM +0100, Jorge Boncompte [DTI2] wrote:
>> Nick Piggin escribió:
>>> On Fri, Feb 27, 2009 at 07:08:46PM +0100, Jorge Boncompte [DTI2] wrote:
>>>> Hi,
>>>>
>>>> I have added Nick Piggin to the CC: as maintainer of the brd driver.
>>>>
>>>> After switching an embedded distribution that /etc on a ramdisk
>>>> based minix filesystem from 2.6.23.17 to 2.6.29-rcX i am too getting
>>>> errors ant the filesystem is corrupted. Does not happen always. The
>>>> visible effect with text files after reboot is getting the old version of
>>>> the file and "\0"'s at the end.
>>>>
>>>> Did you found a solution?
>>> What architectures are you using? It's possible that brd is missing
>>> a cacheflush. I test it pretty heavily on x86 and no problems, so
>>> this might point to an arch specific problem.
>>>
>>> ---
>>> drivers/block/brd.c | 4 +++-
>>> 1 file changed, 3 insertions(+), 1 deletion(-)
>>>
>>> Index: linux-2.6/drivers/block/brd.c
>>> ===================================================================
>>> --- linux-2.6.orig/drivers/block/brd.c
>>> +++ linux-2.6/drivers/block/brd.c
>>> @@ -275,8 +275,10 @@ static int brd_do_bvec(struct brd_device
>>> if (rw == READ) {
>>> copy_from_brd(mem + off, brd, sector, len);
>>> flush_dcache_page(page);
>>> - } else
>>> + } else {
>>> + flush_dcache_page(page);
>>> copy_to_brd(brd, mem + off, sector, len);
>>> + }
>>> kunmap_atomic(mem, KM_USER0);
>>>
>>> out:
>> Hi, I am on 32bits x86, 2 x Xeon with HT CPUs, but I have seen the
>> same corruption on a KVM/QEMU guest with single emulated CPU.
>>
>> With your patch on top of vanilla 2.6.29-rc3+plus some networking
>> patches I still get corruption sometimes.
>>
>> The script that saves the configuration does...
>>
>> ------------
>> mount -no remount,ro /dev/ram0
>> dd if=/dev/ram0 of=config.bin bs=1k count=1000
>> mount -no remount,rw /dev/ram0
>> md5sum config.bin
>> dd if=config.bin of=/dev/hda1
>> echo $md5sum | dd of=/dev/hda1 bs=1k seek=1100 count=32
>> ------------
>>
>> on system boot
>>
>> ------------
>> CHECK MD5SUM
>> dd if=/dev/hda1 of=/dev/ram0 bs=1k count=1000
>> fsck.minix -a /dev/ram0
>> mount -nt minix /dev/ram0 /etc -o rw
>> ------------
>>
>> I have never seen a MD5 failure on boot, just sometimes the
>> filesystem is corrupted. Kernel config attached.
>
> Hi Jorge,
>
> Well I found and fixed something :) (see other mail) but I don't know
> whether that applies to you here if you're running with a single CPU
> and no preemption. But still, it might be worth trying that patch? I'm
I first saw the corruption on the 2 x Xeon system. I'll try it and let
you know.
> sorry I'm still unable to reproduce a problem with your script
> (although you don't describe how you create the filesystem before
> you remount it).
If the MD5Sum check fails the startup script copy a templated image of
the filesystem contained on a file on another partition to the ramdisk.
>>>From your description, it suggests that the corrupted image is being
> read from /dev/ram0 (becuase the md5sum passes).
No, it is read from /dev/hda1.
> In your script, can you run fsck.minix on config.bin when you first
> create it? What if you unmount /dev/ram0 before copying the image?
Yesterday I did some tests and found that doing...
-----------
umount /etc (/etc is what is mounted from /dev/ram0)
dd if=/dev/zero of=/dev/ram0 bs=1k count=1000
mount /dev/ram0 /etc -t minix -o rw
-----------
...succeds and mounts a corrupted filesystem with the old content. Doing
the same with the all ramdisk driver fails on mount with "no filesystem
found".
If I do...
-----------
umount /etc (/etc is what is mounted from /dev/ram0)
echo 3 > /proc/sys/vm/drop_caches
dd if=/dev/zero of=/dev/ram0 bs=1k count=1000
mount /dev/ram0 /etc -t minix -o rw
----------
... then the mount fails with no filesystem found as it should.
Does this ring any bell? :-)
Regards,
Jorge
--
==============================================================
Jorge Boncompte - Ingenieria y Gestion de RED
DTI2 - Desarrollo de la Tecnologia de las Comunicaciones
--------------------------------------------------------------
C/ Abogado Enriquez Barrios, 5 14004 CORDOBA (SPAIN)
Tlf: +34 957 761395 / FAX: +34 957 450380
==============================================================
- Sin pistachos no hay Rock & Roll...
- Without wicker a basket cannot be made.
==============================================================
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists