lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <alpine.DEB.1.10.0809220926240.23159@p34.internal.lan>
Date:	Mon, 22 Sep 2008 09:26:33 -0400 (EDT)
From:	Justin Piszcz <jpiszcz@...idpixels.com>
To:	Brian Rademacher <rad@...files.net>
cc:	linux-ide@...r.kernel.org, linux-raid@...r.kernel.org,
	linux-kernel@...r.kernel.org
Subject: Re: exception Emask 0x0 SAct 0x1 / SErr 0x0 action 0x2 frozen

>From Brian's earlier e-mail:

> > I filed this kernel bug:
> > https://bugzilla.redhat.com/show_bug.cgi?id=462425


On Mon, 22 Sep 2008, Justin Piszcz wrote:

> I could not agree more.
>
> CC'ing the relevant mailing lists to see if someone out there has any idea 
> what more we could do as this has been affecting you (more so than myself, 
> but I would still like to get some sort of resolution as well, as it still 
> happens to me too):
>
> Similar, but not the same issue:
>
> Sep 17 20:20:05 p34 kernel: [1422169.440538] ata5.00: exception Emask 0x0 
> SAct 0x0 SErr 0x0 action 0x6 frozen
> Sep 17 20:20:05 p34 kernel: [1422169.440549] ata5.00: cmd 
> b0/d8:00:00:4f:c2/00:00:00:00:00/00 tag 0
> Sep 17 20:20:05 p34 kernel: [1422169.440551]          res 
> 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> Sep 17 20:20:05 p34 kernel: [1422169.440556] ata5.00: status: { DRDY }
> Sep 17 20:20:05 p34 kernel: [1422169.440561] ata5: hard resetting link
> Sep 17 20:20:06 p34 kernel: [1422169.744980] ata5: SATA link up 3.0 Gbps 
> (SStatus 123 SControl 300)
> Sep 17 20:20:06 p34 kernel: [1422169.770448] ata5.00: configured for UDMA/133
> Sep 17 20:20:06 p34 kernel: [1422169.770461] ata5: EH complete
>
> (2.6.23.3) above
>
> On Mon, 22 Sep 2008, Brian Rademacher wrote:
>
>> Works fine...Also works under heavy load with only 4 drives.  I could only 
>> get it to fail by doing a raid resync with 4 drives, except for the newer 
>> kernel, which dies pretty easily..
>> 
>> What is really frustrating about it is that short of the bugzilla bug I 
>> submitted, I don't know who would be willing to listen...A lot of the 
>> google hits when searching "action 0x2 frozen" are related to a particular 
>> CDROM drive, or general hardware failure.  I really don't think that is the 
>> case here, but I bet most of the kernel people think the same thing, so 
>> they have no reason to care...
>> 
>> 
>> Sent: Monday, September 22, 2008 7:04 AM
>> Subject: Re: Hardware RAID
>> 
>> 
>>> What about if you just 'stress' one drive?
>>> 
>>> 1. dd if=/dev/sda of=/dev/null bs=1M &
>>> Does it do it?
>>> 2. Same thing for sdb?
>>> 
>>> Justin.
>>> 
>>> On Mon, 22 Sep 2008, Brian Rademacher wrote:
>>> 
>>>> I killed smartd for testing.  Other than that, it seems entirely load 
>>>> based. Anything disk intensive (backups, raid resync, a bunch of spam 
>>>> comes in at once, etc.) makes it fail...
>>>> 
>>>> Sent: Monday, September 22, 2008 6:29 AM
>>>> Subject: Re: Hardware RAID
>>>> 
>>>> 
>>>>> While the error happens for me as well it does NOT happen with that much 
>>>>> consistency, if I were you, I would start testing different kernels and 
>>>>> run it in single user mode (or as close to it as you can) to see if you 
>>>>> can narrow down what is causing it, also boot knoppix and see if it 
>>>>> occurs-- ?
>>>>> 
>>>>> Justin.
>>>>> 
>>>>> On Mon, 22 Sep 2008, Brian Rademacher wrote:
>>>>> 
>>>>>> Doesn't look like a very powerful RAID card, so I may pass on it.  I 
>>>>>> don't think it will have the BW to run as fast as the software RAID 
>>>>>> currently does since it's only a 64bit/66mhz PCI slot...
>>>>>> 
>>>>>> I hate to do the hardware RAID thing, but this error is killing me:
>>>>>> Sep 21 12:05:19 radfiles kernel: ata1.00: exception Emask 0x0 SAct 0x1 
>>>>>> SErr 0x0 action 0x2 frozen
>>>>>> Sep 21 12:32:12 radfiles kernel: ata1.00: exception Emask 0x0 SAct 0x1 
>>>>>> SErr 0x0 action 0x2 frozen
>>>>>> Sep 21 12:41:34 radfiles kernel: ata1.00: exception Emask 0x0 SAct 0x1 
>>>>>> SErr 0x0 action 0x2 frozen
>>>>>> Sep 21 12:58:22 radfiles kernel: ata1.00: exception Emask 0x0 SAct 0x1 
>>>>>> SErr 0x0 action 0x2 frozen
>>>>>> Sep 21 13:11:04 radfiles kernel: ata1.00: exception Emask 0x0 SAct 0x1 
>>>>>> SErr 0x0 action 0x2 frozen
>>>>>> Sep 21 13:23:55 radfiles kernel: ata1.00: exception Emask 0x0 SAct 0x1 
>>>>>> SErr 0x0 action 0x2 frozen
>>>>>> Sep 21 13:54:23 radfiles kernel: ata1.00: exception Emask 0x0 SAct 0x1 
>>>>>> SErr 0x0 action 0x2 frozen
>>>>>> Sep 21 15:15:04 radfiles kernel: ata1.00: exception Emask 0x0 SAct 0x1 
>>>>>> SErr 0x0 action 0x2 frozen
>>>>>> Sep 21 15:44:06 radfiles kernel: ata1.00: exception Emask 0x0 SAct 0x1 
>>>>>> SErr 0x0 action 0x2 frozen
>>>>>> Sep 21 21:15:12 radfiles kernel: ata1.00: exception Emask 0x0 SAct 0x1 
>>>>>> SErr 0x0 action 0x2 frozen
>>>>>> 
>>>>>> And at this point, I can either regress to a 4 drive RAID and don't 
>>>>>> update the kernel, or move forward with hardware...
>>>>>> 
>>>>>> I don't see a fix coming any time soon, but maybe I'll try one of the 
>>>>>> latest F10 kernels just to see if anything has changed...
>>>>>> 
>>>>>> 
>>>>>> ----- Original Message ----- From: "Justin Piszcz" Sent: Monday, 
>>>>>> September 22, 2008 2:05 AM
>>>>>> Subject: Re: Hardware RAID
>>>>>> 
>>>>>> 
>>>>>>> 
>>>>>>> 
>>>>>>> On Sun, 21 Sep 2008, Brian Rademacher wrote:
>>>>>>> 
>>>>>>>> The RAID gods must have been thinking about me.  My MB has one of 
>>>>>>>> these funny slots and supports ZCR, so for the price I'm going to 
>>>>>>>> jump ship. I would guess (and hope) this solves the problem, 
>>>>>>>> especially since I'll have to reconstruct the entire array...
>>>>>>>> 
>>>>>>>> http://cgi.ebay.com/2113600-R-Adaptec-Serial-ATA-RAID-2025SA-Storage_W0QQitemZ250295938636QQihZ015QQcategoryZ167QQssPageNameZWDVWQQrdZ1QQcmdZViewItem
>>>>>>> 
>>>>>>> Hm cool-- let me know how it goes.
>
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ