lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <4B7EDE2F.8030107@gmx.de>
Date:	Fri, 19 Feb 2010 19:53:35 +0100
From:	Axel Uhl <axel.uhl@....de>
To:	Mikael Pettersson <mikpe@...uu.se>
CC:	Tejun Heo <tj@...nel.org>, linux-ide@...r.kernel.org,
	linux-kernel@...r.kernel.org
Subject: Re: Kernel Bug in ATA or SMART area

I now enabled IO/APIC in my kernel. See attached .config. I also enabled 
  pata_via but was unsure which IDE driver to disable. The kernel 
rebooted fine. The following appeared in my syslog when the smartctl 
command spinned up the disk:

Feb 19 18:57:09 homemp3 kernel: ata5.00: exception Emask 0x0 SAct 0x0 
SErr 0x0 action 0x6 frozen
Feb 19 18:57:09 homemp3 kernel: ata5.00: failed command: SMART
Feb 19 18:57:09 homemp3 kernel: ata5.00: cmd 
b0/da:00:00:4f:c2/00:00:00:00:00/00 tag 0
Feb 19 18:57:09 homemp3 kernel:          res 
40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Feb 19 18:57:09 homemp3 kernel: ata5.00: status: { DRDY }
Feb 19 18:57:09 homemp3 kernel: ata5: soft resetting link
Feb 19 18:57:09 homemp3 kernel: ata5.00: configured for UDMA/133
Feb 19 18:57:09 homemp3 kernel: ata5: EH complete


At least it seems that the kernel recovered better from this exception 
than before. In particular, IRQ10 didn't get disabled and so I/O 
continued to work fine. Thanks for the hint.

Would you consider the exception above a serious problem that should be 
taken care of somehow?

Best,
-- Axel

Mikael Pettersson wrote:
> Axel Uhl writes:
>  > Here's what /var/log/dmesg contains:
> ...
>  > Tejun Heo wrote:
>  > > Hello,
>  > > 
>  > > On 02/12/2010 05:46 PM, Axel Uhl wrote:
>  > >> I don't have a /var/log/boot.msg, only a /var/log/boot. Its contents:
>  > > 
>  > > Then please attach output of dmesg after boot.
> 
> Judging from your initial message and this one, it appears that the
> problematic disk is driven by sata_via, but sata_via shares IRQ with
> an awful lot of other junk.
> 
> Suggestions:
> 1. Enable IO/APIC support in the kernel. According to your previous lspci
>    the chipset does have one. This should reduce IRQ sharing and make IRQ
>    handling generally better. If there's still some sharing going on,
>    try moving PCI cards to other slots.
> 2. Move the problematic disk around to e.g. the Promise controller. Do the
>    stray exceptions persist? If so, then it's the disk that's at fault.
>    As long as the libata's error handling recovers, things should work anyway.
> 3. (Unrelated but...) Why use Old IDE to drive the VIA PATA controller?
>    Just use pata_via for that one, enable SCSI SR+SG support, and disable IDE.
> 

-- 
Find Security Certificate at http://www.axel-uhl.de/cgi-bin/cacert.cgi

View attachment ".config" of type "text/plain" (66427 bytes)

Download attachment "smime.p7s" of type "application/x-pkcs7-signature" (2124 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ