lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Date:	Sun, 05 Jul 2009 12:40:25 -0600
From:	Robert Hancock <hancockrwd@...il.com>
To:	Lars Kunert <lkunert@...-sb.mpg.de>
CC:	linux-kernel@...r.kernel.org
Subject: Re: kernel:Disabling IRQ #23

On 07/05/2009 06:50 AM, Lars Kunert wrote:
> Hi,
> yesterday evening I lost the ssh connection to my server,
> the last message was:
>
>> Message from syslogd@...st-195 at Jul  4 23:29:08 ...
>> kernel:Disabling IRQ #23
>
> I could not renew the connection. After a reboot I found the following
> messages in /var/log/messages (attached below)
>
> The server contains 10 harddisks
> - 2 SAS drives connected as SAS drives
> - 6 SATA drives connected via SAS, and
> - 2 SATA drives connected via SATA
>
> Do these messages point to a single harddisk as the source of the problem?

Something generated spurious IRQs on that IRQ line and caused the 
interrupt to be disabled. From that point, anything on that IRQ line 
won't function properly. Likely a driver bug in either the USB or ATA 
driver.

You'll likely want to try a newer kernel..

>
>
>> distribution
> Fedora 10 server
>
>> uname -a
> Linux guest-195.mpi-sb.mpg.de 2.6.27.24-170.2.68.fc10.x86_64 #1 SMP Wed
> May 20 22:47:23 EDT 2009 x86_64 x86_64 x86_64 GNU/Linux
>
>> cat /var/log/messages
> # at this point I lost the ssh connection to the server
> Jul  4 23:29:08 guest-195 kernel: irq 23: nobody cared (try booting with
> the "irqpoll" option)
> Jul  4 23:29:08 guest-195 kernel: Pid: 0, comm: swapper Tainted:
> P          2.6.27.21-170.2.56.fc10.x86_64 #1
> Jul  4 23:29:08 guest-195 kernel:
> Jul  4 23:29:08 guest-195 kernel: Call Trace:
> Jul  4 23:29:08 guest-195 kernel:<IRQ>   [<ffffffff81083523>]
> __report_bad_irq+0x38/0x7c
> Jul  4 23:29:08 guest-195 kernel: [<ffffffff8108376f>]
> note_interrupt+0x208/0x26d
> Jul  4 23:29:08 guest-195 kernel: [<ffffffff81083e9c>]
> handle_fasteoi_irq+0xbb/0xeb
> Jul  4 23:29:08 guest-195 kernel: [<ffffffff810130ce>] do_IRQ+0xf7/0x169
> Jul  4 23:29:08 guest-195 kernel: [<ffffffff81010963>]
> ret_from_intr+0x0/0x2e
> Jul  4 23:29:08 guest-195 kernel:<EOI>   [<ffffffff810173a9>] ?
> mwait_idle+0x3e/0x4f
> Jul  4 23:29:08 guest-195 kernel: [<ffffffff810173a0>] ?
> mwait_idle+0x35/0x4f
> Jul  4 23:29:08 guest-195 kernel: [<ffffffff8100f2a7>] ? cpu_idle+0xb2/0x10b
> Jul  4 23:29:08 guest-195 kernel: [<ffffffff8132e04b>] ?
> start_secondary+0x16e/0x173
> Jul  4 23:29:08 guest-195 kernel:
> Jul  4 23:29:08 guest-195 kernel: handlers:
> Jul  4 23:29:08 guest-195 kernel: [<ffffffff812256e5>]
> (ata_sff_interrupt+0x0/0xc2)
> Jul  4 23:29:08 guest-195 kernel: [<ffffffff8123c306>]
> (usb_hcd_irq+0x0/0xb3)
> Jul  4 23:29:08 guest-195 kernel: Disabling IRQ #23
> Jul  4 23:29:39 guest-195 kernel: ata3.00: exception Emask 0x0 SAct 0x0
> SErr 0x0 action 0x6 frozen
> Jul  4 23:29:39 guest-195 kernel: ata3.00: cmd
> a0/00:00:00:00:00/00:00:00:00:00/a0 tag 0
> Jul  4 23:29:39 guest-195 kernel:         cdb 00 00 00 00 00 00 00 00
> 00 00 00 00 00 00 00 00
> Jul  4 23:29:39 guest-195 kernel:         res
> 40/00:02:00:08:00/00:00:00:00:00/a0 Emask 0x4 (timeout)
> Jul  4 23:29:39 guest-195 kernel: ata3.00: status: { DRDY }
> Jul  4 23:29:39 guest-195 kernel: ata3: soft resetting link
> Jul  4 23:29:44 guest-195 kernel: ata3.01: qc timeout (cmd 0x27)
> Jul  4 23:29:44 guest-195 kernel: ata3.01: failed to read native max
> address (err_mask=0x4)
> Jul  4 23:29:44 guest-195 kernel: ata3.01: HPA support seems broken,
> skipping HPA handling
> Jul  4 23:29:44 guest-195 kernel: ata3.01: revalidation failed (errno=-5)
> Jul  4 23:29:44 guest-195 kernel: ata3: soft resetting link
> Jul  4 23:29:44 guest-195 kernel: ata3.00: configured for UDMA/100
> Jul  4 23:29:44 guest-195 kernel: ata3.01: configured for UDMA/133
> Jul  4 23:29:44 guest-195 kernel: ata3: EH complete
>
> # the log continues with the following message, repeated every 30 seconds...
>
> Jul  4 23:30:14 guest-195 kernel: ata3.00: exception Emask 0x0 SAct 0x0
> SErr 0x0 action 0x6 frozen
> Jul  4 23:30:14 guest-195 kernel: ata3.00: cmd
> a0/00:00:00:00:00/00:00:00:00:00/a0 tag 0
> Jul  4 23:30:14 guest-195 kernel:         cdb 00 00 00 00 00 00 00 00
> 00 00 00 00 00 00 00 00
> Jul  4 23:30:14 guest-195 kernel:         res
> 40/00:02:00:08:00/00:00:00:00:00/a0 Emask 0x4 (timeout)
> Jul  4 23:30:14 guest-195 kernel: ata3.00: status: { DRDY }
> Jul  4 23:30:14 guest-195 kernel: ata3: soft resetting link
> Jul  4 23:30:14 guest-195 kernel: ata3.00: configured for UDMA/100
> Jul  4 23:30:14 guest-195 kernel: ata3.01: configured for UDMA/133
> Jul  4 23:30:14 guest-195 kernel: ata3: EH complete
>
>

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ