lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date:	Thu, 25 Aug 2011 11:49:35 -0300
From:	Rogério Brito <rbrito@....usp.br>
To:	Bjorn Helgaas <bhelgaas@...gle.com>
Cc:	Linus Torvalds <torvalds@...ux-foundation.org>,
	Ivan Kokshaysky <ink@...assic.park.msu.ru>,
	Thomas Meyer <thomas@...3r.de>,
	Gary Hade <garyhade@...ibm.com>, Ingo Molnar <mingo@...e.hu>,
	Stefan Richter <stefanr@...6.in-berlin.de>,
	linux-kernel@...r.kernel.org,
	Jesse Barnes <jbarnes@...tuousgeek.org>,
	bugme-daemon@...zilla.kernel.org
Subject: Re: [Bug 41622] [REGRESSION][BISECTED] Notebook crashes upon
 detecting the PCI subsystem with kernels >= 2.6.24-rc7

Hi everybody.

2011/8/24 Bjorn Helgaas <bhelgaas@...gle.com>:
>> If there is anything else that you would like me to change or to
>> provide any extra information, then please let me know and I will do
>> my best.
>
> I'd still like to see a dmesg log with no arguments (remove the
> "acpi=off pnpbios=off noapic nolapic" arguments).  Your machine is new
> enough that we'll use PCI _CRS by default, and I'd like to make sure
> we're doing the right thing.

OK. I put them there one by one, but let me report what I get with
those disabled, but *with* that small patch applied:

* "acpi=off pnpbios=off noapic": it works well, no problems with
dropping the nolapic.
* "acpi=off noapic": it works well, with no problems dropping
"pnpbios=off, aside from the "[Firmware Bug]: powernow-k8: No PSB or
ACPI _PSS objects" message.
* "acpi=off": does not work well---it books OK, but some accesses to
disk get very long and I get the following in the dmesg log:

[  243.389359] ata1: EH in SWNCQ mode,QC:qc_active 0x7 sactive 0x7
[  243.389367] ata1: SWNCQ:qc_active 0x3 defer_bits 0x4 last_issue_tag 0x1
[  243.389368]   dhfis 0x3 dmafis 0x0 sdbfis 0x0
[  243.389374] ata1: ATA_REG 0x40 ERR_REG 0x0
[  243.389377] ata1: tag : dhfis dmafis sdbfis sactive
[  243.389380] ata1: tag 0x0: 1 0 0 1
[  243.389383] ata1: tag 0x1: 1 0 0 1
[  243.389398] ata1.00: exception Emask 0x0 SAct 0x7 SErr 0x0 action 0x6 frozen
[  243.389403] ata1.00: failed command: READ FPDMA QUEUED
[  243.389411] ata1.00: cmd 60/08:00:c9:10:be/00:00:07:00:00/40 tag 0
ncq 4096 in
[  243.389413]          res 40/00:00:00:4f:c2/00:00:00:00:00/00 Emask
0x4 (timeout)
[  243.389417] ata1.00: status: { DRDY }
[  243.389420] ata1.00: failed command: READ FPDMA QUEUED
[  243.389428] ata1.00: cmd 60/08:08:19:10:be/00:00:07:00:00/40 tag 1
ncq 4096 in
[  243.389430]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask
0x4 (timeout)
[  243.389433] ata1.00: status: { DRDY }
[  243.389437] ata1.00: failed command: WRITE FPDMA QUEUED
[  243.389444] ata1.00: cmd 61/58:10:79:4f:62/00:00:06:00:00/40 tag 2
ncq 45056 out
[  243.389446]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask
0x4 (timeout)
[  243.389450] ata1.00: status: { DRDY }
[  243.389457] ata1: hard resetting link
[  243.389460] ata1: nv: skipping hardreset on occupied port
[  244.289547] ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[  244.306186] Clocksource tsc unstable (delta = 429752755 ns)
[  244.306214] Switching to clocksource jiffies
[  244.309862] ata1.00: configured for UDMA/100
[  244.309862] ata1.00: device reported invalid CHS sector 0
[  244.309862] ata1.00: device reported invalid CHS sector 0
[  244.309862] ata1.00: device reported invalid CHS sector 0
[  244.309862] sd 0:0:0:0: [sda]  Result: hostbyte=DID_OK
driverbyte=DRIVER_SENSE
[  244.309862] sd 0:0:0:0: [sda]  Sense Key : Aborted Command
[current] [descriptor]
[  244.309862] Descriptor sense data with sense descriptors (in hex):
[  244.309862]         72 0b 00 00 00 00 00 0c 00 0a 80 00 00 00 00 00
[  244.309862]         00 00 00 00
[  244.309862] sd 0:0:0:0: [sda]  Add. Sense: No additional sense information
[  244.309862] sd 0:0:0:0: [sda] CDB: Read(10): 28 00 07 be 10 c9 00 00 08 00
[  244.309862] end_request: I/O error, dev sda, sector 129896649
[  244.309862] ata1: EH complete
[  293.834434] ata1: EH in SWNCQ mode,QC:qc_active 0x7FF sactive 0x7FF
[  293.834442] ata1: SWNCQ:qc_active 0x1F defer_bits 0x7E0 last_issue_tag 0x4
[  293.834444]   dhfis 0x1F dmafis 0x0 sdbfis 0x0
[  293.834449] ata1: ATA_REG 0x40 ERR_REG 0x0
[  293.834452] ata1: tag : dhfis dmafis sdbfis sactive
[  293.834456] ata1: tag 0x0: 1 0 0 1
[  293.834459] ata1: tag 0x1: 1 0 0 1
[  293.834462] ata1: tag 0x2: 1 0 0 1
[  293.834465] ata1: tag 0x3: 1 0 0 1
[  293.834468] ata1: tag 0x4: 1 0 0 1
[  293.834483] ata1.00: exception Emask 0x0 SAct 0x7ff SErr 0x0 action
0x6 frozen
[  293.834488] ata1.00: failed command: WRITE FPDMA QUEUED
[  293.834496] ata1.00: cmd 61/80:00:49:eb:6e/00:00:06:00:00/40 tag 0
ncq 65536 out
[  293.834498]          res 40/00:00:00:4f:c2/00:00:00:00:00/00 Emask
0x4 (timeout)
[  293.834502] ata1.00: status: { DRDY }
[  293.834505] ata1.00: failed command: WRITE FPDMA QUEUED
[  293.834513] ata1.00: cmd 61/00:08:c9:f2:6e/02:00:06:00:00/40 tag 1
ncq 262144 out
[  293.834515]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask
0x4 (timeout)
[  293.834518] ata1.00: status: { DRDY }
[  293.834522] ata1.00: failed command: WRITE FPDMA QUEUED
[  293.834530] ata1.00: cmd 61/c8:10:c9:f5:6e/01:00:06:00:00/40 tag 2
ncq 233472 out
[  293.834531]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask
0x4 (timeout)
(...)
[  395.538669] ata1: tag 0x9: 1 0 0 1
[  395.538672] ata1: tag 0xa: 1 0 0 1
[  395.538675] ata1: tag 0xb: 1 0 0 1
[  395.538678] ata1: tag 0xc: 1 0 0 1
[  395.538681] ata1: tag 0xd: 1 0 0 1
[  395.538684] ata1: tag 0xe: 0 0 0 1
[  395.538697] ata1.00: NCQ disabled due to excessive errors
[  395.538703] ata1.00: exception Emask 0x0 SAct 0x7fe00 SErr 0x0
action 0x6 frozen
[  395.538708] ata1.00: failed command: WRITE FPDMA QUEUED
[  395.538717] ata1.00: cmd 61/80:48:49:34:8e/00:00:06:00:00/40 tag 9
ncq 65536 out
[  395.538718]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask
0x4 (timeout)
[  395.538722] ata1.00: status: { DRDY }
[  395.538725] ata1.00: failed command: WRITE FPDMA QUEUED
[  395.538733] ata1.00: cmd 61/00:50:c9:37:8e/01:00:06:00:00/40 tag 10
ncq 131072 out
[  395.538735]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask
0x4 (timeout)
[  395.538739] ata1.00: status: { DRDY }
[  395.538742] ata1.00: failed command: WRITE FPDMA QUEUED
[  395.538749] ata1.00: cmd 61/80:58:b9:37:7b/00:00:06:00:00/40 tag 11
ncq 65536 out
[  395.538751]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask
0x4 (timeout)
[  395.538755] ata1.00: status: { DRDY }
[  395.538758] ata1.00: failed command: WRITE FPDMA QUEUED
[  395.538766] ata1.00: cmd 61/08:60:c9:18:8e/00:00:06:00:00/40 tag 12
ncq 4096 out
[  395.538767]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask
0x4 (timeout)
[  395.538771] ata1.00: status: { DRDY }
[  395.538774] ata1.00: failed command: WRITE FPDMA QUEUED
[  395.538782] ata1.00: cmd 61/40:68:89:43:8e/00:00:06:00:00/40 tag 13
ncq 32768 out
[  395.538783]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask
0x4 (timeout)
[  395.538787] ata1.00: status: { DRDY }
[  395.538790] ata1.00: failed command: WRITE FPDMA QUEUED
[  395.538798] ata1.00: cmd 61/08:70:d9:ff:6e/00:00:06:00:00/40 tag 14
ncq 4096 out
[  395.538800]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask
0x4 (timeout)
[  395.538803] ata1.00: status: { DRDY }
[  395.538806] ata1.00: failed command: WRITE FPDMA QUEUED
[  395.538814] ata1.00: cmd 61/08:78:a9:34:6a/00:00:06:00:00/40 tag 15
ncq 4096 out
[  395.538816]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask
0x4 (timeout)
[  395.538819] ata1.00: status: { DRDY }
[  395.538822] ata1.00: failed command: WRITE FPDMA QUEUED
[  395.538830] ata1.00: cmd 61/10:80:71:d7:6f/00:00:06:00:00/40 tag 16
ncq 8192 out
[  395.538832]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask
0x4 (timeout)
[  395.538835] ata1.00: status: { DRDY }
[  395.538839] ata1.00: failed command: WRITE FPDMA QUEUED
[  395.538846] ata1.00: cmd 61/08:88:61:ff:6d/00:00:06:00:00/40 tag 17
ncq 4096 out
[  395.538848]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask
0x4 (timeout)
[  395.538852] ata1.00: status: { DRDY }
[  395.538855] ata1.00: failed command: WRITE FPDMA QUEUED
[  395.538862] ata1.00: cmd 61/08:90:e9:62:62/01:00:06:00:00/40 tag 18
ncq 135168 out
[  395.538864]          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask
0x4 (timeout)
[  395.538868] ata1.00: status: { DRDY }
[  395.538874] ata1: hard resetting link
[  395.538877] ata1: nv: skipping hardreset on occupied port
[  396.005559] ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[  396.025796] ata1.00: configured for UDMA/100
[  396.025796] ata1.00: device reported invalid CHS sector 0
[  396.025796] ata1.00: device reported invalid CHS sector 0
[  396.025796] ata1.00: device reported invalid CHS sector 0
[  396.025796] ata1.00: device reported invalid CHS sector 0
[  396.025796] ata1.00: device reported invalid CHS sector 0
[  396.025796] ata1.00: device reported invalid CHS sector 0
[  396.025796] ata1.00: device reported invalid CHS sector 0
[  396.025796] ata1.00: device reported invalid CHS sector 0
[  396.025796] ata1.00: device reported invalid CHS sector 0
[  396.025796] ata1.00: device reported invalid CHS sector 0
[  396.025796] ata1: EH complete

During that, the computer is frozen (well, not actually, the trackpad
is able to move the mouse).  For all the details, please see the full
dmesg attached to this mail. After that, it seems that I can use the
computer and it doesn't happen anymore, but I get a a very high (98+%
of time) amount of hardware interrupts happening while I am using X
(not sure yet when I quit X). See the attached /proc/interrupts.

* without any boot options: without "acpi=off", the machine just hangs
at (hand copied, but photographed, if you want it):

(...)
CPU: Mobile AMD Sempron (tm) Processor 3400+ stepping 02
ACPI: Core revision 20110623

And nothing else happens.

> I assume that with no arguments, you still need the "skip transparent
> bridge sizing" change to boot.
>
> I don't really like that change because in __pci_bus_size_bridges(),
> it's not obvious why sizing transparent bridges should be a problem.
> If growing transparent bridge windows makes us run out of space, let's
> put the smarts ("this bridge is transparent, we can take advantage of
> subtractive decode so we may not need to grow the positive decode
> windows") at the point where we grow, not at the point where we size.
> If we do have enough space, growing the positive decode windows is
> better because they're faster than subtractive decode.

I'm afraid that I have understood almost nothing of what you just said
:-), but I will try to read some of the code.

Oh, Linux, all this is with "CONFIG_PCI_DEBUG=y" since the very first beginning.


Thanks,

-- 
Rogério Brito : rbrito@...e.usp.br,gmail.com} : GPG key 4096R/BCFCAAAA
http://rb.doesntexist.org : Packages for LaTeX : algorithms.berlios.de
DebianQA: http://qa.debian.org/developer.php?login=rbrito%40ime.usp.br

View attachment "dmesg-3.1.0-rc3+.txt" of type "text/plain" (59150 bytes)

View attachment "proc-interrupts.txt" of type "text/plain" (857 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ