lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <CABhMZUVhM3PU5BUu=k-KfR5injzFM4VoABKtN8HxXW2HiPStQQ@mail.gmail.com>
Date:   Mon, 3 Dec 2018 17:36:15 -0600
From:   Bjorn Helgaas <bjorn.helgaas@...il.com>
To:     Keith Busch <keith.busch@...el.com>,
        Oza Pawandeep <poza@...eaurora.org>
Cc:     linux-pci@...r.kernel.org, mikhail.v.gavrilov@...il.com,
        emteeelp@...il.com, linux-kernel@...r.kernel.org
Subject: Fwd: [Bug 201517] New: pcieport 0000:00:03.1: AER: Corrected error
 received: 0000:00:00.0

[Forwarding this to linux-pci since nobody really monitors the bugzilla]

Possibly the same issue reported here:

  https://bugzilla.kernel.org/show_bug.cgi?id=109691
  https://bugzilla.kernel.org/show_bug.cgi?id=111601
  https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1588428/
  https://lore.kernel.org/linux-pci/20160215171423.GA12641@localhost/

I had a theory about the problem (see the lore.kernel link above), but
that was before a lot of AER rework, and I haven't checked the code
since then.

---------- Forwarded message ---------
From: <bugzilla-daemon@...zilla.kernel.org>
Date: Thu, Oct 25, 2018 at 12:45 AM
Subject: [Bug 201517] New: pcieport 0000:00:03.1: AER: Corrected error
received: 0000:00:00.0
To: <bugzilla.pci@...il.com>


https://bugzilla.kernel.org/show_bug.cgi?id=201517

            Bug ID: 201517
           Summary: pcieport 0000:00:03.1: AER: Corrected error received:
                    0000:00:00.0
           Product: Drivers
           Version: 2.5
    Kernel Version: 4.19
          Hardware: All
                OS: Linux
              Tree: Mainline
            Status: NEW
          Severity: normal
          Priority: P1
         Component: PCI
          Assignee: drivers_pci@...nel-bugs.osdl.org
          Reporter: mikhail.v.gavrilov@...il.com
        Regression: No

Created attachment 279149
  --> https://bugzilla.kernel.org/attachment.cgi?id=279149&action=edit
dmesg

I often get a strange error in the kernel log:

[ 8885.590311] pcieport 0000:00:03.1: AER: Corrected error received:
0000:00:00.0
[ 8885.590320] pcieport 0000:00:03.1: PCIe Bus Error: severity=Corrected,
type=Data Link Layer, (Transmitter ID)
[ 8885.590324] pcieport 0000:00:03.1:   device [1022:1453] error
status/mask=00001000/00006000
[ 8885.590328] pcieport 0000:00:03.1:    [12] Timeout

But not always, it means that if this message starts to appear after a reboot,
then it will appear again and again, and if it does not appear, it does not
appear at all.

# lspci -nn
00:00.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) Root Complex [1022:1450]
00:00.2 IOMMU [0806]: Advanced Micro Devices, Inc. [AMD] Family 17h (Models
00h-0fh) I/O Memory Management Unit [1022:1451]
00:01.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) PCIe Dummy Host Bridge [1022:1452]
00:01.1 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) PCIe GPP Bridge [1022:1453]
00:01.3 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) PCIe GPP Bridge [1022:1453]
00:02.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) PCIe Dummy Host Bridge [1022:1452]
00:03.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) PCIe Dummy Host Bridge [1022:1452]
00:03.1 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) PCIe GPP Bridge [1022:1453]
00:04.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) PCIe Dummy Host Bridge [1022:1452]
00:07.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) PCIe Dummy Host Bridge [1022:1452]
00:07.1 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) Internal PCIe GPP Bridge 0 to Bus B [1022:1454]
00:08.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) PCIe Dummy Host Bridge [1022:1452]
00:08.1 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) Internal PCIe GPP Bridge 0 to Bus B [1022:1454]
00:14.0 SMBus [0c05]: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller
[1022:790b] (rev 59)
00:14.3 ISA bridge [0601]: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge
[1022:790e] (rev 51)
00:18.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) Data Fabric: Device 18h; Function 0 [1022:1460]
00:18.1 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) Data Fabric: Device 18h; Function 1 [1022:1461]
00:18.2 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) Data Fabric: Device 18h; Function 2 [1022:1462]
00:18.3 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) Data Fabric: Device 18h; Function 3 [1022:1463]
00:18.4 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) Data Fabric: Device 18h; Function 4 [1022:1464]
00:18.5 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) Data Fabric: Device 18h; Function 5 [1022:1465]
00:18.6 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) Data Fabric: Device 18h; Function 6 [1022:1466]
00:18.7 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 17h
(Models 00h-0fh) Data Fabric: Device 18h; Function 7 [1022:1467]
01:00.0 Non-Volatile memory controller [0108]: Intel Corporation Optane SSD
900P Series [8086:2700]
02:00.0 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] Device
[1022:43d0] (rev 01)
02:00.1 SATA controller [0106]: Advanced Micro Devices, Inc. [AMD] Device
[1022:43c8] (rev 01)

# uname -r
4.19.0-0.rc8.git4.1.fc30.x86_64

--
You are receiving this mail because:
You are watching the assignee of the bug.

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ