lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CAPoiz9yhALYy5ThFh_TtpFH0GbEkQGPzCnmik0iw3YuAsdooZA@mail.gmail.com>
Date:	Tue, 4 Oct 2011 15:12:01 -0500
From:	Jon Mason <mason@...i.com>
To:	Avi Kivity <avi@...hat.com>
Cc:	Sven Schnelle <svens@...ckframe.org>, Simon Kirby <sim@...tway.ca>,
	Eric Dumazet <eric.dumazet@...il.com>,
	Niels Ole Salscheider <niels_ole@...scheider-online.de>,
	Jesse Barnes <jbarnes@...tuousgeek.org>,
	Linus Torvalds <torvalds@...ux-foundation.org>,
	linux-kernel <linux-kernel@...r.kernel.org>,
	"linux-pci@...r.kernel.org" <linux-pci@...r.kernel.org>,
	Ben Hutchings <bhutchings@...arflare.com>
Subject: Re: Workaround for Intel MPS errata

On Tue, Oct 4, 2011 at 8:06 AM, Avi Kivity <avi@...hat.com> wrote:
> On 10/04/2011 11:46 AM, Avi Kivity wrote:
>>
>> On 10/03/2011 05:12 PM, Jon Mason wrote:
>>>
>>>     PCI: Workaround for Intel MPS errata
>>>
>>>     Intel 5000 and 5100 series memory controllers have a known issue if
>>> read
>>>     completion coalescing is enabled (the default setting) and the PCI-E
>>>     Maximum Payload Size is set to 256B.  To work around this issue,
>>> disable
>>>     read completion coalescing if the MPS is 256B.
>>>
>>>     It is worth noting that there is no function to undo the disable of
>>> read
>>>     completion coalescing, and the performance benefit of read completion
>>>     coalescing will be lost if the MPS is set from 256B to 128B.  It is
>>> only
>>>     possible to have this issue via hotplug removing the only 256B MPS
>>>     device in the system (thus making all of the other devices in the
>>> system
>>>     have a performance degradation without the benefit of any 256B
>>>     transfers).  Therefore, this trade off is acceptable.
>>>
>>>
>>> http://www.intel.com/content/dam/doc/specification-update/5000-chipset-memory-controller-hub-specification-update.pdf
>>>
>>> http://www.intel.com/content/dam/doc/specification-update/5100-memory-controller-hub-chipset-specification-update.pdf
>>>
>>>     Thanks to Jesse Brandeburg and Ben Hutchings for providing insight
>>> into
>>>     the problem.
>>>
>>>     Reported-by: Avi Kivity<avi@...hat.com>
>>>     Signed-off-by: Jon Mason<mason@...i.com>
>>>
>>> +
>>> +        if (!(val&  (1<<  10))) {
>>> +            done = true;
>>> +            return;
>>> +        }
>>
>> Here, you bail out if bit 10 is clear.  So if we're here, it's set.
>>
>>> +
>>> +        val |= (1<<  10);
>>
>> Now it's even more set?
>>
>
> Even with this line changed to clear bit 10, I still get a hard lockup.  Do
> we need to clear this bit on the other 5000 devices?  I notice they have
> similar values in word 0x48, with bits 10 set in them.
>
> What does "Device 7-2,0" refer to in the workaround description?  Seems to
> me we need to apply the workaround to the PCIe ports as well.

I believe you are correct.  On my system (which I still can't get to
fail by enabling the RCC bit), I have
00:00.0 Host bridge: Intel Corporation 5000X Chipset Memory Controller
Hub (rev 12)
00:02.0 PCI bridge: Intel Corporation 5000 Series Chipset PCI Express
x4 Port 2 (rev 12)
00:03.0 PCI bridge: Intel Corporation 5000 Series Chipset PCI Express
x4 Port 3 (rev 12)
00:04.0 PCI bridge: Intel Corporation 5000 Series Chipset PCI Express
x8 Port 4-5 (rev 12)
00:05.0 PCI bridge: Intel Corporation 5000 Series Chipset PCI Express
x4 Port 5 (rev 12)
00:06.0 PCI bridge: Intel Corporation 5000 Series Chipset PCI Express
x8 Port 6-7 (rev 12)
00:07.0 PCI bridge: Intel Corporation 5000 Series Chipset PCI Express
x4 Port 7 (rev 12)

Those PCI devices numbers match perfectly to the ones from the
erratum.  Patch to disable the bit on those devices coming shortly.

Thanks,
Jon

>
> --
> error compiling committee.c: too many arguments to function
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-pci" in
> the body of a message to majordomo@...r.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ