lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date:   Tue, 19 Dec 2017 03:34:21 -0500
From:   Alexandru Chirvasitu <achirvasub@...il.com>
To:     Pavel Machek <pavel@....cz>
Cc:     kernel list <linux-kernel@...r.kernel.org>,
        Ingo Molnar <mingo@...hat.com>,
        "Maciej W. Rozycki" <macro@...ux-mips.org>,
        Mikael Pettersson <mikpelinux@...il.com>,
        Thomas Gleixner <tglx@...utronix.de>,
        Josh Poulson <jopoulso@...rosoft.com>,
        Mihai Costache <v-micos@...rosoft.com>,
        Stephen Hemminger <sthemmin@...rosoft.com>,
        Marc Zyngier <marc.zyngier@....com>, linux-pci@...r.kernel.org,
        Haiyang Zhang <haiyangz@...rosoft.com>,
        Dexuan Cui <decui@...rosoft.com>,
        Simon Xiao <sixiao@...rosoft.com>,
        Saeed Mahameed <saeedm@...lanox.com>,
        Jork Loeser <Jork.Loeser@...rosoft.com>,
        Bjorn Helgaas <bhelgaas@...gle.com>,
        devel@...uxdriverproject.org, KY Srinivasan <kys@...rosoft.com>
Subject: Re: PROBLEM: 4.15.0-rc3 APIC causes lockups on Core 2 Duo laptop

Thank you!

On Mon, Dec 18, 2017 at 11:11:31AM +0100, Pavel Machek wrote:
> Hi!
> On Mon 2017-12-18 03:20:11, Alexandru Chirvasitu wrote:
> > Short description of the problem: latest rc kernel results in seemingly APIC-caused hard lockups, whereas latest stable kernel works fine.
> > 
> > I have an old ASUS F5RL laptop with an Intel Core 2 Duo CPU T5450 @1.66GHz. It is currently running Debian 9.3 stable 32 bit (by default on a 4.9-series kernel), but I have been compiling and installing the latest kernels.
> > 
> 
> Thanks for doing that.
> 
> > The latest rc kernel at the time of this writing (4.15.0-rc3) boots but then results in hard lockups on both CPUs after login. Starting in recovery mode returns the error
> > 
> > "spurious APIC interrupt through vector ff on CPU#0, should never happen"
> > 
> > before lockip up the CPUs again. A hard reboot is necessary. 
> > 
> > Starting with kernel option noapic logs me in uneventfully, but for some reason has the effect of rendering my ethrenet card inoperable. It is a Qualcomm Atheros Attansic L2 Fast Ethernet (rev a0), handled by kernel module atl2. In noapic mode the card is still seen by the system, can be brought up / down, etc., but dhclient never manages to acquire a lease.
> > 
> > Starting with kernel option nolapic instead brings up the network and logs me in, but only sees one CPU instead of two, as usual.
> > 
> > The latest kernel that exhibits none of these issues is the latest stable one as of this writing: 4.14.7.
> > 
> > ---
> > 
> > As this seems to be APIC-related, I am sending the message to the maintainers mentioned in arch/x86/kernel/apic/apic.c. I am unsure whether this is the correct procedure however.
> > 
> 
> Good enough procedure. You want to always copy linux-kernel mailing
> list, and you should probably look for X86 maintainers in MAINTAINERS
> file, and  cc them, too.
> 
> If you run out of other options, you can always do "git bisect"...
>


I had never heard of 'bisect' before this casual mention (you might tell I am a bit out of my depth). I've since applied it to Linus' tree between

bebc608 Linux 4.14 (good)

and

4fbd8d1 Linux 4.15-rc1 (bad)

It took about 13 attempts (I had access to a faster machine to compile on, and ccache helped once the cache built up some momentum). The result is (as presented by 'git bisect' at the end of the process, between the --- dividers added by me for clarity):

--- start of output ---

2b5175c4fa974b6aa05bbd2ee8d443a8036a1714 is the first bad commit
commit 2b5175c4fa974b6aa05bbd2ee8d443a8036a1714
Author: Thomas Gleixner <tglx@...utronix.de>
Date:   Tue Oct 17 09:54:57 2017 +0200

    genirq: Add config option for reservation mode
    
    The interrupt reservation mode requires reactivation of PCI/MSI
    interrupts. Create a config option, so the PCI code can set the
    corresponding flag when required.
    
    Signed-off-by: Thomas Gleixner <tglx@...utronix.de>
    Cc: Josh Poulson <jopoulso@...rosoft.com>
    Cc: Mihai Costache <v-micos@...rosoft.com>
    Cc: Stephen Hemminger <sthemmin@...rosoft.com>
    Cc: Marc Zyngier <marc.zyngier@....com>
    Cc: linux-pci@...r.kernel.org
    Cc: Haiyang Zhang <haiyangz@...rosoft.com>
    Cc: Dexuan Cui <decui@...rosoft.com>
    Cc: Simon Xiao <sixiao@...rosoft.com>
    Cc: Saeed Mahameed <saeedm@...lanox.com>
    Cc: Jork Loeser <Jork.Loeser@...rosoft.com>
    Cc: Bjorn Helgaas <bhelgaas@...gle.com>
    Cc: devel@...uxdriverproject.org
    Cc: KY Srinivasan <kys@...rosoft.com>
    Link: https://lkml.kernel.org/r/20171017075600.369375409@linutronix.de

:040000 040000 5e73031cc0c8411a20722cce7876ab7b82ed3858 dcf98e7a6b7d5f7c5353b7ccab02125e6d332ec8 M      kernel

--- end of output ---

Consequently, I am cc-ing in the listed addresses.


Thank you,

Alex Chirvasitu

> Best regards,								Pavel
> -- 
> (english) http://www.livejournal.com/~pavelmachek
> (cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html


Powered by blists - more mailing lists