[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20161206093038.GA21270@dhcp22.suse.cz>
Date: Tue, 6 Dec 2016 10:30:39 +0100
From: Michal Hocko <mhocko@...nel.org>
To: Yu Zhao <yuzhao@...gle.com>
Cc: Dan Streetman <ddstreet@...e.org>,
Seth Jennings <sjenning@...hat.com>,
Linux-MM <linux-mm@...ck.org>,
linux-kernel <linux-kernel@...r.kernel.org>,
Avi Kivity <avi@...ranet.com>,
Thomas Gleixner <tglx@...utronix.de>,
Ingo Molnar <mingo@...nel.org>
Subject: Re: [PATCH] hotplug: make register and unregister notifier API
symmetric
On Mon 05-12-16 12:59:02, Yu Zhao wrote:
> On Fri, Dec 02, 2016 at 04:19:36PM +0100, Michal Hocko wrote:
> > [Let's CC more people - the thread started
> > http://lkml.kernel.org/r/1480540516-6458-1-git-send-email-yuzhao@google.com]
> >
> > On Fri 02-12-16 09:56:26, Dan Streetman wrote:
> > > On Fri, Dec 2, 2016 at 9:44 AM, Michal Hocko <mhocko@...nel.org> wrote:
> > > > On Fri 02-12-16 15:38:48, Michal Hocko wrote:
> > > >> On Fri 02-12-16 09:24:35, Dan Streetman wrote:
> > > >> > On Fri, Dec 2, 2016 at 8:46 AM, Michal Hocko <mhocko@...nel.org> wrote:
> > > >> > > On Wed 30-11-16 13:15:16, Yu Zhao wrote:
> > > >> > >> __unregister_cpu_notifier() only removes registered notifier from its
> > > >> > >> linked list when CPU hotplug is configured. If we free registered CPU
> > > >> > >> notifier when HOTPLUG_CPU=n, we corrupt the linked list.
> > > >> > >>
> > > >> > >> To fix the problem, we can either use a static CPU notifier that walks
> > > >> > >> through each pool or just simply disable CPU notifier when CPU hotplug
> > > >> > >> is not configured (which is perfectly safe because the code in question
> > > >> > >> is called after all possible CPUs are online and will remain online
> > > >> > >> until power off).
> > > >> > >>
> > > >> > >> v2: #ifdef for cpu_notifier_register_done during cleanup.
> > > >> > >
> > > >> > > this ifedfery is just ugly as hell. I am also wondering whether it is
> > > >> > > really needed. __register_cpu_notifier and __unregister_cpu_notifier are
> > > >> > > noops for CONFIG_HOTPLUG_CPU=n. So what's exactly that is broken here?
> > > >> >
> > > >> > hmm, that's interesting, __unregister_cpu_notifier is always a noop if
> > > >> > HOTPLUG_CPU=n, but __register_cpu_notifier is only a noop if
> > > >> > HOTPLUG_CPU=n *and* MODULE. If !MODULE, __register_cpu_notifier does
> > > >>
> > > >> OK, I've missed the MODULE part
> > > >>
> > > >> > actually register! This was added by commit
> > > >> > 47e627bc8c9a70392d2049e6af5bd55fae61fe53 ('hotplug: Allow modules to
> > > >> > use the cpu hotplug notifiers even if !CONFIG_HOTPLUG_CPU') and looks
> > > >> > like it's to allow built-ins to register so they can notice during
> > > >> > boot when cpus are initialized.
> > > >>
> > > >> I cannot say I wound understand the motivation but that is not really
> > > >> all that important.
> > > >>
> > > >> > IMHO, that is the real problem - sure, without HOTPLUG_CPU, nobody
> > > >> > should ever get a notification that a cpu is dying, but that doesn't
> > > >> > mean builtins that register notifiers will never unregister their
> > > >> > notifiers and then free them.
> > > >>
> > > >> Yes that is true. That suggests that __unregister_cpu_notifier should
> > > >> the the symmetric thing to the __register_cpu_notifier for
> > > >> CONFIG_MODULE, right?
> > > >
> > > > I meant the following. Completely untested
> > >
> > > agreed, but also needs the non-__ version, and kernel/cpu.c needs
> > > tweaking to move those functions out of the #ifdef CONFIG_HOTPLUG_CPU
> > > section.
> >
> > OK, this is still only compile tested. Yu Zhao, assuming you were able
> > to trigger the original problem could you test with the below patch
> > please?
>
> This patch (plus the latest fix in this thread) solves the problem.
>
> Just for the record, the problem is when CONFIG_HOTPLUG_CPU=n, changing
> /sys/module/zswap/parameters/compressor multiple times will cause:
>
> [ 144.964346] BUG: unable to handle kernel paging request at ffff880658a2be78
> [ 144.971337] IP: [<ffffffffa290b00b>] raw_notifier_chain_register+0x1b/0x40
> <snipped>
> [ 145.122628] Call Trace:
> [ 145.125086] [<ffffffffa28e5cf8>] __register_cpu_notifier+0x18/0x20
> [ 145.131350] [<ffffffffa2a5dd73>] zswap_pool_create+0x273/0x400
> [ 145.137268] [<ffffffffa2a5e0fc>] __zswap_param_set+0x1fc/0x300
> [ 145.143188] [<ffffffffa2944c1d>] ? trace_hardirqs_on+0xd/0x10
> [ 145.149018] [<ffffffffa2908798>] ? kernel_param_lock+0x28/0x30
> [ 145.154940] [<ffffffffa2a3e8cf>] ? __might_fault+0x4f/0xa0
> [ 145.160511] [<ffffffffa2a5e237>] zswap_compressor_param_set+0x17/0x20
> [ 145.167035] [<ffffffffa2908d3c>] param_attr_store+0x5c/0xb0
> [ 145.172694] [<ffffffffa290848d>] module_attr_store+0x1d/0x30
> [ 145.178443] [<ffffffffa2b2b41f>] sysfs_kf_write+0x4f/0x70
> [ 145.183925] [<ffffffffa2b2a5b9>] kernfs_fop_write+0x149/0x180
> [ 145.189761] [<ffffffffa2a99248>] __vfs_write+0x18/0x40
> [ 145.194982] [<ffffffffa2a9a412>] vfs_write+0xb2/0x1a0
> [ 145.200122] [<ffffffffa2a9a732>] SyS_write+0x52/0xa0
> [ 145.205177] [<ffffffffa2ff4d97>] entry_SYSCALL_64_fastpath+0x12/0x17
Thanks for this additional information which I have added to the
changelog. I have also added your Tested-by unless you have any
objections and will repost soon.
--
Michal Hocko
SUSE Labs
Powered by blists - more mailing lists