[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <201006221119.06210.rjw@sisk.pl>
Date: Tue, 22 Jun 2010 11:19:05 +0200
From: "Rafael J. Wysocki" <rjw@...k.pl>
To: Maxim Levitsky <maximlevitsky@...il.com>
Cc: Adrian Hunter <adrian.hunter@...ia.com>,
"linux-mmc" <linux-mmc@...r.kernel.org>,
Andrew Morton <akpm@...ux-foundation.org>,
"linux-pm" <linux-pm@...ts.linux-foundation.org>,
"linux-kernel" <linux-kernel@...r.kernel.org>,
Philip Langdale <philipl@...rt.org>
Subject: Re: [PATCH 1/2] MMC: fix all hangs related to mmc/sd card insert/removal during suspend/resume.
On Tuesday, June 22, 2010, Maxim Levitsky wrote:
> On Mon, 2010-06-21 at 22:26 +0200, Rafael J. Wysocki wrote:
> > On Monday, June 21, 2010, Maxim Levitsky wrote:
> > > On Mon, 2010-06-21 at 23:04 +0300, Adrian Hunter wrote:
> > > > ext Maxim Levitsky wrote:
> > > > > If you don't use CONFIG_MMC_UNSAFE_RESUME, as soon as you attempt to
> > > > > suspend, the card will be removed, therefore this patch doesn't change
> > > > > the behavior of this option.
> > > > >
> > > > > However the removal will be done by pm notifier, which runs while
> > > > > userspace is still not frozen and thus can freely use del_gendisk,
> > > > > without the risk of deadlock which would happen otherwise.
> > > > >
> > > > >
> > > > > Card detect workqueue is now freezeable,
> > > > > therefore if you do use CONFIG_MMC_UNSAFE_RESUME,
> > > > > and remove the card during suspend, the removal will be
> > > > > detected as soon as userspace is unfrozen, again at the moment
> > > > > it is safe to call del_gendisk.
> > > > >
> > > > > Tested with and without CONFIG_MMC_UNSAFE_RESUME with suspend and hibernate.
> > > > >
> > > > > Signed-off-by: Maxim Levitsky <maximlevitsky@...il.com>
> > > > > ---
> > > > > drivers/mmc/core/core.c | 54 +++++++++++++++++++++++++++------------------
> > > > > drivers/mmc/core/host.c | 6 +++++
> > > > > include/linux/mmc/host.h | 3 ++
> > > > > 3 files changed, 41 insertions(+), 22 deletions(-)
> > > > >
> > > > > diff --git a/drivers/mmc/core/core.c b/drivers/mmc/core/core.c
> > > > > index 569e94d..0cba53a 100644
> > > > > --- a/drivers/mmc/core/core.c
> > > > > +++ b/drivers/mmc/core/core.c
> > > > > @@ -1259,26 +1259,11 @@ int mmc_suspend_host(struct mmc_host *host)
> > > > >
> > > > > if (host->caps & MMC_CAP_DISABLE)
> > > > > cancel_delayed_work(&host->disable);
> > > > > - cancel_delayed_work(&host->detect);
> > > > > - mmc_flush_scheduled_work();
> > > > >
> > > > > mmc_bus_get(host);
> > > > > if (host->bus_ops && !host->bus_dead) {
> > > > > if (host->bus_ops->suspend)
> > > > > err = host->bus_ops->suspend(host);
> > > > > - if (err == -ENOSYS || !host->bus_ops->resume) {
> > > > > - /*
> > > > > - * We simply "remove" the card in this case.
> > > > > - * It will be redetected on resume.
> > > > > - */
> > > > > - if (host->bus_ops->remove)
> > > > > - host->bus_ops->remove(host);
> > > > > - mmc_claim_host(host);
> > > > > - mmc_detach_bus(host);
> > > > > - mmc_release_host(host);
> > > > > - host->pm_flags = 0;
> > > > > - err = 0;
> > > > > - }
> > > > > }
> > > > > mmc_bus_put(host);
> > > > >
> > > > > @@ -1310,12 +1295,6 @@ int mmc_resume_host(struct mmc_host *host)
> > > > > printk(KERN_WARNING "%s: error %d during resume "
> > > > > "(card was removed?)\n",
> > > > > mmc_hostname(host), err);
> > > > > - if (host->bus_ops->remove)
> > > > > - host->bus_ops->remove(host);
> > > > > - mmc_claim_host(host);
> > > > > - mmc_detach_bus(host);
> > > > > - mmc_release_host(host);
> > > > > - /* no need to bother upper layers */
> > > > > err = 0;
> > > > > }
> > > > > }
> > > > > @@ -1330,6 +1309,37 @@ int mmc_resume_host(struct mmc_host *host)
> > > > > return err;
> > > > > }
> > > > >
> > > > > +/* Do the card removal on suspend if card is assumed removeable
> > > > > + * Do that in pm notifier while userspace isn't yet frozen, so we will be able
> > > > > + to sync the card.
> > > > > +*/
> > > > > +int mmc_pm_notify(struct notifier_block *notify_block,
> > > > > + unsigned long mode, void *unused)
> > > > > +{
> > > > > + struct mmc_host *host = container_of(
> > > > > + notify_block, struct mmc_host, pm_notify);
> > > > > +
> > > > > +
> > > > > + switch (mode) {
> > > > > + case PM_HIBERNATION_PREPARE:
> > > > > + case PM_SUSPEND_PREPARE:
> > > > > +
> > > > > + if (!host->bus_ops || host->bus_ops->suspend)
> > > > > + break;
> > > > > +
> > > > > + if (host->bus_ops->remove)
> > > > > + host->bus_ops->remove(host);
> > > > > + mmc_claim_host(host);
> > > > > + mmc_detach_bus(host);
> > > > > + mmc_release_host(host);
> > > > > + host->pm_flags = 0;
> > > > > + break;
> > > >
> > > > Is it possible that you receive PM_SUSPEND_PREPARE
> > > > but there is no suspend and therefore no resume
> > > > and therefore the card is removed but not detected
> > > > again?
> > > This is very good point.
> > > The solution is to kick mmc detection thread from this notifier.
> > > on resume.
> > > I update the patch.
> > >
> > > >
> > > > Is it possible that you are racing with kmmcd and the
> > > > card is added after you receive PM_SUSPEND_PREPARE but
> > > > before kmmcd is frozen?
> > > This is unlikely but valid race.
> > > I afraid I don't know nice way to solve it right now.
> > > I can add some ad-hoc variable to tell interrupt handler not to kick the
> > > detection workqueue after suspend notifier was called.
> > >
> > > I wish there was a generic freeze_workqueue function.
> >
> > There are freezable workqueues that are automatically frozen during suspend
> > by the process freezer. However, at the moment they need to be singlethread
> > and I'm not sure if using one in this particular case is appropriate.
>
> I *do* use freezable work-queue.
I overlooked that, sorry.
> However since this is pm notifier, it is called before userspace and the
> workqueue is frozen.
> Therefore I would like manually to freeze the workqueue from the pm
> notifier.
No, that won't work. You need to find an alternative solution. I guess you
may insert a work item that's going to sleep until a condition is
satisfied (analogous to a workqueue barrier) and wait for it to run.
Rafael
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists