[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20120914150306.GQ28039@dhcp22.suse.cz>
Date: Fri, 14 Sep 2012 17:03:06 +0200
From: Michal Hocko <mhocko@...e.cz>
To: Tejun Heo <tj@...nel.org>
Cc: containers@...ts.linux-foundation.org, cgroups@...r.kernel.org,
linux-kernel@...r.kernel.org, Li Zefan <lizefan@...wei.com>,
Glauber Costa <glommer@...allels.com>,
Peter Zijlstra <peterz@...radead.org>,
Paul Turner <pjt@...gle.com>,
Johannes Weiner <hannes@...xchg.org>,
Thomas Graf <tgraf@...g.ch>,
"Serge E. Hallyn" <serue@...ibm.com>,
Paul Mackerras <paulus@...ba.org>,
Ingo Molnar <mingo@...hat.com>,
Arnaldo Carvalho de Melo <acme@...stprotocols.net>,
Neil Horman <nhorman@...driver.com>,
"Aneesh Kumar K.V" <aneesh.kumar@...ux.vnet.ibm.com>
Subject: Re: [RFC] cgroup TODOs
On Thu 13-09-12 13:58:27, Tejun Heo wrote:
[...]
> 2. memcg's __DEPRECATED_clear_css_refs
>
> This is a remnant of another weird design decision of requiring
> synchronous draining of refcnts on cgroup removal and allowing
> subsystems to veto cgroup removal - what's the userspace supposed to
> do afterwards? Note that this also hinders co-mounting different
> controllers.
>
> The behavior could be useful for development and debugging but it
> unnecessarily interlocks userland visible behavior with in-kernel
> implementation details. To me, it seems outright wrong (either
> implement proper severing semantics in the controller or do full
> refcnting) and disallows, for example, lazy drain of caching refs.
> Also, it complicates the removal path with try / commit / revert
> logic which has never been fully correct since the beginning.
>
> Currently, the only left user is memcg.
>
> Solution:
>
> * Update memcg->pre_destroy() such that it never fails.
>
> * Drop __DEPRECATED_clear_css_refs and all related logic.
> Convert pre_destroy() to return void.
>
> Who:
>
> KAMEZAWA, Michal, PLEASE. I will make __DEPRECATED_clear_css_refs
> trigger WARN sooner or later. Let's please get this settled.
I think we are almost there. One big step was that we no longer charge
to the parent and only move statistics but there are still some corner
cases when we race with LRU handling.
[...]
> * memcg can be fully hierarchical but we need to phase out the flat
> hierarchy support. Unfortunately, this involves flipping the
> behavior for the existing users. Upstream will try to nudge users
> with warning messages. Most burden would be on the distros and at
> least SUSE seems to be on board with it. Needs coordination with
> other distros.
I am currently planning to add a warning to most of the currenly
maintained distributions to have as big coverage as possible. No default
switch for obvious reasons but hopefuly we will get some feedback at
least.
Thanks Tejun for doing this. We needed it for a long time.
--
Michal Hocko
SUSE Labs
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists