[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <alpine.DEB.2.02.1311261931210.5973@chino.kir.corp.google.com>
Date: Tue, 26 Nov 2013 19:33:12 -0800 (PST)
From: David Rientjes <rientjes@...gle.com>
To: Johannes Weiner <hannes@...xchg.org>
cc: Andrew Morton <akpm@...ux-foundation.org>,
Michal Hocko <mhocko@...e.cz>, linux-mm@...ck.org,
cgroups@...r.kernel.org, linux-kernel@...r.kernel.org
Subject: Re: [patch] mm: memcg: do not declare OOM from __GFP_NOFAIL
allocations
On Tue, 26 Nov 2013, David Rientjes wrote:
> On Fri, 22 Nov 2013, Johannes Weiner wrote:
>
> > diff --git a/mm/memcontrol.c b/mm/memcontrol.c
> > index 13b9d0f..cc4f9cb 100644
> > --- a/mm/memcontrol.c
> > +++ b/mm/memcontrol.c
> > @@ -2677,6 +2677,9 @@ static int __mem_cgroup_try_charge(struct mm_struct *mm,
> > if (unlikely(task_in_memcg_oom(current)))
> > goto bypass;
> >
> > + if (gfp_mask & __GFP_NOFAIL)
> > + oom = false;
> > +
> > /*
> > * We always charge the cgroup the mm_struct belongs to.
> > * The mm_struct's mem_cgroup changes on task migration if the
>
> Sorry, I don't understand this. What happens in the following scenario:
>
> - memory.usage_in_bytes == memory.limit_in_bytes,
>
> - memcg reclaim fails to reclaim memory, and
>
> - all processes (perhaps only one) attached to the memcg are doing one of
> the over dozen __GFP_NOFAIL allocations in the kernel?
>
> How do we make forward progress if you cannot oom kill something?
>
Ah, this is because of 3168ecbe1c04 ("mm: memcg: use proper memcg in limit
bypass") which just bypasses all of these allocations and charges the root
memcg. So if allocations want to bypass memcg isolation they just have to
be __GFP_NOFAIL?
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists