[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <YZiBH6GxlkFFuyqa@casper.infradead.org>
Date: Sat, 20 Nov 2021 05:01:19 +0000
From: Matthew Wilcox <willy@...radead.org>
To: Mina Almasry <almasrymina@...gle.com>
Cc: Jonathan Corbet <corbet@....net>,
Alexander Viro <viro@...iv.linux.org.uk>,
Andrew Morton <akpm@...ux-foundation.org>,
Johannes Weiner <hannes@...xchg.org>,
Michal Hocko <mhocko@...nel.org>,
Vladimir Davydov <vdavydov.dev@...il.com>,
Hugh Dickins <hughd@...gle.com>, Shuah Khan <shuah@...nel.org>,
Shakeel Butt <shakeelb@...gle.com>,
Greg Thelen <gthelen@...gle.com>,
Dave Chinner <david@...morbit.com>,
Roman Gushchin <guro@...com>, Theodore Ts'o <tytso@....edu>,
linux-kernel@...r.kernel.org, linux-fsdevel@...r.kernel.org,
linux-mm@...ck.org
Subject: Re: [PATCH v4 0/4] Deterministic charging of shared memory
On Fri, Nov 19, 2021 at 08:50:06PM -0800, Mina Almasry wrote:
> 1. One complication to address is the behavior when the target memcg
> hits its memory.max limit because of remote charging. In this case the
> oom-killer will be invoked, but the oom-killer may not find anything
> to kill in the target memcg being charged. Thera are a number of considerations
> in this case:
>
> 1. It's not great to kill the allocating process since the allocating process
> is not running in the memcg under oom, and killing it will not free memory
> in the memcg under oom.
> 2. Pagefaults may hit the memcg limit, and we need to handle the pagefault
> somehow. If not, the process will forever loop the pagefault in the upstream
> kernel.
>
> In this case, I propose simply failing the remote charge and returning an ENOSPC
> to the caller. This will cause will cause the process executing the remote
> charge to get an ENOSPC in non-pagefault paths, and get a SIGBUS on the pagefault
> path. This will be documented behavior of remote charging, and this feature is
> opt-in. Users can:
> - Not opt-into the feature if they want.
> - Opt-into the feature and accept the risk of received ENOSPC or SIGBUS and
> abort if they desire.
> - Gracefully handle any resulting ENOSPC or SIGBUS errors and continue their
> operation without executing the remote charge if possible.
Why is ENOSPC the right error instead of ENOMEM?
Powered by blists - more mailing lists