[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <20231207002759.51418-8-gregory.price@memverge.com>
Date: Wed, 6 Dec 2023 19:27:55 -0500
From: Gregory Price <gourry.memverge@...il.com>
To: linux-mm@...ck.org, jgroves@...ron.com, ravis.opensrc@...ron.com,
sthanneeru@...ron.com, emirakhur@...ron.com, Hasan.Maruf@....com
Cc: linux-doc@...r.kernel.org, linux-fsdevel@...r.kernel.org,
linux-api@...r.kernel.org, linux-arch@...r.kernel.org,
linux-kernel@...r.kernel.org, akpm@...ux-foundation.org,
arnd@...db.de, tglx@...utronix.de, luto@...nel.org,
mingo@...hat.com, bp@...en8.de, dave.hansen@...ux.intel.com,
x86@...nel.org, hpa@...or.com, mhocko@...nel.org, tj@...nel.org,
ying.huang@...el.com, gregory.price@...verge.com, corbet@....net,
rakie.kim@...com, hyeongtak.ji@...com, honggyu.kim@...com,
vtavarespetr@...ron.com, peterz@...radead.org,
Frank van der Linden <fvdl@...gle.com>
Subject: [RFC PATCH 07/11] mm/mempolicy: add userland mempolicy arg structure
This patch adds the new user-api argument structure intended for
set_mempolicy2 and mbind2.
struct mpol_args {
/* Basic mempolicy settings */
unsigned short mode;
unsigned short mode_flags;
unsigned long *pol_nodes;
unsigned long pol_maxnodes;
/* get_mempolicy2: policy information (e.g. next interleave node) */
int policy_node;
/* get_mempolicy2: memory range policy */
unsigned long addr;
int addr_node;
/* all operations: policy home node */
unsigned long home_node;
/* mbind2: address ranges to apply the policy */
const struct iovec __user *vec;
size_t vlen;
};
This structure is intended to be extensible as new mempolicy extensions
are added.
For example, set_mempolicy_home_node was added to allow vma mempolicies
to have a preferred/home node assigned. This structure allows the
addition of that setting at the time the mempolicy is set, rather
than requiring additional calls to modify the policy.
Another suggested extension is to allow mbind2 to operate on multiple
memory ranges with a single call. mbind presently operates on a single
(address, length) tuple. It was suggested that mbind2 should operate
on an iovec, which allows many memory ranges to have the same mempolicy
applied to it with a single system call.
Full breakdown of arguments as of this patch:
mode: Mempolicy mode (MPOL_DEFAULT, MPOL_INTERLEAVE)
mode_flags: Flags previously or'd into mode in set_mempolicy
(e.g.: MPOL_F_STATIC_NODES, MPOL_F_RELATIVE_NODES)
pol_nodes: Policy nodemask
pol_maxnodes: Max number of nodes in the policy nodemask
policy_node: for get_mempolicy2. Returns extended information
about a policy that was previously reported by
passing MPOL_F_NODE to get_mempolicy. Instead of
overriding the mode value, simply add a field.
addr: for get_mempolicy2. Used with MPOL_F_ADDR to run
get_mempolicy against the vma the address belongs
to instead of the task.
addr_node: for get_mempolicy2. Returns the node the address
belongs to. Previously get_mempolicy() would
override the output value of (mode) if MPOL_F_ADDR
and MPOL_F_NODE were set. Instead, we extend
mpol_args to do this by default if MPOL_F_ADDR is
set and do away with MPOL_F_NODE.
vec/vlen: Used by mbind2 to apply the mempolicy to all
address ranges described by the iovec.
Suggested-by: Frank van der Linden <fvdl@...gle.com>
Suggested-by: Vinicius Tavares Petrucci <vtavarespetr@...ron.com>
Suggested-by: Hasan Al Maruf <Hasan.Maruf@....com>
Signed-off-by: Gregory Price <gregory.price@...verge.com>
Co-developed-by: Vinicius Tavares Petrucci <vtavarespetr@...ron.com>
Signed-off-by: Vinicius Tavares Petrucci <vtavarespetr@...ron.com>
---
.../admin-guide/mm/numa_memory_policy.rst | 31 +++++++++++++++++++
include/uapi/linux/mempolicy.h | 18 +++++++++++
2 files changed, 49 insertions(+)
diff --git a/Documentation/admin-guide/mm/numa_memory_policy.rst b/Documentation/admin-guide/mm/numa_memory_policy.rst
index b7b8d3dd420f..6d645519c2c1 100644
--- a/Documentation/admin-guide/mm/numa_memory_policy.rst
+++ b/Documentation/admin-guide/mm/numa_memory_policy.rst
@@ -488,6 +488,37 @@ closest to which page allocation will come from. Specifying the home node overri
the default allocation policy to allocate memory close to the local node for an
executing CPU.
+Extended Mempolicy Arguments::
+
+ struct mpol_args {
+ /* Basic mempolicy settings */
+ unsigned short mode;
+ unsigned short mode_flags;
+ unsigned long *pol_nodes;
+ unsigned long pol_maxnodes;
+
+ /* get_mempolicy2: policy node information */
+ int policy_node;
+
+ /* get_mempolicy2: memory range policy */
+ unsigned long addr;
+ int addr_node;
+
+ /* mbind2: policy home node */
+ unsigned long home_node;
+
+ /* mbind2: address ranges to apply the policy */
+ struct iovec *vec;
+ size_t vlen;
+ };
+
+The extended mempolicy argument structure is defined to allow the mempolicy
+interfaces future extensibility without the need for additional system calls.
+
+The core arguments (mode, mode_flags, pol_nodes, and pol_maxnodes) apply to
+all interfaces relative to their non-extended counterparts. Each additional
+field may only apply to specific extended interfaces. See the respective
+extended interface man page for more details.
Memory Policy Command Line Interface
====================================
diff --git a/include/uapi/linux/mempolicy.h b/include/uapi/linux/mempolicy.h
index 1f9bb10d1a47..e6b50903047c 100644
--- a/include/uapi/linux/mempolicy.h
+++ b/include/uapi/linux/mempolicy.h
@@ -27,6 +27,24 @@ enum {
MPOL_MAX, /* always last member of enum */
};
+struct mpol_args {
+ /* Basic mempolicy settings */
+ unsigned short mode;
+ unsigned short mode_flags;
+ unsigned long *pol_nodes;
+ unsigned long pol_maxnodes;
+ /* get_mempolicy: policy node information */
+ int policy_node;
+ /* get_mempolicy: memory range policy */
+ unsigned long addr;
+ int addr_node;
+ /* mbind2: policy home node */
+ int home_node;
+ /* mbind2: address ranges to apply the policy */
+ struct iovec *vec;
+ size_t vlen;
+};
+
/* Flags for set_mempolicy */
#define MPOL_F_STATIC_NODES (1 << 15)
#define MPOL_F_RELATIVE_NODES (1 << 14)
--
2.39.1
Powered by blists - more mailing lists