[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <1403289203-6371-3-git-send-email-paulmck@linux.vnet.ibm.com>
Date: Fri, 20 Jun 2014 11:33:21 -0700
From: "Paul E. McKenney" <paulmck@...ux.vnet.ibm.com>
To: linux-kernel@...r.kernel.org
Cc: mingo@...nel.org, laijs@...fujitsu.com, dipankar@...ibm.com,
akpm@...ux-foundation.org, mathieu.desnoyers@...icios.com,
josh@...htriplett.org, niv@...ibm.com, tglx@...utronix.de,
peterz@...radead.org, rostedt@...dmis.org, dhowells@...hat.com,
edumazet@...gle.com, dvhart@...ux.intel.com, fweisbec@...il.com,
oleg@...hat.com, sbw@....edu,
"Paul E. McKenney" <paulmck@...ux.vnet.ibm.com>,
Andi Kleen <ak@...ux.intel.com>,
Christoph Lameter <cl@...two.org>,
Mike Galbraith <umgwanakikbuti@...il.com>,
Eric Dumazet <eric.dumazet@...il.com>
Subject: [PATCH RFC tip/core/rcu 3/5] rcu: Add RCU_COND_RESCHED_QS for large systems
From: "Paul E. McKenney" <paulmck@...ux.vnet.ibm.com>
People with low-latency kernel-intensive workloads object to the
extra test of a per-CPU variable added to cond_resched(). This commit
therefore adds a Kconfig variable RCU_COND_RESCHED_QS that enables this
extra test. Therefore, people who would like large systems to operate
reliably can enable this Kconfig variable, while people with low-latency
kernel-intensive workloads can leave it disabled. People setting defaults
for Linux distributions should choose wisely. ;-)
Suggested-by: Christoph Lameter <cl@...two.org>
Suggested-by: Josh Triplett <josh@...htriplett.org>
Signed-off-by: Paul E. McKenney <paulmck@...ux.vnet.ibm.com>
Cc: Josh Triplett <josh@...htriplett.org>
Cc: Andi Kleen <ak@...ux.intel.com>
Cc: Christoph Lameter <cl@...two.org>
Cc: Mike Galbraith <umgwanakikbuti@...il.com>
Cc: Eric Dumazet <eric.dumazet@...il.com>
---
include/linux/rcutree.h | 15 +++++++++++++--
init/Kconfig | 26 ++++++++++++++++++++++++++
2 files changed, 39 insertions(+), 2 deletions(-)
diff --git a/include/linux/rcutree.h b/include/linux/rcutree.h
index ca7d34027935..69ccd4163de2 100644
--- a/include/linux/rcutree.h
+++ b/include/linux/rcutree.h
@@ -56,12 +56,23 @@ static inline bool rcu_should_resched(void)
}
/* Report quiescent states to RCU if it is time to do so. */
-static inline void rcu_cond_resched(void)
+static inline void __rcu_cond_resched(void)
{
if (unlikely(rcu_should_resched()))
rcu_resched();
}
+/*
+ * Report quiescent states to RCU in kernels that are not configured
+ * for low-latency kernel-intensive workloads.
+ */
+static inline void rcu_cond_resched(void)
+{
+#ifdef CONFIG_RCU_COND_RESCHED_QS
+ __rcu_cond_resched();
+#endif /* #ifdef CONFIG_RCU_COND_RESCHED_QS */
+}
+
/**
* cond_resched_rcu_qs - Report potential quiescent states to RCU
*
@@ -71,7 +82,7 @@ static inline void rcu_cond_resched(void)
*/
#define cond_resched_rcu_qs() \
do { \
- rcu_cond_resched(); \
+ __rcu_cond_resched(); \
cond_resched(); \
} while (0)
diff --git a/init/Kconfig b/init/Kconfig
index 9d76b99af1b9..6457a7f1f0be 100644
--- a/init/Kconfig
+++ b/init/Kconfig
@@ -781,6 +781,32 @@ config RCU_NOCB_CPU_ALL
endchoice
+config RCU_COND_RESCHED_QS
+ bool "Use cond_resched() calls as RCU quiescent states"
+ depends on TREE_RCU || TREE_PREEMPT_RCU
+ default n
+ help
+ Use this option to allow RCU grace periods to make progress
+ on large systems that can execute for extended time periods
+ in the kernel. Workloads that contain processes with large
+ numbers of open files, systems with large quantities of
+ memory, workloads that do mm-related system calls on large
+ regions, and systems with large numbers of mass-storage
+ devices are particularly prone to this behavior. Without
+ this option set, RCU CPU stall warnings or even OOMs can
+ result.
+
+ This option causes cond_resched() to check to see if the
+ current RCU grace period has been waiting for too long for
+ this CPU, and to emulate a zero-duration dyntick-idle
+ period if so. RCU's grace-period kthreads will then note
+ this dyntick-idle period and report a quiescent state to
+ the RCU core on this CPU's behalf, thus avoiding both RCU
+ CPU stall warnings and OOMs.
+
+ Say Y here for reliable operation on large systems.
+ Say N here for kernel-intensive low-latency workloads.
+
endmenu # "RCU Subsystem"
config IKCONFIG
--
1.8.1.5
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists