[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <5166F062.2090007@infradead.org>
Date: Thu, 11 Apr 2013 10:18:26 -0700
From: Randy Dunlap <rdunlap@...radead.org>
To: "Paul E. McKenney" <paulmck@...ux.vnet.ibm.com>
CC: linux-kernel@...r.kernel.org, mingo@...e.hu, laijs@...fujitsu.com,
dipankar@...ibm.com, akpm@...ux-foundation.org,
mathieu.desnoyers@...ymtl.ca, josh@...htriplett.org,
niv@...ibm.com, tglx@...utronix.de, peterz@...radead.org,
rostedt@...dmis.org, Valdis.Kletnieks@...edu, dhowells@...hat.com,
edumazet@...gle.com, darren@...art.com, fweisbec@...il.com,
sbw@....edu, Borislav Petkov <bp@...en8.de>,
Arjan van de Ven <arjan@...ux.intel.com>,
Kevin Hilman <khilman@...aro.org>,
Christoph Lameter <cl@...ux.com>
Subject: Re: [PATCH documentation 2/2] kthread: Document ways of reducing
OS jitter due to per-CPU kthreads
On 04/11/2013 09:05 AM, Paul E. McKenney wrote:
> From: "Paul E. McKenney" <paulmck@...ux.vnet.ibm.com>
>
> The Linux kernel uses a number of per-CPU kthreads, any of which might
> contribute to OS jitter at any time. The usual approach to normal
> kthreads, namely to affinity them to a "housekeeping" CPU, does not
ugh. to affine them
> work with these kthreads because they cannot operate correctly if moved
> to some other CPU. This commit therefore lists ways of controlling OS
> jitter from the Linux kernel's per-CPU kthreads.
>
> Signed-off-by: Paul E. McKenney <paulmck@...ux.vnet.ibm.com>
> Cc: Frederic Weisbecker <fweisbec@...il.com>
> Cc: Steven Rostedt <rostedt@...dmis.org>
> Cc: Borislav Petkov <bp@...en8.de>
> Cc: Arjan van de Ven <arjan@...ux.intel.com>
> Cc: Kevin Hilman <khilman@...aro.org>
> Cc: Christoph Lameter <cl@...ux.com>
> ---
> Documentation/kernel-per-CPU-kthreads.txt | 159 ++++++++++++++++++++++++++++++
> 1 file changed, 159 insertions(+)
> create mode 100644 Documentation/kernel-per-CPU-kthreads.txt
>
> diff --git a/Documentation/kernel-per-CPU-kthreads.txt b/Documentation/kernel-per-CPU-kthreads.txt
> new file mode 100644
> index 0000000..495dacf
> --- /dev/null
> +++ b/Documentation/kernel-per-CPU-kthreads.txt
> @@ -0,0 +1,159 @@
> +REDUCING OS JITTER DUE TO PER-CPU KTHREADS
> +
> +This document lists per-CPU kthreads in the Linux kernel and presents
> +options to control OS jitter due to these kthreads. Note that kthreads
> +that are not per-CPU are not listed here -- to reduce OS jitter from
> +non-per-CPU kthreads, bind them to a "housekeeping" CPU that is dedicated
> +to such work.
> +
> +
> +Name: ehca_comp/%u
> +Purpose: Periodically process Infiniband-related work.
> +To reduce corresponding OS jitter, do any of the following:
> +1. Don't use EHCA Infiniband hardware. This will prevent these
> + kthreads from being created in the first place. (This will
> + work for most people, as this hardware, though important,
> + is relatively old as is produced in relatively low unit
> + volumes.)
> +2. Do all EHCA-Infiniband-related work on other CPUs, including
> + interrupts.
> +
> +
> +Name: irq/%d-%s
> +Purpose: Handle threaded interrupts.
> +To reduce corresponding OS jitter, do the following:
> +1. Use irq affinity to force the irq threads to execute on
> + some other CPU.
It would be very nice to explain here how that is done.
> +
> +Name: kcmtpd_ctr_%d
> +Purpose: Handle Bluetooth work.
> +To reduce corresponding OS jitter, do one of the following:
> +1. Don't use Bluetooth, in cwhich case these kthreads won't be
which
> + created in the first place.
> +2. Use irq affinity to force Bluetooth-related interrupts to
> + occur on some other CPU and furthermore initiate all
> + Bluetooth activity from some other CPU.
> +
> +Name: ksoftirqd/%u
> +Purpose: Execute softirq handlers when threaded or when under heavy load.
> +To reduce corresponding OS jitter, each softirq vector must be handled
> +separately as follows:
> +TIMER_SOFTIRQ:
> +1. Build with CONFIG_HOTPLUG_CPU=y.
> +2. To the extent possible, keep the CPU out of the kernel when it
I guess I have a different viewpoint. I would say: keep the kernel
off of that CPU ....
> + is non-idle, for example, by forcing user and kernel threads as
> + well as interrupts to execute elsewhere.
> +3. Force the CPU offline, then bring it back online. This forces
> + recurring timers to migrate elsewhere. If you are concerned
> + with multiple CPUs, force them all offline before bringing the
> + first one back online.
> +NET_TX_SOFTIRQ and NET_RX_SOFTIRQ: Do all of the following:
> +1. Force networking interrupts onto other CPUs.
> +2. Initiate any network I/O on other CPUs.
> +3. Prevent CPU-hotplug operations from being initiated from tasks
> + that might run on the CPU to be de-jittered.
> +BLOCK_SOFTIRQ: Do all of the following:
> +1. Force block-device interrupts onto some other CPU.
> +2. Initiate any block I/O on other CPUs.
> +3. Prevent CPU-hotplug operations from being initiated from tasks
> + that might run on the CPU to be de-jittered.
> +BLOCK_IOPOLL_SOFTIRQ: Do all of the following:
> +1. Force block-device interrupts onto some other CPU.
> +2. Initiate any block I/O and block-I/O polling on other CPUs.
> +3. Prevent CPU-hotplug operations from being initiated from tasks
> + that might run on the CPU to be de-jittered.
> +TASKLET_SOFTIRQ: Do one or more of the following:
> +1. Avoid use of drivers that use tasklets.
> +2. Convert all drivers that you must use from tasklets to workqueues.
> +3. Force interrupts for drivers using tasklets onto other CPUs,
> + and also do I/O involving these drivers on other CPUs.
> +SCHED_SOFTIRQ: Do all of the following:
> +1. Avoid sending scheduler IPIs to the CPU to be de-jittered,
> + for example, ensure that at most one runnable kthread is
> + present on that CPU. If a thread awakens that expects
> + to run on the de-jittered CPU, the scheduler will send
> + an IPI that can result in a subsequent SCHED_SOFTIRQ.
> +2. Build with CONFIG_RCU_NOCB_CPU=y, CONFIG_RCU_NOCB_CPU_ALL=y,
> + CONFIG_NO_HZ_EXTENDED=y, and in addition ensure that the CPU
> + to be de-jittered is marked as an adaptive-ticks CPU using the
> + "nohz_extended=" boot parameter. This reduces the number of
> + scheduler-clock interrupts that the de-jittered CPU receives,
> + minimizing its chances of being selected to do load balancing,
> + which happens in SCHED_SOFTIRQ context.
> +3. To the extent possible, keep the CPU out of the kernel when it
same viewpoint point.
> + is non-idle, for example, by forcing user and kernel threads as
> + well as interrupts to execute elsewhere. This further reduces
> + the number of scheduler-clock interrupts that the de-jittered
> + CPU receives.
> +HRTIMER_SOFTIRQ: Do all of the following:
> +1. Build with CONFIG_HOTPLUG_CPU=y.
> +2. To the extent possible, keep the CPU out of the kernel when it
> + is non-idle, for example, by forcing user and kernel threads as
> + well as interrupts to execute elsewhere.
> +3. Force the CPU offline, then bring it back online. This forces
> + recurring timers to migrate elsewhere. If you are concerned
> + with multiple CPUs, force them all offline before bringing the
> + first one back online.
> +RCU_SOFTIRQ: Do at least one of the following:
> +1. Offload callbacks and keep the CPU in either dyntick-idle or
> + adaptive-ticks state by doing all of the following:
> + a. Build with CONFIG_RCU_NOCB_CPU=y, CONFIG_RCU_NOCB_CPU_ALL=y,
> + CONFIG_NO_HZ_EXTENDED=y, and in addition ensure that
> + the CPU to be de-jittered is marked as an adaptive-ticks CPU
> + using the "nohz_extended=" boot parameter.
> + b. To the extent possible, keep the CPU out of the kernel
viewpoint?
> + when it is non-idle, for example, by forcing user and
> + kernel threads as well as interrupts to execute elsewhere.
> +2. Enable RCU to do its processing remotely via dyntick-idle by
> + doing all of the following:
> + a. Build with CONFIG_NO_HZ=y and CONFIG_RCU_FAST_NO_HZ=y.
> + b. To the extent possible, keep the CPU out of the kernel
viewpoint?
> + when it is non-idle, for example, by forcing user and
> + kernel threads as well as interrupts to execute elsewhere.
> + c. Ensure that the CPU goes idle frequently, allowing other
> + CPUs to detect that it has passed through an RCU
> + quiescent state.
> +
> +Name: rcuc/%u
> +Purpose: Execute RCU callbacks in CONFIG_RCU_BOOST=y kernels.
> +To reduce corresponding OS jitter, do at least one of the following:
> +1. Build the kernel with CONFIG_PREEMPT=n. This prevents these
> + kthreads from being created in the first place, and also prevents
> + RCU priority boosting from ever being required. This approach
> + is feasible for workloads that do not require high degrees of
> + responsiveness.
> +2. Build the kernel with CONFIG_RCU_BOOST=n. This prevents these
> + kthreads from being created in the first place. This approach
> + is feasible only if your workload never requires RCU priority
> + boosting, for example, if you ensure ample idle time on all CPUs
> + that might execute within the kernel.
> +3. Build with CONFIG_RCU_NOCB_CPU=y and CONFIG_RCU_NOCB_CPU_ALL=y,
> + which offloads all RCU callbacks to kthreads that can be moved
> + off of CPUs susceptible to OS jitter. This approach prevents the
> + rcuc/%u kthreads from having any work to do, and are therefore
> + never awakened.
> +4. Ensure that then CPU never enters the kernel and avoid any
the
viewpoint?
> + CPU hotplug operations. This is another way of preventing any
> + callbacks from being queued on the CPU, again preventing the
> + rcuc/%u kthreads from having any work to do.
> +
> +Name: rcuob/%d, rcuop/%d, and rcuos/%d
> +Purpose: Offload RCU callbacks from the corresponding CPU.
> +To reduce corresponding OS jitter, do at least one of the following:
> +1. Use affinity, cgroups, or other mechanism to force these kthreads
> + to execute on some other CPU.
> +2. Build with CONFIG_RCU_NOCB_CPUS=n, which will prevent these
> + kthreads from being created in the first place. However,
> + please note that this will not eliminate the corresponding
> + OS jitter, but will instead merely shift it to softirq.
> +
> +Name: watchdog/%u
> +Purpose: Detect software lockups on each CPU.
> +To reduce corresponding OS jitter, do at least one of the following:
> +1. Build with CONFIG_LOCKUP_DETECTOR=n, which will prevent these
> + kthreads from being created in the first place.
> +2. Echo a zero to /proc/sys/kernel/watchdog to disable the
> + watchdog timer.
> +3. Echo a large number of /proc/sys/kernel/watchdog_thresh in
> + order to reduce the frequency of OS jitter due to the watchdog
> + timer down to a level that is acceptable for your workload.
>
Reviewed-by: Randy Dunlap <rdunlap@...radead.org>
--
~Randy
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists