[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20201209011124.GA31164@paulmck-ThinkPad-P72>
Date: Tue, 8 Dec 2020 17:11:24 -0800
From: "Paul E. McKenney" <paulmck@...nel.org>
To: rcu@...r.kernel.org
Cc: linux-kernel@...r.kernel.org, kernel-team@...com, mingo@...nel.org,
jiangshanlai@...il.com, akpm@...ux-foundation.org,
mathieu.desnoyers@...icios.com, josh@...htriplett.org,
tglx@...utronix.de, peterz@...radead.org, rostedt@...dmis.org,
dhowells@...hat.com, edumazet@...gle.com, fweisbec@...il.com,
oleg@...hat.com, joel@...lfernandes.org
Subject: [PATCH RFC v2 sl-b] Export return addresses etc. for better
diagnostics
Hello!
This is v2 of the series the improves diagnostics by providing access to
additional information including the return addresses collected by the
sl*b allocators and by vmalloc(). If the allocator is not configured
to collect this information, the diagnostics fall back to a reasonable
approximation of their earlier state.
One use case is the queue_rcu_work() function, which might be used
by any number of kernel subsystems. If the caller does back-to-back
invocations of queue_rcu_work(), this constitutes a double-free bug,
and (if so configured) the debug-objects system will flag this, printing
the callback function. In most cases, printing this function suffices.
However, for double-free bugs involving queue_rcu_work(), the RCU callback
function will always be rcu_work_rcufn(), which provides almost no help to
the poor person trying to find this double-free bug. The return address
from the allocator of the memory containing the rcu_work structure can
provide an additional valuable clue.
Another use case is the percpu_ref_switch_to_atomic_rcu() function,
which detects percpu_ref reference-count underflow. Unfortunately,
the only data that this function has access to doesn't have much in the
way of identifying characteristics. Yes, it might be possible to gain
more information from a crash dump, but it is more convenient for the
needed hints to be in the console log.
Unfortunately, printing the return address in this case is of little help
because this object is allocated from percpu_ref_init(), regardless of
what part of the kernel is responsible for the reference-count underflow.
However, CONFIG_STACKTRACE=y kernels (such as those enabling ftrace)
using slub with debugging enabled also collect stack traces. This series
therefore also provides a way of extracting these stack traces to provide
additional information to those debugging percpu_ref reference-count
underflows.
The patches are as follows:
1. Add mem_dump_obj() to print source of memory block.
2. Make mem_dump_obj() handle NULL and zero-sized pointers.
3. Make mem_dump_obj() handle vmalloc() memory.
4. Make call_rcu() print mem_dump_obj() info for double-freed
callback.
5. Dump mem_dump_obj() info upon reference-count underflow.
Thanx, Paul
Changes since v1:
o Apply feedback from Joonsoo Kim, mostly around naming and
code structure.
o Apply fix suggested by Stephen Rothwell for a bug that was
also located by kbuild test robot.
o Add support for vmalloc().
o Add support for special pointers.
o Additional rework simplifying use of mem_dump_obj(), which
simplifies both the RCU and the percpu_ref uses.
------------------------------------------------------------------------
include/linux/mm.h | 2 +
include/linux/slab.h | 2 +
include/linux/vmalloc.h | 6 ++++
kernel/rcu/tree.c | 7 +++-
lib/percpu-refcount.c | 12 ++++++--
mm/slab.c | 28 +++++++++++++++++++
mm/slab.h | 11 +++++++
mm/slab_common.c | 69 ++++++++++++++++++++++++++++++++++++++++++++++++
mm/slob.c | 7 ++++
mm/slub.c | 40 +++++++++++++++++++++++++++
mm/util.c | 44 ++++++++++++++++++++++++++----
mm/vmalloc.c | 12 ++++++++
12 files changed, 229 insertions(+), 11 deletions(-)
Powered by blists - more mailing lists