lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-Id: <1360060336-16498-1-git-send-email-chenggang.qin@gmail.com>
Date:	Tue,  5 Feb 2013 18:32:16 +0800
From:	chenggang <chenggang.qin@...il.com>
To:	linux-kernel@...r.kernel.org
Cc:	chenggang <chenggang.qin@...il.com>, chenggang.qcg@...bao.com,
	Peter Zijlstra <peterz@...radead.org>,
	Paul Mackerras <paulus@...ba.org>,
	Ingo Molnar <mingo@...hat.com>,
	Arnaldo Carvalho de Melo <acme@...stprotocols.net>,
	David Ahern <dsahern@...il.com>,
	Arjan van de Ven <arjan@...ux.intel.com>,
	Namhyung Kim <namhyung@...il.com>,
	Yanmin Zhang <yanmin.zhang@...el.com>,
	Wu Fengguang <fengguang.wu@...el.com>,
	Mike Galbraith <efault@....de>,
	Andrew Morton <akpm@...ux-foundation.org>
Subject: [PATCH v2] Add a python script to statistic direct io behavior

From: chenggang.qcg@...bao.com

The last version of this patch need to introduce 2 new tracepoint events in VFS,
but introduce new tracepoint events into VFS is not a clever idea. So, I modified
this patch, and only use a existing tracepoint event (ext4:ext4_direct_IO_exit).

If the engineers want to analyze the direct io behavior of some applications
without source code, perf tools with some appropriate tracepoints events in the
VFS subsystem are excellent choice.

Many database systems use their own page cache subsystems and use the direct IO
to access the disks. Sometimes, the system engineers need to know the misses rate
of the database system's page cache. This requirements can be satisfied by recording
the database's file access behavior through the way of direct IO. So, we use
tracepoint event, ext4:ext4_direct_IO_exit, to record the system wide's direct IO behavior.
The script direct-io.py are introduced by this patch can record the tracepoint events,
ext4:ext4_direct_IO_exit, analyse the sample data, and give a concise report.

usage:
        "perf script record direct-io\n"
        "perf script report direct-io [comm|pid]\n"

Cc: Peter Zijlstra <peterz@...radead.org>
Cc: Paul Mackerras <paulus@...ba.org>
Cc: Ingo Molnar <mingo@...hat.com>
Cc: Arnaldo Carvalho de Melo <acme@...stprotocols.net>
Cc: David Ahern <dsahern@...il.com>
Cc: Arjan van de Ven <arjan@...ux.intel.com>
Cc: Namhyung Kim <namhyung@...il.com>
Cc: Yanmin Zhang <yanmin.zhang@...el.com>
Cc: Wu Fengguang <fengguang.wu@...el.com>
Cc: Mike Galbraith <efault@....de>
Cc: Andrew Morton <akpm@...ux-foundation.org>
Signed-off-by: Chenggang Qin <chenggang.qcg@...bao.com>

---
 tools/perf/scripts/python/bin/direct-io-record |    2 +
 tools/perf/scripts/python/bin/direct-io-report |   21 +++
 tools/perf/scripts/python/direct-io.py         |  197 ++++++++++++++++++++++++
 3 files changed, 220 insertions(+)
 create mode 100755 tools/perf/scripts/python/bin/direct-io-record
 create mode 100644 tools/perf/scripts/python/bin/direct-io-report
 create mode 100644 tools/perf/scripts/python/direct-io.py

diff --git a/tools/perf/scripts/python/bin/direct-io-record b/tools/perf/scripts/python/bin/direct-io-record
new file mode 100755
index 0000000..f38d5fc
--- /dev/null
+++ b/tools/perf/scripts/python/bin/direct-io-record
@@ -0,0 +1,2 @@
+#!/bin/bash
+perf record -e ext4:ext4_direct_IO_exit  $@
diff --git a/tools/perf/scripts/python/bin/direct-io-report b/tools/perf/scripts/python/bin/direct-io-report
new file mode 100644
index 0000000..828d9c6
--- /dev/null
+++ b/tools/perf/scripts/python/bin/direct-io-report
@@ -0,0 +1,21 @@
+#!/bin/bash
+# description: direct_io statistic
+# args: [comm|pid]
+n_args=0
+for i in "$@"
+do
+    if expr match "$i" "-" > /dev/null ; then
+	break
+    fi
+    n_args=$(( $n_args + 1 ))
+done
+if [ "$n_args" -gt 1 ] ; then
+    echo "usage: perf script report direct-io [comm|pid]"
+    exit
+fi
+
+if [ "$n_args" -gt 0 ] ; then
+    comm=$1
+    shift
+fi
+perf script $@ -s "$PERF_EXEC_PATH"/scripts/python/direct-io.py $comm
diff --git a/tools/perf/scripts/python/direct-io.py b/tools/perf/scripts/python/direct-io.py
new file mode 100644
index 0000000..b609e95
--- /dev/null
+++ b/tools/perf/scripts/python/direct-io.py
@@ -0,0 +1,197 @@
+# direct IO counts
+# (c) 2013, Chenggang Qin <chenggang.qcg@...baba-inc.com>
+# Licensed under the terms of the GNU GPL License version 2
+
+# Displays system-wide file direct IO behavior.
+# It helps us to investigate which processes trigger a direct IO,
+# and what files are accessed by these processes.
+#
+# options
+# comm, pid: show details of the file r/w behavior of a special process.
+
+import os, sys
+
+sys.path.append(os.environ['PERF_EXEC_PATH'] + \
+	'/scripts/python/Perf-Trace-Util/lib/Perf/Trace')
+
+from perf_trace_context import *
+from Core import *
+from Util import *
+
+MINORBITS = 20
+MINORMASK = ((1 << MINORBITS) - 1)
+
+usage = "perf script record direct-io\n" \
+	"perf script report direct-io [comm|pid]\n"
+
+for_comm = None
+for_pid = None
+pid_2_comm = None
+
+if len(sys.argv) > 2:
+	sys.exit(usage)
+
+if len(sys.argv) > 1:
+	try:
+		for_pid = int(sys.argv[1])
+	except:
+		for_comm = sys.argv[1]
+
+direct_write = autodict()
+direct_read = autodict()
+
+direct_write_bytes = autodict()
+direct_read_bytes = autodict()
+
+comm_read_info = autodict()
+comm_write_info = autodict()
+
+wevent_count = 0
+revent_count = 0
+
+comm_revent_count = 0;
+comm_wevent_count = 0;
+
+def MAJOR(dev):
+	return (dev) >> MINORBITS
+
+def MINOR(dev):
+	return (dev) & MINORMASK
+
+def trace_begin():
+	print "Press control+C to stop and show the summary"
+
+def trace_end():
+	if (for_comm is not None) or (for_pid is not None):
+		print_direct_io_event_for_comm()
+	else:
+		print_direct_io_event_totals()
+
+def ext4__ext4_direct_IO_exit(event_name, context, common_cpu,
+	common_secs, common_nsecs, common_pid, common_comm,
+	ino, dev, pos, len, rw, ret):
+	global wevent_count
+	global comm_wevent_count
+	global pid_2_comm
+
+	if rw is 1: #write
+		if (for_comm is not None) or (for_pid is not None):
+			if (common_comm != for_comm) and (common_pid != for_pid):
+				return
+			else:
+				if (pid_2_comm is None) and (for_pid is not None):
+					pid_2_comm = common_comm
+				comm_write_info[comm_wevent_count] = (common_pid, \
+						common_cpu, common_secs, common_nsecs, \
+						ino, dev, pos, ret)
+				comm_wevent_count += 1
+				return
+
+		wevent_count += 1
+		try:
+			direct_write[(common_pid, common_comm, ino, dev)] += 1
+			direct_write_bytes[(common_pid, common_comm, ino, dev)] += ret
+		except TypeError:
+			direct_write[(common_pid, common_comm, ino, dev)] = 1
+			direct_write_bytes[(common_pid, common_comm, ino, dev)] = ret
+
+	elif rw is 0: #read
+		if (for_comm is not None) or (for_pid is not None):
+			if (common_comm != for_comm) and (common_pid != for_pid):
+				return
+			else:
+				if (pid_2_comm is None) and (for_pid is not None):
+					pid_2_comm = common_comm
+				comm_read_info[comm_revent_count] = (common_pid, \
+						common_cpu, common_secs, common_nsecs, \
+						ino, dev, pos, ret)
+				comm_revent_count += 1
+				return
+		try:
+			revent_count += 1
+			direct_read[(common_pid, common_comm, ino, dev)] += 1
+			direct_read_bytes[(common_pid, common_comm, ino, dev)] += ret
+		except TypeError:
+			direct_read[(common_pid, common_comm, ino, dev)] = 1
+			direct_read_bytes[(common_pid, common_comm, ino, dev)] = ret
+
+def print_direct_io_event_totals():
+	clear_term()
+	print "\nDirect IO:\n\n",
+	print "All read events: %d\n" % (revent_count)
+	print "%5s %20s %10s %10s %10s %10s\n" % ("pid", "comm", "inode no", "dev", \
+	      "count", "Bytes"),
+	print "%5s %20s %10s %10s %10s %10s\n" % ("-----", "----------",\
+					      "----------", "----------", \
+					      "----------", "----------"),
+
+	for tuple, val in sorted(direct_read.iteritems(), key = lambda(k, v): (v, k), \
+				 reverse = True):
+		try:
+			print "%5d  %20s %10s %4d.%4d %10d %10d\n" % \
+			      (tuple[0], tuple[1], tuple[2], MAJOR(tuple[3]), MINOR(tuple[3]), \
+			      val, direct_read_bytes[tuple[0], tuple[1], tuple[2], tuple[3]])
+		except TypeError:
+			pass
+
+	print "\n\n"
+
+	print "All write events: %d\n" % (wevent_count)
+	print "%5s %20s %10s %10s %10s %10s\n" % ("pid", "comm", "inode no", "dev", \
+	      "count", "Bytes"),
+	print "%5s %20s %10s %10s %10s %10s\n" % ("-----", "----------",\
+					      "----------", "----------", \
+					      "----------", "----------"),
+
+	for tuple, val in sorted(direct_write.iteritems(), key = lambda(k, v): (v, k), \
+				reverse = True):
+		try:
+			print "%5d  %20s %10s %4d.%4d %10d %10d\n" % \
+			      (tuple[0], tuple[1], tuple[2], MAJOR(tuple[3]), MINOR(tuple[3]), \
+			      val, direct_write_bytes[tuple[0], tuple[1], tuple[2], tuple[3]])
+		except TypeError:
+			pass
+
+def print_direct_io_event_for_comm():
+	if for_comm is not None:
+		print "Direct IO record for %s:\n" % (for_comm)
+	else:
+		print "Direct IO record for %s (pid: %d):\n" % (pid_2_comm, for_pid)
+	print "Number of read: %d\n" % (comm_revent_count)
+	print "%5s %3s %16s %10s %10s %10s %10s\n" % ("pid", "cpu", "timestamp", \
+						"inode no", "dev", "position", "Bytes")
+	print "%5s %3s %16s %10s %10s %10s %10s\n" % ("-----", "---", "----------", \
+						 "----------", "----------", \
+						 "----------", "----------"),
+
+	for i in range(0, comm_revent_count):
+		try:
+			print "%5d %3d %6d.%09d %10s %4d.%-4d %10s %10s\n" % \
+				(comm_read_info[i][0], comm_read_info[i][1], \
+				comm_read_info[i][2], comm_read_info[i][3], \
+				comm_read_info[i][4], MAJOR(comm_read_info[i][5]), \
+				MINOR(comm_read_info[i][5]), comm_read_info[i][6], \
+				comm_read_info[i][7])
+		except TypeError:
+			pass
+
+	print "\nNumber of write: %d\n" % (comm_wevent_count)
+	for i in range(0, comm_wevent_count):
+		if i % 20 == 0:
+			print "\n%5s %3s %16s %10s %10s %10s %10s\r" %  \
+				("pid", "cpu", "timestamp", "inode no", \
+				"dev", "position", "Bytes")
+			print "%5s %3s %16s %10s %10s %10s %10s\n" % ("-----", "---", 
+							"----------", "----------",\
+							"----------", "----------",\
+							"----------"),
+		try:
+			print "%5d %3d %6d.%09d %10s %4d.%-4d %10s %10s\r" % \
+				(comm_write_info[i][0], comm_write_info[i][1], \
+				comm_write_info[i][2], comm_write_info[i][3], \
+				comm_write_info[i][4], MAJOR(comm_write_info[i][5]), \
+				MINOR(comm_write_info[i][5]), comm_write_info[i][6], \
+				comm_write_info[i][7])
+		except TypeError:
+			pass
+
-- 
1.7.9.5

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ