[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <20200623230803.3987674-1-yhs@fb.com>
Date: Tue, 23 Jun 2020 16:08:03 -0700
From: Yonghong Song <yhs@...com>
To: <bpf@...r.kernel.org>, <netdev@...r.kernel.org>
CC: Alexei Starovoitov <ast@...nel.org>,
Daniel Borkmann <daniel@...earbox.net>, <kernel-team@...com>,
Martin KaFai Lau <kafai@...com>
Subject: [PATCH bpf-next v5 00/15] implement bpf iterator for tcp and udp sockets
bpf iterator implments traversal of kernel data structures and these
data structures are passed to a bpf program for processing.
This gives great flexibility for users to examine kernel data
structure without using e.g. /proc/net which has limited and
fixed format.
Commit 138d0be35b14 ("net: bpf: Add netlink and ipv6_route bpf_iter targets")
implemented bpf iterators for netlink and ipv6_route.
This patch set intends to implement bpf iterators for tcp and udp.
Currently, /proc/net/tcp is used to print tcp4 stats and /proc/net/tcp6
is used to print tcp6 stats. /proc/net/udp[6] have similar usage model.
In contrast, only one tcp iterator is implemented and it is bpf program
resposibility to filter based on socket family. The same is for udp.
This will avoid another unnecessary traversal pass if users want
to check both tcp4 and tcp6.
Several helpers are also implemented in this patch
bpf_skc_to_{tcp, tcp6, tcp_timewait, tcp_request, udp6}_sock
The argument for these helpers is not a fixed btf_id. For example,
bpf_skc_to_tcp(struct sock_common *), or
bpf_skc_to_tcp(struct sock *), or
bpf_skc_to_tcp(struct inet_sock *), ...
are all valid. At runtime, the helper will check whether pointer cast
is legal or not. Please see Patch #5 for details.
Since btf_id's for both arguments and return value are known at
build time, the btf_id's are pre-computed once vmlinux btf becomes
valid. Jiri's "adding d_path helper" patch set
https://lore.kernel.org/bpf/20200616100512.2168860-1-jolsa@kernel.org/T/
provides a way to pre-compute btf id during vmlinux build time.
This can be applied here as well. A followup patch can convert
to build time btf id computation after Jiri's patch landed.
Changelogs:
v4 -> v5:
- fix bpf_skc_to_udp6_sock helper as besides sk_protocol, sk_family,
sk_type == SOCK_DGRAM is also needed to differentiate from
SOCK_RAW (Eric)
v3 -> v4:
- fix bpf_skc_to_{tcp_timewait, tcp_request}_sock helper implementation
as just checking sk->sk_state is not enough (Martin)
- fix a few kernel test robot reported failures
- move bpf_tracing_net.h from libbpf to selftests (Andrii)
- remove __weak attribute from selftests CONFIG_HZ variables (Andrii)
v2 -> v3:
- change sock_cast*/SOCK_CAST* names to btf_sock* names for generality (Martin)
- change gpl_license to false (Martin)
- fix helper to cast to tcp timewait/request socket. (Martin)
v1 -> v2:
- guard init_sock_cast_types() defination properly with CONFIG_NET (Martin)
- reuse the btf_ids, computed for new helper argument, for return
values (Martin)
- using BTF_TYPE_EMIT to express intent of btf type generation (Andrii)
- abstract out common net macros into bpf_tracing_net.h (Andrii)
Yonghong Song (15):
net: bpf: add bpf_seq_afinfo in tcp_iter_state
net: bpf: implement bpf iterator for tcp
bpf: support 'X' in bpf_seq_printf() helper
bpf: allow tracing programs to use bpf_jiffies64() helper
bpf: add bpf_skc_to_tcp6_sock() helper
bpf: add bpf_skc_to_{tcp,tcp_timewait,tcp_request}_sock() helpers
net: bpf: add bpf_seq_afinfo in udp_iter_state
net: bpf: implement bpf iterator for udp
bpf: add bpf_skc_to_udp6_sock() helper
selftests/bpf: move newer bpf_iter_* type redefining to a new header
file
selftests/bpf: refactor some net macros to bpf_tracing_net.h
selftests/bpf: add more common macros to bpf_tracing_net.h
selftests/bpf: implement sample tcp/tcp6 bpf_iter programs
selftests/bpf: implement sample udp/udp6 bpf_iter programs
selftests/bpf: add tcp/udp iterator programs to selftests
include/linux/bpf.h | 16 ++
include/net/tcp.h | 1 +
include/net/udp.h | 1 +
include/uapi/linux/bpf.h | 37 ++-
kernel/bpf/btf.c | 1 +
kernel/bpf/verifier.c | 43 ++-
kernel/trace/bpf_trace.c | 15 +-
net/core/filter.c | 166 ++++++++++++
net/ipv4/tcp_ipv4.c | 153 ++++++++++-
net/ipv4/udp.c | 144 +++++++++-
scripts/bpf_helpers_doc.py | 10 +
tools/include/uapi/linux/bpf.h | 37 ++-
.../selftests/bpf/prog_tests/bpf_iter.c | 68 +++++
tools/testing/selftests/bpf/progs/bpf_iter.h | 80 ++++++
.../selftests/bpf/progs/bpf_iter_bpf_map.c | 18 +-
.../selftests/bpf/progs/bpf_iter_ipv6_route.c | 25 +-
.../selftests/bpf/progs/bpf_iter_netlink.c | 22 +-
.../selftests/bpf/progs/bpf_iter_task.c | 18 +-
.../selftests/bpf/progs/bpf_iter_task_file.c | 20 +-
.../selftests/bpf/progs/bpf_iter_tcp4.c | 234 ++++++++++++++++
.../selftests/bpf/progs/bpf_iter_tcp6.c | 250 ++++++++++++++++++
.../selftests/bpf/progs/bpf_iter_test_kern3.c | 17 +-
.../selftests/bpf/progs/bpf_iter_test_kern4.c | 17 +-
.../bpf/progs/bpf_iter_test_kern_common.h | 18 +-
.../selftests/bpf/progs/bpf_iter_udp4.c | 71 +++++
.../selftests/bpf/progs/bpf_iter_udp6.c | 79 ++++++
.../selftests/bpf/progs/bpf_tracing_net.h | 51 ++++
27 files changed, 1443 insertions(+), 169 deletions(-)
create mode 100644 tools/testing/selftests/bpf/progs/bpf_iter.h
create mode 100644 tools/testing/selftests/bpf/progs/bpf_iter_tcp4.c
create mode 100644 tools/testing/selftests/bpf/progs/bpf_iter_tcp6.c
create mode 100644 tools/testing/selftests/bpf/progs/bpf_iter_udp4.c
create mode 100644 tools/testing/selftests/bpf/progs/bpf_iter_udp6.c
create mode 100644 tools/testing/selftests/bpf/progs/bpf_tracing_net.h
--
2.24.1
Powered by blists - more mailing lists