[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CAEf4Bzb30P_jcWmjX-oo3VxRmPGyjDfULgQM0xz9JOmdgKkcRw@mail.gmail.com>
Date: Fri, 25 Oct 2019 15:08:16 -0700
From: Andrii Nakryiko <andrii.nakryiko@...il.com>
To: Daniel Borkmann <daniel@...earbox.net>
Cc: Alexei Starovoitov <ast@...nel.org>, bpf <bpf@...r.kernel.org>,
Networking <netdev@...r.kernel.org>
Subject: Re: [PATCH bpf-next 3/5] bpf: Add probe_read_{user,kernel} and
probe_read_str_{user,kernel} helpers
On Fri, Oct 25, 2019 at 1:44 PM Daniel Borkmann <daniel@...earbox.net> wrote:
>
> The current bpf_probe_read() and bpf_probe_read_str() helpers are broken
> in that they assume they can be used for probing memory access for kernel
> space addresses /as well as/ user space addresses.
>
> However, plain use of probe_kernel_read() for both cases will attempt to
> always access kernel space address space given access is performed under
> KERNEL_DS and some archs in-fact have overlapping address spaces where a
> kernel pointer and user pointer would have the /same/ address value and
> therefore accessing application memory via bpf_probe_read{,_str}() would
> read garbage values.
>
> Lets fix BPF side by making use of recently added 3d7081822f7f ("uaccess:
> Add non-pagefault user-space read functions"). Unfortunately, the only way
> to fix this status quo is to add dedicated bpf_probe_read_{user,kernel}()
> and bpf_probe_read_str_{user,kernel}() helpers. The bpf_probe_read{,_str}()
> helpers are aliased to the *_kernel() variants to retain their current
> behavior; for API consistency and ease of use the latter have been added
> so that it is immediately *obvious* which address space the memory is being
> probed on (user,kernel). The two *_user() variants attempt the access under
> USER_DS set.
>
> Fixes: a5e8c07059d0 ("bpf: add bpf_probe_read_str helper")
> Fixes: 2541517c32be ("tracing, perf: Implement BPF programs attached to kprobes")
> Signed-off-by: Daniel Borkmann <daniel@...earbox.net>
> ---
> include/uapi/linux/bpf.h | 119 ++++++++++++++++++++---------
> kernel/trace/bpf_trace.c | 133 ++++++++++++++++++++++-----------
> tools/include/uapi/linux/bpf.h | 119 ++++++++++++++++++++---------
> 3 files changed, 253 insertions(+), 118 deletions(-)
>
> diff --git a/include/uapi/linux/bpf.h b/include/uapi/linux/bpf.h
> index 4af8b0819a32..b8ffb419df51 100644
> --- a/include/uapi/linux/bpf.h
> +++ b/include/uapi/linux/bpf.h
> @@ -564,7 +564,11 @@ union bpf_attr {
> * int bpf_probe_read(void *dst, u32 size, const void *src)
> * Description
> * For tracing programs, safely attempt to read *size* bytes from
> - * address *src* and store the data in *dst*.
> + * kernel space address *src* and store the data in *dst*.
> + *
> + * This helper is an alias to bpf_probe_read_kernel().
> + *
> + * Generally, use bpf_probe_read_user() or bpf_probe_read_kernel() instead.
> * Return
> * 0 on success, or a negative error in case of failure.
> *
> @@ -1428,43 +1432,14 @@ union bpf_attr {
> *
> * int bpf_probe_read_str(void *dst, int size, const void *unsafe_ptr)
seems like an approriate time to standardize terminology. Should it be
unsafe_ptr like here, or src like in bpf_probe_read description?
> * Description
> - * Copy a NUL terminated string from an unsafe address
> - * *unsafe_ptr* to *dst*. The *size* should include the
> - * terminating NUL byte. In case the string length is smaller than
> - * *size*, the target is not padded with further NUL bytes. If the
> - * string length is larger than *size*, just *size*-1 bytes are
> - * copied and the last byte is set to NUL.
> - *
[...]
> #define __BPF_FUNC_MAPPER(FN) \
> FN(unspec), \
> @@ -2888,7 +2929,11 @@ union bpf_attr {
> FN(sk_storage_delete), \
> FN(send_signal), \
> FN(tcp_gen_syncookie), \
> - FN(skb_output),
> + FN(skb_output), \
> + FN(probe_read_user), \
> + FN(probe_read_kernel), \
> + FN(probe_read_str_user), \
> + FN(probe_read_str_kernel),
naming is subjective, but I'd go with probe_{user,kernel}_read[_str]
scheme, but given bpf_probe_write_user and desire to stay consistent,
I'd still stick to slightly different probe_read_{user,kernel}[_str]
scheme.
>
> /* integer value in 'imm' field of BPF_CALL instruction selects which helper
> * function eBPF program intends to call
> diff --git a/kernel/trace/bpf_trace.c b/kernel/trace/bpf_trace.c
> index 79919a26cd59..ff001b766799 100644
> --- a/kernel/trace/bpf_trace.c
> +++ b/kernel/trace/bpf_trace.c
> @@ -138,12 +138,52 @@ static const struct bpf_func_proto bpf_override_return_proto = {
> };
> #endif
>
[...]
Powered by blists - more mailing lists