lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CAP-5=fWZH20L4kv-BwVtGLwR=Em3AOOT+Q4QGivvQuYn5AsPRg@mail.gmail.com>
Date:   Mon, 24 Jan 2022 18:09:50 -0800
From:   Ian Rogers <irogers@...gle.com>
To:     Arnaldo Carvalho de Melo <acme@...nel.org>,
        Riccardo Mancini <rickyman7@...il.com>
Cc:     Jiri Olsa <jolsa@...hat.com>,
        Peter Zijlstra <peterz@...radead.org>,
        Ingo Molnar <mingo@...hat.com>,
        Mark Rutland <mark.rutland@....com>,
        Alexander Shishkin <alexander.shishkin@...ux.intel.com>,
        Namhyung Kim <namhyung@...nel.org>,
        linux-perf-users@...r.kernel.org, linux-kernel@...r.kernel.org,
        Stephane Eranian <eranian@...gle.com>,
        Sohaib Mohamed <sohaib.amhmd@...il.com>,
        Eric Dumazet <edumazet@...gle.com>,
        Dmitry Vyukov <dvyukov@...gle.com>
Subject: Re: [PATCH] perf map: Fix namespace memory leak

Hi,

As with:
https://lore.kernel.org/linux-perf-users/20220122045811.3402706-4-irogers@google.com/
I implemented REFCNT_CHECKING for nsinfo (I'll mail it soon). This
more accurately identifies the problem. In one run I got:
```
=================================================================
==3900722==ERROR: AddressSanitizer: attempting double-free on
0x602000026710 in thread T6:
    #0 0x7f9a1dd6f4d7 in __interceptor_free
../../../../src/libsanitizer/asan/asan_malloc_linux.cpp:127
    #1 0x5650d04cb29c in nsinfo__put util/namespaces.c:259
    #2 0x5650d04aba93 in map__new util/map.c:198
    #3 0x5650d04a11f6 in machine__process_mmap2_event util/machine.c:1798
    #4 0x5650d04a24bf in machine__process_event util/machine.c:2019
    #5 0x5650d03e78b3 in perf_event__process util/event.c:567
    #6 0x5650d056e3bb in perf_tool__process_synth_event
util/synthetic-events.c:65
    #7 0x5650d05711a6 in perf_event__synthesize_mmap_events
util/synthetic-events.c:501
    #8 0x5650d0572fc0 in __event__synthesize_thread util/synthetic-events.c:792
    #9 0x5650d05739fd in __perf_event__synthesize_threads
util/synthetic-events.c:929
    #10 0x5650d0573cbd in synthesize_threads_worker util/synthetic-events.c:961
    #11 0x7f9a1dc48d7f in start_thread nptl/pthread_create.c:481
    #12 0x7f9a1cfdcb6e in clone (/lib/x86_64-linux-gnu/libc.so.6+0xfcb6e)

0x602000026710 is located 0 bytes inside of 8-byte region
[0x602000026710,0x602000026718)
freed by thread T13 here:
    #0 0x7f9a1dd6f4d7 in __interceptor_free
../../../../src/libsanitizer/asan/asan_malloc_linux.cpp:127
    #1 0x5650d04cb29c in nsinfo__put util/namespaces.c:259
    #2 0x5650d04aba93 in map__new util/map.c:198
    #3 0x5650d04a11f6 in machine__process_mmap2_event util/machine.c:1798
    #4 0x5650d04a24bf in machine__process_event util/machine.c:2019
    #5 0x5650d03e78b3 in perf_event__process util/event.c:567
    #6 0x5650d056e3bb in perf_tool__process_synth_event
util/synthetic-events.c:65
    #7 0x5650d05711a6 in perf_event__synthesize_mmap_events
util/synthetic-events.c:501
    #8 0x5650d0572fc0 in __event__synthesize_thread util/synthetic-events.c:792
    #9 0x5650d05739fd in __perf_event__synthesize_threads
util/synthetic-events.c:929
    #10 0x5650d0573cbd in synthesize_threads_worker util/synthetic-events.c:961
    #11 0x7f9a1dc48d7f in start_thread nptl/pthread_create.c:481

previously allocated by thread T3 here:
    #0 0x7f9a1dd6f7cf in __interceptor_malloc
../../../../src/libsanitizer/asan/asan_malloc_linux.cpp:145
    #1 0x5650d04cb17f in nsinfo__get util/namespaces.c:244
    #2 0x5650d04ab7df in map__new util/map.c:150
    #3 0x5650d04a11f6 in machine__process_mmap2_event util/machine.c:1798
    #4 0x5650d04a24bf in machine__process_event util/machine.c:2019
    #5 0x5650d03e78b3 in perf_event__process util/event.c:567
    #6 0x5650d056e3bb in perf_tool__process_synth_event
util/synthetic-events.c:65
    #7 0x5650d05711a6 in perf_event__synthesize_mmap_events
util/synthetic-events.c:501
    #8 0x5650d0572fc0 in __event__synthesize_thread util/synthetic-events.c:792
    #9 0x5650d05739fd in __perf_event__synthesize_threads
util/synthetic-events.c:929
    #10 0x5650d0573cbd in synthesize_threads_worker util/synthetic-events.c:961
    #11 0x7f9a1dc48d7f in start_thread nptl/pthread_create.c:481

Thread T6 created by T0 here:
    #0 0x7f9a1dd17716 in __interceptor_pthread_create
../../../../src/libsanitizer/asan/asan_interceptors.cpp:216
    #1 0x5650d05745eb in perf_event__synthesize_threads
util/synthetic-events.c:1039
    #2 0x5650d05789fc in __machine__synthesize_threads
util/synthetic-events.c:1791
    #3 0x5650d0578a81 in machine__synthesize_threads
util/synthetic-events.c:1802
    #4 0x5650d024edd8 in __cmd_top
/usr/local/google/home/irogers/kernel-trees/kernel.org/tip/tools/perf/builtin-top.c:1273
    #5 0x5650d025629b in cmd_top
/usr/local/google/home/irogers/kernel-trees/kernel.org/tip/tools/perf/builtin-top.c:1774
    #6 0x5650d03b132b in run_builtin
/usr/local/google/home/irogers/kernel-trees/kernel.org/tip/tools/perf/perf.c:313
    #7 0x5650d03b188d in handle_internal_command
/usr/local/google/home/irogers/kernel-trees/kernel.org/tip/tools/perf/perf.c:365
    #8 0x5650d03b1c52 in run_argv
/usr/local/google/home/irogers/kernel-trees/kernel.org/tip/tools/perf/perf.c:409
    #9 0x5650d03b246b in main
/usr/local/google/home/irogers/kernel-trees/kernel.org/tip/tools/perf/perf.c:539
    #10 0x7f9a1cf077ec in __libc_start_main ../csu/libc-start.c:332

Thread T13 created by T0 here:
    #0 0x7f9a1dd17716 in __interceptor_pthread_create
../../../../src/libsanitizer/asan/asan_interceptors.cpp:216
    #1 0x5650d05745eb in perf_event__synthesize_threads
util/synthetic-events.c:1039
    #2 0x5650d05789fc in __machine__synthesize_threads
util/synthetic-events.c:1791
```
That is thread 3 had created the wrapper object around the nsinfo,
thread 13 had put and freed it which cause a double free when thread 6
tried to do the same thing. How did thread 6 and 13 have the same
value for dso->nsinfo? In this patch we read then put the value, as
the put is slow another thread reads and puts the value too. We need
to atomically swap the value of dso->nsinfo with nsi and then do a put
on nsi which holds the old value of dso->nsinfo. The question is how
to do the swap?

C11 has atomic_exchange, but Makefile.config sets the std as gnu99.
C11 is used in some selftests. atomic_t and atomic_long_t aren't
guaranteed to be 64-bit, a long can be 32-bit with 64-bit pointers.
Using a non-atomic swap fixes the issue, memory leak, etc. but clearly
this needs to be atomic. My preference would be to bump to C11, but I
can do a linux atomic or poor man's swap if others prefer. Wdyt?

Thanks,
Ian

On Tue, Nov 30, 2021 at 9:20 AM Ian Rogers <irogers@...gle.com> wrote:
>
> On Tue, Nov 30, 2021 at 7:00 AM Arnaldo Carvalho de Melo
> <acme@...nel.org> wrote:
> >
> > Em Sun, Nov 28, 2021 at 04:40:26PM +0100, Jiri Olsa escreveu:
> > > On Thu, Nov 18, 2021 at 11:37:14AM -0800, Ian Rogers wrote:
> > > > This leak was happening reliably with test "Lookup mmap thread" with
> > > > stack traces like:
> > > >
> > > > Direct leak of 5504 byte(s) in 172 object(s) allocated from:
> > > >     #0 0x7f4685e47987 in __interceptor_calloc
> > > >     #1 0x56063b974c2a in nsinfo__new util/namespaces.c:142
> > > >     #2 0x56063b9781ff in thread__new util/thread.c:70
> > > >     #3 0x56063b944953 in ____machine__findnew_thread util/machine.c:543
> > > >     #4 0x56063b944ac6 in __machine__findnew_thread util/machine.c:574
> > > >     #5 0x56063b944b36 in machine__findnew_thread util/machine.c:584
> > > >     #6 0x56063b94c892 in machine__process_fork_event util/machine.c:1954
> > > >     #7 0x56063b94cc1f in machine__process_event util/machine.c:2019
> > > >     #8 0x56063b894f18 in perf_event__process util/event.c:567
> > > >     #9 0x56063ba17951 in perf_tool__process_synth_event util/synthetic-events.c:65
> > > >     #10 0x56063ba19086 in perf_event__synthesize_fork util/synthetic-events.c:287
> > > >     #11 0x56063ba1c39d in __event__synthesize_thread util/synthetic-events.c:775
> > > >     #12 0x56063ba1cf6f in __perf_event__synthesize_threads util/synthetic-events.c:929
> > > >     #13 0x56063ba1d4ab in perf_event__synthesize_threads util/synthetic-events.c:1000
> > > >     #14 0x56063b821a3d in synth_all tests/mmap-thread-lookup.c:136
> > > >     #15 0x56063b821c86 in mmap_events tests/mmap-thread-lookup.c:174
> > > >     #16 0x56063b8221b7 in test__mmap_thread_lookup tests/mmap-thread-lookup.c:230
> > > >
> > > > The dso->nsinfo is overwritten, but without a nsinfo__put this can leak
> > > > the overwritten nsinfo.
> > > >
> > > > Signed-off-by: Ian Rogers <irogers@...gle.com>
> > >
> > > nice catch!
> > >
> > > Acked-by: Jiri Olsa <jolsa@...hat.com>
> >
> > This one is tricky, I have to retrieve the details from last time this
> > surfaced, but try:
> >
> > # perf top -F 10000
> >
> > leave it for a while and press 'q':
> >
> > perf: /home/acme/git/perf/tools/include/linux/refcount.h:131: refcount_sub_and_test: Assertion `!(new > val)' failed.
> >                                                                                                                      Aborted (core dumped)
> >
> > I reproduced the above assertion a few times, now I'm not being able,
> > probably some namespace related activity took place when it hit.
>
> I was able to reproduce in about 1 run in 3. With address sanitizer
> the core dump turns into a heap-use-after-free with a report of:
>
> ==235440==ERROR: AddressSanitizer: heap-use-after-free on address
> 0x6030000485c8 at pc 0x55d8b882bfb7 bp 0x7fffd3426e90 sp
> 0x7fffd3426e88
> READ of size 4 at 0x6030000485c8 thread T0
>     #0 0x55d8b882bfb6 in __read_once_size ./tools/include/linux/compiler.h:132
>     #1 0x55d8b882bfb6 in atomic_read
> ./tools/include/asm/../../arch/x86/include/asm/atomic.h:28
>     #2 0x55d8b882c35c in refcount_sub_and_test
> ./tools/include/linux/refcount.h:123
>     #3 0x55d8b882c45e in refcount_dec_and_test
> ./tools/include/linux/refcount.h:148
>     #4 0x55d8b882d4af in nsinfo__put util/namespaces.c:204
>     #5 0x55d8b87adecd in __nsinfo__zput util/namespaces.h:65
>     #6 0x55d8b87b4bcc in dso__delete util/dso.c:1327
>     #7 0x55d8b87b4c8d in dso__put util/dso.c:1342
>     #8 0x55d8b87fb23b in dsos__purge util/machine.c:182
>     #9 0x55d8b87fb2e3 in dsos__exit util/machine.c:190
>     #10 0x55d8b87fb46e in machine__exit util/machine.c:222
>     #11 0x55d8b87fb715 in machines__exit util/machine.c:262
>     #12 0x55d8b88149d2 in perf_session__delete util/session.c:305
>     #13 0x55d8b85c0924 in cmd_top ./tools/perf/builtin-top.c:1781
>     #14 0x55d8b87176b7 in run_builtin ./tools/perf/perf.c:313
>     #15 0x55d8b8717c10 in handle_internal_command ./tools/perf/perf.c:365
>     #16 0x55d8b8717fcd in run_argv ./tools/perf/perf.c:409
>     #17 0x55d8b87187bc in main ./tools/perf/perf.c:539
>
> 0x6030000485c8 is located 24 bytes inside of 32-byte region
> [0x6030000485b0,0x6030000485d0)
> freed by thread T0 here:
>     #0 0x7fb6e7c454d7 in __interceptor_free
> ../../../../src/libsanitizer/asan/asan_malloc_linux.cpp:127
>     #1 0x55d8b882d41a in nsinfo__delete util/namespaces.c:192
>     #2 0x55d8b882d4bf in nsinfo__put util/namespaces.c:205
>     #3 0x55d8b87adecd in __nsinfo__zput util/namespaces.h:65
>     #4 0x55d8b87b4bcc in dso__delete util/dso.c:1327
>     #5 0x55d8b87b4c8d in dso__put util/dso.c:1342
>     #6 0x55d8b87fb23b in dsos__purge util/machine.c:182
>     #7 0x55d8b87fb2e3 in dsos__exit util/machine.c:190
>     #8 0x55d8b87fb46e in machine__exit util/machine.c:222
>     #9 0x55d8b87fb715 in machines__exit util/machine.c:262
>     #10 0x55d8b88149d2 in perf_session__delete util/session.c:305
>     #11 0x55d8b85c0924 in cmd_top ./tools/perf/builtin-top.c:1781
>     #12 0x55d8b87176b7 in run_builtin ./tools/perf/perf.c:313
>     #13 0x55d8b8717c10 in handle_internal_command ./tools/perf/perf.c:365
>     #14 0x55d8b8717fcd in run_argv ./tools/perf/perf.c:409
>     #15 0x55d8b87187bc in main ./tools/perf/perf.c:539
>
> previously allocated by thread T11 here:
>     #0 0x7fb6e7c45987 in __interceptor_calloc
> ../../../../src/libsanitizer/asan/asan_malloc_linux.cpp:154
>     #1 0x55d8b882ceaa in nsinfo__new util/namespaces.c:142
>     #2 0x55d8b883047f in thread__new util/thread.c:70
>     #3 0x55d8b87fcb93 in ____machine__findnew_thread util/machine.c:543
>     #4 0x55d8b87fcd06 in __machine__findnew_thread util/machine.c:574
>     #5 0x55d8b87fcd76 in machine__findnew_thread util/machine.c:584
>     #6 0x55d8b8804ad2 in machine__process_fork_event util/machine.c:1954
>     #7 0x55d8b8804e5f in machine__process_event util/machine.c:2019
>     #8 0x55d8b874cf18 in perf_event__process util/event.c:567
>     #9 0x55d8b88cfbd1 in perf_tool__process_synth_event
> util/synthetic-events.c:65
>     #10 0x55d8b88d1306 in perf_event__synthesize_fork
> util/synthetic-events.c:287
>     #11 0x55d8b88d461d in __event__synthesize_thread util/synthetic-events.c:775
>     #12 0x55d8b88d51ef in __perf_event__synthesize_threads
> util/synthetic-events.c:929
>     #13 0x55d8b88d54af in synthesize_threads_worker util/synthetic-events.c:961
>     #14 0x7fb6e7b1eead in start_thread nptl/pthread_create.c:463
>
> Thread T11 created by T0 here:
>     #0 0x7fb6e7bed716 in __interceptor_pthread_create
> ../../../../src/libsanitizer/asan/asan_interceptors.cpp:216
>     #1 0x55d8b88d5ddd in perf_event__synthesize_threads
> util/synthetic-events.c:1039
>     #2 0x55d8b88da3b8 in __machine__synthesize_threads
> util/synthetic-events.c:1791
>     #3 0x55d8b88da43d in machine__synthesize_threads
> util/synthetic-events.c:1802
>     #4 0x55d8b85b9378 in __cmd_top ./tools/perf/builtin-top.c:1273
>     #5 0x55d8b85c0811 in cmd_top ./tools/perf/builtin-top.c:1774
>     #6 0x55d8b87176b7 in run_builtin ./tools/perf/perf.c:313
>     #7 0x55d8b8717c10 in handle_internal_command ./tools/perf/perf.c:365
>     #8 0x55d8b8717fcd in run_argv ./tools/perf/perf.c:409
>     #9 0x55d8b87187bc in main ./tools/perf/perf.c:539
>
> SUMMARY: AddressSanitizer: heap-use-after-free
> ./tools/include/linux/compiler.h:132 in __read_once_size
> Shadow bytes around the buggy address:
>   0x0c0680001060: fa fa fa fa fa fa fa fa fd fd fd fd fa fa fa fa
>   0x0c0680001070: fa fa fa fa fa fa fa fa fa fa fd fd fd fd fa fa
>   0x0c0680001080: fa fa fa fa fa fa fa fa fa fa fa fa fd fd fd fd
>   0x0c0680001090: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
>   0x0c06800010a0: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
> =>0x0c06800010b0: fa fa fa fa fa fa fd fd fd[fd]fa fa fa fa fa fa
>   0x0c06800010c0: fa fa fa fa fa fa fa fa fd fd fd fa fa fa fd fd
>   0x0c06800010d0: fd fa fa fa fd fd fd fd fa fa fd fd fd fa fa fa
>   0x0c06800010e0: fd fd fd fd fa fa fd fd fd fd fa fa fd fd fd fa
>   0x0c06800010f0: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fd fd
>   0x0c0680001100: fd fd fa fa fa fa fa fa fa fa fa fa fa fa fa fa
> Shadow byte legend (one shadow byte represents 8 application bytes):
>   Addressable:           00
>   Partially addressable: 01 02 03 04 05 06 07
>   Heap left redzone:       fa
>   Freed heap region:       fd
>   Stack left redzone:      f1
>   Stack mid redzone:       f2
>   Stack right redzone:     f3
>   Stack after return:      f5
>   Stack use after scope:   f8
>   Global redzone:          f9
>   Global init order:       f6
>   Poisoned by user:        f7
>   Container overflow:      fc
>   Array cookie:            ac
>   Intra object redzone:    bb
>   ASan internal:           fe
>   Left alloca redzone:     ca
>   Right alloca redzone:    cb
>   Shadow gap:              cc
> ==235440==ABORTING
>
> My hope was to land this so that 'perf test' wouldn't have new
> failures caused by leak sanitizer. So one option could be to guard the
> change with an address sanitizer #ifdef. The larger fix is to rework
> the reference counting logic, but I'd really hope we could switch that
> work to being in a language like Rust to get us out of the current
> memory leak vs crash wac-a-mole.
>
> +Riccardo as he did some great and related fixes over the summer.
>
> Thanks,
> Ian
>
>
>
> > - Arnaldo
> >
> > > thanks,
> > > jirka
> > >
> > > > ---
> > > >  tools/perf/util/map.c | 1 +
> > > >  1 file changed, 1 insertion(+)
> > > >
> > > > diff --git a/tools/perf/util/map.c b/tools/perf/util/map.c
> > > > index 8af693d9678c..ceed8f407bc0 100644
> > > > --- a/tools/perf/util/map.c
> > > > +++ b/tools/perf/util/map.c
> > > > @@ -192,6 +192,7 @@ struct map *map__new(struct machine *machine, u64 start, u64 len,
> > > >                     if (!(prot & PROT_EXEC))
> > > >                             dso__set_loaded(dso);
> > > >             }
> > > > +           nsinfo__put(dso->nsinfo);
> > > >             dso->nsinfo = nsi;
> > > >
> > > >             if (build_id__is_defined(bid))
> > > > --
> > > > 2.34.0.rc2.393.gf8c9666880-goog
> > > >
> >
> > --
> >
> > - Arnaldo

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ