lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Date:   Mon, 21 Dec 2020 16:12:43 +0800
From:   Tiezhu Yang <yangtiezhu@...ngson.cn>
To:     Arnaldo Carvalho de Melo <acme@...nel.org>,
        Jiaxun Yang <jiaxun.yang@...goat.com>,
        Ralf Baechle <ralf@...ux-mips.org>
Cc:     linux-mips@...r.kernel.org, linux-kernel@...r.kernel.org,
        Xuefeng Li <lixuefeng@...ngson.cn>,
        Juxin Gao <gaojuxin@...ngson.cn>,
        Archer Yan <ayan@...ecomp.com>,
        David Daney <david.daney@...ium.com>
Subject: Re: [QUESTION] support perf record --call-graph dwarf for mips

On 12/17/2020 08:48 PM, Tiezhu Yang wrote:
> On 12/16/2020 11:16 PM, Arnaldo Carvalho de Melo wrote:
>> Em Wed, Dec 16, 2020 at 11:30:47AM -0300, Arnaldo Carvalho de Melo 
>> escreveu:
>>> Em Wed, Dec 16, 2020 at 07:14:02PM +0800, Jiaxun Yang escreveu:
>>>>
>>>> 在 2020/12/16 下午6:05, Tiezhu Yang 写道:
>>>>> Hi,
>>>>>
>>>>> In the current upstream mainline kernel, perf record --call-graph 
>>>>> dwarf
>>>>> is not supported for architecture mips64. I find the following 
>>>>> related
>>>>> patches about this feature by David Daney <david.daney@...ium.com> 
>>>>> and
>>>>> Archer Yan <ayan@...ecomp.com> in Sep 2019.
>>>
...
> (3)[loongson@...ux perf]$ ./perf record --call-graph dwarf cd
> Error:
> The sys_perf_event_open() syscall returned with 89 (Function not 
> implemented) for event (cycles:u).
> /bin/dmesg | grep -i perf may provide additional information.
>
> Call Trace:
> record__open
>   evsel__open()
>     evsel__open_cpu()
>       perf_event_open()
>   evsel__open_strerror
>
> Maybe we need tools/perf/arch/mips/entry/syscalls/syscall.tbl?

The code about mips kernel and perf tool are debugged successfully
on the Loongson 3A4000 CPU platform, we can see the following result,
I will make and submit some patches based on 5.11-rc1 in the next week.

[root@...ux perf]# uname -r
5.10.0-rc7
[root@...ux perf]# ./perf record --call-graph dwarf -F 1000 lscpu
Architecture:          mips64
Byte Order:            Little Endian
CPU(s):                4
On-line CPU(s) list:   0-3
Thread(s) per core:    1
Core(s) per socket:    4
Socket(s):             1
NUMA node(s):          1
L1d cache:             64K
L1i cache:             64K
L2 cache:              2048K
NUMA node0 CPU(s):     0-3
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.078 MB perf.data (8 samples) ]
[root@...ux perf]# ./perf report
# To display the perf.data header info, please use 
--header/--header-only options.
#
#
# Total Lost Samples: 0
#
# Samples: 8  of event 'cycles'
# Event count (approx.): 5682386
#
# Children      Self  Command  Shared Object Symbol
# ........  ........  .......  ................ ...........................
#
     94.86%    94.86%  lscpu    [kernel.vmlinux]  [k] get_page_from_freelist
             |
             ---__GI_access (inlined)
                syscall_common
                do_faccessat
                filename_lookup
                path_lookupat
                walk_component
                __lookup_slow
                d_alloc_parallel
                d_alloc
                __d_alloc
                kmem_cache_alloc
                __slab_alloc.isra.96
                ___slab_alloc
                allocate_slab
                __alloc_pages_nodemask
                get_page_from_freelist

     94.86%     0.00%  lscpu    libc-2.20.so      [.] __GI_access (inlined)
             |
             ---__GI_access (inlined)
                syscall_common
                do_faccessat
                filename_lookup
                path_lookupat
                walk_component
                __lookup_slow
                d_alloc_parallel
                d_alloc
                __d_alloc
                kmem_cache_alloc
                __slab_alloc.isra.96
                ___slab_alloc
                allocate_slab
                __alloc_pages_nodemask
                get_page_from_freelist

     94.86%     0.00%  lscpu    [kernel.vmlinux]  [k] syscall_common
             |
             ---syscall_common
                do_faccessat
                filename_lookup
                path_lookupat
                walk_component
                __lookup_slow
                d_alloc_parallel
                d_alloc
                __d_alloc
                kmem_cache_alloc
                __slab_alloc.isra.96
                ___slab_alloc
                allocate_slab
                __alloc_pages_nodemask
                get_page_from_freelist

     94.86%     0.00%  lscpu    [kernel.vmlinux]  [k] do_faccessat
             |
             ---do_faccessat
                filename_lookup
                path_lookupat
                walk_component
                __lookup_slow
                d_alloc_parallel
                d_alloc
                __d_alloc
                kmem_cache_alloc
                __slab_alloc.isra.96
                ___slab_alloc
                allocate_slab
                __alloc_pages_nodemask
                get_page_from_freelist

     94.86%     0.00%  lscpu    [kernel.vmlinux]  [k] filename_lookup
             |
             ---filename_lookup
                path_lookupat
                walk_component
                __lookup_slow
                d_alloc_parallel
                d_alloc
                __d_alloc
                kmem_cache_alloc
                __slab_alloc.isra.96
                ___slab_alloc
                allocate_slab
                __alloc_pages_nodemask
                get_page_from_freelist

     94.86%     0.00%  lscpu    [kernel.vmlinux]  [k] path_lookupat
             |
             ---path_lookupat
                walk_component
                __lookup_slow
                d_alloc_parallel
                d_alloc
                __d_alloc
                kmem_cache_alloc
                __slab_alloc.isra.96
                ___slab_alloc
                allocate_slab
                __alloc_pages_nodemask
                get_page_from_freelist

     94.86%     0.00%  lscpu    [kernel.vmlinux]  [k] walk_component
             |
             ---walk_component
                __lookup_slow
                d_alloc_parallel
                d_alloc
                __d_alloc
                kmem_cache_alloc
                __slab_alloc.isra.96
                ___slab_alloc
                allocate_slab
                __alloc_pages_nodemask
                get_page_from_freelist

     94.86%     0.00%  lscpu    [kernel.vmlinux]  [k] __lookup_slow
             |
             ---__lookup_slow
                d_alloc_parallel
                d_alloc
                __d_alloc
                kmem_cache_alloc
                __slab_alloc.isra.96
                ___slab_alloc
                allocate_slab
                __alloc_pages_nodemask
                get_page_from_freelist

     94.86%     0.00%  lscpu    [kernel.vmlinux]  [k] d_alloc_parallel
             |
             ---d_alloc_parallel
                d_alloc
                __d_alloc
                kmem_cache_alloc
                __slab_alloc.isra.96
                ___slab_alloc
                allocate_slab
                __alloc_pages_nodemask
                get_page_from_freelist

     94.86%     0.00%  lscpu    [kernel.vmlinux]  [k] d_alloc
             |
             ---d_alloc
                __d_alloc
                kmem_cache_alloc
                __slab_alloc.isra.96
                ___slab_alloc
                allocate_slab
                __alloc_pages_nodemask
                get_page_from_freelist

     94.86%     0.00%  lscpu    [kernel.vmlinux]  [k] __d_alloc
             |
             ---__d_alloc
                kmem_cache_alloc
                __slab_alloc.isra.96
                ___slab_alloc
                allocate_slab
                __alloc_pages_nodemask
                get_page_from_freelist

     94.86%     0.00%  lscpu    [kernel.vmlinux]  [k] kmem_cache_alloc
             |
             ---kmem_cache_alloc
                __slab_alloc.isra.96
                ___slab_alloc
                allocate_slab
                __alloc_pages_nodemask
                get_page_from_freelist

     94.86%     0.00%  lscpu    [kernel.vmlinux]  [k] __slab_alloc.isra.96
             |
             ---__slab_alloc.isra.96
                ___slab_alloc
                allocate_slab
                __alloc_pages_nodemask
                get_page_from_freelist

     94.86%     0.00%  lscpu    [kernel.vmlinux]  [k] ___slab_alloc
             |
             ---___slab_alloc
                allocate_slab
                __alloc_pages_nodemask
                get_page_from_freelist

     94.86%     0.00%  lscpu    [kernel.vmlinux]  [k] allocate_slab
             |
             ---allocate_slab
                __alloc_pages_nodemask
                get_page_from_freelist

     94.86%     0.00%  lscpu    [kernel.vmlinux]  [k] __alloc_pages_nodemask
             |
             ---__alloc_pages_nodemask
                get_page_from_freelist

      5.00%     5.00%  lscpu    ld-2.20.so        [.] dl_main
             |
             ---dl_main

      0.13%     0.13%  lscpu    [kernel.vmlinux]  [k] perf_event_comm_output
      0.13%     0.00%  lscpu    [kernel.vmlinux]  [k] merge_sched_in
      0.13%     0.00%  lscpu    [kernel.vmlinux]  [k] 
event_sched_in.isra.132
      0.00%     0.00%  perf     [kernel.vmlinux]  [k] arch_local_irq_restore

#
# (Tip: List events using substring match: perf list <keyword>)
#

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ