lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date:   Mon, 28 Nov 2016 18:31:11 +0800
From:   "Wangnan (F)" <wangnan0@...wei.com>
To:     Alexei Starovoitov <alexei.starovoitov@...il.com>
CC:     <acme@...hat.com>, <ast@...com>, <lizefan@...wei.com>,
        <hekuang@...wei.com>, <linux-kernel@...r.kernel.org>,
        <pi3orama@....com>, <joe@....org>, Jiri Olsa <jolsa@...nel.org>
Subject: Re: [PATCH v3 14/30] perf clang: Support compile IR to BPF object and
 add testcase



On 2016/11/28 14:32, Wangnan (F) wrote:
>
>
> On 2016/11/27 1:25, Alexei Starovoitov wrote:
>> On Sat, Nov 26, 2016 at 07:03:38AM +0000, Wang Nan wrote:
>>> getBPFObjectFromModule() is introduced to compile LLVM IR(Module)
>>> to BPF object. Add new testcase for it.
>>>
>>> Test result:
>>>    $ ./buildperf/perf test -v clang
>>>    51: Test builtin clang support                               :
>>>    51.1: Test builtin clang compile C source to IR              :
>>>    --- start ---
>>>    test child forked, pid 21822
>>>    test child finished with 0
>>>    ---- end ----
>>>    Test builtin clang support subtest 0: Ok
>>>    51.2: Test builtin clang compile C source to ELF object      :
>>>    --- start ---
>>>    test child forked, pid 21823
>>>    test child finished with 0
>>>    ---- end ----
>>>    Test builtin clang support subtest 1: Ok
>>>
>>> Signed-off-by: Wang Nan <wangnan0@...wei.com>
>> ...
>>> +    legacy::PassManager PM;
>>> +    if (TargetMachine->addPassesToEmitFile(PM, ostream,
>>> +                           TargetMachine::CGFT_ObjectFile)) {
>>> +        llvm::errs() << "TargetMachine can't emit a file of this 
>>> type\n";
>>> +        return std::unique_ptr<llvm::SmallVectorImpl<char>>(nullptr);;
>>> +    }
>>> +    PM.run(*Module);
>> I'm pretty sure you want to add FunctionInlingPass as well otherwise
>> I think llvm won't be doing much inlining and only very very simple
>> programs will compile fine. See what we did on bcc side.
>
> Thank you for your information. I though inlining should be done during
> C to IR phase, and we have use -O2 for it. Let me check it.
>

I did a simple test. It seems even without FunctionInliningPass clang/llvm
can inline static function with no problem. For example, in the sample code
in the cover letter, extract a static function like this:

   static void inc_counter(u64 id)
   {
       u64 *counter;

       counter = bpf_map_lookup_elem(&syscall_counter, &id);
       if (!counter) {
           u64 value = 1;
           bpf_map_update_elem(&syscall_counter, &id, &value, 0);
           return;
       }
       __sync_fetch_and_add(counter, 1);
       return;
   }

Then enable llvm.dump-obj = true in ~/.perfconfig so we can see the
resuling ELF object.

The script worked correctly. readelf report:

  $ readelf -a ./count_syscalls.o | grep inc_counter
  $

Inserting output command into PerfModule::prepareBPF and 
PerfModule::prepareJIT
to print names of functions, can't see inc_counter.

Then remove -O2 in cflags in createCompilerInvocation. Result:

# ./perf record -e ./count_syscalls.c -a sleep 1
LLVM ERROR: Cannot select: t38: ch,glue = BPFISD::CALL t37, t31, 
Register:i64 %R1, Register:i64 %R2, t37:1
   t31: i64,ch = load<LD8[@bpf_map_lookup_elem]> t51, t58, undef:i64
     t58: i64 = BPFISD::Wrapper TargetGlobalAddress:i64<i8* (i8*, i8*)** 
@bpf_map_lookup_elem> 0
       t57: i64 = TargetGlobalAddress<i8* (i8*, i8*)** 
@bpf_map_lookup_elem> 0
     t5: i64 = undef
   t34: i64 = Register %R1
   t36: i64 = Register %R2
   t37: ch,glue = CopyToReg t35, Register:i64 %R2, FrameIndex:i64<5>, t35:1
     t36: i64 = Register %R2
     t8: i64 = FrameIndex<5>
     t35: ch,glue = CopyToReg t33, Register:i64 %R1, t56
       t34: i64 = Register %R1
       t56: i64 = BPFISD::Wrapper 
TargetGlobalAddress:i64<%struct.bpf_map_def* @GVALS> 0
         t55: i64 = TargetGlobalAddress<%struct.bpf_map_def* @GVALS> 0
In function: func

Don't know whether -O2 imply inlining.

In bcc, you not only use FunctionInlining, but also add AlwaysInlinerPass
and use populateModulePassManager to append other optimization. I tried to
minimic your code, but it seems the perfhook functions are optimized out
by some optimization added by populateModulePassManager.

Although not quite clear, I'll make following change. Please help me
check it.

Thank you.

diff --git a/tools/perf/util/c++/clang.cpp b/tools/perf/util/c++/clang.cpp
index d05ab6f..d6d1959 100644
--- a/tools/perf/util/c++/clang.cpp
+++ b/tools/perf/util/c++/clang.cpp
@@ -22,6 +22,8 @@
  #include "llvm/Support/TargetSelect.h"
  #include "llvm/Target/TargetMachine.h"
  #include "llvm/Target/TargetOptions.h"
+#include "llvm-c/Transforms/IPO.h"
+#include "llvm/Transforms/IPO.h"
  #include <memory>

  #include "clang.h"
@@ -133,6 +135,13 @@ getBPFObjectFromModule(llvm::Module *Module)
         raw_svector_ostream ostream(*Buffer);

         legacy::PassManager PM;
+
+    PM.add(createFunctionInliningPass());
+    /*
+     * LLVM is changing its interface. Use a stable workaround.
+     */
+ LLVMAddAlwaysInlinerPass(reinterpret_cast<LLVMPassManagerRef>(&PM));
+
      if (TargetMachine->addPassesToEmitFile(PM, ostream,
                             TargetMachine::CGFT_ObjectFile)) {
          llvm::errs() << "TargetMachine can't emit a file of this type\n";


Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ