lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [day] [month] [year] [list]
Message-ID: <20210912132923.GC25450@xsang-OptiPlex-9020>
Date:   Sun, 12 Sep 2021 21:29:23 +0800
From:   kernel test robot <oliver.sang@...el.com>
To:     Ville Syrjälä <ville.syrjala@...ux.intel.com>
Cc:     Chris Wilson <chris@...is-wilson.co.uk>,
        LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
        lkp@...el.com, ying.huang@...el.com, feng.tang@...el.com,
        zhengjun.xing@...ux.intel.com, guobing.chen@...el.com,
        ming.a.chen@...el.com, frank.du@...el.com, Shuhua.Fan@...el.com,
        wangyang.guo@...el.com, Wenhuan.Huang@...el.com,
        jessica.ji@...el.com, shan.kang@...el.com, guangli.li@...el.com,
        tiejun.li@...el.com, yu.ma@...el.com, dapeng1.mi@...el.com,
        jiebin.sun@...el.com, gengxin.xie@...el.com, fan.zhao@...el.com
Subject: [drm/i915]  36b6b68169:
 phoronix-test-suite.supertuxkart.1024x768.Windowed.Basic.1.ZenGarden.frames_per_second
 14.4% improvement



Greeting,

FYI, we noticed a 14.4% improvement of phoronix-test-suite.supertuxkart.1024x768.Windowed.Basic.1.ZenGarden.frames_per_second due to commit:


commit: 36b6b6816989cf6f468eea82694e83211a066fa4 ("drm/i915: Fix MOCS PTE setting for gen9+")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master


in testcase: phoronix-test-suite
on test machine: 12 threads 1 sockets Intel(R) Core(TM) i7-8700 CPU @ 3.20GHz with 16G memory
with following parameters:

	need_x: true
	test: supertuxkart-1.5.2
	option_a: 1024 x 768
	option_b: Windowed
	option_c: Basic
	option_d: 1
	option_e: Zen Garden [Low poly]
	cpufreq_governor: performance
	ucode: 0xde

test-description: The Phoronix Test Suite is the most comprehensive testing and benchmarking platform available that provides an extensible framework for which new tests can be easily added.
test-url: http://www.phoronix-test-suite.com/





Details are as below:
-------------------------------------------------------------------------------------------------->


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        bin/lkp install                job.yaml  # job file is attached in this email
        bin/lkp split-job --compatible job.yaml  # generate the yaml file for lkp run
        bin/lkp run                    generated-yaml-file

=========================================================================================
compiler/cpufreq_governor/kconfig/need_x/option_a/option_b/option_c/option_d/option_e/rootfs/tbox_group/test/testcase/ucode:
  gcc-9/performance/x86_64-rhel-8.3/true/1024 x 768/Windowed/Basic/1/Zen Garden [Low poly]/debian-x86_64-phoronix/lkp-cfl-d1/supertuxkart-1.5.2/phoronix-test-suite/0xde

commit: 
  d46b60a2e8 ("drm/i915: Mark ininitial fb obj as WT on eLLC machines to avoid rcu lockup during fbdev init")
  36b6b68169 ("drm/i915: Fix MOCS PTE setting for gen9+")

d46b60a2e8d246f1 36b6b6816989cf6f468eea82694 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
    638.12 ±  2%     +14.4%     730.32 ±  2%  phoronix-test-suite.supertuxkart.1024x768.Windowed.Basic.1.ZenGarden.frames_per_second
      5.98 ±  5%     +52.2%       9.10 ±  3%  turbostat.GFXWatt
     31.18 ±  6%     +12.8%      35.19 ±  2%  turbostat.PkgWatt
      0.08 ±  6%     +29.7%       0.11 ± 15%  perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
      5.42 ±  3%     -12.0%       4.77 ±  5%  perf-sched.wait_and_delay.avg.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
      6.68 ±  4%     -10.6%       5.97 ±  8%  perf-sched.wait_and_delay.avg.ms.worker_thread.kthread.ret_from_fork
      6287 ±  3%     +11.0%       6976 ±  3%  perf-sched.wait_and_delay.count.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
      5.40 ±  3%     -12.1%       4.75 ±  5%  perf-sched.wait_time.avg.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
      6.67 ±  4%     -10.6%       5.96 ±  8%  perf-sched.wait_time.avg.ms.worker_thread.kthread.ret_from_fork
     37.42 ±  2%     -12.2       25.26 ±  5%  perf-stat.i.cache-miss-rate%
  78410775 ±  3%     -26.6%   57551017 ±  6%  perf-stat.i.cache-misses
      0.79           +10.8%       0.88 ±  2%  perf-stat.i.ipc
      0.00 ± 63%      +0.0        0.00 ± 93%  perf-stat.i.node-load-miss-rate%
   7810539 ±  3%     -40.4%    4657311 ± 10%  perf-stat.i.node-loads
   1653601 ±  2%     -26.6%    1213931 ±  5%  perf-stat.i.node-stores
      0.82 ± 11%      -0.4        0.47 ± 45%  perf-profile.calltrace.cycles-pp.eb_lookup_vmas.i915_gem_do_execbuffer.i915_gem_execbuffer2_ioctl.drm_ioctl_kernel.drm_ioctl
      0.76 ± 12%      +0.2        0.97 ± 11%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.entry_SYSCALL_64_after_hwframe
      1.11 ± 15%      -0.4        0.74 ± 20%  perf-profile.children.cycles-pp.__queue_work
      0.87 ± 17%      -0.4        0.51 ± 26%  perf-profile.children.cycles-pp.__i915_vma_retire
      0.91 ± 17%      -0.3        0.59 ± 24%  perf-profile.children.cycles-pp.queue_work_on
      0.83 ± 11%      -0.3        0.55 ± 10%  perf-profile.children.cycles-pp.eb_lookup_vmas
      0.68 ± 26%      -0.2        0.46 ± 24%  perf-profile.children.cycles-pp.process_csb
      0.60 ± 30%      -0.2        0.37 ± 20%  perf-profile.children.cycles-pp.execlists_schedule_out
      0.72 ± 24%      -0.2        0.51 ± 23%  perf-profile.children.cycles-pp.execlists_submission_tasklet
      0.36 ± 16%      -0.1        0.22 ± 21%  perf-profile.children.cycles-pp.run_posix_cpu_timers
      0.43 ± 15%      -0.1        0.29 ± 22%  perf-profile.children.cycles-pp.eb_pin_engine
      0.39 ± 14%      -0.1        0.26 ± 23%  perf-profile.children.cycles-pp.__intel_context_do_pin_ww
      0.33 ±  5%      -0.1        0.22 ± 25%  perf-profile.children.cycles-pp.__active_lookup
      0.24 ± 13%      -0.1        0.15 ± 25%  perf-profile.children.cycles-pp.__perf_event_header__init_id
      0.11 ± 11%      -0.1        0.05 ± 76%  perf-profile.children.cycles-pp.security_socket_recvmsg
      0.07 ± 50%      +0.0        0.10 ± 15%  perf-profile.children.cycles-pp.irqentry_exit
      0.19 ± 10%      +0.1        0.24 ± 14%  perf-profile.children.cycles-pp.printk
      0.19 ± 10%      +0.1        0.24 ± 14%  perf-profile.children.cycles-pp.vprintk_emit
      0.19 ± 10%      +0.1        0.24 ± 14%  perf-profile.children.cycles-pp.console_unlock
      0.19 ± 10%      +0.1        0.24 ± 14%  perf-profile.children.cycles-pp.serial8250_console_write
      0.19 ± 10%      +0.1        0.24 ± 14%  perf-profile.children.cycles-pp.uart_console_write
      0.08 ± 33%      +0.1        0.13 ± 34%  perf-profile.children.cycles-pp.rcu_gp_kthread
      0.64 ± 12%      +0.2        0.80 ±  7%  perf-profile.children.cycles-pp._raw_spin_lock_irqsave
      0.87 ± 17%      -0.4        0.51 ± 27%  perf-profile.self.cycles-pp.__i915_vma_retire
      0.58 ± 17%      -0.2        0.38 ±  9%  perf-profile.self.cycles-pp.eb_lookup_vmas
      0.36 ± 16%      -0.1        0.22 ± 21%  perf-profile.self.cycles-pp.run_posix_cpu_timers
      0.32 ±  6%      -0.1        0.20 ± 28%  perf-profile.self.cycles-pp.__active_lookup
      0.39 ± 15%      -0.1        0.28 ± 16%  perf-profile.self.cycles-pp.__radix_tree_lookup
      0.12 ± 18%      -0.1        0.06 ± 48%  perf-profile.self.cycles-pp.kmem_cache_alloc
      0.12 ± 16%      -0.0        0.08 ± 20%  perf-profile.self.cycles-pp.rcu_nmi_enter
      0.59 ± 12%      +0.2        0.74 ± 10%  perf-profile.self.cycles-pp._raw_spin_lock_irqsave


                                                                                
                                                                                
                                                                                
  800 +---------------------------------------------------------------------+   
      |O    O  OO     OO     OO    O   O  OO O O O  O  OOO O  OO  O   O OO O|   
  700 |-+O       O O+.   OOO    O    O.+O     O   O  O      O    O O O      |   
  600 |+.++.++.+++.+  ++.+++.++.++.+++  +.++.+                              |   
      |                                                                     |   
  500 |-+                                                                   |   
      |                                                                     |   
  400 |-+                                                                   |   
      |                                                                     |   
  300 |-+                                                                   |   
  200 |-+                                                                   |   
      |                                                                     |   
  100 |-+                                                                   |   
      |                                                                     |   
    0 +---------------------------------------------------------------------+   
                                                                                
                                                                                
[*] bisect-good sample
[O] bisect-bad  sample



Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


---
0DAY/LKP+ Test Infrastructure                   Open Source Technology Center
https://lists.01.org/hyperkitty/list/lkp@lists.01.org       Intel Corporation

Thanks,
Oliver Sang


View attachment "config-5.9.0-rc2-00398-g36b6b6816989c" of type "text/plain" (168571 bytes)

View attachment "job-script" of type "text/plain" (7804 bytes)

View attachment "job.yaml" of type "text/plain" (5168 bytes)

View attachment "reproduce" of type "text/plain" (304 bytes)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ