lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite for Android: free password hash cracker in your pocket
[<prev] [next>] [day] [month] [year] [list]
Date:	Thu, 26 Feb 2015 13:20:22 +0800
From:	Huang Ying <ying.huang@...el.com>
To:	Mel Gorman <mgorman@...e.de>
Cc:	Linus Torvalds <torvalds@...ux-foundation.org>,
	LKML <linux-kernel@...r.kernel.org>, LKP ML <lkp@...org>
Subject: [LKP] [mm] 8a0516ed8b9: -1.7% netperf.Throughput_Mbps, +2189.6%
 netperf.time.minor_page_faults, +3987.5% proc-vmstat.numa_pte_updates

FYI, we noticed the below changes on

git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git master
commit 8a0516ed8b90c95ffa1363b420caa37418149f21 ("mm: convert p[te|md]_numa users to p[te|md]_protnone_numa")


testbox/testcase/testparams: lkp-sbx04/netperf/performance-900s-200%-TCP_MAERTS

e7bb4b6d1609cce3  8a0516ed8b90c95ffa1363b420  
----------------  --------------------------  
         %stddev     %change         %stddev
             \          |                \  
    226261 ±  1%   +2189.6%    5180560 ±  0%  netperf.time.minor_page_faults
       721 ±  0%      -1.7%        709 ±  0%  netperf.Throughput_Mbps
     12341 ± 16%    -100.0%          0 ±  0%  proc-vmstat.numa_pages_migrated
    364595 ±  3%    -100.0%          0 ±  0%  proc-vmstat.numa_hint_faults_local
    388922 ±  4%    -100.0%          0 ±  0%  proc-vmstat.numa_hint_faults
    226261 ±  1%   +2189.6%    5180560 ±  0%  time.minor_page_faults
    388831 ±  3%   +3987.5%   15893407 ±  0%  proc-vmstat.numa_pte_updates
     12341 ± 16%    -100.0%          0 ±  0%  proc-vmstat.pgmigrate_success
        47 ± 42%     -60.3%         18 ± 13%  sched_debug.cfs_rq[5]:/.blocked_load_avg
        73 ± 19%     -53.9%         34 ± 18%  sched_debug.cfs_rq[46]:/.load
        32 ± 20%     +75.0%         56 ± 32%  sched_debug.cpu#32.load
        27 ± 37%     +61.1%         43 ± 27%  sched_debug.cfs_rq[15]:/.blocked_load_avg
        54 ± 20%     -43.8%         30 ±  5%  sched_debug.cfs_rq[17]:/.load
        57 ± 30%     -39.8%         34 ± 17%  sched_debug.cfs_rq[53]:/.load
        70 ± 29%     -41.3%         41 ±  8%  sched_debug.cfs_rq[5]:/.tg_load_contrib
        64 ± 20%     -27.9%         46 ± 14%  sched_debug.cpu#26.load
        34 ± 21%     +68.6%         57 ±  1%  sched_debug.cfs_rq[15]:/.load
        60 ± 21%     -28.2%         43 ± 26%  sched_debug.cfs_rq[6]:/.load
        50 ± 18%     +33.2%         67 ± 18%  sched_debug.cfs_rq[15]:/.tg_load_contrib
        62 ± 28%     -40.6%         37 ± 32%  sched_debug.cfs_rq[30]:/.load
        59 ± 18%     -33.5%         39 ± 14%  sched_debug.cfs_rq[62]:/.load
       556 ± 25%     -54.2%        255 ± 36%  sched_debug.cpu#59.sched_goidle
      1.63 ±  2%     -31.2%       1.12 ±  0%  perf-profile.cpu-cycles._raw_spin_lock.free_one_page.__free_pages_ok.free_compound_page.put_compound_page
        50 ± 40%     -35.5%         32 ± 16%  sched_debug.cpu#43.load
        31 ± 18%     +39.7%         44 ± 22%  sched_debug.cpu#53.load
      2.18 ±  3%     -29.0%       1.55 ±  3%  perf-profile.cpu-cycles.free_one_page.__free_pages_ok.free_compound_page.put_compound_page.put_page
        46 ± 13%     -37.6%         29 ± 31%  sched_debug.cfs_rq[16]:/.blocked_load_avg
        51 ± 26%     -36.6%         32 ±  6%  sched_debug.cpu#7.load
        73 ± 13%     -20.8%         58 ±  9%  sched_debug.cfs_rq[51]:/.tg_load_contrib
      1.77 ±  2%     -25.1%       1.33 ±  1%  perf-profile.cpu-cycles._raw_spin_lock_irqsave.get_page_from_freelist.__alloc_pages_nodemask.alloc_pages_current.skb_page_frag_refill
        58 ± 23%     -38.4%         35 ± 24%  sched_debug.cfs_rq[2]:/.load
   8833788 ±  8%     +22.5%   10821104 ± 12%  sched_debug.cfs_rq[12]:/.max_vruntime
   8833787 ±  8%     +22.5%   10821104 ± 12%  sched_debug.cfs_rq[12]:/.MIN_vruntime
      1951 ± 12%     +20.1%       2343 ± 12%  sched_debug.cpu#9.curr->pid
    112948 ±  2%     +25.6%     141909 ± 11%  sched_debug.cpu#32.sched_count
      1955 ±  9%     +17.3%       2293 ±  9%  sched_debug.cpu#46.curr->pid
   9533920 ± 16%     +31.8%   12561711 ± 13%  sched_debug.cfs_rq[53]:/.max_vruntime
   9533919 ± 16%     +31.8%   12561711 ± 13%  sched_debug.cfs_rq[53]:/.MIN_vruntime
      0.97 ± 10%     -15.7%       0.82 ±  6%  perf-profile.cpu-cycles.tcp_send_mss.tcp_sendmsg.inet_sendmsg.do_sock_sendmsg.SYSC_sendto
     59313 ± 24%     -21.3%      46703 ±  2%  sched_debug.cpu#25.ttwu_count
      3.92 ±  2%     -17.1%       3.25 ±  0%  perf-profile.cpu-cycles.put_compound_page.put_page.skb_release_data.skb_release_all.__kfree_skb
      3.72 ±  2%     -16.4%       3.11 ±  0%  perf-profile.cpu-cycles.free_compound_page.put_compound_page.put_page.skb_release_data.skb_release_all
      3.65 ±  1%     -16.8%       3.04 ±  0%  perf-profile.cpu-cycles.__free_pages_ok.free_compound_page.put_compound_page.put_page.skb_release_data
      1853 ±  9%     +15.7%       2144 ±  5%  sched_debug.cpu#45.curr->pid
      1769 ±  4%     +19.9%       2121 ±  6%  sched_debug.cpu#61.curr->pid
      5.97 ±  2%     -16.1%       5.01 ±  0%  perf-profile.cpu-cycles.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv.ip_local_deliver_finish.ip_local_deliver
      1.59 ±  2%     -14.2%       1.37 ±  2%  perf-profile.cpu-cycles.sk_stream_alloc_skb.tcp_sendmsg.inet_sendmsg.do_sock_sendmsg.SYSC_sendto
      2.65 ±  3%     -17.1%       2.20 ±  1%  perf-profile.cpu-cycles.tcp_transmit_skb.tcp_write_xmit.__tcp_push_pending_frames.tcp_rcv_established.tcp_v4_do_rcv
      6.19 ±  1%     -15.3%       5.25 ±  0%  perf-profile.cpu-cycles.tcp_v4_do_rcv.tcp_v4_rcv.ip_local_deliver_finish.ip_local_deliver.ip_rcv_finish
      4.45 ±  1%     -15.6%       3.75 ±  0%  perf-profile.cpu-cycles.put_page.skb_release_data.skb_release_all.__kfree_skb.tcp_recvmsg
    693150 ±  3%     +14.6%     794663 ±  6%  sched_debug.cpu#1.avg_idle
      8.14 ±  1%     -14.7%       6.94 ±  0%  perf-profile.cpu-cycles.ip_rcv.__netif_receive_skb_core.__netif_receive_skb.process_backlog.net_rx_action
      7.37 ±  2%     -14.8%       6.28 ±  1%  perf-profile.cpu-cycles.tcp_v4_rcv.ip_local_deliver_finish.ip_local_deliver.ip_rcv_finish.ip_rcv
      6.70 ±  1%     -14.3%       5.74 ±  1%  perf-profile.cpu-cycles.__kfree_skb.tcp_recvmsg.inet_recvmsg.sock_recvmsg.SYSC_recvfrom
      1988 ±  7%      -8.0%       1829 ±  7%  sched_debug.cpu#33.curr->pid
      5.21 ±  1%     -14.0%       4.49 ±  0%  perf-profile.cpu-cycles.skb_release_data.skb_release_all.__kfree_skb.tcp_recvmsg.inet_recvmsg
      7.93 ±  2%     -14.5%       6.78 ±  0%  perf-profile.cpu-cycles.ip_rcv_finish.ip_rcv.__netif_receive_skb_core.__netif_receive_skb.process_backlog
      7.73 ±  2%     -14.3%       6.62 ±  0%  perf-profile.cpu-cycles.ip_local_deliver.ip_rcv_finish.ip_rcv.__netif_receive_skb_core.__netif_receive_skb
      7.70 ±  2%     -14.6%       6.58 ±  0%  perf-profile.cpu-cycles.ip_local_deliver_finish.ip_local_deliver.ip_rcv_finish.ip_rcv.__netif_receive_skb_core
      8.90 ±  1%     -14.1%       7.65 ±  0%  perf-profile.cpu-cycles.__do_softirq.do_softirq_own_stack.do_softirq.__local_bh_enable_ip.ip_finish_output
     39826 ±  8%     -11.0%      35446 ±  1%  sched_debug.cpu#22.ttwu_local
      6.29 ±  1%     -14.3%       5.38 ±  0%  perf-profile.cpu-cycles.skb_release_all.__kfree_skb.tcp_recvmsg.inet_recvmsg.sock_recvmsg
      8.68 ±  2%     -14.1%       7.45 ±  0%  perf-profile.cpu-cycles.net_rx_action.__do_softirq.do_softirq_own_stack.do_softirq.__local_bh_enable_ip
      8.43 ±  2%     -13.5%       7.29 ±  0%  perf-profile.cpu-cycles.process_backlog.net_rx_action.__do_softirq.do_softirq_own_stack.do_softirq
      8.30 ±  2%     -13.5%       7.18 ±  0%  perf-profile.cpu-cycles.__netif_receive_skb_core.__netif_receive_skb.process_backlog.net_rx_action.__do_softirq
      8.32 ±  2%     -13.4%       7.21 ±  0%  perf-profile.cpu-cycles.__netif_receive_skb.process_backlog.net_rx_action.__do_softirq.do_softirq_own_stack
      8.99 ±  2%     -13.3%       7.79 ±  1%  perf-profile.cpu-cycles.do_softirq.part.13.__local_bh_enable_ip.ip_finish_output.ip_output.ip_local_out_sk
      9.02 ±  1%     -13.3%       7.82 ±  1%  perf-profile.cpu-cycles.__local_bh_enable_ip.ip_finish_output.ip_output.ip_local_out_sk.ip_queue_xmit
      1.02 ±  6%     -15.9%       0.86 ±  3%  perf-profile.cpu-cycles.ip_queue_xmit.tcp_transmit_skb.tcp_write_xmit.__tcp_push_pending_frames.tcp_rcv_established
  10430785 ±  7%     -16.8%    8673532 ±  9%  sched_debug.cfs_rq[49]:/.max_vruntime
  10430785 ±  7%     -16.8%    8673530 ±  9%  sched_debug.cfs_rq[49]:/.MIN_vruntime
  11509303 ±  9%     -23.1%    8846526 ± 14%  sched_debug.cfs_rq[13]:/.MIN_vruntime
  11509303 ±  9%     -23.1%    8846526 ± 14%  sched_debug.cfs_rq[13]:/.max_vruntime
      8.95 ±  1%     -13.2%       7.77 ±  0%  perf-profile.cpu-cycles.do_softirq_own_stack.do_softirq.__local_bh_enable_ip.ip_finish_output.ip_output
     40589 ±  6%     -10.4%      36365 ±  3%  sched_debug.cpu#54.ttwu_local
      4.34 ±  1%     -12.8%       3.78 ±  1%  perf-profile.cpu-cycles.alloc_pages_current.skb_page_frag_refill.sk_page_frag_refill.tcp_sendmsg.inet_sendmsg
      3.74 ±  1%     -14.7%       3.19 ±  1%  perf-profile.cpu-cycles.get_page_from_freelist.__alloc_pages_nodemask.alloc_pages_current.skb_page_frag_refill.sk_page_frag_refill
      1.02 ±  6%     -14.0%       0.88 ±  4%  perf-profile.cpu-cycles.skb_release_head_state.skb_release_all.__kfree_skb.tcp_recvmsg.inet_recvmsg
     15.16 ±  0%     +15.4%      17.50 ±  0%  perf-profile.cpu-cycles.copy_user_generic_string.tcp_sendmsg.inet_sendmsg.do_sock_sendmsg.SYSC_sendto
     10.04 ±  1%     -12.1%       8.82 ±  0%  perf-profile.cpu-cycles.ip_output.ip_local_out_sk.ip_queue_xmit.tcp_transmit_skb.tcp_send_ack
     10.11 ±  1%     -12.1%       8.88 ±  0%  perf-profile.cpu-cycles.ip_local_out_sk.ip_queue_xmit.tcp_transmit_skb.tcp_send_ack.tcp_cleanup_rbuf
     38453 ±  6%      -9.4%      34832 ±  2%  sched_debug.cpu#12.ttwu_local
     10.22 ±  1%     -12.5%       8.94 ±  0%  perf-profile.cpu-cycles.ip_finish_output.ip_output.ip_local_out_sk.ip_queue_xmit.tcp_transmit_skb
      4.16 ±  1%     -13.3%       3.60 ±  1%  perf-profile.cpu-cycles.__alloc_pages_nodemask.alloc_pages_current.skb_page_frag_refill.sk_page_frag_refill.tcp_sendmsg
      3.71 ±  2%     -11.8%       3.27 ±  0%  perf-profile.cpu-cycles.__tcp_push_pending_frames.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv.ip_local_deliver_finish
        24 ±  9%     -14.4%         20 ±  3%  sched_debug.cfs_rq[23]:/.runnable_load_avg
        23 ±  4%     -10.6%         21 ±  3%  sched_debug.cpu#23.cpu_load[0]
      5196 ±  6%     +19.6%       6213 ± 18%  numa-vmstat.node1.nr_anon_pages
     20787 ±  6%     +19.6%      24855 ± 18%  numa-meminfo.node1.AnonPages
      3.57 ±  2%     -12.9%       3.11 ±  0%  perf-profile.cpu-cycles.tcp_write_xmit.__tcp_push_pending_frames.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv
     10.44 ±  1%     -12.2%       9.17 ±  0%  perf-profile.cpu-cycles.ip_queue_xmit.tcp_transmit_skb.tcp_send_ack.tcp_cleanup_rbuf.tcp_recvmsg
     52615 ±  7%     -10.5%      47095 ±  3%  sched_debug.cpu#27.ttwu_count
    747189 ±  2%      +9.4%     817106 ±  5%  sched_debug.cpu#5.avg_idle
      4.58 ±  2%     -13.0%       3.98 ±  1%  perf-profile.cpu-cycles.skb_page_frag_refill.sk_page_frag_refill.tcp_sendmsg.inet_sendmsg.do_sock_sendmsg
      4.95 ±  1%     -12.0%       4.36 ±  1%  perf-profile.cpu-cycles.sk_page_frag_refill.tcp_sendmsg.inet_sendmsg.do_sock_sendmsg.SYSC_sendto
     12.85 ±  1%     -11.8%      11.33 ±  0%  perf-profile.cpu-cycles.tcp_cleanup_rbuf.tcp_recvmsg.inet_recvmsg.sock_recvmsg.SYSC_recvfrom
      9.45 ±  1%     -11.5%       8.36 ±  0%  perf-profile.cpu-cycles.tcp_transmit_skb.tcp_send_ack.tcp_cleanup_rbuf.tcp_recvmsg.inet_recvmsg
     12.65 ±  1%     -11.8%      11.16 ±  0%  perf-profile.cpu-cycles.tcp_send_ack.tcp_cleanup_rbuf.tcp_recvmsg.inet_recvmsg.sock_recvmsg
      1967 ±  5%     -11.3%       1744 ±  4%  sched_debug.cpu#38.curr->pid
      1.50 ±  3%     -11.2%       1.33 ±  2%  perf-profile.cpu-cycles.__alloc_skb.sk_stream_alloc_skb.tcp_sendmsg.inet_sendmsg.do_sock_sendmsg
      1796 ±  7%     +16.9%       2101 ±  3%  sched_debug.cpu#20.curr->pid
     52803 ±  8%      -9.1%      48004 ±  2%  sched_debug.cpu#22.ttwu_count
    121926 ±  3%      -7.3%     113073 ±  3%  sched_debug.cpu#27.nr_switches
    122808 ±  3%      -7.8%     113235 ±  3%  sched_debug.cpu#27.sched_count
    121572 ±  5%      -7.1%     112948 ±  2%  sched_debug.cpu#46.nr_switches
     39590 ±  3%      -6.3%      37095 ±  4%  sched_debug.cpu#43.ttwu_local
     18366 ±  0%      -5.0%      17440 ±  0%  vmstat.system.cs
     70241 ±  0%      -2.8%      68273 ±  0%  vmstat.system.in

lkp-sbx04: Sandy Bridge-EX
Memory: 64G




                                time.minor_page_faults

  5.5e+06 ++----------------------------------------------------------------+
    5e+06 ++                                                       O O OO O O
          |                                                                 |
  4.5e+06 ++                                                                |
    4e+06 ++                                                                |
  3.5e+06 ++                                                                |
    3e+06 ++                                                                |
          |                                                                 |
  2.5e+06 ++                                                                |
    2e+06 ++                                                                |
  1.5e+06 ++                                                                |
    1e+06 ++                                                                |
          |                                                                 |
   500000 O+O.O.OO.O.O.O.O.OO.O.O.O.O.OO.O.O.O.O.OO.O.O.O.O OO O O          |
        0 ++----------------------------------------------------------------+


                            netperf.time.minor_page_faults

  5.5e+06 ++----------------------------------------------------------------+
    5e+06 ++                                                       O O OO O O
          |                                                                 |
  4.5e+06 ++                                                                |
    4e+06 ++                                                                |
  3.5e+06 ++                                                                |
    3e+06 ++                                                                |
          |                                                                 |
  2.5e+06 ++                                                                |
    2e+06 ++                                                                |
  1.5e+06 ++                                                                |
    1e+06 ++                                                                |
          |                                                                 |
   500000 O+O.O.OO.O.O.O.O.OO.O.O.O.O.OO.O.O.O.O.OO.O.O.O.O OO O O          |
        0 ++----------------------------------------------------------------+


	[*] bisect-good sample
	[O] bisect-bad  sample

To reproduce:

	apt-get install ruby
	git clone git://git.kernel.org/pub/scm/linux/kernel/git/wfg/lkp-tests.git
	cd lkp-tests
	bin/setup-local job.yaml # the job file attached in this email
	bin/run-local   job.yaml


Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


Thanks,
Ying Huang


View attachment "job.yaml" of type "text/plain" (1725 bytes)

View attachment "reproduce" of type "text/plain" (4490 bytes)

_______________________________________________
LKP mailing list
LKP@...ux.intel.com

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ