lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <96fa41bb-7e9f-ae47-bf34-accef5c36fa8@linux.intel.com>
Date:   Tue, 30 Nov 2021 14:46:06 +0800
From:   Xing Zhengjun <zhengjun.xing@...ux.intel.com>
To:     kernel test robot <oliver.sang@...el.com>,
        Dave Chinner <dchinner@...hat.com>
Cc:     "Darrick J. Wong" <djwong@...nel.org>,
        Chandan Babu R <chandanrlinux@...il.com>,
        Brian Foster <bfoster@...hat.com>,
        Allison Henderson <allison.henderson@...cle.com>,
        LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
        lkp@...el.com
Subject: Re: [LKP] Re: [xfs] bad77c375e: stress-ng.fallocate.ops_per_sec
 -10.0% regression

Hi Dave,

    Do you have time to look at this? It still existed in v5.16-rc3. Thanks

On 9/3/2021 3:40 PM, Xing Zhengjun wrote:
> Hi Dave,
> 
>     Do you have time to look at this? Thanks.
> 
> On 9/2/2021 3:27 PM, kernel test robot wrote:
>>
>>
>> Greeting,
>>
>> FYI, we noticed a -10.0% regression of stress-ng.fallocate.ops_per_sec 
>> due to commit:
>>
>>
>> commit: bad77c375e8de6c776c848e443f7dc2d0d909be5 ("xfs: CIL checkpoint 
>> flushes caches unconditionally")
>> https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
>>
>>
>> in testcase: stress-ng
>> on test machine: 96 threads 2 sockets Intel(R) Xeon(R) Gold 6252 CPU @ 
>> 2.10GHz with 512G memory
>> with following parameters:
>>
>>     nr_threads: 10%
>>     disk: 1HDD
>>     testtime: 60s
>>     fs: xfs
>>     class: filesystem
>>     test: fallocate
>>     cpufreq_governor: performance
>>     ucode: 0x5003006
>>
>>
>> In addition to that, the commit also has significant impact on the 
>> following tests:
>>
>>
>>
>> If you fix the issue, kindly add following tag
>> Reported-by: kernel test robot <oliver.sang@...el.com>
>>
>>
>> Details are as below:
>> --------------------------------------------------------------------------------------------------> 
>>
>>
>>
>> To reproduce:
>>
>>          git clone https://github.com/intel/lkp-tests.git
>>          cd lkp-tests
>>          bin/lkp install                job.yaml  # job file is 
>> attached in this email
>>          bin/lkp split-job --compatible job.yaml  # generate the yaml 
>> file for lkp run
>>          bin/lkp run                    generated-yaml-file
>>
>> ========================================================================================= 
>>
>> class/compiler/cpufreq_governor/disk/fs/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime/ucode: 
>>
>>    
>> filesystem/gcc-9/performance/1HDD/xfs/x86_64-rhel-8.3/10%/debian-10.4-x86_64-20200603.cgz/lkp-csl-2sp7/fallocate/stress-ng/60s/0x5003006 
>>
>>
>> commit:
>>    0431d926b3 ("xfs: async blkdev cache flush")
>>    bad77c375e ("xfs: CIL checkpoint flushes caches unconditionally")
>>
>> 0431d926b399d74f bad77c375e8de6c776c848e443f
>> ---------------- ---------------------------
>>           %stddev     %change         %stddev
>>               \          |                \
>>      618.67           -10.0%     556.67 ±  2%  stress-ng.fallocate.ops
>>       10.31           -10.0%       9.28 ±  2%  
>> stress-ng.fallocate.ops_per_sec
>>      111618            -5.7%     105270        
>> stress-ng.time.voluntary_context_switches
>>      502.67 ±  2%     +52.0%     763.83 ± 47%  
>> interrupts.CPU44.CAL:Function_call_interrupts
>>        2.10 ±  2%     -25.1%       1.57        iostat.cpu.iowait
>>        2.16 ±  2%      -0.5        1.62        mpstat.cpu.all.iowait%
>>        0.09 ± 11%     +45.0%       0.14 ± 34%  
>> perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork
>>      151.00 ± 19%     +20.2%     181.50 ±  7%  
>> perf-sched.wait_and_delay.count.smpboot_thread_fn.kthread.ret_from_fork
>>        6749            +6.4%       7178        
>> perf-stat.i.context-switches
>>        6640            +6.4%       7063        
>> perf-stat.ps.context-switches
>>      353877            -1.4%     348924        proc-vmstat.pgfree
>>       39122           -15.1%      33221        proc-vmstat.pgpgout
>>        1225 ± 14%      -9.0%       1115        turbostat.Bzy_MHz
>>       86.82 ± 17%     -10.4%      77.83        turbostat.PkgWatt
>>      602.83           -15.1%     511.83        vmstat.io.bo
>>        6953            +6.1%       7376        vmstat.system.cs
>>        1.33 ±  6%      +0.4        1.70 ± 13%  
>> perf-profile.calltrace.cycles-pp.perf_mux_hrtimer_handler.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt 
>>
>>        0.08 ± 22%      +0.1        0.17 ± 23%  
>> perf-profile.children.cycles-pp.wait_for_completion
>>        0.07 ± 17%      +0.1        0.19 ± 64%  
>> perf-profile.children.cycles-pp.submit_bio
>>        0.07 ± 17%      +0.1        0.19 ± 64%  
>> perf-profile.children.cycles-pp.submit_bio_noacct
>>        1.40 ±  8%      +0.3        1.74 ± 12%  
>> perf-profile.children.cycles-pp.perf_mux_hrtimer_handler
>>
>>
>>                           stress-ng.fallocate.ops_per_sec
>>    12 
>> +----------------------------------------------------------------------+
>>       
>> |                                                                      |
>>    10 |.+.++.+.+.+  
>> +.+.+.++.+.+.++.+.+.+.++.+.+.++.+.+.++.+.+.+.++.+.+.++.+.|
>>       | O        :  :                             O O   OO O O O 
>> OO          |
>>       |   OO O   :O :            O             O O    
>> O                      |
>>     8 |-+      O :  : O O OO O O  O O O O OO 
>> O                               |
>>       |          : 
>> :                                                         |
>>     6 |-+        : 
>> :                                                         |
>>       |           
>> ::                                                         |
>>     4 |-+         
>> ::                                                         |
>>       |           
>> ::                                                         |
>>       |           
>> ::                                                         |
>>     2 |-+         
>> :                                                          |
>>       |           
>> :                                                          |
>>     0 
>> +----------------------------------------------------------------------+
>> [*] bisect-good sample
>> [O] bisect-bad  sample
>>
>>
>>
>>
>> Disclaimer:
>> Results have been estimated based on internal Intel analysis and are 
>> provided
>> for informational purposes only. Any difference in system hardware or 
>> software
>> design or configuration may affect actual performance.
>>
>>
>> ---
>> 0DAY/LKP+ Test Infrastructure                   Open Source Technology 
>> Center
>> https://lists.01.org/hyperkitty/list/lkp@lists.01.org       Intel 
>> Corporation
>>
>> Thanks,
>> Oliver Sang
>>
> 

-- 
Zhengjun Xing

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ