[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <02dcdbfe-605a-4acb-8f51-6ccae7fbee65@amd.com>
Date: Tue, 16 Sep 2025 08:49:46 +0530
From: K Prateek Nayak <kprateek.nayak@....com>
To: John Stultz <jstultz@...gle.com>, LKML <linux-kernel@...r.kernel.org>
CC: Joel Fernandes <joelagnelf@...dia.com>, Qais Yousef <qyousef@...alina.io>,
Ingo Molnar <mingo@...hat.com>, Peter Zijlstra <peterz@...radead.org>, "Juri
Lelli" <juri.lelli@...hat.com>, Vincent Guittot <vincent.guittot@...aro.org>,
Dietmar Eggemann <dietmar.eggemann@....com>, Valentin Schneider
<vschneid@...hat.com>, Steven Rostedt <rostedt@...dmis.org>, Ben Segall
<bsegall@...gle.com>, Zimuzo Ezeozue <zezeozue@...gle.com>, Mel Gorman
<mgorman@...e.de>, Will Deacon <will@...nel.org>, Waiman Long
<longman@...hat.com>, Boqun Feng <boqun.feng@...il.com>, "Paul E. McKenney"
<paulmck@...nel.org>, Metin Kaya <Metin.Kaya@....com>, Xuewen Yan
<xuewen.yan94@...il.com>, Thomas Gleixner <tglx@...utronix.de>, "Daniel
Lezcano" <daniel.lezcano@...aro.org>, Suleiman Souhlal <suleiman@...gle.com>,
kuyo chang <kuyo.chang@...iatek.com>, hupu <hupu.gm@...il.com>,
<kernel-team@...roid.com>
Subject: Re: [RESEND][PATCH v21 0/6] Donor Migration for Proxy Execution (v21)
Hello John,
On 9/4/2025 5:51 AM, John Stultz wrote:
> Also you can find the full proxy-exec series here:
> https://github.com/johnstultz-work/linux-dev/commits/proxy-exec-v21-6.17-rc4/
> https://github.com/johnstultz-work/linux-dev.git proxy-exec-v21-6.17-rc4
tl;dr
This series seems fine from performance standpoint but the above branch
may have some performance issues but take them with a grain of salt
since this is not all apples to apples comparison.
For this series things are alright - my harness for longer running
benchmarks gave up for some reason so I'll rerun those tests again and
report back later but either tip has some improvements for
netperf / tbench or "proxy-exec-v21-6.17-rc4" may have some issues
around it. I'll take a deeper look later in the week.
o System Details
- 3rd Generation EPYC System
- 2 x 64C/128T
- NPS1 mode
o Kernels
- tip tip:sched/core at commit 5b726e9bf954 ("sched/fair: Get
rid of throttled_lb_pair()")
(CONFIG_SCHED_PROXY_EXEC disabled)
- proxy-v21 tip + this series as is
(CONFIG_SCHED_PROXY_EXEC=y)
- proxy-full proxy-exec-v21-6.17-rc4 as is
(CONFIG_SCHED_PROXY_EXEC=y)
o Benchmark results
==================================================================
Test : hackbench
Units : Normalized time in seconds
Interpretation: Lower is better
Statistic : AMean
==================================================================
Case: tip[pct imp](CV) proxy-v21[pct imp](CV) proxy-full[pct imp](CV)
1-groups 1.00 [ -0.00](10.57) 0.94 [ 6.24]( 7.88) 0.91 [ 9.46](10.11)
2-groups 1.00 [ -0.00]( 3.33) 1.02 [ -1.75]( 3.16) 1.04 [ -4.17]( 2.51)
4-groups 1.00 [ -0.00]( 2.41) 1.01 [ -0.87]( 2.29) 1.03 [ -3.03]( 1.27)
8-groups 1.00 [ -0.00]( 2.67) 1.02 [ -1.66]( 2.10) 1.01 [ -0.55]( 1.45)
16-groups 1.00 [ -0.00]( 1.83) 1.01 [ -0.82]( 2.30) 1.00 [ -0.25]( 1.72)
==================================================================
Test : tbench
Units : Normalized throughput
Interpretation: Higher is better
Statistic : AMean
==================================================================
Clients: tip[pct imp](CV) proxy-v21[pct imp](CV) proxy-full[pct imp](CV)
1 1.00 [ 0.00]( 0.81) 1.00 [ -0.13]( 0.16) 0.92 [ -8.06]( 0.39)
2 1.00 [ 0.00]( 0.32) 0.99 [ -0.84]( 0.66) 0.91 [ -8.85]( 0.54)
4 1.00 [ 0.00]( 0.32) 0.98 [ -2.37]( 1.40) 0.92 [ -8.28]( 0.28)
8 1.00 [ 0.00]( 0.69) 0.98 [ -2.47]( 0.53) 0.90 [ -9.58]( 0.36)
16 1.00 [ 0.00]( 1.24) 0.96 [ -3.94]( 1.51) 0.90 [ -9.83]( 0.69)
32 1.00 [ 0.00]( 0.60) 0.99 [ -1.47]( 3.38) 0.89 [-11.43]( 5.60)
64 1.00 [ 0.00]( 1.22) 0.99 [ -1.33]( 0.88) 0.91 [ -8.52]( 2.67)
128 1.00 [ 0.00]( 0.34) 0.99 [ -1.48]( 0.99) 0.92 [ -7.51]( 0.13)
256 1.00 [ 0.00]( 1.32) 0.98 [ -1.75]( 0.96) 0.97 [ -3.35]( 1.22)
512 1.00 [ 0.00]( 0.25) 0.99 [ -1.29]( 0.41) 0.97 [ -2.90]( 0.17)
1024 1.00 [ 0.00]( 0.24) 0.99 [ -0.59]( 0.14) 0.98 [ -2.36]( 0.33)
==================================================================
Test : stream-10
Units : Normalized Bandwidth, MB/s
Interpretation: Higher is better
Statistic : HMean
==================================================================
Test: tip[pct imp](CV) proxy-v21[pct imp](CV) proxy-full[pct imp](CV)
Copy 1.00 [ 0.00](10.90) 1.07 [ 6.53]( 8.21) 1.07 [ 7.26]( 7.22)
Scale 1.00 [ 0.00]( 9.62) 1.04 [ 4.00]( 6.99) 1.05 [ 4.71]( 5.85)
Add 1.00 [ 0.00](10.17) 1.05 [ 5.07]( 6.14) 1.06 [ 6.03]( 6.56)
Triad 1.00 [ 0.00]( 8.48) 1.04 [ 4.34]( 5.09) 1.04 [ 4.07]( 4.40)
==================================================================
Test : stream-100
Units : Normalized Bandwidth, MB/s
Interpretation: Higher is better
Statistic : HMean
==================================================================
Test: tip[pct imp](CV) proxy-v21[pct imp](CV) proxy-full[pct imp](CV)
Copy 1.00 [ 0.00]( 1.38) 1.01 [ 0.99]( 1.21) 1.02 [ 1.68]( 1.50)
Scale 1.00 [ 0.00]( 6.19) 1.02 [ 1.94]( 4.34) 1.03 [ 3.00]( 1.19)
Add 1.00 [ 0.00]( 4.42) 1.01 [ 0.94]( 4.17) 1.02 [ 1.58]( 1.54)
Triad 1.00 [ 0.00]( 1.30) 1.01 [ 0.61]( 1.37) 1.00 [ 0.18]( 2.65)
==================================================================
Test : netperf
Units : Normalized Througput
Interpretation: Higher is better
Statistic : AMean
==================================================================
Clients: tip[pct imp](CV) proxy-v21[pct imp](CV) proxy-full[pct imp](CV)
1-clients 1.00 [ 0.00]( 0.41) 0.99 [ -1.03]( 0.34) 0.90 [ -9.96]( 0.46)
2-clients 1.00 [ 0.00]( 0.31) 0.99 [ -1.17]( 0.72) 0.90 [ -9.77]( 0.78)
4-clients 1.00 [ 0.00]( 0.57) 0.99 [ -0.68]( 0.32) 0.90 [-10.21]( 0.89)
8-clients 1.00 [ 0.00]( 0.46) 0.99 [ -0.69]( 0.32) 0.90 [-10.20]( 0.70)
16-clients 1.00 [ 0.00]( 0.57) 0.99 [ -1.39]( 1.28) 0.90 [-10.37]( 1.34)
32-clients 1.00 [ 0.00]( 1.03) 0.97 [ -2.53]( 1.92) 0.90 [-10.00]( 1.23)
64-clients 1.00 [ 0.00]( 1.23) 0.97 [ -3.15]( 2.94) 0.90 [ -9.94]( 1.52)
128-clients 1.00 [ 0.00]( 1.14) 0.99 [ -1.07]( 0.95) 0.90 [ -9.91]( 0.90)
256-clients 1.00 [ 0.00]( 3.73) 0.98 [ -1.80]( 3.66) 0.97 [ -3.41]( 4.47)
512-clients 1.00 [ 0.00](54.79) 0.97 [ -3.03](48.98) 0.95 [ -4.63](51.77)
==================================================================
Test : schbench
Units : Normalized 99th percentile latency in us
Interpretation: Lower is better
Statistic : Median
==================================================================
#workers: tip[pct imp](CV) proxy-v21[pct imp](CV) proxy-full[pct imp](CV)
1 1.00 [ -0.00](30.14) 1.11 [-11.11](35.78) 1.31 [-30.56](42.87)
2 1.00 [ -0.00]( 7.87) 0.93 [ 7.14]( 8.45) 0.95 [ 4.76]( 7.50)
4 1.00 [ -0.00]( 7.87) 1.07 [ -7.14]( 7.36) 1.14 [-14.29](12.73)
8 1.00 [ -0.00]( 4.59) 1.08 [ -8.16]( 5.09) 1.12 [-12.24]( 7.44)
16 1.00 [ -0.00]( 5.33) 1.05 [ -5.08]( 0.93) 1.05 [ -5.08]( 2.75)
32 1.00 [ -0.00]( 1.04) 1.00 [ -0.00]( 3.12) 1.07 [ -7.29]( 4.49)
64 1.00 [ -0.00]( 1.04) 0.96 [ 3.50]( 3.78) 1.01 [ -1.00]( 2.24)
128 1.00 [ -0.00]( 5.11) 1.06 [ -6.11]( 7.56) 1.09 [ -8.60]( 6.26)
256 1.00 [ -0.00](19.39) 1.29 [-28.73](14.92) 1.15 [-14.71](14.83)
512 1.00 [ -0.00]( 0.15) 0.98 [ 2.02]( 1.85) 0.99 [ 1.01]( 1.66)
==================================================================
Test : new-schbench-requests-per-second
Units : Normalized Requests per second
Interpretation: Higher is better
Statistic : Median
==================================================================
#workers: tip[pct imp](CV) proxy-v21[pct imp](CV) proxy-full[pct imp](CV)
1 1.00 [ 0.00]( 0.26) 1.00 [ 0.29]( 0.15) 1.00 [ -0.29]( 0.30)
2 1.00 [ 0.00]( 0.00) 1.00 [ 0.00]( 0.15) 1.00 [ 0.00]( 0.15)
4 1.00 [ 0.00]( 0.00) 1.00 [ 0.00]( 0.15) 1.00 [ 0.00]( 0.00)
8 1.00 [ 0.00]( 0.15) 1.00 [ 0.29]( 0.15) 1.00 [ 0.29]( 0.15)
16 1.00 [ 0.00]( 0.00) 1.00 [ 0.00]( 0.00) 1.00 [ 0.00]( 0.00)
32 1.00 [ 0.00]( 1.86) 1.00 [ -0.31]( 0.28) 1.00 [ -0.31]( 2.12)
64 1.00 [ 0.00](13.62) 0.99 [ -0.77]( 4.78) 0.81 [-18.52](11.11)
128 1.00 [ 0.00]( 0.00) 1.00 [ 0.38]( 0.00) 1.00 [ 0.38]( 0.00)
256 1.00 [ 0.00]( 1.49) 1.02 [ 1.82]( 1.63) 1.00 [ 0.00]( 1.19)
512 1.00 [ 0.00]( 0.75) 1.01 [ 0.71]( 1.65) 1.01 [ 1.19]( 1.53)
==================================================================
Test : new-schbench-wakeup-latency
Units : Normalized 99th percentile latency in us
Interpretation: Lower is better
Statistic : Median
==================================================================
#workers: tip[pct imp](CV) proxy-v21[pct imp](CV) proxy-full[pct imp](CV)
1 1.00 [ -0.00]( 6.74) 1.00 [ -0.00]( 6.74) 1.12 [-12.50](19.26)
2 1.00 [ -0.00](11.18) 1.00 [ -0.00](17.21) 1.50 [-50.00]( 7.45)
4 1.00 [ -0.00]( 9.94) 1.00 [ -0.00](19.26) 1.56 [-55.56](15.78)
8 1.00 [ -0.00](10.68) 1.00 [ -0.00](10.68) 1.44 [-44.44](28.77)
16 1.00 [ -0.00]( 9.68) 1.00 [ -0.00]( 9.68) 1.20 [-20.00]( 8.15)
32 1.00 [ -0.00](14.08) 1.00 [ -0.00]( 5.34) 1.20 [-20.00](14.70)
64 1.00 [ -0.00]( 3.52) 1.13 [-13.33]( 5.26) 1.27 [-26.67]( 2.77)
128 1.00 [ -0.00]( 1.79) 1.07 [ -6.56]( 2.70) 1.07 [ -6.97]( 7.71)
256 1.00 [ -0.00]( 9.89) 1.04 [ -4.50]( 3.81) 1.02 [ -2.00]( 7.78)
512 1.00 [ -0.00]( 0.00) 1.01 [ -0.77]( 0.34) 1.00 [ -0.00]( 0.20)
==================================================================
Test : new-schbench-request-latency
Units : Normalized 99th percentile latency in us
Interpretation: Lower is better
Statistic : Median
==================================================================
#workers: tip[pct imp](CV) proxy-v21[pct imp](CV) proxy-full[pct imp](CV)
1 1.00 [ -0.00]( 1.33) 0.96 [ 3.89]( 1.46) 1.02 [ -1.82]( 3.02)
2 1.00 [ -0.00]( 0.14) 1.01 [ -1.09]( 0.24) 1.02 [ -2.44]( 2.73)
4 1.00 [ -0.00]( 1.24) 1.00 [ -0.26]( 1.69) 0.97 [ 2.65]( 0.14)
8 1.00 [ -0.00]( 0.54) 1.00 [ -0.00]( 1.02) 0.99 [ 1.31]( 2.16)
16 1.00 [ -0.00]( 0.36) 1.00 [ -0.00]( 1.70) 0.98 [ 1.59]( 1.00)
32 1.00 [ -0.00]( 5.51) 0.99 [ 0.73]( 2.09) 1.01 [ -1.45]( 7.52)
64 1.00 [ -0.00]( 5.38) 1.09 [ -9.27]( 0.88) 1.09 [ -8.56]( 0.11)
128 1.00 [ -0.00]( 0.32) 1.00 [ -0.36]( 0.32) 1.03 [ -2.54]( 1.15)
256 1.00 [ -0.00](10.51) 1.14 [-14.23](11.19) 1.00 [ 0.24](11.42)
512 1.00 [ -0.00]( 2.00) 1.03 [ -3.27]( 0.94) 1.02 [ -2.41]( 1.96)
--
Thanks and Regards,
Prateek
Powered by blists - more mailing lists