[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <X6/MyFRkEPGqnNtK@kroah.com>
Date: Sat, 14 Nov 2020 13:25:44 +0100
From: Greg Kroah-Hartman <gregkh@...uxfoundation.org>
To: Feng Tang <feng.tang@...el.com>
Cc: kernel test robot <oliver.sang@...el.com>,
Lee Jones <lee.jones@...aro.org>,
Daniel Vetter <daniel.vetter@...ll.ch>,
Russell King <linux@...linux.org.uk>,
Peilin Ye <yepeilin.cs@...il.com>,
LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
lkp@...el.com, ying.huang@...el.com, zhengjun.xing@...el.com
Subject: Re: [Fonts] 9522750c66: fio.read_iops 7.5% improvement
On Sat, Nov 14, 2020 at 03:19:17PM +0800, Feng Tang wrote:
> Hi Greg,
>
> On Fri, Nov 13, 2020 at 07:46:57AM +0100, Greg Kroah-Hartman wrote:
> > On Thu, Nov 12, 2020 at 10:06:25PM +0800, kernel test robot wrote:
> > >
> > > Greeting,
> > >
> > > FYI, we noticed a 7.5% improvement of fio.read_iops due to commit:
> > >
> > >
> > > commit: 9522750c66c689b739e151fcdf895420dc81efc0 ("Fonts: Replace discarded const qualifier")
> > > https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
> >
> > I strongly doubt this :)
>
> We just double checked, the test was run 4 times and the result are
> very stable.
>
> The commit does looks irrelevant to fio test, and we just further
> checked the System map of the 2 kernels, and many data's alignment
> have been changed (systemmaps attached).
>
> We have a hack debug patch to make data sections of each .o file to
> be aligned, with that the fio result gap could be reduced from +7.5%
> to +3.8%, so there is still some other factor affecting the benchmark,
> which need more checking. And with the same debug method of forcing
> data sections aligned, 2 other strange performance bumps[1][2] reported
> by 0day could be recovered.
>
> [1]. https://lore.kernel.org/lkml/20200205123216.GO12867@shao2-debian/
> [2]. https://lore.kernel.org/lkml/20200305062138.GI5972@shao2-debian/
That's really odd. Why wouldn't .o sections be aligned already and how
does that affect the real .ko files that are created from that? What
alignment are you forcing?
And also, what hardware is seeing this performance gains? Something is
fitting into a cache now that previously wasn't, and tracking that down
seems like it would be very worthwhile as that is a non-trivial speedup
that some developers take years to achieve with code changes.
thanks,
greg k-h
Powered by blists - more mailing lists