[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20250708105839.4ed8c792@pumpkin>
Date: Tue, 8 Jul 2025 10:58:39 +0100
From: David Laight <david.laight.linux@...il.com>
To: Oleg Nesterov <oleg@...hat.com>
Cc: Steven Rostedt <rostedt@...dmis.org>, "Li,Rongqing"
<lirongqing@...du.com>, Peter Zijlstra <peterz@...radead.org>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
"vschneid@...hat.com" <vschneid@...hat.com>, "mgorman@...e.de"
<mgorman@...e.de>, "bsegall@...gle.com" <bsegall@...gle.com>,
"dietmar.eggemann@....com" <dietmar.eggemann@....com>,
"vincent.guittot@...aro.org" <vincent.guittot@...aro.org>,
"juri.lelli@...hat.com" <juri.lelli@...hat.com>, "mingo@...hat.com"
<mingo@...hat.com>
Subject: Re: [????] Re: [????] Re: divide error in x86 and cputime
On Tue, 8 Jul 2025 04:17:04 +0200
Oleg Nesterov <oleg@...hat.com> wrote:
> On 07/07, Steven Rostedt wrote:
> >
> > On Tue, 8 Jul 2025 01:58:00 +0000
> > "Li,Rongqing" <lirongqing@...du.com> wrote:
> >
> > > But mul_u64_u64_div_u64() for x86 should not trigger a division error panic,
> > maybe should return a ULLONG_MAX on #DE (like non-x86 mul_u64_u64_div_u64(),)
> >
> > Perhaps.
>
> So do you think
>
> static inline u64 mul_u64_u64_div_u64(u64 a, u64 mul, u64 div)
> {
> int ok = 0;
> u64 q;
>
> asm ("mulq %3; 1: divq %4; movl $1,%1; 2:\n"
> _ASM_EXTABLE(1b, 2b)
> : "=a" (q), "+r" (ok)
> : "a" (a), "rm" (mul), "rm" (div)
> : "rdx");
>
> return ok ? q : -1ul;
You need to decide what to return/do when 'div' is zero.
So perhaps:
if (ok)
return q;
BUG_ON(!div);
return ~(u64)0;
But maybe 0/0 should return 0.
> }
>
> makes sense at least for consistency with the generic implementation
> in lib/math/div64.c ?
I don't like the way the current version handles divide by zero at all.
Even forcing the cpu to execute a 'divide by zero' doesn't seem right.
The result should be well defined (and useful).
It might even be worth adding an extra parameter to report overflow
and return ~0 for overflow and 0 for divide by zero (I think that is
less likely to cause grief in the following instructions).
That does 'pass the buck' to the caller.
>
> > But it is still producing garbage.
>
> Agreed. And not a solution to this particular problem.
Using mul_u64_u_64_div_u64() here is also horribly expensive for a
simple split between (IIRC) utime and stime.
It isn't too bad on x86-64, but everywhere else it is horrid.
For 'random' values the code hits 900 clocks on x86-32 - and that
is in userspace with cmov and %ebp as a general register.
My new version is ~230 for x86-32 and ~130 for x86-64 (not doing
the fast asm) on ivy bridge, ~80 for x86-64 on zen5.
(I'm on holiday and have limited systems available.)
David
>
> Oleg.
>
Powered by blists - more mailing lists