[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20180413183527.GC16308@e103592.cambridge.arm.com>
Date: Fri, 13 Apr 2018 19:35:38 +0100
From: Dave Martin <Dave.Martin@....com>
To: Russell King - ARM Linux <linux@...linux.org.uk>
Cc: Linus Torvalds <torvalds@...ux-foundation.org>,
Linux Kernel Mailing List <linux-kernel@...r.kernel.org>,
"Dmitry V. Levin" <ldv@...linux.org>,
"Eric W. Biederman" <ebiederm@...ssion.com>,
sparclinux <sparclinux@...r.kernel.org>,
ppc-dev <linuxppc-dev@...ts.ozlabs.org>,
linux-arm-kernel <linux-arm-kernel@...ts.infradead.org>
Subject: Re: sparc/ppc/arm compat siginfo ABI regressions: sending SIGFPE via
kill() returns wrong values in si_pid and si_uid
On Fri, Apr 13, 2018 at 06:54:08PM +0100, Russell King - ARM Linux wrote:
> On Fri, Apr 13, 2018 at 06:08:28PM +0100, Dave Martin wrote:
> > On Fri, Apr 13, 2018 at 09:33:17AM -0700, Linus Torvalds wrote:
> > > On Fri, Apr 13, 2018 at 2:42 AM, Russell King - ARM Linux
> > > <linux@...linux.org.uk> wrote:
> > > >
> > > > Yes, it does solve the problem at hand with strace - the exact patch I
> > > > tested against 4.16 is below.
> > >
> > > Ok, good.
> > >
> > > > However, FPE_FLTUNK is not defined in older kernels, so while we can
> > > > fix it this way for the current merge window, that doesn't help 4.16.
> > >
> > > I wonder if we should even bother with FPE_FLTUNK.
> > >
> > > I suspect we might as well use FPE_FLTINV, I suspect, and not have
> > > this complexity at all. That case is not worth worrying about, since
> > > it's a "this shouldn't happen anyway" and the *real* reason will be in
> > > the kernel logs due to vfs_panic().
> > >
> > > So it's not like this is something that the user should ever care
> > > about the si_code about.
> >
> > Ack, my intended meaning for FPE_FLTUNK is that the fp exception is
> > either spurious or we can't tell easily (or possibly at all) which
> > FPE_XXX should be returned. It's up to userspace to figure it out
> > if it really cares. Previously we were accidentally returning SI_USER
> > in si_code for arm64.
> >
> > This case on arm looks like a more serious error for which FPE_FLTINV
> > may be more appropriate anyway.
>
> No. The cases where we get to this point are:
>
> 1. A trap concerning a coprocessor register transfer instruction (iow, move
> between a VFP register and ARM register.)
> 2. A trap concerning a coprocessor register load or save instruction.
>
> (In both of these, "concerning" means that the VFP hardware provides
> such an instruction as the reason for the fault, *not* that it is the
> faulting instruction.)
>
> 3. A combination of the exception bits (EX and DEX) on certain VFP
> implementations.
>
> All of these can be summarised as "the hardware went wrong in some way"
> rather than "the user program did something wrong."
Although my understanding of VFP bounces is a bit hazy, I think this is
broadly in line with my assumptions.
> FPE_FLTINV means "floating point invalid operation". Does it really
> cover the case where hardware has failed, or is it intended to cover
> the case where userspace did something wrong and asked for an invalid
> operation from the FP hardware?
So, there's an argument that FPE_FLTINV is not really correct. My
rationale was that there is nothing correct that we can return, and
FPE_FLTINV may be no worse than the alternatives.
If we can only hit this case as the result of a hardware failure
or kernel bug though, should this be delivered as SIGKILL instead?
That's the approach I eventually followed for various exceptions
on arm64 that were theoretically delivered to userspace with si_code==0,
but really should be impossible unless and kernel and/or hardware
is buggy.
If that's the case though, I don't see how a userspace testsuite is
hitting this code path. Maybe I've misunderstood the context of this
thread.
Cheers
---Dave
Powered by blists - more mailing lists