[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CAObL_7GM0n80N7J_DFw_eQyfLyzq+sf4y2AvsCCV88Tb3AwEHA@mail.gmail.com>
Date: Wed, 8 Mar 2017 20:39:55 -0800
From: Andrew Lutomirski <luto@....edu>
To: "Dmitry V. Levin" <ldv@...linux.org>
Cc: Denys Vlasenko <vda.linux@...glemail.com>,
Linus Torvalds <torvalds@...ux-foundation.org>,
Indan Zupancic <indan@....nu>, Oleg Nesterov <oleg@...hat.com>,
Andi Kleen <andi@...stfloor.org>,
Jamie Lokier <jamie@...reable.org>,
Will Drewry <wad@...omium.org>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
Kees Cook <keescook@...omium.org>,
John Johansen <john.johansen@...onical.com>,
Serge Hallyn <serge.hallyn@...onical.com>,
coreyb@...ux.vnet.ibm.com, pmoore@...hat.com,
Eric Paris <eparis@...hat.com>, djm@...drot.org,
segoon@...nwall.com, Steven Rostedt <rostedt@...dmis.org>,
James Morris <jmorris@...ei.org>,
Chris Evans <scarybeasts@...il.com>,
Avi Kivity <avi@...hat.com>, penberg@...helsinki.fi,
Al Viro <viro@...iv.linux.org.uk>, Ingo Molnar <mingo@...e.hu>,
Andrew Morton <akpm@...ux-foundation.org>, khilman@...com,
borislav.petkov@....com, amwang@...hat.com,
Andi Kleen <ak@...ux.intel.com>,
Eric Dumazet <eric.dumazet@...il.com>, gregkh@...e.de,
dhowells@...hat.com, daniel.lezcano@...e.fr,
Linux FS Devel <linux-fsdevel@...r.kernel.org>,
linux-security-module <linux-security-module@...r.kernel.org>,
olofj@...omium.org, Michael Halcrow <mhalcrow@...gle.com>,
dlaor@...hat.com, Roland McGrath <mcgrathr@...omium.org>
Subject: Re: Compat 32-bit syscall entry from 64-bit task!?
On Wed, Mar 8, 2017 at 3:41 PM, Dmitry V. Levin <ldv@...linux.org> wrote:
> Hi,
>
> On Thu, Jan 26, 2012 at 07:03:43PM +0100, Denys Vlasenko wrote:
>> Hi Linus,
>>
>> On Thu, Jan 26, 2012 at 4:47 AM, Linus Torvalds
>> <torvalds@...ux-foundation.org> wrote:
>> >> Please look at strace source, get_scno() function, where
>> >> it reads syscall no and parameters. Let's see....
>> >> - POWERPC: has 32-bit and 64-bit mode
>> >> - X86_64: has 32-bit and 64-bit mode
>> >> - IA64: has i386-compat mode
>> >> - ARM: has more than one ABI
>> >> - SPARC: has 32-bit and 64-bit mode
>> >>
>> >> Do you want to re-invent a different arch-specific way to report
>> >> syscall type for each of these arches?
>> >
>> > I think an arch-specific one is better than trying to make some
>> > generic one that is messy.
>> >
>> > As you say, many architectures have multiple system call ABIs.
>> >
>> > But they tend to be very *different* issues. They can be about
>> > multiple ABI's, as you mention, and even when they *look* similar
>> > (32-bit vs 64-bit ABI's) they are actually totally different issues.
>> > [skip]
>>
>> I don't have a particular attachment to my solution,
>> and I think we already talk about this problem for
>> far too long.
>>
>> Looks like nobody is _strongly_ opposed to your patch
>> which uses a few bits in eflags to report bitness
>> of the x86 syscall.
>>
>> Lets just do that already. If you commit it to kernel git,
>> I will immediately change strace accordingly.
>
> Is there any progress with this (or any alternative) solution?
>
> I see the kernel side has changed a bit, and the strace part
> is in a better shape than 5 years ago (although I'm biased of course),
> but I don't see any kernel interface that would allow strace to reliably
> recognize this 0x80 case.
I am strongly opposed to fudging registers to half-arsedly slightly
improve the epicly crappy ptrace(2) interface for syscalls.
To fix this right, please just add PTRACE_GET_SYSCALL_INFO or similar
to, in one shot, read out all the syscall details. This means: arch,
no, arg0..arg5, and *whether it's entry or exit*. I propose returning
this structure:
struct ptrace_syscall_info {
u8 op; /* 0 for entry, 1 for exit */
u8 pad0;
u16 pad1;
u32 pad2;
union {
struct seccomp_data syscall_entry;
s64 syscall_exit_retval;
};
};
because struct seccomp_data already gets this right. There's plenty
of opportunity to fine-tune this. Now it works on all architectures.
Since struct seccomp_data may be extended in the future, the operation
should be:
ptrace(PTRACE_GET_SYSCALL_INFO, pid, (void *)sizeof(struct
ptrace_syscall_info), &info);
returns 0 on success and some error code if, for example, the current
ptrace stop isn't a syscall entry or exit.
--Andy
Powered by blists - more mailing lists