[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CALCETrVEnVV5gibE++gj=eudqEBFTckDasqcY2zZDC9Y+NMUrQ@mail.gmail.com>
Date: Mon, 5 Jan 2015 12:31:15 -0800
From: Andy Lutomirski <luto@...capital.net>
To: Borislav Petkov <bp@...en8.de>
Cc: X86 ML <x86@...nel.org>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
Frédéric Weisbecker <fweisbec@...il.com>,
Oleg Nesterov <oleg@...hat.com>, kvm list <kvm@...r.kernel.org>
Subject: Re: [PATCH 1/3] x86_64,entry: Fix RCX for traced syscalls
On Mon, Jan 5, 2015 at 4:59 AM, Borislav Petkov <bp@...en8.de> wrote:
> On Fri, Nov 07, 2014 at 03:58:17PM -0800, Andy Lutomirski wrote:
>> The int_ret_from_sys_call and syscall tracing code disagrees with
>> the sysret path as to the value of RCX.
>>
>> The Intel SDM, the AMD APM, and my laptop all agree that sysret
>> returns with RCX == RIP. The syscall tracing code does not respect
>> this property.
>>
>> For example, this program:
>>
>> int main()
>> {
>> extern const char syscall_rip[];
>> unsigned long rcx = 1;
>> unsigned long orig_rcx = rcx;
>> asm ("mov $-1, %%eax\n\t"
>> "syscall\n\t"
>> "syscall_rip:"
>> : "+c" (rcx) : : "r11");
>> printf("syscall: RCX = %lX RIP = %lX orig RCX = %lx\n",
>> rcx, (unsigned long)syscall_rip, orig_rcx);
>> return 0;
>> }
>>
>> prints:
>> syscall: RCX = 400556 RIP = 400556 orig RCX = 1
>>
>> Running it under strace gives this instead:
>> syscall: RCX = FFFFFFFFFFFFFFFF RIP = 400556 orig RCX = 1
>
> I can trigger the same even without tracing it:
>
> syscall: RCX = FFFFFFFFFFFFFFFF RIP = 40052C orig RCX = 1
Do you have context tracking on?
>
>> This changes FIXUP_TOP_OF_STACK to match sysret, causing the test to
>> show RCX == RIP even under strace.
>>
>> Signed-off-by: Andy Lutomirski <luto@...capital.net>
>> ---
>> arch/x86/kernel/entry_64.S | 3 ++-
>> 1 file changed, 2 insertions(+), 1 deletion(-)
>>
>> diff --git a/arch/x86/kernel/entry_64.S b/arch/x86/kernel/entry_64.S
>> index df088bb03fb3..3710b8241945 100644
>> --- a/arch/x86/kernel/entry_64.S
>> +++ b/arch/x86/kernel/entry_64.S
>> @@ -143,7 +143,8 @@ ENDPROC(native_usergs_sysret64)
>> movq \tmp,RSP+\offset(%rsp)
>> movq $__USER_DS,SS+\offset(%rsp)
>> movq $__USER_CS,CS+\offset(%rsp)
>> - movq $-1,RCX+\offset(%rsp)
>> + movq RIP+\offset(%rsp),\tmp /* get rip */
>> + movq \tmp,RCX+\offset(%rsp) /* copy it to rcx as sysret would do */
>> movq R11+\offset(%rsp),\tmp /* get eflags */
>> movq \tmp,EFLAGS+\offset(%rsp)
>> .endm
>> --
>
> For some reason this patch is causing ata resets on by box, see the
> end of this mail. So something's not kosher yet. If I boot the kernel
> without it, it all seems ok.
>
> Btw, this change got introduced in 2002 where it used to return rIP in
> %rcx before, but it got changed to return -1 for rIP for some reason.
Thanks! I assume that's in the historical tree?
[...]
>
> ---
>
> [ 180.059170] ata1.00: exception Emask 0x0 SAct 0x7fffffff SErr 0x0 action 0x6 frozen
> [ 180.066873] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.072158] ata1.00: cmd 61/08:00:a8:ac:d9/00:00:23:00:00/40 tag 0 ncq 4096 out
> [ 180.072158] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
That's really weird. The only thing I can think of is that somehow we
returned to user mode without enabling interrupts. This leads me to
wonder: why do we save eflags in the R11 pt_regs slot? This seems
entirely backwards, not to mention that it accounts for two
instructions in each of FIXUP_TOP_OF_STACK and RESTORE_TOP_OF_STACK
for no apparently reason whatsoever.
Can you send the full output from syscall_exit_regs_64 from here:
https://gitorious.org/linux-test-utils/linux-clock-tests/source/34884122b6ebe81d9b96e3e5128b6d6d95082c6e:
with the patch applied (assuming it even gets that far for you)? I
see results like:
[NOTE] syscall ffff: orig RCX = 1 ss = 2b orig_ss = 6b flags =
217 orig_flags = 217
which seems fine.
Are you seeing this with the whole series applied or with only this patch?
--Andy
> [ 180.086912] ata1.00: status: { DRDY }
> [ 180.090591] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.095846] ata1.00: cmd 61/08:08:18:ae:d9/00:00:23:00:00/40 tag 1 ncq 4096 out
> [ 180.095846] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.110603] ata1.00: status: { DRDY }
> [ 180.114283] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.119539] ata1.00: cmd 61/10:10:f0:b1:d9/00:00:23:00:00/40 tag 2 ncq 8192 out
> [ 180.119539] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.134292] ata1.00: status: { DRDY }
> [ 180.137973] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.143226] ata1.00: cmd 61/08:18:00:98:18/00:00:1d:00:00/40 tag 3 ncq 4096 out
> [ 180.143226] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.158105] ata1.00: status: { DRDY }
> [ 180.161809] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.167071] ata1.00: cmd 61/10:20:18:98:18/00:00:1d:00:00/40 tag 4 ncq 8192 out
> [ 180.167071] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.181822] ata1.00: status: { DRDY }
> [ 180.185503] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.190756] ata1.00: cmd 61/a0:28:e0:7c:5d/25:00:1d:00:00/40 tag 5 ncq 4931584 out
> [ 180.190756] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.205770] ata1.00: status: { DRDY }
> [ 180.209448] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.214692] ata1.00: cmd 61/80:30:80:a2:5d/4a:00:1d:00:00/40 tag 6 ncq 9764864 out
> [ 180.214692] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.229701] ata1.00: status: { DRDY }
> [ 180.233381] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.238631] ata1.00: cmd 61/08:38:00:ed:5d/22:00:1d:00:00/40 tag 7 ncq 4460544 out
> [ 180.238631] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.253647] ata1.00: status: { DRDY }
> [ 180.257321] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.262576] ata1.00: cmd 61/08:40:38:b8:18/00:00:1d:00:00/40 tag 8 ncq 4096 out
> [ 180.262576] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.277321] ata1.00: status: { DRDY }
> [ 180.281007] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.286263] ata1.00: cmd 61/10:48:78:b8:18/00:00:1d:00:00/40 tag 9 ncq 8192 out
> [ 180.286263] res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.301013] ata1.00: status: { DRDY }
> [ 180.304693] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.309947] ata1.00: cmd 61/08:50:60:98:d8/00:00:1d:00:00/40 tag 10 ncq 4096 out
> [ 180.309947] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.324781] ata1.00: status: { DRDY }
> [ 180.328461] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.333713] ata1.00: cmd 61/08:58:18:f8:a1/00:00:20:00:00/40 tag 11 ncq 4096 out
> [ 180.333713] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.348550] ata1.00: status: { DRDY }
> [ 180.352235] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.357484] ata1.00: cmd 61/08:60:00:88:a0/00:00:20:00:00/40 tag 12 ncq 4096 out
> [ 180.357484] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.372318] ata1.00: status: { DRDY }
> [ 180.375997] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.381251] ata1.00: cmd 61/48:68:c8:ac:dc/00:00:22:00:00/40 tag 13 ncq 36864 out
> [ 180.381251] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.396203] ata1.00: status: { DRDY }
> [ 180.399882] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.405134] ata1.00: cmd 61/08:70:40:c8:9d/00:00:20:00:00/40 tag 14 ncq 4096 out
> [ 180.405134] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.419993] ata1.00: status: { DRDY }
> [ 180.423674] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.428920] ata1.00: cmd 61/08:78:58:a7:9c/00:00:20:00:00/40 tag 15 ncq 4096 out
> [ 180.428920] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.443989] ata1.00: status: { DRDY }
> [ 180.447892] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.453367] ata1.00: cmd 61/08:80:28:c5:dd/00:00:25:00:00/40 tag 16 ncq 4096 out
> [ 180.453367] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.469053] ata1.00: status: { DRDY }
> [ 180.472950] ata1.00: failed command: READ FPDMA QUEUED
> [ 180.478330] ata1.00: cmd 60/20:88:00:42:3c/00:00:1d:00:00/40 tag 17 ncq 16384 in
> [ 180.478330] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.494011] ata1.00: status: { DRDY }
> [ 180.497910] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.503378] ata1.00: cmd 61/10:90:80:98:d8/00:00:23:00:00/40 tag 18 ncq 8192 out
> [ 180.503378] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.519072] ata1.00: status: { DRDY }
> [ 180.522970] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.528434] ata1.00: cmd 61/08:98:88:99:d8/00:00:23:00:00/40 tag 19 ncq 4096 out
> [ 180.528434] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.544106] ata1.00: status: { DRDY }
> [ 180.547997] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.553462] ata1.00: cmd 61/08:a0:c8:99:d8/00:00:23:00:00/40 tag 20 ncq 4096 out
> [ 180.553462] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.569137] ata1.00: status: { DRDY }
> [ 180.573030] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.578508] ata1.00: cmd 61/08:a8:e0:9c:d8/00:00:23:00:00/40 tag 21 ncq 4096 out
> [ 180.578508] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.594221] ata1.00: status: { DRDY }
> [ 180.598126] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.603591] ata1.00: cmd 61/10:b0:18:9f:d8/00:00:23:00:00/40 tag 22 ncq 8192 out
> [ 180.603591] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.619266] ata1.00: status: { DRDY }
> [ 180.623157] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.628614] ata1.00: cmd 61/08:b8:78:b2:d8/00:00:23:00:00/40 tag 23 ncq 4096 out
> [ 180.628614] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.644286] ata1.00: status: { DRDY }
> [ 180.648178] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.653642] ata1.00: cmd 61/08:c0:b8:a3:d9/00:00:23:00:00/40 tag 24 ncq 4096 out
> [ 180.653642] res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.669316] ata1.00: status: { DRDY }
> [ 180.673205] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.678667] ata1.00: cmd 61/08:c8:00:a5:d9/00:00:23:00:00/40 tag 25 ncq 4096 out
> [ 180.678667] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.694340] ata1.00: status: { DRDY }
> [ 180.698231] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.703690] ata1.00: cmd 61/08:d0:78:a9:d9/00:00:23:00:00/40 tag 26 ncq 4096 out
> [ 180.703690] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.719368] ata1.00: status: { DRDY }
> [ 180.723258] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.728720] ata1.00: cmd 61/08:d8:10:aa:d9/00:00:23:00:00/40 tag 27 ncq 4096 out
> [ 180.728720] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.744390] ata1.00: status: { DRDY }
> [ 180.748276] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.753737] ata1.00: cmd 61/08:e0:20:aa:d9/00:00:23:00:00/40 tag 28 ncq 4096 out
> [ 180.753737] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.769414] ata1.00: status: { DRDY }
> [ 180.773303] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.778760] ata1.00: cmd 61/10:e8:20:ab:d9/00:00:23:00:00/40 tag 29 ncq 8192 out
> [ 180.778760] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.794433] ata1.00: status: { DRDY }
> [ 180.798323] ata1.00: failed command: WRITE FPDMA QUEUED
> [ 180.803785] ata1.00: cmd 61/08:f0:90:ac:d9/00:00:23:00:00/40 tag 30 ncq 4096 out
> [ 180.803785] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
> [ 180.819455] ata1.00: status: { DRDY }
> [ 180.823358] ata1: hard resetting link
> [ 181.131630] ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
> [ 181.140035] ata1.00: configured for UDMA/133
> [ 181.144549] ata1.00: device reported invalid CHS sector 0
> [ 181.150207] ata1.00: device reported invalid CHS sector 0
> [ 181.155843] ata1.00: device reported invalid CHS sector 0
> [ 181.161481] ata1.00: device reported invalid CHS sector 0
> [ 181.167147] ata1.00: device reported invalid CHS sector 0
> [ 181.172782] ata1.00: device reported invalid CHS sector 0
> [ 181.178407] ata1.00: device reported invalid CHS sector 0
> [ 181.184033] ata1.00: device reported invalid CHS sector 0
> [ 181.189655] ata1.00: device reported invalid CHS sector 0
> [ 181.195267] ata1.00: device reported invalid CHS sector 0
> [ 181.200882] ata1.00: device reported invalid CHS sector 0
> [ 181.206489] ata1.00: device reported invalid CHS sector 0
> [ 181.212107] ata1.00: device reported invalid CHS sector 0
> [ 181.217720] ata1.00: device reported invalid CHS sector 0
> [ 181.223326] ata1.00: device reported invalid CHS sector 0
> [ 181.228936] ata1.00: device reported invalid CHS sector 0
> [ 181.234533] ata1.00: device reported invalid CHS sector 0
> [ 181.240134] ata1.00: device reported invalid CHS sector 0
> [ 181.245726] ata1.00: device reported invalid CHS sector 0
> [ 181.251323] ata1.00: device reported invalid CHS sector 0
> [ 181.256913] ata1.00: device reported invalid CHS sector 0
> [ 181.262505] ata1.00: device reported invalid CHS sector 0
> [ 181.268087] ata1.00: device reported invalid CHS sector 0
> [ 181.273672] ata1.00: device reported invalid CHS sector 0
> [ 181.279257] ata1.00: device reported invalid CHS sector 0
> [ 181.284836] ata1.00: device reported invalid CHS sector 0
> [ 181.290416] ata1.00: device reported invalid CHS sector 0
> [ 181.295988] ata1.00: device reported invalid CHS sector 0
> [ 181.301571] ata1.00: device reported invalid CHS sector 0
> [ 181.307139] ata1.00: device reported invalid CHS sector 0
> [ 181.312708] ata1.00: device reported invalid CHS sector 0
> [ 181.318426] ata1: EH complete
>
> --
> Regards/Gruss,
> Boris.
>
> Sent from a fat crate under my desk. Formatting is fine.
> --
--
Andy Lutomirski
AMA Capital Management, LLC
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists