[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <71849794.2361.1554824757946.JavaMail.zimbra@efficios.com>
Date: Tue, 9 Apr 2019 11:45:57 -0400 (EDT)
From: Mathieu Desnoyers <mathieu.desnoyers@...icios.com>
To: Carlos O'Donell <codonell@...hat.com>
Cc: Tulio Magno Quites Machado Filho <tuliom@...ii.art.br>,
Alan Modra <amodra@...il.com>,
Michael Ellerman <mpe@...erman.id.au>,
Florian Weimer <fweimer@...hat.com>,
Michael Meissner <meissner@...ux.ibm.com>,
Peter Bergner <bergner@...t.ibm.com>,
Paul Burton <paul.burton@...s.com>,
Will Deacon <will.deacon@....com>,
Boqun Feng <boqun.feng@...il.com>,
heiko carstens <heiko.carstens@...ibm.com>,
gor <gor@...ux.ibm.com>, schwidefsky <schwidefsky@...ibm.com>,
"Russell King, ARM Linux" <linux@...linux.org.uk>,
Benjamin Herrenschmidt <benh@...nel.crashing.org>,
Paul Mackerras <paulus@...ba.org>, carlos <carlos@...hat.com>,
Joseph Myers <joseph@...esourcery.com>,
Szabolcs Nagy <szabolcs.nagy@....com>,
libc-alpha <libc-alpha@...rceware.org>,
Thomas Gleixner <tglx@...utronix.de>,
Ben Maurer <bmaurer@...com>,
Peter Zijlstra <peterz@...radead.org>,
"Paul E. McKenney" <paulmck@...ux.vnet.ibm.com>,
Dave Watson <davejwatson@...com>, Paul Turner <pjt@...gle.com>,
Rich Felker <dalias@...c.org>,
linux-kernel <linux-kernel@...r.kernel.org>,
linux-api <linux-api@...r.kernel.org>
Subject: Re: [PATCH 1/4] glibc: Perform rseq(2) registration at C startup
and thread creation (v7)
----- On Apr 9, 2019, at 10:13 AM, Carlos O'Donell codonell@...hat.com wrote:
> On 4/9/19 9:58 AM, Tulio Magno Quites Machado Filho wrote:
>> Alan Modra <amodra@...il.com> writes:
>>> Yes, looks fine to me, except that in VLE mode (do we care?)
>>> ".long 0x0fe50553" disassembles as
>>> 0: 0f e5 se_cmphl r5,r30
>>> 2: 05 53 se_mullw r3,r5
>>> No illegal/trap/privileged insn there.
>>>
>>> ".long 0x0fe5000b" might be better to cover VLE.
>>
>> Looks good for me too.
>
> The requirement that it be a valid instruction is simply to aid in the
> disassembly of rseq regions which may be hand written assembly with a
> thin veneer of CFI/DWARF information.
>
> It has already been pointed out that POWER uses data in the instruction
> stream for jump tables to implement switch statements, but that specific
> use has compiler support and one presumes good debug information. So as
> Alan says, there is already data in the insn stream, though such things
> can't be good for performance (pollutes D-cache, problematic for
> speculative execution).
>
>> Actually, it better fits what Carlos O'Donnell had requested:
>>
>>>>> I think the order of preference is:
>>>>>
>>>>> 1. An uncommon insn (with random immediate values), in a literal pool, that is
>>>>> not a useful ROP/JOP sequence (very uncommon)
>>>>> 2a. A uncommon TRAP hopefully with some immediate data encoded (maybe uncommon)
>>>>> 2b. A NOP to avoid affecting speculative execution (maybe uncommon)
>>>>>
>>>>> With 2a/2b being roughly equivalent depending on speculative execution policy.
>
> Yes, though "in a literal pool" is something that is not required, since
> users might not want literal pools and so we shouldn't require that
> feature (it also pollutes D-cache).
>
> Keep in mind the insn will never execute.
>
> If a trap insn calls out the nature of the signature more clearly then
> use that instead.
So based on the recent discussions, there are a few things we can conclude:
- Choosing a random value and relying on literal pools is a bad idea, because some
compilation environments disable them entirely,
- We ideally want the signature to be a valid instruction in the instruction set
so disassembler/emulator tools don't get confused and we don't hurt speculative
execution.
- Best option is a trap with an unlikely immediate opcode, because it traps on the
instruction in case the program try to execute it by mistake,
- Second best would be a no-op with an unlikely immediate opcode,
- We may want to stay away from privileged instructions because they can confuse
emulators with may try to emulate them,
- Some architectures have big endian/little endian variants. We may need to carefully
#ifdef each case so the numeric value matches actual instructions,
- Some architectures have extensions to their instruction set (e.g. ARM thumb, power
VLE) which can be combined with the basic instruction set within the same program.
We need to decide whether we care what those signatures look like in those
instruction set extensions or not. Is it a best effort to match real instructions
or a hard requirement ? If it's a hard requirement, we may need to extend the rseq
system call with new flags to accept more than one signature.
Thoughts ?
Thanks,
Mathieu
--
Mathieu Desnoyers
EfficiOS Inc.
http://www.efficios.com
Powered by blists - more mailing lists