[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <676569856.13488.1456863792603.JavaMail.zimbra@efficios.com>
Date: Tue, 1 Mar 2016 20:23:12 +0000 (UTC)
From: Mathieu Desnoyers <mathieu.desnoyers@...icios.com>
To: Peter Zijlstra <peterz@...radead.org>,
"H. Peter Anvin" <hpa@...or.com>
Cc: Linus Torvalds <torvalds@...ux-foundation.org>,
Ben Maurer <bmaurer@...com>,
Thomas Gleixner <tglx@...utronix.de>,
Ingo Molnar <mingo@...hat.com>,
Russell King <linux@....linux.org.uk>,
linux-api <linux-api@...r.kernel.org>,
Andrew Morton <akpm@...ux-foundation.org>,
Michael Kerrisk <mtk.manpages@...il.com>,
Dave Watson <davejwatson@...com>,
rostedt <rostedt@...dmis.org>,
Andy Lutomirski <luto@...capital.net>,
Will Deacon <will.deacon@....com>,
"Paul E. McKenney" <paulmck@...ux.vnet.ibm.com>,
Chris Lameter <cl@...ux.com>, Andi Kleen <andi@...stfloor.org>,
Josh Triplett <josh@...htriplett.org>,
Paul Turner <pjt@...gle.com>,
Linux Kernel Mailing List <linux-kernel@...r.kernel.org>,
Catalin Marinas <catalin.marinas@....com>,
Andrew Hunter <ahh@...gle.com>
Subject: Re: [PATCH v4 1/5] getcpu_cache system call: cache CPU number of
running thread
----- On Feb 29, 2016, at 5:35 AM, Peter Zijlstra peterz@...radead.org wrote:
> On Sun, Feb 28, 2016 at 02:32:28PM +0000, Mathieu Desnoyers wrote:
>> The part of ABI I'm trying to express here is for discoverability
>> of available features by user-space. For instance, a kernel
>> could be configured with "CONFIG_RSEQ=n", and userspace should
>> not rely on the rseq fields of the thread-local ABI in that case.
>
> Per the just proposed interface; discoverability would end with:
>
> thread_local_abi_register(NULL, TLA_ENABLE_RSEQ, 0);
>
> failing. This would indicate your kernel does not support (or your glibc
> failed to register, depending on error code I suppose).
>
> Then your program can either fall back to full atomics or just bail.
I think it's important that user-space fast-paths can quickly
detect whether the feature is enabled without having to rely on
always reading a separate cache-line. I've put together an ABI
proposal that take into account the feedback received so far.
The main trick here is to use "-1" value in cpu_id and rseq_seqnum
to mean "the feature is inactive" so user-space can call the system
call to register the feature, and the value "-2" can be set by the
kernel when it knows the feature is not available. It does mean
that seqnum would wrap from MAX_INT to 0 in the kernel, skipping
negative values.
Please let me know if I missed anything.
#ifdef __LP64__
# define TLABI_FIELD_u32_u64(field) uint64_t field
#elif __BYTE_ORDER__ == __ORDER_LITTLE_ENDIAN__
# define TLABI_FIELD_u32_u64(field) uint32_t field, _padding ## field
#else
# define TLABI_FIELD_u32_u64(field) uint32_t _padding ## field, field
#endif
/*
* The thread-local ABI structure needs to be aligned at least on 32
* bytes multiples.
*/
#define TLABI_ALIGNMENT 32
struct thread_local_abi {
/*
* Thread-local ABI cpu_id field.
* Updated by the kernel, and read by user-space with
* single-copy atomicity semantics. Aligned on 32-bit.
* Values:
* >= 0: CPU number of running thread.
* -1 (initial value): means the cpu_id feature is inactive.
* -2: cpu_id feature is not available.
*/
int32_t cpu_id;
/*
* Thread-local ABI rseq_seqnum field.
* Updated by the kernel, and read by user-space with
* single-copy atomicity semantics. Aligned on 32-bit.
* Values:
* >= 0: current seqnum for this thread (feature is active).
* -1 (initial value): means the rseq feature is inactive.
* -2: rseq feature is not available.
*/
int32_t rseq_seqnum;
/*
* Thread-local ABI rseq_post_commit_ip field.
* Updated by user-space, and read by the kernel with
* single-copy atomicity semantics.
* Aligned on 64-bit.
*/
TLABI_FIELD_u32_u64(rseq_post_commit_ip);
/* Add new fields at the end. */
} __attribute__ ((aligned(TLABI_ALIGNMENT)));
enum thread_local_abi_feature {
TLA_FEATURE_NONE = 0,
TLA_FEATURE_CPU_ID = (1 << 0),
TLA_FEATURE_RSEQ = (1 << 1),
};
/*
* Thread local ABI system call.
*
* First call with (NULL, 0, 0), returns the size of the struct
* thread_local_abi expected by the kernel, or -1 on error.
*
* Second, allocate a memory area to hold the struct thread_local_abi,
* and call with (ptr, 0, 0). Returns 0 on success, or -1 on error.
*
* Third, enable specific features by passing a mask, e.g. call with
* (NULL, TLA_FEATURE_CPU_ID | TLA_FEATURE_RSEQ, 0).
* Returns 0 on success, -1 on error.
*
* Then the fields associated with the enabled features are managed by
* the kernel.
*/
ssize_t thread_local_abi(struct thread_local_abi *tlabi,
uint64_t feature_mask, int flags);
Thanks for your feedback!
Mathieu
--
Mathieu Desnoyers
EfficiOS Inc.
http://www.efficios.com
Powered by blists - more mailing lists