lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <a8e9dff20e63f135b512ea3235669120@she-devel.com>
Date: Sat, 24 May 2025 15:09:16 -0700
From: Alison Chaiken <alison@...-devel.com>
To: Sebastian Andrzej Siewior <bigeasy@...utronix.de>
Cc: corbet@....net, gratian.crisan@...com, triegel@...hat.com,
 rostedt@...dmis.org, linux-kernel@...r.kernel.org,
 linux-rt-users@...r.kernel.org, achaiken@...ora.tech
Subject: Re: [PATCH] Documentation: locking: update libc support status of PI
 futexes

On 2025-05-23 08:00, Sebastian Andrzej Siewior wrote:
> On 2025-01-11 10:55:55 [-0800], Alison Chaiken wrote:
>> > Are you sure? My memory is that glibc avoided using the internal mutex.
>> > The old problem should be gone and pthread_cond_signal() and
>> > pthread_cond_wait() should work.
>> 
>> Ignoring support for 64-bit time, the last substantive change to
>> pthread_cond_wait() and pthread_cond_signal() was Torvald Riegel's  
>> commit
>> ed19993b5b0d05d62cc883571519a67dae481a14 "New condvar implementation 
>> that
>> provides stronger ordering guarantees," which fixed problems with 
>> waking of
>> ineligible futex waiters and with ABA issues concerning the futex 
>> word.
>> What the patch does not do is made clear by the commit message:
>> 
>>      This condvar doesn't yet use a requeue optimization (ie, on a
>> broadcast,
>>      waking just one thread and requeueing all others on the futex of 
>> the
>>      mutex supplied by the program).
>> 
>> What futex-requeue-pi.rst directs is
>> 
>>      In order to support PI-aware pthread_condvar's, the kernel needs 
>> to
>>      be able to requeue tasks to PI futexes.
>> 
>> Riegel and Darren Hart discussed Riegel's patch in at length at the 
>> 2016 RT
>> Summit:
>> 
>> https://wiki.linuxfoundation.org/realtime/events/rt-summit2016/schedule
>> 
>> The related glibc bug report by Darren may be found at
>> 
>>     https://sourceware.org/bugzilla/show_bug.cgi?id=11588
>> 
>> The last comment on the bug from 2017 is by Riegel:
>> 
>>     So far, there is no known solution for how to achieve PI support 
>> given
>> the current constraints we have (eg, available futex operations, POSIX
>> requirements, ...).
>> 
>> I ran the bug reproducer posted by Darren in Qemu and found that it 
>> did not
>> fail.   I'm not sure if the result is valid given the peculiarities of 
>> Qemu,
>> or whether I made some other mistake.
> 
> I've been looking at this again for other reasons and looked at the
> code again…
> 
> Back then we use futex-requeue API and required both futex-object to
> have the PI bit set. This wasn't the case originally, hence the patch 
> by
> Darren which did not make it into the official libc.
> 
> With the rework by Riegel, the mutex within pthread's condvar
> implementation is gone also the usage of the requeue API. The
> pthread_cond_wait()/ pthread_cond_signal() API is back to use futex'
> wait/ wake.
> The glibc comments write something about important ordering constrains.
> The futex wait enqueues the waiter according to its priority. So the
> task with highest priority gets always a front seat. The futex wake
> function wakes always the first waiter in the queue.
> 
> With all this I would say that the glib'c condvar implementation does
> not have any issues since the rework.
> There were a few loops, with a 0 retry counter (basically dead) and 
> they
> have been removed.
> 
> Sebastian

Thanks, Sebastian, for looking into this question.

Torvald Riegel's last patch to pthread_cond_wait.c:

$ git log -n 1 --author=riegel -- pthread_cond_wait.c
commit ed19993b5b0d05d62cc883571519a67dae481a14
Author: Torvald Riegel <triegel@...hat.com>
Date:   Wed May 25 23:43:36 2016 +0200
     New condvar implementation that provides stronger ordering 
guarantees.

Speaking of ordering, the 2016 Linux Realtime Summit happened after, on 
11 October.   Torvald and Darren co-presented a talk about condition 
variables:

https://wiki.linuxfoundation.org/realtime/events/rt-summit2016/pthread-condvars

Torvald in his half of the talk discusses the POSIX requirement which 
necessitated a change to condvars and his redesign.  In the video at 
30:50,

---

Zijlstra:  Even for FIFO, in the previous slides, S2 will only wake W2, 
because W3 was not yet
eligible, but W3 might be the highest-priority waiter.  Strictly 
speaking, W3 was eligible at S2.
[p. 9, W3 was in G2, not G1, but "happened before" S2].    At S2, the 
only possible wakeup was W2,
even thought W3 might be the highest-priority waiter.

Hart: Correct.  Not in this scheme.

Zijstra: Sequence-wise, it's correct,

Hart: it's mathematically correct.

Zijlstra: But it's not the one we want to wake according to PI rules.

Hart: Yep.

Zijlstra: This scheme does not permit us doing so.

Hart: Noted.

---

Darren and Torvald agree that glibc cannot make pthread condvars 
PI-aware without breaking ABI.   Am I missing something?

Thanks,
Alison

---
Alison Chaiken                   alison@...-devel.com
https://she-devel.com
"What respite from her thrilling toil did Beauty ever take — But Work 
might be Electric Rest To those that Magic make" -- Emily Dickinson

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ