lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20250221161439.0e34fba9@eugeo>
Date: Fri, 21 Feb 2025 16:14:39 +0000
From: Gary Guo <gary@...yguo.net>
To: Tamir Duberstein <tamird@...il.com>
Cc: Miguel Ojeda <ojeda@...nel.org>, Alex Gaynor <alex.gaynor@...il.com>,
 Boqun Feng <boqun.feng@...il.com>, Björn Roy Baron
 <bjorn3_gh@...tonmail.com>, Benno Lossin <benno.lossin@...ton.me>, Andreas
 Hindborg <a.hindborg@...nel.org>, Alice Ryhl <aliceryhl@...gle.com>, Trevor
 Gross <tmgross@...ch.edu>, Danilo Krummrich <dakr@...nel.org>, Wedson
 Almeida Filho <walmeida@...rosoft.com>, Alex Mantel
 <alexmantel93@...lbox.org>, Will Deacon <will@...nel.org>, Peter Zijlstra
 <peterz@...radead.org>, Mark Rutland <mark.rutland@....com>,
 rust-for-linux@...r.kernel.org, linux-kernel@...r.kernel.org
Subject: Re: [PATCH v3 2/4] rust: convert `Arc` to use `Refcount`

On Wed, 19 Feb 2025 17:12:10 -0500
Tamir Duberstein <tamird@...il.com> wrote:

> On Wed, Feb 19, 2025 at 3:17 PM Gary Guo <gary@...yguo.net> wrote:
> >
> > With `Refcount` type created, `Arc` can use `Refcount` instead of
> > calling into FFI directly.
> >
> > Signed-off-by: Gary Guo <gary@...yguo.net>
> > ---
> >  rust/kernel/sync/arc.rs | 65 +++++++++++++++++------------------------
> >  1 file changed, 26 insertions(+), 39 deletions(-)
> >
> > diff --git a/rust/kernel/sync/arc.rs b/rust/kernel/sync/arc.rs
> > index 3cefda7a4372..1f5fbc6b3742 100644
> > --- a/rust/kernel/sync/arc.rs
> > +++ b/rust/kernel/sync/arc.rs
> > @@ -8,7 +8,7 @@
> >  //! threads.
> >  //!
> >  //! It is different from the standard library's [`Arc`] in a few ways:
> > -//! 1. It is backed by the kernel's `refcount_t` type.
> > +//! 1. It is backed by the kernel's [`Refcount`] type.
> >  //! 2. It does not support weak references, which allows it to be half the size.
> >  //! 3. It saturates the reference count instead of aborting when it goes over a threshold.
> >  //! 4. It does not provide a `get_mut` method, so the ref counted object is pinned.
> > @@ -18,10 +18,10 @@
> >
> >  use crate::{
> >      alloc::{AllocError, Flags, KBox},
> > -    bindings,
> >      init::{self, InPlaceInit, Init, PinInit},
> > +    sync::Refcount,
> >      try_init,
> > -    types::{ForeignOwnable, Opaque},
> > +    types::ForeignOwnable,
> >  };
> >  use core::{
> >      alloc::Layout,
> > @@ -143,7 +143,7 @@ pub struct Arc<T: ?Sized> {
> >  #[pin_data]
> >  #[repr(C)]
> >  struct ArcInner<T: ?Sized> {
> > -    refcount: Opaque<bindings::refcount_t>,
> > +    refcount: Refcount,
> >      data: T,
> >  }
> >
> > @@ -155,7 +155,7 @@ impl<T: ?Sized> ArcInner<T> {
> >      /// `ptr` must have been returned by a previous call to [`Arc::into_raw`], and the `Arc` must
> >      /// not yet have been destroyed.
> >      unsafe fn container_of(ptr: *const T) -> NonNull<ArcInner<T>> {
> > -        let refcount_layout = Layout::new::<bindings::refcount_t>();
> > +        let refcount_layout = Layout::new::<Refcount>();
> >          // SAFETY: The caller guarantees that the pointer is valid.
> >          let val_layout = Layout::for_value(unsafe { &*ptr });
> >          // SAFETY: We're computing the layout of a real struct that existed when compiling this
> > @@ -207,8 +207,7 @@ impl<T> Arc<T> {
> >      pub fn new(contents: T, flags: Flags) -> Result<Self, AllocError> {
> >          // INVARIANT: The refcount is initialised to a non-zero value.
> >          let value = ArcInner {
> > -            // SAFETY: There are no safety requirements for this FFI call.
> > -            refcount: Opaque::new(unsafe { bindings::REFCOUNT_INIT(1) }),
> > +            refcount: Refcount::new(1),
> >              data: contents,
> >          };
> >
> > @@ -290,7 +289,7 @@ pub fn ptr_eq(this: &Self, other: &Self) -> bool {
> >      /// use kernel::sync::{Arc, UniqueArc};
> >      ///
> >      /// let arc = Arc::new(42, GFP_KERNEL)?;
> > -    /// let unique_arc = arc.into_unique_or_drop();
> > +    /// let unique_arc = Arc::into_unique_or_drop(arc);
> >      ///
> >      /// // The above conversion should succeed since refcount of `arc` is 1.
> >      /// assert!(unique_arc.is_some());
> > @@ -306,35 +305,30 @@ pub fn ptr_eq(this: &Self, other: &Self) -> bool {
> >      /// let arc = Arc::new(42, GFP_KERNEL)?;
> >      /// let another = arc.clone();
> >      ///
> > -    /// let unique_arc = arc.into_unique_or_drop();
> > +    /// let unique_arc = Arc::into_unique_or_drop(arc);
> >      ///
> >      /// // The above conversion should fail since refcount of `arc` is >1.
> >      /// assert!(unique_arc.is_none());
> >      ///
> >      /// # Ok::<(), Error>(())
> >      /// ```
> > -    pub fn into_unique_or_drop(self) -> Option<Pin<UniqueArc<T>>> {
> > +    pub fn into_unique_or_drop(this: Self) -> Option<Pin<UniqueArc<T>>> {  
> 
> Why did this signature need to change?

I think I mentioned this in a earlier series. Smart pointers are not
supposed to have methods (i.e. with a self receiver) as it may shadow
deref'ed functions.

> 
> >          // We will manually manage the refcount in this method, so we disable the destructor.
> > -        let me = ManuallyDrop::new(self);
> > +        let this = ManuallyDrop::new(this);
> >          // SAFETY: We own a refcount, so the pointer is still valid.
> > -        let refcount = unsafe { me.ptr.as_ref() }.refcount.get();
> > +        let refcount = unsafe { &this.ptr.as_ref().refcount };
> >
> >          // If the refcount reaches a non-zero value, then we have destroyed this `Arc` and will
> >          // return without further touching the `Arc`. If the refcount reaches zero, then there are
> >          // no other arcs, and we can create a `UniqueArc`.
> > -        //
> > -        // SAFETY: We own a refcount, so the pointer is not dangling.
> > -        let is_zero = unsafe { bindings::refcount_dec_and_test(refcount) };
> > -        if is_zero {
> > -            // SAFETY: We have exclusive access to the arc, so we can perform unsynchronized
> > -            // accesses to the refcount.
> > -            unsafe { core::ptr::write(refcount, bindings::REFCOUNT_INIT(1)) };
> > +        if refcount.dec_and_test() {
> > +            refcount.set(1);  
> 
> We could retain the unsynchronized operation here by taking a mutable
> reference above and writing through it. Right? Could we remove `set`
> from the abstraction in the previous patch?

This was suggested as well in a previous series but I don't think it's
a good idea. Creating a mutable reference and using unsynchronized
write requires `unsafe`. `set` doesn't.

Note that the `set` here is relaxed order. I doubt (if things are
inlined properly) there'll be any codegen difference with a completely
unsynchronized write.

Not having an additional unsafe is a good trade-off to me.

> 
> >
> > -            // INVARIANT: We own the only refcount to this arc, so we may create a `UniqueArc`. We
> > -            // must pin the `UniqueArc` because the values was previously in an `Arc`, and they pin
> > -            // their values.
> > +            // INVARIANT: If the refcount failed to decrement because it is 1, then we have the
> > +            // exclusive ownership, so we may create a `UniqueArc`. We must pin the `UniqueArc`
> > +            // because the values was previously in an `Arc`, and they pin their values.  
> 
> Pre-existing typo you're taking ownership of: "the values" should be
> "the value". But why touch this comment at all?

I think this is a left-over that I forget to undo.

> 
> >              Some(Pin::from(UniqueArc {
> > -                inner: ManuallyDrop::into_inner(me),
> > +                inner: ManuallyDrop::into_inner(this),
> >              }))
> >          } else {
> >              None
> > @@ -396,14 +390,10 @@ fn as_ref(&self) -> &T {
> >
> >  impl<T: ?Sized> Clone for Arc<T> {
> >      fn clone(&self) -> Self {
> > -        // SAFETY: By the type invariant, there is necessarily a reference to the object, so it is
> > -        // safe to dereference it.
> > -        let refcount = unsafe { self.ptr.as_ref() }.refcount.get();
> > -
> > -        // INVARIANT: C `refcount_inc` saturates the refcount, so it cannot overflow to zero.
> > +        // INVARIANT: `Refcount` saturates the refcount, so it cannot overflow to zero.
> >          // SAFETY: By the type invariant, there is necessarily a reference to the object, so it is
> >          // safe to increment the refcount.
> > -        unsafe { bindings::refcount_inc(refcount) };
> > +        unsafe { self.ptr.as_ref().refcount.inc() };
> >
> >          // SAFETY: We just incremented the refcount. This increment is now owned by the new `Arc`.
> >          unsafe { Self::from_inner(self.ptr) }
> > @@ -412,16 +402,14 @@ fn clone(&self) -> Self {
> >
> >  impl<T: ?Sized> Drop for Arc<T> {
> >      fn drop(&mut self) {
> > -        // SAFETY: By the type invariant, there is necessarily a reference to the object. We cannot
> > -        // touch `refcount` after it's decremented to a non-zero value because another thread/CPU
> > -        // may concurrently decrement it to zero and free it. It is ok to have a raw pointer to
> > -        // freed/invalid memory as long as it is never dereferenced.
> > -        let refcount = unsafe { self.ptr.as_ref() }.refcount.get();
> > -
> >          // INVARIANT: If the refcount reaches zero, there are no other instances of `Arc`, and
> >          // this instance is being dropped, so the broken invariant is not observable.
> > -        // SAFETY: Also by the type invariant, we are allowed to decrement the refcount.
> > -        let is_zero = unsafe { bindings::refcount_dec_and_test(refcount) };
> > +        // SAFETY: By the type invariant, there is necessarily a reference to the object.
> > +        // NOTE: we cannot touch `refcount` after it's decremented to a non-zero value because
> > +        // another thread/CPU may concurrently decrement it to zero and free it. However it is okay
> > +        // to have a transient reference to decrement the refcount, see
> > +        // https://github.com/rust-lang/rust/issues/55005.
> > +        let is_zero = unsafe { self.ptr.as_ref().refcount.dec_and_test() };  
> 
> How come this careful handling is not required in into_unique_or_drop?
> At least, the SAFETY comment there is much more mundane.

Because `into_unique_or_drop` doesn't actually remove the allocation
(it only decrements refcount for non-zero or turn it into `UniqueArc`).

> 
> >          if is_zero {
> >              // The count reached zero, we must free the memory.
> >              //
> > @@ -673,8 +661,7 @@ pub fn new_uninit(flags: Flags) -> Result<UniqueArc<MaybeUninit<T>>, AllocError>
> >          // INVARIANT: The refcount is initialised to a non-zero value.
> >          let inner = KBox::try_init::<AllocError>(
> >              try_init!(ArcInner {
> > -                // SAFETY: There are no safety requirements for this FFI call.
> > -                refcount: Opaque::new(unsafe { bindings::REFCOUNT_INIT(1) }),
> > +                refcount: Refcount::new(1),
> >                  data <- init::uninit::<T, AllocError>(),
> >              }? AllocError),
> >              flags,
> > --
> > 2.47.2
> >  


Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ