[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20230708041049.2tdw6fnnjtqwuqlw@moria.home.lan>
Date: Sat, 8 Jul 2023 00:10:49 -0400
From: Kent Overstreet <kent.overstreet@...ux.dev>
To: Matthew Wilcox <willy@...radead.org>
Cc: James Bottomley <James.Bottomley@...senpartnership.com>,
Christian Brauner <brauner@...nel.org>,
"Darrick J. Wong" <djwong@...nel.org>,
Josef Bacik <josef@...icpanda.com>,
torvalds@...ux-foundation.org, linux-kernel@...r.kernel.org,
linux-fsdevel@...r.kernel.org, linux-bcachefs@...r.kernel.org,
dchinner@...hat.com, sandeen@...hat.com, tytso@....edu,
bfoster@...hat.com, jack@...e.cz, andreas.gruenbacher@...il.com,
peterz@...radead.org, akpm@...ux-foundation.org,
dhowells@...hat.com
Subject: Re: [GIT PULL] bcachefs
On Sat, Jul 08, 2023 at 04:54:22AM +0100, Matthew Wilcox wrote:
> One thing I particularly like about btrfs
:)
> compared to ntfs3 is that it doesn't use old legacy code like the buffer
> heads, which means that it doesn't add to the technical debt. From the
> page cache point of view, it's fairly clean. I wish it used iomap, but
> iomap would need quite a lot of new features to accommodate everything
> bcachefs wants to do. Maybe iomap will grow those features over time.
My big complaint with iomap is that it's still the old callback based
approach - an indirect function call into the filesystem to get a
mapping, then Doing Stuff, for every walk.
Instead of calling back and forth, we could be filling out a data
structure to represent the IO, then handing it off to the filesystem to
look up the mappings and send to the right place, splitting as needed.
Best part is, we already have such a data structure: struct bio. That's
the approach bcachefs takes.
It would be nice sharing the page cache management code, but like you
mentioned, iomap would have to grow a bunch of features. But, some of
those features other users might like: in particular bcachefs hangs disk
reservations and dirty sector (for i_blocks accounting) off the
pagecache, which to me is a total no brainer, it eliminates looking up
in a second data structure for e.g. the buffered write path.
Also worth noting - bcachefs has had large folio support for awhile now :)
Powered by blists - more mailing lists