[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-Id: <20240709130343.858363-1-dongsheng.yang@linux.dev>
Date: Tue, 9 Jul 2024 13:03:36 +0000
From: Dongsheng Yang <dongsheng.yang@...ux.dev>
To: axboe@...nel.dk,
dan.j.williams@...el.com,
gregory.price@...verge.com,
John@...ves.net,
Jonathan.Cameron@...wei.com,
bbhushan2@...vell.com,
chaitanyak@...dia.com,
rdunlap@...radead.org
Cc: linux-block@...r.kernel.org,
linux-kernel@...r.kernel.org,
linux-cxl@...r.kernel.org,
Dongsheng Yang <dongsheng.yang@...ux.dev>
Subject: [PATCH v1 0/7] Introduce CBD (CXL Block Device)
Hi all,
This is V1 for CXL Block Device. This patchset is based on v6.9 and
it's available at: https://github.com/DataTravelGuide/linux branch cbd.
changes from RFC: (https://lore.kernel.org/lkml/20240422071606.52637-1-dongsheng.yang@easystack.cn/)
(1) only support hardware-consistency cxl shared memory.
As discussed in the RFC, the current cbd only supports
hardware-consistency for CXL shared memory, and some code related to
software-consistency support has been removed from the RFC. In the
current tests, whether using local PMEM or QEMU-simulated shared memory
devices, they all are hardware-consistency.
(2) add a segment abstraction for transport data space management.
The layout of the transport remains essentially
unchanged, with the only difference being the addition of a segment
abstraction for scalability purposes. A channel is a type of segment
used for data transfer between the blkdev and the backend. In the
future, there will be more segment types, such as a cache segment for
caching data for the blkdev.
(3) add CONFIG_CBD_CRC option in Kconfig
We only support hardware-consistency, so theoretically,
there should be no data consistency issues when transferring data
between blkdev and the backend. However, cbd provides a verification
mechanism, offering CRC checks for both metadata and data to verify
after data reception. This method impacts performance, so it is an
option in Kconfig.
(4) allow user to clear dead object in transport metadata
When a host using cbd, whether backend or blkdev, dies
without unregistering, the metadata in the transport will retain some
dead information. In v1, users are allowed to clear this dead metadata
via sysfs. Of course, there is a heartbeat mechanism to ensure users do
not mistakenly delete alive metadata.
(5) allow user to force stop blkdev and reattach backend
This also handles scenarios where the host goes offline
unexpectedly. When the backend goes offline unexpectedly, the
corresponding blkdev might have I/O operations that cannot finish. In
such cases, cbd provides two ways to handle this:
a) If the backend can recover, we can re-add the backend to the
corresponding transport, allowing the blkdev's I/O operations to continue being processed.
b) If the backend cannot recover, the blkdev can be force-stopped, and
the incomplete I/O operations will return EIO, but they will no longer remain blocked.
(6) dont allocate new pages in hander for bio data.
The backend handler does not allocate pages for bio.
Instead, the handler can directly map the data pages from the transport
to the bio, and then send the bio to the backend disk, achieving zero
copy on the backend side.
(7) new test project cbd-tests:
cbd-tests (https://github.com/DataTravelGuide/cbd-tests), for testing cbd. It is
an automated testing project based on the Avocado testing framework. Currently,
it includes xfstests on cbd block devices with XFS, V1 Passed all 944 tests in xfstests
(https://datatravelguide.github.io/dtg-blog/cbd/test-results/test_result_v1/test-results/xfstests-1-xfstests.py_Xfstests.test_run-cbdd_timeout-no_timeout-disk_type-fs_type-fs_xfs-f090/debug.log). as well as fio performance testing directly on /dev/cbdX block devices.
The test results can be viewed here in [test results]:
https://datatravelguide.github.io/dtg-blog/cbd/cbd.html#test-results
Thanx
Dongsheng Yang (7):
cbd: introduce cbd_transport
cbd: introduce cbd_host
cbd: introduce cbd_segment
cbd: introduce cbd_channel
cbd: introduce cbd_blkdev
cbd: introduce cbd_backend
block: Init for CBD(CXL Block Device) module
drivers/block/Kconfig | 2 +
drivers/block/Makefile | 2 +
drivers/block/cbd/Kconfig | 23 +
drivers/block/cbd/Makefile | 3 +
drivers/block/cbd/cbd_backend.c | 296 ++++++++++
drivers/block/cbd/cbd_blkdev.c | 417 ++++++++++++++
drivers/block/cbd/cbd_channel.c | 153 ++++++
drivers/block/cbd/cbd_handler.c | 263 +++++++++
drivers/block/cbd/cbd_host.c | 128 +++++
drivers/block/cbd/cbd_internal.h | 848 ++++++++++++++++++++++++++++
drivers/block/cbd/cbd_main.c | 224 ++++++++
drivers/block/cbd/cbd_queue.c | 526 ++++++++++++++++++
drivers/block/cbd/cbd_segment.c | 108 ++++
drivers/block/cbd/cbd_transport.c | 883 ++++++++++++++++++++++++++++++
14 files changed, 3876 insertions(+)
create mode 100644 drivers/block/cbd/Kconfig
create mode 100644 drivers/block/cbd/Makefile
create mode 100644 drivers/block/cbd/cbd_backend.c
create mode 100644 drivers/block/cbd/cbd_blkdev.c
create mode 100644 drivers/block/cbd/cbd_channel.c
create mode 100644 drivers/block/cbd/cbd_handler.c
create mode 100644 drivers/block/cbd/cbd_host.c
create mode 100644 drivers/block/cbd/cbd_internal.h
create mode 100644 drivers/block/cbd/cbd_main.c
create mode 100644 drivers/block/cbd/cbd_queue.c
create mode 100644 drivers/block/cbd/cbd_segment.c
create mode 100644 drivers/block/cbd/cbd_transport.c
--
2.34.1
Powered by blists - more mailing lists