[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <b3cbc01c-e314-4df1-bf0c-b69bdcd638f7@suse.de>
Date: Mon, 23 Sep 2024 11:43:02 +0200
From: Hannes Reinecke <hare@...e.de>
To: John Garry <john.g.garry@...cle.com>, axboe@...nel.dk, hch@....de
Cc: linux-block@...r.kernel.org, linux-kernel@...r.kernel.org,
linux-raid@...r.kernel.org, martin.petersen@...cle.com
Subject: Re: [PATCH RFC 0/6] bio_split() error handling rework
On 9/23/24 09:19, John Garry wrote:
> On 23/09/2024 06:53, Hannes Reinecke wrote:
>> On 9/19/24 11:22, John Garry wrote:
>>> bio_split() error handling could be improved as follows:
>>> - Instead of returning NULL for an error - which is vague - return a
>>> PTR_ERR, which may hint what went wrong.
>>> - Remove BUG_ON() calls - which are generally not preferred - and
>>> instead
>>> WARN and pass an error code back to the caller. Many callers of
>>> bio_split() don't check the return code. As such, for an error we
>>> would
>>> be getting a crash still from an invalid pointer dereference.
>>>
>>> Most bio_split() callers don't check the return value. However, it could
>>> be argued the bio_split() calls should not fail. So far I have just
>>> fixed up the md RAID code to handle these errors, as that is my interest
>>> now.
>>>
>>> Sending as an RFC as unsure if this is the right direction.
>>>
>>> The motivator for this series was initial md RAID atomic write
>>> support in
>>> https://lore.kernel.org/linux-block/21f19b4b-4b83-4ca2-
>>> a93b-0a433741fd26@...cle.com/
>>>
>>> There I wanted to ensure that we don't split an atomic write bio, and it
>>> made more sense to handle this in bio_split() (instead of the
>>> bio_split()
>>> caller).
>>>
>>> John Garry (6):
>>> block: Rework bio_split() return value
>>> block: Error an attempt to split an atomic write in bio_split()
>>> block: Handle bio_split() errors in bio_submit_split()
>>> md/raid0: Handle bio_split() errors
>>> md/raid1: Handle bio_split() errors
>>> md/raid10: Handle bio_split() errors
>>>
>>> block/bio.c | 14 ++++++++++----
>>> block/blk-crypto-fallback.c | 2 +-
>>> block/blk-merge.c | 5 +++++
>>> drivers/md/raid0.c | 10 ++++++++++
>>> drivers/md/raid1.c | 8 ++++++++
>>> drivers/md/raid10.c | 18 ++++++++++++++++++
>>> 6 files changed, 52 insertions(+), 5 deletions(-)
>>>
>> You are missing '__bio_split_to_limits()' which looks as it would need
>> to be modified, too.
>>
>
> In __bio_split_to_limits(), for REQ_OP_DISCARD, REQ_OP_SECURE_ERASE, and
> REQ_OP_WRITE_ZEROES, we indirectly call bio_split(). And bio_split()
> might error. But functions like bio_split_discard() can return NULL for
> cases where a split is not required. So I suppose we need to check
> IS_ERR(split) for those request types mentioned. For NULL being
> returned, we would still have the __bio_split_to_limits() is "if
> (split)" check.
>
Indeed. And then you'll need to modify nvme:
diff --git a/drivers/nvme/host/multipath.c b/drivers/nvme/host/multipath.c
index f72c5a6a2d8e..c99f51e7730e 100644
--- a/drivers/nvme/host/multipath.c
+++ b/drivers/nvme/host/multipath.c
@@ -453,7 +453,7 @@ static void nvme_ns_head_submit_bio(struct bio *bio)
* pool from the original queue to allocate the bvecs from.
*/
bio = bio_split_to_limits(bio);
- if (!bio)
+ if (IS_ERR_OR_NULL(bio))
return;
srcu_idx = srcu_read_lock(&head->srcu);
Cheers,
Hannes
--
Dr. Hannes Reinecke Kernel Storage Architect
hare@...e.de +49 911 74053 688
SUSE Software Solutions GmbH, Frankenstr. 146, 90461 Nürnberg
HRB 36809 (AG Nürnberg), GF: I. Totev, A. McDonald, W. Knoblich
Powered by blists - more mailing lists