lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <Pine.LNX.4.64.0806241029120.23052@engineering.redhat.com>
Date:	Tue, 24 Jun 2008 10:36:27 -0400 (EDT)
From:	Mikulas Patocka <mpatocka@...hat.com>
To:	Neil Brown <neilb@...e.de>
cc:	linux-kernel@...r.kernel.org, axboe@...nel.dk
Subject: Re: [PATCH 1/2] Avoid bio_endio recursion



On Tue, 24 Jun 2008, Neil Brown wrote:

> On Tuesday June 24, mpatocka@...hat.com wrote:
>> Hi
>>
>> bio_endio calls bi_end_io callback. In case of stacked devices (raid, dm),
>> bio_end_io may call bio_endio again, up to an unspecified length.
>>
>> The crash because of stack overflow was really observed on sparc64. And
>> this recursion was one of the contributing factors (using 9 stack frames
>> --- that is 1728 bytes).
>>
>> This patch removes the recursion.
>
> This is very cool, thanks!  A close mirror of the recursion
> avoidance in generic_make_request.
>
> You use a per-cpu queue were generic_make_request uses a per-task
> queue.  This is fitting as bi_end_io doesn't have a process context,
> but is supposed to be fast and able to run with interrupts disabled,
> so tying to a cpu is no problem.

Yes. I think "current" variable can't be used in irq context, it would 
blow with irq-stacks (or access some weird unknown memory).

I had another version of the patch that doesn't disable interrupts and 
only disables preempt and uses local_t atomic cpu-local variables. It is 
somehow more tricky, because interrupt can be triggered any time while 
processing the queue and it can add anything to the queue. Then I realized 
that bio_endio runs most time with disabled interrupts anyway, so it'd be 
better to just disable interrupts and don't do that local_cmpxchg 
trickery.

Mikulas

>> +
>> +	bio_queue = NULL;
>> +queue_empty_next_bio:
>> +	*bio_end_queue_ptr = &bio_queue;
>> +next_bio:
>> +
>>   	if (error)
>>   		clear_bit(BIO_UPTODATE, &bio->bi_flags);
>>   	else if (!test_bit(BIO_UPTODATE, &bio->bi_flags))
>> @@ -1175,6 +1196,17 @@
>>
>>   	if (bio->bi_end_io)
>>   		bio->bi_end_io(bio, error);
>> +
>> +	if (bio_queue) {
>> +		bio = bio_queue;
>> +		bio_queue = bio->bi_next;
>> +		if (!bio_queue) goto queue_empty_next_bio;
>> +		goto next_bio;
>
> checkpatch.pl doesn't like that:
> ERROR: trailing statements should be on next line
>
> and I don't either.  I would not bother with the mini-optimisation at
> all.
> Discard the queue_empty_next_bio label and replace the "if () goto"
> with
>       if (!bio_queue)
> 		*bio_end_queue_ptr = &bio_queue;
>
> and leave gcc to optimise the assignment if it wants to.
>
> Reviewed-by: NeilBrown <neilb@...e.de>
>
> Thanks,
> NeilBrown
>
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ