[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20151215181629.GA19007@redhat.com>
Date: Tue, 15 Dec 2015 19:16:29 +0100
From: Oleg Nesterov <oleg@...hat.com>
To: Peter Zijlstra <peterz@...radead.org>
Cc: Paul Turner <pjt@...gle.com>, NeilBrown <nfbrown@...ell.com>,
Linus Torvalds <torvalds@...ux-foundation.org>,
Thomas Gleixner <tglx@...utronix.de>,
LKML <linux-kernel@...r.kernel.org>,
Mike Galbraith <efault@....de>, Ingo Molnar <mingo@...nel.org>,
Peter Anvin <hpa@...or.com>, vladimir.murzin@....com,
linux-tip-commits@...r.kernel.org, jstancek@...hat.com
Subject: Re: [tip:locking/core] sched/wait: Fix signal handling in bit wait
helpers
On 12/11, Peter Zijlstra wrote:
>
> On Fri, Dec 11, 2015 at 03:30:33AM -0800, Paul Turner wrote:
>
> > > Blergh, all I've managed to far is to confuse myself further. Even
> > > something like the original (+- the EINTR) should work when we consider
> > > the looping, even when mixed with an occasional spurious wakeup.
> > >
> > >
> > > int bit_wait()
> > > {
> > > if (signal_pending_state(current->state, current))
> > > return -EINTR;
> > > schedule();
> > > }
>
> So I asked Vladimir to test that (simply changing the return from 1 to
> -EINTR) and it made his fail much less likely but it still failed in the
> same way.
>
> So I'm fairly sure I'm still missing something :/
Same here...
Yes, "return 1" in bit_wait_io() doesn't look right. For example
do_generic_file_read() can wrongly return if lock_page_killable() returns
this error code. But I fail to understand how this can read to rcu-stall.
Oleg.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists