[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CALCETrWaVFPkP=aP+KW_sQQ+6M0gOtVbgg2awJqck93PFJHJsw@mail.gmail.com>
Date: Fri, 12 Aug 2016 13:18:24 -0700
From: Andy Lutomirski <luto@...capital.net>
To: Waiman Long <waiman.long@....com>
Cc: Thomas Gleixner <tglx@...utronix.de>,
John Stultz <john.stultz@...aro.org>,
Ingo Molnar <mingo@...hat.com>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
Borislav Petkov <bp@...e.de>,
Jiang Liu <jiang.liu@...ux.intel.com>,
Randy Wright <rwright@....com>,
Scott J Norton <scott.norton@....com>,
Douglas Hatch <doug.hatch@....com>,
Dave Hansen <dave.hansen@...el.com>,
Prarit Bhargava <prarit@...hat.com>, X86 ML <x86@...nel.org>,
"H. Peter Anvin" <hpa@...or.com>
Subject: Re: [RESEND PATCH v4] x86/hpet: Reduce HPET counter read contention
On Aug 12, 2016 9:31 PM, "Waiman Long" <waiman.long@....com> wrote:
>
> On 08/12/2016 01:16 PM, Dave Hansen wrote:
>>
>> On 08/12/2016 10:01 AM, Waiman Long wrote:
>>>
>>> The reason for using a special lock is that I want both sequence number
>>> update and locking to be done together atomically. They can be made
>>> separate as is in the seqlock. However, that will make the code more
>>> complex to make sure that all the threads see a consistent set of lock
>>> state and sequence number.
>>
>> Why do we need a sequence number? The "cached" HPET itself could be used.
>>
>> I'm thinking something like below could use a spinlock instead of the
>> doing a custom cmpxchg sequence. The spin_is_locked() should allow the
>> contended "readers" to avoid using atomics.
>>
>> spinlock_t hpet_lock;
>> u32 hpet_value;
>> ...
>> {
>> u32 old_hpet = READ_ONCE(hpet_value);
>> u32 new_hpet;
>>
>> // need to ensure that the spin_is_locked() is ordered after
>> // the READ_ONCE().
>> smp_rmb();
>> // spin_is_locked() doesn't do atomics
>> if (!spin_is_locked(&hpet_lock)&& spin_trylock(&hpet_lock)) {
>>
>> WRITE_ONCE(hpet_value, real_read_hpet());
>> spin_unlock(&hpet_lock);
>> return hpet_value;
>> }
>> // Contended case. We spin here waiting for the guy who holds
>> // the lock to write a new value to 'hpet_value'.
>> //
>> // We know that our old_hpet is older than our check for the
>> // spinlock being locked. So, someone must either have already
>> // updated it or be updating it.
>> do {
>> cpu_relax();
>> // We do not do a rmb() here. We don't need a guarantee
>> // that this read is up-to-date, just that it will
>> // _eventually_ see an up-to-date value.
>> new_hpet = READ_ONCE(hpet_value);
>> } while (old_hpet == new_hpet);
>> return new_hpet;
>> }
>
>
> Yes, I think that work too. I will update my patch accordingly. Thanks for the input.
Why is Dave more convincing than I was a couple months ago when I
asked a similar question? :)
I don't think this is right. If the HPET ever returns the same value
twice in a row (unlikely because it's generally too slow to read, but
it's plausible that someone will make a fast HPET some day), then this
could deadlock.
Also, does this code need to be NMI-safe? This implementation is
deadlocky if it's called from an NMI.
The original code was wait-free, right? That was a nice property, too.
--Andy
Powered by blists - more mailing lists