[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <4DF2099B.7030600@jp.fujitsu.com>
Date: Fri, 10 Jun 2011 21:10:03 +0900
From: KOSAKI Motohiro <kosaki.motohiro@...fujitsu.com>
To: peterz@...radead.org
CC: eric.dumazet@...il.com, david@...advisors.com,
linux-kernel@...r.kernel.org, sbohrer@...advisors.com,
zvonler@...advisors.com, hughd@...gle.com, tglx@...utronix.de,
dvhart@...ux.intel.com, mingo@...e.hu
Subject: Re: Change in functionality of futex() system call.
>> Urgh,. maybe something like the below but with more conditionals that
>> enable the extra logic only for FUTEX_WAIT..
>>
>> The idea is to try a RO gup() when the RW gup() fails so as not to slow
>> down the common path of writable anonymous maps and bail when we used
>> the RO path on anonymous memory.
>>
>> ---
>> diff --git a/kernel/futex.c b/kernel/futex.c
>> index fe28dc2..11f2ad1 100644
>> --- a/kernel/futex.c
>> +++ b/kernel/futex.c
>> @@ -234,7 +234,7 @@ get_futex_key(u32 __user *uaddr, int fshared, union futex_key *key)
>> unsigned long address = (unsigned long)uaddr;
>> struct mm_struct *mm = current->mm;
>> struct page *page, *page_head;
>> - int err;
>> + int err, ro = 0;
>>
>> /*
>> * The futex address must be "naturally" aligned.
>> @@ -262,6 +262,10 @@ get_futex_key(u32 __user *uaddr, int fshared, union futex_key *key)
>>
>> again:
>> err = get_user_pages_fast(address, 1, 1, &page);
>> + if (err == -EFAULT) {
>> + err = get_user_pages_fast(address, 1, 0, &page);
>> + ro = 1;
>> + }
>> if (err < 0)
>> return err;
>>
>> @@ -316,6 +320,11 @@ again:
>> * the object not the particular process.
>> */
>> if (PageAnon(page_head)) {
>> + if (ro) {
>> + err = -EFAULT;
>> + goto out;
>> + }
>> +
>> key->both.offset |= FUT_OFF_MMSHARED; /* ref taken on mm */
>> key->private.mm = mm;
>> key->private.address = address;
>> @@ -327,9 +336,10 @@ again:
>>
>> get_futex_key_refs(key);
>>
Need err=0 here. (note: get_user_pages_fast() return 1) Other than that looks
good to me and this patch passed my test.
Reviewed-and-tested-by: KOSAKI Motohiro <kosaki.motohiro@...fujitsu.com>
>> +out:
>> unlock_page(page_head);
>> put_page(page_head);
>> - return 0;
>> + return err;
>> }
>>
>> static inline void put_futex_key(union futex_key *key)
>>
>
> Hmm, wouldn't that still be susceptible to the zero-page thing if: we
> create a writable private file map of a sparse file, touch a page and
> then remap the thing RO?
After while thinking, I've conclude this is ok. Because 1) as Andrew and
Kyle described, RO mapping usage is not so sane. We need to care it for
only compatibility. 2) David Oliver's case is real compatibility issue.
but I doubt such mprotect() vs futex() race is happen on real world.
3) Anyway, overkill compatibility care might make code slower perhaps.
Off topic: current futex documentations are near terribly unclear and
many futex op are completely undocumented. They are one of root cause
that every change can make compatibility issue. (;_;
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists