[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <20100415092258.9f837c12.kamezawa.hiroyu@jp.fujitsu.com>
Date: Thu, 15 Apr 2010 09:22:58 +0900
From: KAMEZAWA Hiroyuki <kamezawa.hiroyu@...fujitsu.com>
To: Greg Thelen <gthelen@...gle.com>
Cc: balbir@...ux.vnet.ibm.com, Andrea Righi <arighi@...eler.com>,
Daisuke Nishimura <nishimura@....nes.nec.co.jp>,
Vivek Goyal <vgoyal@...hat.com>,
Peter Zijlstra <peterz@...radead.org>,
Trond Myklebust <trond.myklebust@....uio.no>,
Suleiman Souhlal <suleiman@...gle.com>,
"Kirill A. Shutemov" <kirill@...temov.name>,
Andrew Morton <akpm@...ux-foundation.org>,
containers@...ts.linux-foundation.org,
linux-kernel@...r.kernel.org, linux-mm@...ck.org
Subject: Re: [PATCH -mmotm 1/5] memcg: disable irq at page cgroup lock
On Wed, 14 Apr 2010 09:22:41 -0700
Greg Thelen <gthelen@...gle.com> wrote:
> On Wed, Apr 14, 2010 at 2:29 AM, KAMEZAWA Hiroyuki
> <kamezawa.hiroyu@...fujitsu.com> wrote:
>
> >> if (irqs_disabled()) {
> >> if (! trylock_page_cgroup(pc))
> >> return;
> >> } else
> >> lock_page_cgroup(pc);
> >>
> >
> > I prefer trylock_page_cgroup() always.
>
> What is your reason for preferring trylock_page_cgroup()? I assume
> it's for code simplicity, but I wanted to check.
>
> I had though about using trylock_page_cgroup() always, but I think
> that would make file_mapped accounting even more fuzzy that it already
> it is. I was trying to retain the current accuracy of file_mapped and
> only make new counters, like writeback/dirty/etc (those obtained in
> interrupt), fuzzy.
>
file_mapped should have different interface as mem_cgroup_update_stat_verrrry_safe().
or some.
I don't think accuracy is important (if it's doesn't go minus) but if people want,
I agree to keep it accurate.
> > I have another idea fixing this up _later_. (But I want to start from simple one.)
> >
> > My rough idea is following. Similar to your idea which you gave me before.
>
> Hi Kame-san,
>
> I like the general approach. The code I previously gave you appears
> to work and is faster than non-root memcgs using mmotm due to mostly
> being lockless.
>
I hope so.
> > ==
> > DEFINE_PERCPU(account_move_ongoing);
>
> What's the reason for having a per-cpu account_move_ongoing flag?
> Would a single system-wide global be sufficient? I assume the
> majority of the time this value will not be changing because
> accounting moves are rare.
>
> Perhaps all of the per-cpu variables are packed within a per-cpu
> cacheline making accessing it more likely to be local, but I'm not
> sure if this is true.
>
Yes. this value is rarely updated but update is not enough rare to put
this value to read_mostly section. We see cacheline ping-pong by random
placement of global variables. This is performance critical.
Recent updates for percpu variables accessor makes access to percpu
very efficient. I'd like to make use of it.
Thanks,
-Kame
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists