[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-Id: <20100902114227.34562723.kamezawa.hiroyu@jp.fujitsu.com>
Date: Thu, 2 Sep 2010 11:42:27 +0900
From: KAMEZAWA Hiroyuki <kamezawa.hiroyu@...fujitsu.com>
To: Vivek Goyal <vgoyal@...hat.com>
Cc: Gui Jianfeng <guijianfeng@...fujitsu.com>,
Jens Axboe <axboe@...nel.dk>, Jeff Moyer <jmoyer@...hat.com>,
Divyesh Shah <dpshah@...gle.com>,
Corrado Zoccolo <czoccolo@...il.com>,
Nauman Rafique <nauman@...gle.com>,
linux kernel mailing list <linux-kernel@...r.kernel.org>
Subject: Re: [RFC] [PATCH] cfq-iosched: add cfq group hierarchical
scheduling support
On Wed, 1 Sep 2010 22:29:36 -0400
Vivek Goyal <vgoyal@...hat.com> wrote:
> On Wed, Sep 01, 2010 at 06:02:43PM +0900, KAMEZAWA Hiroyuki wrote:
>
> [..]
> > > > One of the possible way forward could be this.
> > > >
> > > > - Treat queue and group at same level (like CFS)
> > > >
> > > > - Get rid of cfq_slice_offset() logic. That means without idling on, there
> > > > will be no ioprio difference between cfq queues. I think anyway as of
> > > > today that logic helps in so little situations that I would not mind
> > > > getting rid of it. Just that Jens should agree to it.
> > > >
> > > > - With this new scheme, it will break the existing semantics of root group
> > > > being at same level as child groups. To avoid that, we can probably
> > > > implement two modes (flat and hierarchical), something similar to what
> > > > memory cgroup controller has done. May be one tunable in root cgroup of
> > > > blkio "use_hierarchy". By default everything will be in flat mode and
> > > > if user wants hiearchical control, he needs to set user_hierarchy in
> > > > root group.
> > > >
> > > > I think memory controller provides "use_hierarchy" tunable in each
> > > > cgroup. I am not sure why do we need it in each cgroup and not just
> > > > in root cgroup.
> > >
> > > I think Kamezawa-san should be able to answer this question. :)
> > >
> >
> > At first, please be sure that "hierarchical accounting is _very_ slow".
> > Please measure how hierarchical accounting (of 4-6 levels) are slow ;)
> >
> > Then, there are 2 use cases.
> >
> > 1) root/to/some/directory/A
> > /B
> > /C
> > ....
> > All A, B, C ....are flat cgroup and has no relationship, not sharing limit.
> > In this case, hierarchy should not be enabled.
> >
> > 2) root/to/some/directory/Gold/A,B,C...
> > Silver/D,E,F
> >
> > All A, B, C ....are limited by "Gold" or "Silver".
> > But Gold and Silver has no relationthip, they has their own limitations.
> > But A, B, C, D, E, F shares limit under Gold or Silver.
> > In this case, hierarchy
> > "root/to/some/directory" should be disabled.
> > Gold/ and Silver should have use_hierarchy=1.
> >
> > (Assume these Gold and Silver as Container and the user of container
> > divides memory into A and B, C...)
> >
> > For example, libvirt makes very long "root/to/some/directory" ...
> > I never want to count-up all counters in the hierarchy even if
> > we'd like to use some fantasic hierarchical accounting under a container.
> >
> > I don't like "all or nothing" option (as making use_hierarchy as mount
> > option or has parameter on root cgroup etc..) Then, allowed mixture.
>
> Hi Kame San,
>
> If you don't want any relationship between Gold and Silver then one can
> make root as unlimited group (limit_in_bytes = -1) and practically Gold
> and Silver have no dependency. There is no need of setting use_hierarchy
> different at root level and inside Gold/ and Silver/ groups?
>
and counts up 4 levels accounting ? ;)
We allow mixuture of
/root/to/some/directory/Gold/
/Silver
/Extraone
Gold and Silver can be under some limitation, of course.
(For example, Extraone is for system-admin and not-for-user.
System admin is in another container than users.)
> It sounds like you did it for two reasons.
>
> - It can potentially provide more flexibility.
Right.
> - performance reason so that you can stop do hierarchical accounting
> all the way to root and stop before that (libvirt example).
Yes.
>
> I think for blkio controller we can probably begin with either a mount
> time option or a use_hierachy file in root group and then later make
> it per group if there are use cases.
>
I hope something flexible. Complexity to the code is not very big.
Thanks,
-Kame
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists