[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <20100702145042.5298416f.kamezawa.hiroyu@jp.fujitsu.com>
Date: Fri, 2 Jul 2010 14:50:42 +0900
From: KAMEZAWA Hiroyuki <kamezawa.hiroyu@...fujitsu.com>
To: Greg KH <gregkh@...e.de>
Cc: Dave Hansen <dave@...ux.vnet.ibm.com>,
KOSAKI Motohiro <kosaki.motohiro@...fujitsu.com>,
Nathan Fontenot <nfont@...tin.ibm.com>,
Andi Kleen <andi@...stfloor.org>, linux-kernel@...r.kernel.org,
"Eric W. Biederman" <ebiederm@...ssion.com>
Subject: Re: [PATCH] memory hotplug disable boot option
On Thu, 1 Jul 2010 16:26:34 -0700
Greg KH <gregkh@...e.de> wrote:
> On Thu, Jul 01, 2010 at 09:31:30AM +0900, KAMEZAWA Hiroyuki wrote:
> > On Wed, 30 Jun 2010 08:47:55 -0700
> > Greg KH <gregkh@...e.de> wrote:
> > > > and adding a scalable interface for large scale machines ?
> > > > I'd like to consider something..
> > >
> > > Dynamically changing the layout on big memory boxes makes sense to me,
> > > how about you?
> > >
> >
> > like this ?
> > ==
> > boot option:
> > memory_sysfs_layout=compact
> > memory_sysfs_layout=auto (default)
> > memory_sysfs_layout=full
> >
> > Considering briefly, how about this compact layout ?
> >
> > /sys/devices/system/memory/:
> > list, hide, show, memoryX...
> >
> > list: // show available memory index list.
> > #cat list
> > 0 1 2 ....10000...
> >
> > show: //an interface to enable the interface.
> > #echo INDEX > memory_index
> > will create memoryINDEX diretory.
> >
> > hide: //an interface to hide the interface.
> > #echo INDEX > memory_hide
> > will remove memoryINDEX sysfs directory.
>
> Ick, that can get confusing very quickly, and not really solve any of
> your root problems, right?
>
Why not ?
I'm sorry if I misunderstand the problem . It is tooo many entries under
a directory, which makes the routine slow. No ?
By this interface, the sysfs entries pops up on demand if the user hides
it at boot time (by boot options.)
IIUC, the most problematic IBM's 16MB-sections system has no notification
from firmware and their system management middleware controls all
memory hotplug information. IOW, all memory-hotplug actions are from
user-land, not from firmware. They should know which mem_section should be
visible under /memory before starting hotplug.
In other platforms, using ACPI, hot-added memory sections can be automatically
visible. At memory hot-remove, management software should now which phys_address
memory should be removed before it starts action. It can make visible
section directory.
For many many users, who never wants memory hotplug, they can avoid creating
sysfs for memory hotplug and will be happy.
> > In compact mode, all memoryX interface are hidden at boot.
> > In full mode, all memoryX interaface are shown.
> > The Boot option just affects status at boot. If users want, he can make
> > all memory sysfs in shown state.
> >
> > At hot-add event (via acpi) or probe-event, newly created memory section
> > should be start from "shown" mode. hotplug scirpt can hide it after online.
> >
> > At hot-remove, the users has to offline memory before hotplug. He'll has
> > to do check list and show interface.
> >
> > I think this change is not very difficult technically but can this kind of
> > interface be allowed ?
>
> Not really, I don't like it.
>
> Why not just simplify what you currently have to not use so many
> directories and files?
>
> And maybe, this doesn't belong in sysfs at all...
>
Because the system has topology and sysfs's /sys/devices/system/ shows
system's topology by directories and files, we use it. Especially, "node"
information is important in some system and we'd made use of it.
If you recommend guys to remake all NUMA-cpu-memory topology information
in other fs, someone will do.
Fujitsu will not complain about that and welcome it because we have a long
period until RHEL7 ages.
Thanks,
-Kame
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists