[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <m14ognkd26.fsf@fess.ebiederm.org>
Date: Sun, 27 Jun 2010 21:15:29 -0700
From: ebiederm@...ssion.com (Eric W. Biederman)
To: KOSAKI Motohiro <kosaki.motohiro@...fujitsu.com>
Cc: Nathan Fontenot <nfont@...tin.ibm.com>,
Andi Kleen <andi@...stfloor.org>, linux-kernel@...r.kernel.org,
Greg Kroah-Hartman <gregkh@...e.de>
Subject: Re: [PATCH] memory hotplug disable boot option
KOSAKI Motohiro <kosaki.motohiro@...fujitsu.com> writes:
>> On 06/25/2010 04:19 AM, Andi Kleen wrote:
>> > KOSAKI Motohiro <kosaki.motohiro@...fujitsu.com> writes:
>> >
>> >>> Proposed patch to disable memory hotplug via a boot option,
>> >>> mem_hotplug=[on|off]. The patch only disables memory hotplug in that it
>> >>> prevents the creation of the memory sysfs directories for memory sections.
>> >>>
>> >>> This patch is meant to help alleviate very long boot times on systems with
>> >>> large memory (1+ TB) and many memory sections (10's of thousands).
>> >>
>> >> Why making simple /sys file is so slowly? Couldn't we fix such performance
>> >> problem?
>>
>> The issue is the large number of sysfs memory directories that get created.
>> On a system with 1 TB of memory I am seeing ~63,00 directories. The long
>> creation time is due to the string compare check in sysfs code to ensure
>> we are not creating a directory with a duplicate name.
>
> Ah, I see. probably this is sysfs issue. So Let's cc Greg and Eric.
> Greg, I have dumb question. Why sysfs call strcmp() so heavily? I mean why sysfs
> don't have hash based name dupliation check?
Simplicity of the current implementation.
I have a prototype patch sitting around somewhere. I think ultimately
it makes sense to do something like extN's htree directory structure
in sysfs. I wanted to get the tagged sysfs support in before I worked
on scalability because that slightly changes the requirements.
Improving the scalability here is certainly worth doing, but I am slightly
concerned there is something else algorithmically wrong if this is still
going to take 33 minutes to boot with 2TB.
Eric
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists