[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <54F802C3.7070903@jp.fujitsu.com>
Date: Thu, 5 Mar 2015 16:16:19 +0900
From: Kamezawa Hiroyuki <kamezawa.hiroyu@...fujitsu.com>
To: Gu Zheng <guz.fnst@...fujitsu.com>
CC: Tejun Heo <tj@...nel.org>, <linux-kernel@...r.kernel.org>,
<laijs@...fujitsu.com>, <isimatu.yasuaki@...fujitsu.com>,
<tangchen@...fujitsu.com>
Subject: Re: [PATCH] workqueue: update numa affinity when node hotplug
On 2015/03/05 10:23, Gu Zheng wrote:
> Hi Kamazawa-san,
> On 03/04/2015 01:45 PM, Kamezawa Hiroyuki wrote:
>
>> On 2015/03/03 22:18, Tejun Heo wrote:
>>> Hello, Kame.
>>>
>>> On Tue, Mar 03, 2015 at 03:53:46PM +0900, Kamezawa Hiroyuki wrote:
>>>> relationship between proximity domain and lapic id doesn't change.
>>>> relationship between lapic-id and cpu-id changes.
>>>>
>>>> pxm <-> memory address : no change
>>>> pxm <-> lapicid : no change
>>>> pxm <-> node id : no change
>>>> lapicid <-> cpu id : change.
>>>
>>> So, we're changing the cpu ID to NUMA node mapping because current
>>> NUMA code is ignoring PXM for memoryless nodes? That's it?
>>>
>>
>> For memory-less node case, yes.
>> Another problem is that lapicid <-> cpuid relationship is not persistent.
>>
>>
>>>>>> I personally thinks proper fix is building persistent cpu-id <-> lapicid relationship as
>>>>>> pxm does rather than creating band-aid.
>>>>>
>>>>> Oh if this is possible, I agree that's the right direction too.
>>>>>
>>>>
>>>> Implementation is a bit complicated now :(.
>>>
>>> Ah well, even then, the obviously right thing to do is updating NUMA
>>> code to always keep track of PXM information. We don't really want to
>>> pile NUMA hacks in random users of NUMA code.
>>>
>>
>> We'd like to start from making apicid <-> cpuid persistent because memory-less
>> node case doesn't cause panic.
>>
>> Gu-san, how do you think ?
>
> Fine by me. But it seems that the change will break the re-use of free cpuid when
> hot add new cpus, I am afraid it may affect other sub-systems, though I can not
> point out the specific example.
>
That may be concern. But, IHMO, applications should not expect cpuid of newly added cpu.
If an application takes care of placement of [thread, memory], OS API should be refreshed
to allow user to tell "please pleace [thread, memory] nearby each other" rather than specifying
[cpu, memory]. (As Peter Z.'s schedNUMA)
If application takes care of physical placement of hardware, ethernet, SSD, etc...
it should not depends on cpuid.
Some of required features may not be archived yet but cpuid of newly added cpu
should not be a big problem for applications, I think.
Open dicussion with (x86/sched/mm) maintainers will be required, anyway.
I think it's worth trying.
Thanks,
-Kame
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists