[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <87ttg91046.fsf@yhuang6-desk2.ccr.corp.intel.com>
Date: Mon, 29 Jul 2024 09:02:33 +0800
From: "Huang, Ying" <ying.huang@...el.com>
To: Gregory Price <gourry@...rry.net>
Cc: linux-mm@...ck.org, akpm@...ux-foundation.org, dave.jiang@...el.com,
Jonathan.Cameron@...wei.com, horenchuang@...edance.com,
linux-kernel@...r.kernel.org, linux-acpi@...r.kernel.org,
dan.j.williams@...el.com, lenb@...nel.org
Subject: Re: [PATCH] acpi/hmat,mm/memtier: always register hmat adist
calculation callback
Gregory Price <gourry@...rry.net> writes:
> In the event that hmat data is not available for the DRAM tier,
> or if it is invalid (bandwidth or latency is 0), we can still register
> a callback to calculate the abstract distance for non-cpu nodes
> and simply assign it a different tier manually.
>
> In the case where DRAM HMAT values are missing or not sane we
> manually assign adist=(MEMTIER_ADISTANCE_DRAM + MEMTIER_CHUNK_SIZE).
>
> If the HMAT data for the non-cpu tier is invalid (e.g. bw = 0), we
> cannot reasonable determine where to place the tier, so it will default
> to MEMTIER_ADISTANCE_DRAM (which is the existing behavior).
Why do we need this? Do you have machines with broken HMAT table? Can
you ask the vendor to fix the HMAT table?
--
Best Regards,
Huang, Ying
> Signed-off-by: Gregory Price <gourry@...rry.net>
> ---
> drivers/acpi/numa/hmat.c | 6 ++++--
> mm/memory-tiers.c | 10 ++++++++--
> 2 files changed, 12 insertions(+), 4 deletions(-)
>
> diff --git a/drivers/acpi/numa/hmat.c b/drivers/acpi/numa/hmat.c
> index 2c8ccc91ebe6..1642d2bd83b5 100644
> --- a/drivers/acpi/numa/hmat.c
> +++ b/drivers/acpi/numa/hmat.c
> @@ -1080,8 +1080,10 @@ static __init int hmat_init(void)
> if (hotplug_memory_notifier(hmat_callback, HMAT_CALLBACK_PRI))
> goto out_put;
>
> - if (!hmat_set_default_dram_perf())
> - register_mt_adistance_algorithm(&hmat_adist_nb);
> + if (hmat_set_default_dram_perf())
> + pr_notice("Failed to set default dram perf\n");
> +
> + register_mt_adistance_algorithm(&hmat_adist_nb);
>
> return 0;
> out_put:
> diff --git a/mm/memory-tiers.c b/mm/memory-tiers.c
> index 6632102bd5c9..43bd508938ae 100644
> --- a/mm/memory-tiers.c
> +++ b/mm/memory-tiers.c
> @@ -765,8 +765,14 @@ int mt_perf_to_adistance(struct access_coordinate *perf, int *adist)
> perf->read_bandwidth + perf->write_bandwidth == 0)
> return -EINVAL;
>
> - if (default_dram_perf_ref_nid == NUMA_NO_NODE)
> - return -ENOENT;
> + /*
> + * If the DRAM tier did not have valid HMAT data, we can instead just
> + * assume that the non-cpu numa nodes are 1 tier below cpu nodes
> + */
> + if (default_dram_perf_ref_nid == NUMA_NO_NODE) {
> + *adist = MEMTIER_ADISTANCE_DRAM + MEMTIER_CHUNK_SIZE;
> + return 0;
> + }
>
> /*
> * The abstract distance of a memory node is in direct proportion to
Powered by blists - more mailing lists