[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID:
<OS9PR01MB124214F1E05F7D27C50591AF590AE2@OS9PR01MB12421.jpnprd01.prod.outlook.com>
Date: Thu, 3 Apr 2025 04:25:34 +0000
From: "Yasunori Gotou (Fujitsu)" <y-goto@...itsu.com>
To: "Zhijian Li (Fujitsu)" <lizhijian@...itsu.com>, Gregory Price
<gourry@...rry.net>, "linux-cxl@...r.kernel.org" <linux-cxl@...r.kernel.org>
CC: "linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
"kernel-team@...a.com" <kernel-team@...a.com>, "dan.j.williams@...el.com"
<dan.j.williams@...el.com>, "vishal.l.verma@...el.com"
<vishal.l.verma@...el.com>, "dave.jiang@...el.com" <dave.jiang@...el.com>,
"dave@...olabs.net" <dave@...olabs.net>, "jonathan.cameron@...wei.com"
<jonathan.cameron@...wei.com>, "alison.schofield@...el.com"
<alison.schofield@...el.com>, "ira.weiny@...el.com" <ira.weiny@...el.com>
Subject: RE: [PATCH v2] cxl: core/region - ignore interleave granularity when
ways=1
Hello, Gregory-san, and everyone
> Hi Gregory and CXL community
> Cc Goto-san
>
> Wow, our platform has encountered a similar issue,
Yes, I actually encountered this issue. The endpoint decoders show the granularity value as 1024 bytes,
but other decoders show 256 bytes, which is the default value, even when the interleave setting is one.
As a result, the error message that this patch avoids appeared, and initialization failed, as described in the following email:
https://lore.kernel.org/linux-cxl/OSYPR01MB53525FD64A9AFBAEEC1E19A390112@OSYPR01MB5352.jpnprd01.prod.outlook.com/T/#m811bdd93bca3887b4c14a2a20b8f21a77dcf2eae
So, Gregory's patch is welcome for me.
> and I am intending to consult the community regarding this matter.
> I drafted similar patch locally, but I wonder if we should "ignore" the IG or
> "program" the IG to the decoder.
I'll try to summarize Li-san's question.
Does anyone know what will happen if the CXL driver does not program the IG value to the decoder?
Will it work correctly without any problems?
My initial approach to avoid this problem was to "program" the decoder, as shown below.
This patch is a very early trial version to avoid the error we encountered.
However, Li-san's concern is that this patch writes the IG value to the decoders.
Is this "programming," as shown below, unnecessary?
---
diff --git a/drivers/cxl/core/hdm.c b/drivers/cxl/core/hdm.c
index 0f6661297152..46b933cc8b9f 100644
--- a/drivers/cxl/core/hdm.c
+++ b/drivers/cxl/core/hdm.c
@@ -937,6 +937,14 @@ static int init_hdm_decoder(struct cxl_port *port, struct cxl_decoder *cxld,
}
(snip)
+
+ if (cxld->interleave_granularity != 256) {
+ pr_info("y-goto: force set ig=256\n");
+ cxld->interleave_granularity = 256;
+ u32p_replace_bits(&ctrl, 0, CXL_HDM_DECODER0_CTRL_IG_MASK); <------- make the value from ig
+ writel(ctrl, hdm + CXL_HDM_DECODER0_CTRL_OFFSET(which)); <--- "program" the decoder of ig value
+ }
+
----
Thanks,
----
Yasunori Goto
>
> Let me post the mail(question) from my drafts in your thread(I hope you I hope
> you won't mind).
> ======================================
> [Question] granularity is a don't care if not interleaving?
> I saw this sentence " granularity is a don't care if not interleaving" in this patch
> "[ndctl PATCH 2/6] cxl/list: Add interleave parameters to decoder listings" [1]
>
> This reminds me that our platform programed an unmatched
> interleave_granularity in HDM decoders between endpoint and the host-bridge,
> see below:
>
> CXL Root
> CFMW0 / \ CFMW1
> decoder0.0 decoder0.1
> 128 GiB IW:1 IG:256 IW:1 IG:256 128 GiB
> \ /
> Host Bridge
> / \
> decoder5.0 decoder5.1
> IW:1 IG:256 IW:1 IG:256
> / \
> Endpoint9 Endpoint10
> | |
> decoder9.0 decoder10.0
> IW:1 IG:1024 IW:1 IG:1024
>
> With this setup, the Linux kernel attempts to create regions for Endpoint9 and
> Endpoint10 but fails because the endpoint decoders’ interleave granularity
> (IG=1024) does not match their parent decoders’ IG (256). Ideally, the
> endpoint decoders are expected to be configured for IG=256.
>
> Currently, we learned that we have only special handling for the root decoders
> [2][3].
>
> My question are:
> Q1: whether "granularity is a don't care if not interleaving" is applied to all HDM
> decoders(including root decoder and HDM decoder)
>
> In current cxl cli , it will not show any interleave_granularity at all when
> ways==1(no-interleaving) $ cxl list -PDE | grep granularity # show nothing
> when ways==1
>
> Per the CXL Spec r3.1
> IG: "The number of consecutive bytes that are assigned to each target in the
> Target List."
> Q2: Does this imply a configuration where the number of ways>1?
>
> Q3: Does the IG also represent the device's capabilities? When programming,
> should one also consider whether the device supports it?
>
>
> If "granularity is a don't care if not interleaving" is true, how about below
> changes
>
> diff --git a/drivers/cxl/core/region.c b/drivers/cxl/core/region.c index
> 75cd5dbb41e4..647fe2ce18ca 100644
> --- a/drivers/cxl/core/region.c
> +++ b/drivers/cxl/core/region.c
> @@ -1435,6 +1435,11 @@ static int cxl_port_setup_targets(struct cxl_port
> *port,
> + if (cxld->interleave_ways == 1 &&
> cxld->interleave_granularity != ig) {
> + cxld->interleave_granularity = ig;
> + /* program HDM decoder */
> + ...
> + }
> if (cxld->interleave_ways != iw ||
> cxld->interleave_granularity != ig ||
> cxld->hpa_range.start != p->res->start ||
>
>
>
> [1]
> https://lore.kernel.org/all/165973188300.1528532.222988685552982872.stgit@
> dwillia2-xfh.jf.intel.com/
> [2]
> https://lore.kernel.org/all/165853776917.2430596.16823264262010844458.stgi
> t@...llia2-xfh.jf.intel.com/
> [3]
> https://lore.kernel.org/all/169824893473.1403938.16110924262989774582.stgi
> t@...-140510-bm03.eng.stellus.in/
>
> Thanks
> Zhijian
>
> On 03/04/2025 07:25, Gregory Price wrote:
> > When validating decoder IW/IG when setting up regions, the granularity
> > is irrelevant when iw=1 - all accesses will always route to the only
> > target anyway - so all ig values are "correct". Loosen the requirement
> > that `ig = (parent_iw * parent_ig)` when iw=1.
> >
> > On some Zen5 platforms, the platform BIOS specifies a 256-byte
> > interleave granularity window for host bridges when there is only one
> > target downstream. This leads to Linux rejecting the configuration of
> > a region with a x2 root with two x1 hostbridges.
> >
> > Decoder Programming:
> > root - iw:2 ig:256
> > hb1 - iw:1 ig:256 (Linux expects 512)
> > hb2 - iw:1 ig:256 (Linux expects 512)
> > ep1 - iw:2 ig:256
> > ep2 - iw:2 ig:256
> >
> > This change allows all decoders downstream of a passthrough decoder to
> > also be configured as passthrough (iw:1 ig:X), but still disallows
> > downstream decoders from applying subsequent interleaves.
> >
> > e.g. in the above example if there was another decoder south of hb1
> > attempting to interleave 2 endpoints - Linux would enforce hb1.ig=512
> > because the southern decoder would have iw:2 and require ig=pig*piw.
> >
> > Signed-off-by: Gregory Price <gourry@...rry.net>
> > Reviewed-by: Dave Jiang <dave.jiang@...el.com>
> > ---
> > drivers/cxl/core/region.c | 2 +-
> > 1 file changed, 1 insertion(+), 1 deletion(-)
> >
> > diff --git a/drivers/cxl/core/region.c b/drivers/cxl/core/region.c
> > index 04bc6cad092c..dec262eadf9a 100644
> > --- a/drivers/cxl/core/region.c
> > +++ b/drivers/cxl/core/region.c
> > @@ -1553,7 +1553,7 @@ static int cxl_port_setup_targets(struct
> > cxl_port *port,
> >
> > if (test_bit(CXL_REGION_F_AUTO, &cxlr->flags)) {
> > if (cxld->interleave_ways != iw ||
> > - cxld->interleave_granularity != ig ||
> > + (iw > 1 && cxld->interleave_granularity != ig) ||
> > cxled->spa_range.start != p->res->start ||
> > cxled->spa_range.end != p->res->end ||
> > ((cxld->flags & CXL_DECODER_F_ENABLE) == 0)) {
Powered by blists - more mailing lists