[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <e7e6ea66-bcfe-4af4-9f82-ae39fef1a976@icloud.com>
Date: Tue, 10 Sep 2024 19:46:14 +0800
From: Zijun Hu <zijun_hu@...oud.com>
To: Dan Williams <dan.j.williams@...el.com>,
quic_zijuhu <quic_zijuhu@...cinc.com>, Ira Weiny <ira.weiny@...el.com>,
Greg Kroah-Hartman <gregkh@...uxfoundation.org>
Cc: Davidlohr Bueso <dave@...olabs.net>,
Jonathan Cameron <jonathan.cameron@...wei.com>,
Dave Jiang <dave.jiang@...el.com>,
Alison Schofield <alison.schofield@...el.com>,
Vishal Verma <vishal.l.verma@...el.com>, Timur Tabi <timur@...nel.org>,
"David S. Miller" <davem@...emloft.net>, Eric Dumazet <edumazet@...gle.com>,
Jakub Kicinski <kuba@...nel.org>, Paolo Abeni <pabeni@...hat.com>,
linux-cxl@...r.kernel.org, linux-kernel@...r.kernel.org,
netdev@...r.kernel.org
Subject: Re: [PATCH v4 1/2] cxl/region: Find free cxl decoder by
device_for_each_child()
On 2024/9/10 12:15, Dan Williams wrote:
> quic_zijuhu wrote:
>> On 9/10/2024 8:45 AM, Dan Williams wrote:
>>> Ira Weiny wrote:
>>> [..]
>>>>> This still feels more complex that I think it should be. Why not just
>>>>> modify the needed device information after the device is found? What
>>>>> exactly is being changed in the match_free_decoder that needs to keep
>>>>> "state"? This feels odd.
>>>>
>>>> Agreed it is odd.
>>>>
>>>> How about adding?
>>>
>>> I would prefer just dropping usage of device_find_ or device_for_each_
>>> with storing an array decoders in the port directly. The port already
>>> has arrays for dports , endpoints, and regions. Using the "device" APIs
>>> to iterate children was a bit lazy, and if the id is used as the array
>>> key then a direct lookup makes some cases simpler.
>>
>> it seems Ira and Dan have corrected original logic to ensure
>> that all child decoders are sorted by ID in ascending order as shown
>> by below link.
>>
>> https://lore.kernel.org/all/66df666ded3f7_3c80f229439@iweiny-mobl.notmuch/
>>
>> based on above correction, as shown by my another exclusive fix
>> https://lore.kernel.org/all/20240905-fix_cxld-v2-1-51a520a709e4@quicinc.com/
>> there are a very simple change to solve the remaining original concern
>> that device_find_child() modifies caller's match data.
>>
>> here is the simple change.
>>
>> --- a/drivers/cxl/core/region.c
>> +++ b/drivers/cxl/core/region.c
>> @@ -797,23 +797,13 @@ static size_t show_targetN(struct cxl_region
>> *cxlr, char *buf, int pos)
>> static int match_free_decoder(struct device *dev, void *data)
>> {
>> struct cxl_decoder *cxld;
>> - int *id = data;
>>
>> if (!is_switch_decoder(dev))
>> return 0;
>>
>> cxld = to_cxl_decoder(dev);
>>
>> - /* enforce ordered allocation */
>> - if (cxld->id != *id)
>> - return 0;
>> -
>> - if (!cxld->region)
>> - return 1;
>> -
>> - (*id)++;
>> -
>> - return 0;
>> + return cxld->region ? 0 : 1;
>
> So I wanted to write a comment here to stop the next person from
> tripping over this dependency on decoder 'add' order, but there is a
> problem. For this simple version to work it needs 3 things:
>
> 1/ decoders are added in hardware id order: done,
> devm_cxl_enumerate_decoders() handles that
>
do not known how you achieve it, perhaps, it is not simpler than
my below solution:
finding a free switch cxl decoder with minimal ID
https://lore.kernel.org/all/20240905-fix_cxld-v2-1-51a520a709e4@quicinc.com/
which has simple logic and also does not have any limitation related
to add/allocate/de-allocate a decoder.
i am curious why not to consider this solution ?
> 2/ search for decoders in their added order: done, device_find_child()
> guarantees this, although it is not obvious without reading the internals
> of device_add().
>
> 3/ regions are de-allocated from decoders in reverse decoder id order.
> This is not enforced, in fact it is impossible to enforce. Consider that
> any memory device can be removed at any time and may not be removed in
> the order in which the device allocated switch decoders in the topology.
>
sorry, don't understand, could you take a example ?
IMO, the simple change in question will always get a free decoder with
the minimal ID once 1/ is ensured regardless of de-allocation approach.
> So, that existing comment of needing to enforce ordered allocation is
> still relevant even though the implementation fails to handle the
> out-of-order region deallocation problem.
>
> I alluded to the need for a "tear down the world" implementation back in
> 2022 [1], but never got around to finishing that.
>
> Now, the cxl_port.hdm_end attribute tracks the "last" decoder to be
> allocated for endpoint ports. That same tracking needs to be added for
> switch ports, then this routine could check for ordering constraints by:
>
> /* enforce hardware ordered allocation */
> if (!cxld->region && port->hdm_end + 1 == cxld->id)
> return 1;
> return 0;
>
> As it stands now @hdm_end is never updated for switch ports.
>
> [1]: 176baefb2eb5 cxl/hdm: Commit decoder state to hardware
>
>
>
>
>
>
>
> Yes, that looks simple enough for now, although lets not use a ternary
> condition and lets leave a comment for the next person:
>
> /* decoders are added in hardware id order
> * (devm_cxl_enumerate_decoders), allocated to regions in id order
> * (device_find_child() walks children in 'add' order)
> */
>> }
>>
>> static int match_auto_decoder(struct device *dev, void *data)
>> @@ -840,7 +830,6 @@ cxl_region_find_decoder(struct cxl_port *port,
>> struct cxl_region *cxlr)
>> {
>> struct device *dev;
>> - int id = 0;
>>
>> if (port == cxled_to_port(cxled))
>> return &cxled->cxld;
>> @@ -849,7 +838,7 @@ cxl_region_find_decoder(struct cxl_port *port,
>> dev = device_find_child(&port->dev, &cxlr->params,
>> match_auto_decoder);
>> else
>> - dev = device_find_child(&port->dev, &id,
>> match_free_decoder);
>> + dev = device_find_child(&port->dev, NULL,
>> match_free_decoder);
>> if (!dev)
>> return NULL;
>>
>>
>
>
Powered by blists - more mailing lists