[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <eccb157e-10c3-4009-9036-dabccf4ba918@amd.com>
Date: Mon, 17 Feb 2025 13:41:02 +0000
From: Alejandro Lucero Palau <alucerop@....com>
To: Ira Weiny <ira.weiny@...el.com>, linux-cxl@...r.kernel.org,
netdev@...r.kernel.org, dan.j.williams@...el.com, edward.cree@....com,
davem@...emloft.net, kuba@...nel.org, pabeni@...hat.com,
edumazet@...gle.com, dave.jiang@...el.com
Subject: Re: [PATCH v10 09/26] cxl: support device identification without
mailbox
On 2/6/25 19:23, Ira Weiny wrote:
> Alejandro Lucero Palau wrote:
>> On 2/5/25 21:45, Ira Weiny wrote:
>>> alucerop@ wrote:
>>>> From: Alejandro Lucero <alucerop@....com>
>>>>
>>>> Type3 relies on mailbox CXL_MBOX_OP_IDENTIFY command for initializing
>>>> memdev state params.
>>>>
>>>> Allow a Type2 driver to initialize same params using an info struct and
>>>> assume partition alignment not required by now.
>>>>
>>>> Signed-off-by: Alejandro Lucero <alucerop@....com>
>>> This is exactly the type of thing I was hoping to avoid by removing these
>>> members from the mds. There is no reason you should have to fake these
>>> values within an mds just to create partitions in the device state.
>>
>> Let's be practical here.
>>
>>
>> A type2 without a mailbox needs to give that information for building up
>> the DPA partitions. Before it was about dealing with DPA resources from
>> the accel driver, but I do not think an accel driver should handle any
>> partition setup at all.
> I 100% totally agree! However, the dev state is where those partitions are
> managed. Not the memdev state.
But as I said in other previous patches, this patchset version does use
cxl_memdev_state as the opaque struct to be used by the accel driver.
>> Mainly because there is code now doing that in
>> the cxl core which can be used for accel drivers without requiring too
>> much effort. You can see what the sfc driver does now, and it is
>> equivalent to the current pci driver. An accel driver with a device
>> supporting a mailbox will do exactly the same than the pci driver.
>>
> I agree that the effort you made in these patches was not huge. Changing the
> types around and defining mds_info is not hard. But the final result is odd
> and does not fix a couple of the issues Dan had with the core architecture.
> First of which is the carrying of initialization values in the memdev
> state:[1]
>
> [1]
>
> > @@ -473,7 +488,9 @@ static inline struct cxl_dev_state *mbox_to_cxlds(struct cxl_mailbox *cxl_mbox)
> > * @dcd_cmds: List of DCD commands implemented by memory device
> > * @enabled_cmds: Hardware commands found enabled in CEL.
> > * @exclusive_cmds: Commands that are kernel-internal only
> > - * @total_bytes: sum of all possible capacities
> > + * @total_bytes: length of all possible capacities
> > + * @static_bytes: length of possible static RAM and PMEM partitions
> > + * @dynamic_bytes: length of possible DC partitions (DC Regions)
> > * @volatile_only_bytes: hard volatile capacity
> > * @persistent_only_bytes: hard persistent capacity
>
> I have regrets that cxl_memdev_state permanently carries runtime
> storage for init time variables, lets not continue down that path
> with DCD enabling.
>
> -- https://lore.kernel.org/all/67871f05cd767_20f32947f@dwillia2-xfh.jf.intel.com.notmuch/
>
>> For avoiding the mds fields the weight should not be on the accel
>> driver.
> I agree. So why would you want to use the mds fields at all?
I just wanted to have Type2 support patchset working with the new DPA
work. I was hoping those concerns not addressed with another patch or
patches Type2 work should be adapted to.
>
> I proposed a helper function to create cxl_dpa_info [cxl_add_partition] and Dan
> proposed a function to create the partitions from cxl_dpa_info
> [cxl_dpa_setup].[2]
>
> [2]
>
> void cxl_add_partition(struct cxl_dpa_info *info, u64 start, u64 size, enum cxl_partition_mode mode)
> int cxl_dpa_setup(struct cxl_dev_state *cxlds, const struct cxl_dpa_info *info)
>
> -- https://lore.kernel.org/all/20250128-rfc-rearch-mem-res-v1-2-26d1ca151376@intel.com/
>
> What more do you need?
I need a stable API to work with which is not going to change so quick
after a work like the DPA changes.
>> This patch adds a way for giving the required (and little) info
>> to the core for building the partitions.
> The second issue with your patch set is in the addition of struct mds_info.
> This has the same issue which Dan objected to about creating a temporary
> variable[3] but this is worse than my proposal in that your set continues to
> carry the initialization state around in the memdev forever.
>
> [3]
>
> The crux of the concern for me is less about the role of
> cxl_mem_get_partition_info() and more about the introduction of a new
> 'struct cxl_mem_dev_info' in/out parameter which is similar in function
> to 'struct cxl_dpa_info'. If you can find a way to avoid another level
> of indirection or otherwise consolidate all these steps into a straight
> line routine that does "all the DPA enumeration" things.
>
> -- https://lore.kernel.org/all/67a28921ca0b5_2d2c29434@dwillia2-xfh.jf.intel.com.notmuch/
>
>
> Note to Dan. I think doing 'all the DPA enumeration' things is the issue
> here. DCD further complicates this because it adds an additional DPA
> discovery mechanism. In summary we have:
>
> 1) Identify Memory Device (existing)
> 2) Hard coded values (Alejandro's type 2 set)
> 3) Get dynamic capacity configuration (DCD set)
>
> It is conceivable that a device might want to do some random combination of
> those. But the combinations we have in front of us are:
>
> A) 1 only
> B) 2 only
> C) 1 & 3
>
> I'm not sure it is worth having a single call which attempts to enumerate the
> dpa info. I'll explore having a call which does A & C for mailbox supported
> devices. But B was specifically in my mind when I came up with the
> cxl_add_partition() call. And I felt using it in A and C would work just
> fine.
>
>> So if you or Dan suggest this
>> is wrong and the accel driver should deal with the intrinsics of DPA
>> partitions, I will fight against it :-)
> I don't want an accel driver to deal with the intrinsics of the DPA
> partitions at all! But it should be able to specify the size parameters
> separate from creating dummy memdev state objects with values it does not
> care about.
Yes, but those objects have been there for a long time ...
I bet we can optimize other aspects of those structs as well, but this
is being done in the middle of patches like Type2 and DCD relying on them.
>> I'm quite happy with the DPA partition work,
> As am I. I'm just trying to go a step further so it fits a bit cleaner
> when DCD comes along. I do apologize for the delay and churn in your set.
> That was not my intention. But I thought the alterations of the memdev
> state were a good clean up.
>
>> with the result of current
>> v10 being simpler and cleaner. But it is time to get the patchsets
>> depending on that cleaning work going forward.
> Agreed.
>
> If Dan likes what you have here I will adjust the DCD work.
>
> Ira
Powered by blists - more mailing lists