[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <aLkiNdGIXsogC6Rr@zatzit>
Date: Thu, 4 Sep 2025 15:23:01 +1000
From: David Gibson <david@...son.dropbear.id.au>
To: Luca Ceresoli <luca.ceresoli@...tlin.com>
Cc: Krzysztof Kozlowski <krzk@...nel.org>, devicetree@...r.kernel.org,
Rob Herring <robh@...nel.org>, Ayush Singh <ayush@...gleboard.org>,
Jason Kridner <jkridner@...il.com>,
Geert Uytterhoeven <geert@...ux-m68k.org>,
Krzysztof Kozlowski <krzk+dt@...nel.org>,
Conor Dooley <conor+dt@...nel.org>,
devicetree-compiler@...r.kernel.org, linux-kernel@...r.kernel.org,
Thomas Petazzoni <thomas.petazzoni@...tlin.com>,
Herve Codina <herve.codina@...tlin.com>, Andrew Davis <afd@...com>
Subject: Re: Device tree representation of (hotplug) connectors: discussion
at ELCE
On Tue, Sep 02, 2025 at 10:57:10AM +0200, Luca Ceresoli wrote:
> Hello,
>
> [this main was co-written by Hervé and Luca]
>
> Hervé and I are working since 1+y to allow describing hot-pluggable
> add-ons and their connectors with device tree overlays. Our work is not
So.. I think this is a poor way of framing the question. Device tree
overlays were, frankly, a quick hack to get some sort device tree
variability happening. They were pretty easy to implement, but they
have a bunch of limitations. Then, as such things so often are, they
were used and overused.
IMO, a lot of the problems being encountered now are really those
fundamental limitations of the overlay approach. Trying to address
them "with device tree overlays" is constraining you to a poorly
thought out approach, adding hacks on top of hacks.
I proposed a possible "connector" format years ago (which I still
think could do with renewed consideration) as an *alternative to*
device tree overlays, not as an extension of them.
> very much progressing because discussions about device tree bindings
> has raised some issue that are not obvious to solve.
>
> This e-mail is a report of the efforts we did last week during the
> Embedded Linux Conference Europe to try to address the currently
> blocking issues.
>
> First, I gave a talk about the overall hotplug work, to provide a
> status update but also to clarify the goals and use cases. Slides are
> available at [2]. Goals include:
>
> - decoupling base board and add-on, so an addon can have a single dtbo
> valid for any base board, and vice versa
Good goal. This one can be somewhat addressed within the dtbo format
(e.g. the 'export-symbols' proposal'). However doing so leans into
the limitations of the dtbo format, which I think won't serve you
well.
> - supporting main boards with multiple connectors where multiple
> instances of the same addon model can be connected independently
Such as right here. The basic overlay approach is really badly
suited to this.
> - allowing overlay insertion and removal at runtime (hotplug)
Again, I think that's poor framing. You want to be able to insert and
remove things into connectors. Thinking of that as inserting and
removing overlays hotplug limits you to crappy solutions. The basic
definition of overlay application is fundmentally lossy - making them
removable requires a pile of ugly hacks. Better to look at an
approach at a different semantic layer.
> The first goal implies that addon overlays do not refer to anything
> (phandles) beyond the connector node.
>
> The talk has attracted a lot of people. All seats in the 200+ room were
> taken, and when I asked who has a connector use case about 40-50
> attendeed raised their hands. I also had several questions asked after
> the talk and in the hallway.
>
> After the talk we had planned a discussion about the topic. Krzysztof
> Kozlowski was present in person (thanks!), while Ayush Singh and
> Wolfram Sang connected remotely. Jason Kridner (beagleboard.org) and
> Geert Uytterhoeven were present and actively constributing to the
> discussion. Unfortunately Rob Herring was not connected, but still we
> tried to make the best out of the discussion. So we focused on
> discussing the current proposals to go past the issues with our
> export-symbols proposal raised mainly by Rob.
>
> Here is a summary of the ideas we have discussed, in order from
> simplest discussion (looking like not doable) to most complex (which
> look like doable).
>
> ---------------------------------------------------------------------
>
> Idea #1: Label on __overlay__
> Proposed by Rob in [0]
>
> > Couldn't we make something like this work:
> >
> > connector: __overlay__ {
> > node {
> > foo-gpio = <&connector 0 GPIO_ACTIVE_HIGH>;
> > };
> > };
>
> This would be OK for simple cases but it only allows exporting one
> label, for the connector (i.e. the overlay target node). More than one
> label need to be referenced from the overlay for cases such as:
>
> - pinmux, where each pinmux configuration is a node, and is defined
> in the pinmux node outside of the connector
> - HDMI ddc-i2c property, for HDMI chips in the overlay which needs to
> point at an I2C adapter in the base tree
Even wiring up plain old interrupts could be hard with this approach.
It's also not clear how it would be encoded in the dtb.
> ---------------------------------------------------------------------
>
> Idea #2: add /export/ keyword to mark labels to be exported
> Proposed by Rob in [1]
>
> The idea is to mark modes in the base tree that can be referenced by
> overlays:
>
> > /export/ label: node {
> > };
> >
> > And then __symbols__ can be only those exported labels (unless -@ is used).
>
> This is an opt-in version of the "global" __symbols__ to limit the
> issues __symbols__ introduces. However it is not sufficient for
> connectors because it tells what can be exported but not on which
> connector. Also, overlays would need to refer to the nodes in the main
> tree, thus not decoupling mainboard and addon.
Sounds like a strictly worse version of export-symbols.
> ---------------------------------------------------------------------
>
> Idea #3: label on empty (*) node
> (*) until overlay applied
> Proposed by Hervé at LPC2024 in a discussion with Krzysztof, later
> abandoned
>
> This is based on Idea #1 but tries to make HDMI ddc-i2c work:
>
> connector1: connector1 {
> #gpio-cells = <2>;
> gpio-map = <0 0 &soc_gpio 12 0>;
> gpio-map-mask = <0xf 0x0>;
> gpio-map-pass-thru = <0x0 0xf>;
>
> i2c8: i2c-hdmi { [**]
> i2c-parent = <&soc_i2c8>;
> }
> };
>
> connector: __overlay__ {
> node {
> foo-gpio = <&connector 0 GPIO_ACTIVE_HIGH>;
> };
> i2c_hdmi: i2c-hdmi {
> //empty
> };
> hdmictrl@...76 {
> ddc-i2c = <&i2c_hdmi>;
> };
> };
Having symbol / glue information that's local to a particular
connector is, I think, the way to go. But again, I think encoding
this in terms of overlays semantics is going to make it harder than it
needs to be.
> This would leverage the i2c-bus-extension work (also under discussion
> [3]). Since for HDMI an I2C device is not added it would have a node
> (i2c-hdmi) that is empty in the overlay (but not in the base tree and
> thus not in the live tree after the overlay is applied). This empty
> node is needed to ensure we can have a label (i2c_hdmi) that can be
> referenced from elsewhere in the overlay (ddc-i2c).
>
> However there are various issues with this approach:
>
> - mainlin, it does not handle pinumxes nicely
> - if the node that is overlayed by the empty node (i2c-hdmi) has a
> label in the base tree (line [**]), then the overlay-provided
> phandle ID would screw up the base-tree phandle ID
> - in dtbo, the empty node (i2c-hdmi) has a property in the overlay
> (phandle) but the node exists in the base tree, thus the property
> would leak on removal
>
> ---------------------------------------------------------------------
>
> Idea #4: resolving phandle errors by the connector driver
> Proposed by Rob in [1]
>
> > I'll throw out another idea. What if we make resolving phandle errors
> > something that can be handled by the connector driver? The driver
> > knows 'connector' resolves to the connector node it is applying the
> > overlay to.
>
> This idea looked promising, so we tried simulating the process with a
> dts/dtso example:
>
> Base tree:
>
> connector1 {
> compatible = "myvendor,myconn";
>
> #gpio-cells = <2>;
> gpio-map = <0 0 &soc_gpio1 12 0>, <1 0 &soc_gpio3 42 0>;
> gpio-map-mask = <0xf 0x0>;
> gpio-map-pass-thru = <0x0 0xf>;
>
> i2c-sensors {
> compatible = "i2c-bus-extension";
> i2c-parent = <&i2c@...d0000>;
> };
>
> hdmi-ddc-adapter = <&soc_i2c8>;
>
> // All pinctrls that addons may need
> pin12-pinctrl-i2c = <&pin12_mode_i2c>;
> pin1-pinctrl-gpio = <&pin1_mode_gpio>;
> pin2-pinctrl-gpio = <&pin2_mode_gpio>;
> };
>
> Overlay:
>
> / {
> fragment@0 {
> __overlay__ {
> node {
> foo-gpios = <&connector 0 GPIO_ACTIVE_HIGH>, <&connector 1 GPIO_ACTIVE_HIGH>;
> };
> i2c-sensors {
> thm: thermal@15 {reg = <15>;...};
> };
> hdmictrl@...45678 {
> ddc-i2c = <&ddc_adapter>; [*]
> };
> some_other_node {
> pinctrl-0 = <&pin12_pinctrl_i2c>;
> thermal = <&thm>;
> };
> };
>
> This is what would happen for the HDMI ddc-i2c at line [*]:
>
> 1. of_overlay_fdt_apply_new(..., resolve_dt_error_cb) is called;
> it is a variant of of_overlay_fdt_apply() (name to be defined!) that:
> a. takes a function pointer to invoke the connector for resolving
> unknown labels
> b. does not even try to resolve phandles beyond the connector
> c. if target node has no phandle, creates one with next unused
> number
> 2. resolver does not find 'ddc_adapter' label
> 3. before calling it a fatal error, resolver calls connector driver
> callback
> 4. connector driver callback knows the "ddc_adapter" string must be
> resolved using the "hdmi-ddc-adapter" property, returns soc_i2c8
> phandle ID
>
> connector driver callback in pseudocode:
>
> resolve_dt_error_cb(conn, label)
> {
> switch (label) {
> case "connector":
> return conn->of_node;
> case "ddc_adapter":
> return resolve(conn->of_node, "hdmi-ddc-adapter");
> case "pin12_pinctrl_i2c":
> return resolve(conn->of_node, "pin12-pinctrl-i2c");
> }
> }
The idea of putting logic into a connector driver makes sense.
However, it's unclear to me where those strings its resolving are
actually encoded in the dtb.
> We discussed some possible issues, such as: what if a label is actually
> found in the base tree and thus resolved? This is handled by point 1.b.
> above: the OF core does not even try to resolve phandles beyond the
> connector, it would not make sense for connector anyway. In other words
> it only resolves local fixups, which are internal to the overlay, such
> as "thm" in the example above.
>
> This looked like the most promising approach because it handles nicely
> HDMI DDC and pinmux and minimize pollution in the phandle ID space.
>
> ---------------------------------------------------------------------
>
> So that was what we discussed in the meeting last Tuesday. We hope this
> will help in setting the current point and let the discussion move
> forward.
Let me throw a few more ideas in the pot. None of these are total
solutions, but I think they may make good components of solutions.
1) Connector local labels/symbols/aliases
This is not a new idea - both the export-symbols proposal and my
ancient connector proposal had this in one form or another. I think
something along these lines is almost essential. Things that plug
into connectors almost always require references to several host board
resources (interrupt controller, gpio, ...). In order to be pluggable
on multiple host boards you want to refer to those symbolically. In
order to support multiple instances of the same connector type, you
need those symbols to refer to different things fordifferent connector
instances.
Whhat I think is a mistake is trying to tie this too closely to the
existing __symbols__ structure. Those have an ugly encoding that
requires tortured processing in a way that's not natural for dtb
handling. Plus they already kinda-sorta duplicate old-school aliases
in an odd way.
You want some sort of string => node mapping on the connector side,
and a way to mark portions of properties on the plugin side as being
resolved to some string reference. But we can take the opportunity to
design a better way of doing that than the ugly one we have now.
2) Extend dtb itself
A maor thing that makes current symbols and fixups ugly is the fact
that they are encoded into properties in the device tree itself,
despite being logically at a different semantic level. Obviously you
*can* do that, but it's not natural. It would make more sense to add
fixup tags into the dtb format itself.
3) bus-reg / bus-ranges
One thing that makes connector plugins a bit awkward is that they
often need to add things to multiple buses on the host system (MMIO &
i2c for a simple case). This means that once resolved the plugin
isn't neatly a single subtree. That's one factor making removal
really awkward. Here's an idea I had a while ago to allow plugins to
be a single subtree, by extending what's allowed in the tree content:
Currently a node can only really have a presence on its immediate
parent bus, as encoded in the 'reg' and 'ranges' properties.
'bus-reg' and 'bus-ranges' would extend that having a similar format
to 'reg' and 'ranges' but adding a phandle for each entry saying which
bus it lives on - somewhat similar to interrupt-map.
For example, here's an MMIO bus bridge of some sort, which has control
registers on I2C:
mmio-bus@... {
#address-cells = < 2 >;
#size-cells = < 2 >;
bridge@...X {
ranges = <...>;
bus-reg = <&i2c0 0x407>
}
}
i2c0: i2c@... {
#address-cells = < 1 >;
#size-cells = < 0 >;
}
In a sense this extends the device tree to a device DAG.
Obviously this does need changes at the OS device core level, but it
gives you a lot of flexibility having done so.
4) You don't necessarily need to build a "full" device tree
Flattened device trees (as opposed to original IEEE1275 device trees)
- by design - allow certain information to be omitted. The most
common example is that for introspectable buses, like PCI, it's normal
to have the DT only include a node for the host bridge, with devices
under it being discovered by their own bus specific methods. That's
discovery is handled by the bus/bridge driver.
Connectors usually aren't introspectable, but it's still possible to
use an approach like this where the connector driver's discovery
method is "look at a different device tree". So, for example,
Board device tree:
/ {
compatible = "board-with-foo-connector";
. . .
mmio@... {
foo-connector@... {
compatible = "foo-connector";
ranges = < ... >;
}
}
}
Foo device tree:
/ {
compatible = "foo-device";
foo-port-id = < 0x1234 >;
component@... {
reg = < ... >;
}
}
Obviously a "foo device tree" would have different conventions than a
board device tree. It wouldn't have /cpus, /memory, /chosen - but it
could have its own "magic" nodes that make sense for the properties of
the specific connector type.
Again, that would require work in the device core part of the OS. The
bonus is that runtime addition and removal is now trivial. No hacking
of the base device tree is needed, and so doesn't need to be reverted.
The connector driver just adds/removes the reference to its own
private tree.
This would, of course, need some way to refer to board resources
(interrupt controller, gpio controller) etc. I think that can be
assembled using some of the previous ideas, though.
--
David Gibson (he or they) | I'll have my music baroque, and my code
david AT gibson.dropbear.id.au | minimalist, thank you, not the other way
| around.
http://www.ozlabs.org/~dgibson
Download attachment "signature.asc" of type "application/pgp-signature" (834 bytes)
Powered by blists - more mailing lists