lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CAPDyKFqKRy6zJdBpK3bNTvkvAjty691-Vi_HV3E5CeqgRAWGmA@mail.gmail.com>
Date: Fri, 10 May 2024 12:43:25 +0200
From: Ulf Hansson <ulf.hansson@...aro.org>
To: Linux regressions mailing list <regressions@...ts.linux.dev>, Viresh Kumar <viresh.kumar@...aro.org>
Cc: Viresh Kumar <vireshk@...nel.org>, Nishanth Menon <nm@...com>, Stephen Boyd <sboyd@...nel.org>, 
	"Rafael J. Wysocki" <rafael@...nel.org>, linux-pm@...r.kernel.org, 
	Vincent Guittot <vincent.guittot@...aro.org>, Vladimir Lypak <vladimir.lypak@...il.com>, 
	linux-kernel@...r.kernel.org
Subject: Re: [PATCH V2] OPP: Fix required_opp_tables for multiple genpds using
 same table

On Fri, 10 May 2024 at 10:54, Linux regression tracking (Thorsten
Leemhuis) <regressions@...mhuis.info> wrote:
>
> On 10.05.24 10:37, Ulf Hansson wrote:
> > On Thu, 9 May 2024 at 14:35, Thorsten Leemhuis
> > <regressions@...mhuis.info> wrote:
> >> On 12.04.24 08:41, Viresh Kumar wrote:
> >>> The required_opp_tables parsing is not perfect, as the OPP core does the
> >>> parsing solely based on the DT node pointers.
> >>>
> >>> The core sets the required_opp_tables entry to the first OPP table in
> >>> the "opp_tables" list, that matches with the node pointer.
> >>>
> >>> If the target DT OPP table is used by multiple devices and they all
> >>> create separate instances of 'struct opp_table' from it, then it is
> >>> possible that the required_opp_tables entry may be set to the incorrect
> >>> sibling device.
> >>>
> >>> Unfortunately, there is no clear way to initialize the right values
> >>> during the initial parsing and we need to do this at a later point of
> >>> time.
> >>>
> >>> Cross check the OPP table again while the genpds are attached and fix
> >>> them if required.
> >>>
> >>> Also add a new API for the genpd core to fetch the device pointer for
> >>> the genpd.
> >>>
> >>> Cc: Thorsten Leemhuis <regressions@...mhuis.info>
> >>> Reported-by: Vladimir Lypak <vladimir.lypak@...il.com>
> >>> Closes: https://bugzilla.kernel.org/show_bug.cgi?id=218682
> >>
> >> Did this fall through the cracks? Just wondering, as from here it looks
> >> like for about four weeks now nothing happened to fix the regression
> >> linked above. But I might have missed something. Or is everybody waiting
> >> for a test from the reporter?
> >
> > I have chatted a bit with Viresh about this problem offlist, while
> > both me and him are/have been on vacations. Sorry for the delay and
> > confusion.
> >
> > The latest update from my side is that I am working on a solution,
> > that aim to remove the entire dev|devm_pm_opp_detach_genpd() API.
>
> That sounds like something that would have to wait for a merge window;
> so given the timing I assume this would mean that the earliest point in
> time to merge this would be for 6.11-rc1, which is ~2 months away --
> plus another 9 or 10 weeks until the fix would reach users.

Right.

>
> > Instead, the plan is to move consumer drivers to use
> > dev_pm_domain_attach_list() to attach multiple PM domains per device.
> > When it comes to hooking up the required-opps-tables/devs, I think
> > genpd should be able to manage this during the device attach process.
> > In this way, consumer drivers shouldn't need to care about this at
> > all.
> >
> > That said, I am hoping that $subject patch should not be needed.
> > Although, I need a bit more time before I am ready to post a patchset
> > for the above.
> >
> > What do you think?
>
> Given that the report is already more than a month old now and what I
> assumed above (which might be wrong), this makes me wonder: is there a
> downside if we apply this patch now, and simply revert this later when
> your proper solution is merged? I would assume that is what Linus want
> in this case to honor the "no regressions" rule.

Sure, I am certainly okay with this approach too!

>
> Might be something different if this is something like a really odd
> corner case we assume nobody (or nearly nobody) will run into in
> practice. But as somebody noticed this, I assume that is not the case.

I wasn't sure of the level of urgency in this case, as I don't think
we have that many DTSes upstream that could hit this case.

But nevermind, it should be easy to revert/replace the change when we
have something better to take over. Viresh, feel pick this up - or let
me know if you prefer me to pick it.

Reviewed-by: Ulf Hansson <ulf.hansson@...aro.org>

Kind regards
Uffe


>
> Ciao, Thorsten
>
> >>> Co-developed-by: Vladimir Lypak <vladimir.lypak@...il.com>
> >>> Signed-off-by: Viresh Kumar <viresh.kumar@...aro.org>
> >>> ---
> >>> V2:
> >>> - Fix an `if` condition.
> >>> - s/Bugzilla/Closes/ and change ordering.
> >>>
> >>>  drivers/opp/core.c        | 31 ++++++++++++++++++++++++++++++-
> >>>  drivers/pmdomain/core.c   | 10 ++++++++++
> >>>  include/linux/pm_domain.h |  6 ++++++
> >>>  3 files changed, 46 insertions(+), 1 deletion(-)
> >>>
> >>> diff --git a/drivers/opp/core.c b/drivers/opp/core.c
> >>> index e233734b7220..cb4611fe1b5b 100644
> >>> --- a/drivers/opp/core.c
> >>> +++ b/drivers/opp/core.c
> >>> @@ -2394,7 +2394,8 @@ static void _opp_detach_genpd(struct opp_table *opp_table)
> >>>  static int _opp_attach_genpd(struct opp_table *opp_table, struct device *dev,
> >>>                       const char * const *names, struct device ***virt_devs)
> >>>  {
> >>> -     struct device *virt_dev;
> >>> +     struct device *virt_dev, *gdev;
> >>> +     struct opp_table *genpd_table;
> >>>       int index = 0, ret = -EINVAL;
> >>>       const char * const *name = names;
> >>>
> >>> @@ -2427,6 +2428,34 @@ static int _opp_attach_genpd(struct opp_table *opp_table, struct device *dev,
> >>>                       goto err;
> >>>               }
> >>>
> >>> +             /*
> >>> +              * The required_opp_tables parsing is not perfect, as the OPP
> >>> +              * core does the parsing solely based on the DT node pointers.
> >>> +              * The core sets the required_opp_tables entry to the first OPP
> >>> +              * table in the "opp_tables" list, that matches with the node
> >>> +              * pointer.
> >>> +              *
> >>> +              * If the target DT OPP table is used by multiple devices and
> >>> +              * they all create separate instances of 'struct opp_table' from
> >>> +              * it, then it is possible that the required_opp_tables entry
> >>> +              * may be set to the incorrect sibling device.
> >>> +              *
> >>> +              * Cross check it again and fix if required.
> >>> +              */
> >>> +             gdev = dev_to_genpd_dev(virt_dev);
> >>> +             if (IS_ERR(gdev))
> >>> +                     return PTR_ERR(gdev);
> >>> +
> >>> +             genpd_table = _find_opp_table(gdev);
> >>> +             if (!IS_ERR(genpd_table)) {
> >>> +                     if (genpd_table != opp_table->required_opp_tables[index]) {
> >>> +                             dev_pm_opp_put_opp_table(opp_table->required_opp_tables[index]);
> >>> +                             opp_table->required_opp_tables[index] = genpd_table;
> >>> +                     } else {
> >>> +                             dev_pm_opp_put_opp_table(genpd_table);
> >>> +                     }
> >>> +             }
> >>> +
> >>>               /*
> >>>                * Add the virtual genpd device as a user of the OPP table, so
> >>>                * we can call dev_pm_opp_set_opp() on it directly.
> >>> diff --git a/drivers/pmdomain/core.c b/drivers/pmdomain/core.c
> >>> index 4215ffd9b11c..c40eda92a85a 100644
> >>> --- a/drivers/pmdomain/core.c
> >>> +++ b/drivers/pmdomain/core.c
> >>> @@ -184,6 +184,16 @@ static struct generic_pm_domain *dev_to_genpd(struct device *dev)
> >>>       return pd_to_genpd(dev->pm_domain);
> >>>  }
> >>>
> >>> +struct device *dev_to_genpd_dev(struct device *dev)
> >>> +{
> >>> +     struct generic_pm_domain *genpd = dev_to_genpd(dev);
> >>> +
> >>> +     if (IS_ERR(genpd))
> >>> +             return ERR_CAST(genpd);
> >>> +
> >>> +     return &genpd->dev;
> >>> +}
> >>> +
> >>>  static int genpd_stop_dev(const struct generic_pm_domain *genpd,
> >>>                         struct device *dev)
> >>>  {
> >>> diff --git a/include/linux/pm_domain.h b/include/linux/pm_domain.h
> >>> index 772d3280d35f..f24546a3d3db 100644
> >>> --- a/include/linux/pm_domain.h
> >>> +++ b/include/linux/pm_domain.h
> >>> @@ -260,6 +260,7 @@ int pm_genpd_remove_subdomain(struct generic_pm_domain *genpd,
> >>>  int pm_genpd_init(struct generic_pm_domain *genpd,
> >>>                 struct dev_power_governor *gov, bool is_off);
> >>>  int pm_genpd_remove(struct generic_pm_domain *genpd);
> >>> +struct device *dev_to_genpd_dev(struct device *dev);
> >>>  int dev_pm_genpd_set_performance_state(struct device *dev, unsigned int state);
> >>>  int dev_pm_genpd_add_notifier(struct device *dev, struct notifier_block *nb);
> >>>  int dev_pm_genpd_remove_notifier(struct device *dev);
> >>> @@ -307,6 +308,11 @@ static inline int pm_genpd_remove(struct generic_pm_domain *genpd)
> >>>       return -EOPNOTSUPP;
> >>>  }
> >>>
> >>> +static inline struct device *dev_to_genpd_dev(struct device *dev)
> >>> +{
> >>> +     return ERR_PTR(-EOPNOTSUPP);
> >>> +}
> >>> +
> >>>  static inline int dev_pm_genpd_set_performance_state(struct device *dev,
> >>>                                                    unsigned int state)
> >>>  {
> >
> >

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ