[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <206b8e55-184e-7cea-4424-4eecfd0ec189@collabora.com>
Date: Thu, 20 Jul 2023 09:36:31 +0200
From: AngeloGioacchino Del Regno
<angelogioacchino.delregno@...labora.com>
To: Nícolas F. R. A. Prado
<nfraprado@...labora.com>, Stephen Boyd <sboyd@...nel.org>
Cc: kernel@...labora.com,
Greg Kroah-Hartman <gregkh@...uxfoundation.org>,
Hsin-Hsiung Wang <hsin-hsiung.wang@...iatek.com>,
James Lo <james.lo@...iatek.com>,
Matthias Brugger <matthias.bgg@...il.com>,
linux-arm-kernel@...ts.infradead.org, linux-kernel@...r.kernel.org,
linux-mediatek@...ts.infradead.org, Chen-Yu Tsai <wenst@...gle.com>
Subject: Re: [PATCH v2] spmi: mtk-pmif: Serialize PMIF status check and
command submission
Il 19/07/23 21:23, Nícolas F. R. A. Prado ha scritto:
> Before writing the read or write command to the SPMI arbiter through the
> PMIF interface, the current status of the channel is checked to ensure
> it is idle. However, since the status only changes from idle when the
> command is written, it is possible for two concurrent calls to determine
> that the channel is idle and simultaneously send their commands. At this
> point the PMIF interface hangs, with the status register no longer being
> updated, and thus causing all subsequent operations to time out.
>
> This was observed on the mt8195-cherry-tomato-r2 machine, particularly
> after commit 46600ab142f8 ("regulator: Set PROBE_PREFER_ASYNCHRONOUS for
> drivers between 5.10 and 5.15") was applied, since then the two MT6315
> devices present on the SPMI bus would probe assynchronously and
> sometimes (during probe or at a later point) read the bus
> simultaneously, breaking the PMIF interface and consequently slowing
> down the whole system.
>
> To fix the issue at its root cause, introduce locking around the channel
> status check and the command write, so that both become an atomic
> operation, preventing race conditions between two (or more) SPMI bus
> read/write operations. A spinlock is used since this is a fast bus, as
> indicated by the usage of the atomic variant of readl_poll, and
> '.fast_io = true' being used in the mt6315 driver, so spinlocks are
> already used for the regmap access.
>
> Fixes: b45b3ccef8c0 ("spmi: mediatek: Add support for MT6873/8192")
> Signed-off-by: Nícolas F. R. A. Prado <nfraprado@...labora.com>
>
Sorry for pointing that out in a second review, but this just came to mind...
...better late than never, right?
Instead of spinlock_t, we should use raw_spinlock_t: on non-PREEMPT_RT kernels,
spinlocks are mapped to raw spinlocks but, on PREEMPT_RT, those have a different
implementation that's giving a lot of overhead, being based on rtmutex.
If we use raw_spinlock_t we can avoid that kind of overhead in PREEMPT_RT
configurations.
Can you please switch to raw_spinlock_xxxx?
There's also another comment, check below.
> ---
>
> Changes in v2:
> - Added missing spin_unlocks to error paths
> - Moved memcpy outside spinlock region in the write_cmd function
> - Reworded commit message to make clear that issue can happen at any
> point in runtime, not only during boot
>
> drivers/spmi/spmi-mtk-pmif.c | 20 ++++++++++++++++----
> 1 file changed, 16 insertions(+), 4 deletions(-)
>
> diff --git a/drivers/spmi/spmi-mtk-pmif.c b/drivers/spmi/spmi-mtk-pmif.c
> index b3c991e1ea40..78b69e0b5c81 100644
> --- a/drivers/spmi/spmi-mtk-pmif.c
> +++ b/drivers/spmi/spmi-mtk-pmif.c
> @@ -50,6 +50,7 @@ struct pmif {
> struct clk_bulk_data clks[PMIF_MAX_CLKS];
> size_t nclks;
> const struct pmif_data *data;
> + spinlock_t lock;
> };
>
> static const char * const pmif_clock_names[] = {
> @@ -314,6 +315,7 @@ static int pmif_spmi_read_cmd(struct spmi_controller *ctrl, u8 opc, u8 sid,
> struct ch_reg *inf_reg;
> int ret;
> u32 data, cmd;
> + unsigned long flags;
>
> /* Check for argument validation. */
> if (sid & ~0xf) {
> @@ -334,12 +336,14 @@ static int pmif_spmi_read_cmd(struct spmi_controller *ctrl, u8 opc, u8 sid,
> else
> return -EINVAL;
>
> + spin_lock_irqsave(&arb->lock, flags);
> /* Wait for Software Interface FSM state to be IDLE. */
> inf_reg = &arb->chan;
> ret = readl_poll_timeout_atomic(arb->base + arb->data->regs[inf_reg->ch_sta],
> data, GET_SWINF(data) == SWINF_IDLE,
> PMIF_DELAY_US, PMIF_TIMEOUT_US);
> if (ret < 0) {
> + spin_unlock_irqrestore(&arb->lock, flags);
> /* set channel ready if the data has transferred */
> if (pmif_is_fsm_vldclr(arb))
> pmif_writel(arb, 1, inf_reg->ch_rdy);
Unlock the spinlock after setting CHANNEL_READY, otherwise, we may signal that
we're ready *while* another CPU is writing data to the bus, getting back again
to the same lockup that you're trying to solve :-)
P.S.: Same for write_cmd, of course.
P.P.S.: Adding Chen-Yu to the Cc's as he is surely interested in this one as well.
Cheers!
Angelo
> @@ -350,6 +354,7 @@ static int pmif_spmi_read_cmd(struct spmi_controller *ctrl, u8 opc, u8 sid,
> /* Send the command. */
> cmd = (opc << 30) | (sid << 24) | ((len - 1) << 16) | addr;
> pmif_writel(arb, cmd, inf_reg->ch_send);
> + spin_unlock_irqrestore(&arb->lock, flags);
>
> /*
> * Wait for Software Interface FSM state to be WFVLDCLR,
> @@ -376,7 +381,8 @@ static int pmif_spmi_write_cmd(struct spmi_controller *ctrl, u8 opc, u8 sid,
> struct pmif *arb = spmi_controller_get_drvdata(ctrl);
> struct ch_reg *inf_reg;
> int ret;
> - u32 data, cmd;
> + u32 data, wdata, cmd;
> + unsigned long flags;
>
> if (len > 4) {
> dev_err(&ctrl->dev, "pmif supports 1..4 bytes per trans, but:%zu requested", len);
> @@ -394,12 +400,17 @@ static int pmif_spmi_write_cmd(struct spmi_controller *ctrl, u8 opc, u8 sid,
> else
> return -EINVAL;
>
> + /* Set the write data. */
> + memcpy(&wdata, buf, len);
> +
> + spin_lock_irqsave(&arb->lock, flags);
> /* Wait for Software Interface FSM state to be IDLE. */
> inf_reg = &arb->chan;
> ret = readl_poll_timeout_atomic(arb->base + arb->data->regs[inf_reg->ch_sta],
> data, GET_SWINF(data) == SWINF_IDLE,
> PMIF_DELAY_US, PMIF_TIMEOUT_US);
> if (ret < 0) {
> + spin_unlock_irqrestore(&arb->lock, flags);
> /* set channel ready if the data has transferred */
> if (pmif_is_fsm_vldclr(arb))
> pmif_writel(arb, 1, inf_reg->ch_rdy);
> @@ -407,13 +418,12 @@ static int pmif_spmi_write_cmd(struct spmi_controller *ctrl, u8 opc, u8 sid,
> return ret;
> }
>
> - /* Set the write data. */
> - memcpy(&data, buf, len);
> - pmif_writel(arb, data, inf_reg->wdata);
> + pmif_writel(arb, wdata, inf_reg->wdata);
>
> /* Send the command. */
> cmd = (opc << 30) | BIT(29) | (sid << 24) | ((len - 1) << 16) | addr;
> pmif_writel(arb, cmd, inf_reg->ch_send);
> + spin_unlock_irqrestore(&arb->lock, flags);
>
> return 0;
> }
> @@ -488,6 +498,8 @@ static int mtk_spmi_probe(struct platform_device *pdev)
> arb->chan.ch_send = PMIF_SWINF_0_ACC + chan_offset;
> arb->chan.ch_rdy = PMIF_SWINF_0_VLD_CLR + chan_offset;
>
> + spin_lock_init(&arb->lock);
> +
> platform_set_drvdata(pdev, ctrl);
>
> err = spmi_controller_add(ctrl);
Powered by blists - more mailing lists