[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <819f15f9-1b16-4b96-8273-3f95c1e071bb@collabora.com>
Date: Wed, 28 May 2025 10:25:59 +0500
From: Muhammad Usama Anjum <usama.anjum@...labora.com>
To: Manivannan Sadhasivam <manivannan.sadhasivam@...aro.org>
Cc: usama.anjum@...labora.com, kernel@...labora.com,
sebastian.reichel@...labora.com, Jeff Johnson
<jeff.johnson@....qualcomm.com>, Baochen Qiang <quic_bqiang@...cinc.com>,
Sumit Garg <sumit.garg@...nel.org>, mhi@...ts.linux.dev,
linux-arm-msm@...r.kernel.org, linux-kernel@...r.kernel.org,
linux-wireless@...r.kernel.org, ath11k@...ts.infradead.org,
ath12k@...ts.infradead.org, Jeff Hugo <jeff.hugo@....qualcomm.com>,
Jeff Johnson <jjohnson@...nel.org>,
Jacek Lawrynowicz <jacek.lawrynowicz@...ux.intel.com>,
Youssef Samir <quic_yabdulra@...cinc.com>,
Matthew Leung <quic_mattleun@...cinc.com>,
Carl Vanderlip <quic_carlv@...cinc.com>,
"Dr. David Alan Gilbert" <linux@...blig.org>,
Greg Kroah-Hartman <gregkh@...uxfoundation.org>,
Troy Hanson <quic_thanson@...cinc.com>, Alex Elder <elder@...nel.org>,
Yan Zhen <yanzhen@...o.com>, Kunwu Chan <chentao@...inos.cn>
Subject: Re: [PATCH v6] bus: mhi: host: don't free bhie tables during
suspend/hibernation
Soft reminder
On 5/16/25 11:49 PM, Muhammad Usama Anjum wrote:
> Fix dma_direct_alloc() failure at resume time during bhie_table
> allocation because of memory pressure. There is a report where at
> resume time, the memory from the dma doesn't get allocated and MHI
> fails to re-initialize.
>
> To fix it, don't free the memory at power down during suspend /
> hibernation. Instead, use the same allocated memory again after every
> resume / hibernation. This patch has been tested with resume and
> hibernation both.
>
> Optimize the rddm and fbc bhie allocations. The rddm is of constant
> size for a given hardware. While the fbc_image size depends on the
> firmware. If the firmware changes, we'll free and allocate new memory
> for it. This patch is motivated from the ath12k [1] and ath11k [2]
> patches. They don't free the memory and reuse the same memory if new
> size is same. The firmware caching hasn't been implemented for the
> drivers other than in the nouveau. (The changing of firmware isn't
> tested/supported for wireless drivers. But let's follow the example
> patches here.)
>
> [1] https://lore.kernel.org/all/20240419034034.2842-1-quic_bqiang@quicinc.com/
> [2] https://lore.kernel.org/all/20220506141448.10340-1-quic_akolli@quicinc.com/
>
> Tested-on: WCN6855 WLAN.HSP.1.1-03926.13-QCAHSPSWPL_V2_SILICONZ_CE-2.52297.6
> Tested-on: WCN7850 hw2.0 WLAN.HMT.1.1.c5-00284-QCAHMTSWPL_V1.0_V2.0_SILICONZ-3
>
> Acked-by: Jeff Johnson <jeff.johnson@....qualcomm.com>
> Tested-by: Baochen Qiang <quic_bqiang@...cinc.com>
> Signed-off-by: Muhammad Usama Anjum <usama.anjum@...labora.com>
> ---
> Changes since v1:
> - Don't free bhie tables during suspend/hibernation only
> - Handle fbc_image changed size correctly
> - Remove fbc_image getting set to NULL in *free_bhie_table()
>
> Changes since v2:
> - Remove the new mhi_partial_unprepare_after_power_down() and instead
> update mhi_power_down_keep_dev() to use
> mhi_power_down_unprepare_keep_dev() as suggested by Mani
> - Update all users of this API such as ath12k (previously only ath11k
> was updated)
> - Define prev_fw_sz in docs
> - Do better alignment of comments
>
> Changes since v3:
> - Fix state machine of ath12k by setting ATH12K_MHI_DEINIT with
> ATH12K_MHI_POWER_OFF_KEEP_DEV state (Thanks Sebastian for testing and
> finding the problem)
> - Use static with mhi_power_down_unprepare_keep_dev()
> - Remove crash log as it was showing that kworker wasn't able to
> allocate memory.
>
> Changes since v4:
> - Update description
> - Use __mhi_power_down_unprepare_keep_dev() in
> mhi_unprepare_after_power_down()
>
> Changes since v5:
> - Update description to don't give an impression that all bhie
> allocations are being fixed. mhi_load_image_bhie() doesn't require
> this optimization.
>
> This patch doesn't have fixes tag as we are avoiding error in case of
> memory pressure. We are just making this driver more robust by not
> freeing the memory and using the same after resuming.
> ---
> drivers/bus/mhi/host/boot.c | 15 +++++++++++----
> drivers/bus/mhi/host/init.c | 18 ++++++++++++------
> drivers/bus/mhi/host/internal.h | 2 ++
> drivers/bus/mhi/host/pm.c | 1 +
> drivers/net/wireless/ath/ath11k/mhi.c | 8 ++++----
> drivers/net/wireless/ath/ath12k/mhi.c | 14 ++++++++++----
> include/linux/mhi.h | 2 ++
> 7 files changed, 42 insertions(+), 18 deletions(-)
>
> diff --git a/drivers/bus/mhi/host/boot.c b/drivers/bus/mhi/host/boot.c
> index efa3b6dddf4d2..bc8459798bbee 100644
> --- a/drivers/bus/mhi/host/boot.c
> +++ b/drivers/bus/mhi/host/boot.c
> @@ -584,10 +584,17 @@ void mhi_fw_load_handler(struct mhi_controller *mhi_cntrl)
> * device transitioning into MHI READY state
> */
> if (fw_load_type == MHI_FW_LOAD_FBC) {
> - ret = mhi_alloc_bhie_table(mhi_cntrl, &mhi_cntrl->fbc_image, fw_sz);
> - if (ret) {
> - release_firmware(firmware);
> - goto error_fw_load;
> + if (mhi_cntrl->fbc_image && fw_sz != mhi_cntrl->prev_fw_sz) {
> + mhi_free_bhie_table(mhi_cntrl, mhi_cntrl->fbc_image);
> + mhi_cntrl->fbc_image = NULL;
> + }
> + if (!mhi_cntrl->fbc_image) {
> + ret = mhi_alloc_bhie_table(mhi_cntrl, &mhi_cntrl->fbc_image, fw_sz);
> + if (ret) {
> + release_firmware(firmware);
> + goto error_fw_load;
> + }
> + mhi_cntrl->prev_fw_sz = fw_sz;
> }
>
> /* Load the firmware into BHIE vec table */
> diff --git a/drivers/bus/mhi/host/init.c b/drivers/bus/mhi/host/init.c
> index 13e7a55f54ff4..8419ea8a5419b 100644
> --- a/drivers/bus/mhi/host/init.c
> +++ b/drivers/bus/mhi/host/init.c
> @@ -1173,8 +1173,9 @@ int mhi_prepare_for_power_up(struct mhi_controller *mhi_cntrl)
> /*
> * Allocate RDDM table for debugging purpose if specified
> */
> - mhi_alloc_bhie_table(mhi_cntrl, &mhi_cntrl->rddm_image,
> - mhi_cntrl->rddm_size);
> + if (!mhi_cntrl->rddm_image)
> + mhi_alloc_bhie_table(mhi_cntrl, &mhi_cntrl->rddm_image,
> + mhi_cntrl->rddm_size);
> if (mhi_cntrl->rddm_image) {
> ret = mhi_rddm_prepare(mhi_cntrl,
> mhi_cntrl->rddm_image);
> @@ -1200,6 +1201,14 @@ int mhi_prepare_for_power_up(struct mhi_controller *mhi_cntrl)
> }
> EXPORT_SYMBOL_GPL(mhi_prepare_for_power_up);
>
> +void __mhi_unprepare_keep_dev(struct mhi_controller *mhi_cntrl)
> +{
> + mhi_cntrl->bhi = NULL;
> + mhi_cntrl->bhie = NULL;
> +
> + mhi_deinit_dev_ctxt(mhi_cntrl);
> +}
> +
> void mhi_unprepare_after_power_down(struct mhi_controller *mhi_cntrl)
> {
> if (mhi_cntrl->fbc_image) {
> @@ -1212,10 +1221,7 @@ void mhi_unprepare_after_power_down(struct mhi_controller *mhi_cntrl)
> mhi_cntrl->rddm_image = NULL;
> }
>
> - mhi_cntrl->bhi = NULL;
> - mhi_cntrl->bhie = NULL;
> -
> - mhi_deinit_dev_ctxt(mhi_cntrl);
> + __mhi_unprepare_keep_dev(mhi_cntrl);
> }
> EXPORT_SYMBOL_GPL(mhi_unprepare_after_power_down);
>
> diff --git a/drivers/bus/mhi/host/internal.h b/drivers/bus/mhi/host/internal.h
> index ce566f7d2e924..41b3fb835880b 100644
> --- a/drivers/bus/mhi/host/internal.h
> +++ b/drivers/bus/mhi/host/internal.h
> @@ -427,4 +427,6 @@ void mhi_unmap_single_no_bb(struct mhi_controller *mhi_cntrl,
> void mhi_unmap_single_use_bb(struct mhi_controller *mhi_cntrl,
> struct mhi_buf_info *buf_info);
>
> +void __mhi_unprepare_keep_dev(struct mhi_controller *mhi_cntrl);
> +
> #endif /* _MHI_INT_H */
> diff --git a/drivers/bus/mhi/host/pm.c b/drivers/bus/mhi/host/pm.c
> index e6c3ff62bab1d..c2c09c308b9b7 100644
> --- a/drivers/bus/mhi/host/pm.c
> +++ b/drivers/bus/mhi/host/pm.c
> @@ -1263,6 +1263,7 @@ void mhi_power_down_keep_dev(struct mhi_controller *mhi_cntrl,
> bool graceful)
> {
> __mhi_power_down(mhi_cntrl, graceful, false);
> + __mhi_unprepare_keep_dev(mhi_cntrl);
> }
> EXPORT_SYMBOL_GPL(mhi_power_down_keep_dev);
>
> diff --git a/drivers/net/wireless/ath/ath11k/mhi.c b/drivers/net/wireless/ath/ath11k/mhi.c
> index acd76e9392d31..c5dc776b23643 100644
> --- a/drivers/net/wireless/ath/ath11k/mhi.c
> +++ b/drivers/net/wireless/ath/ath11k/mhi.c
> @@ -460,12 +460,12 @@ void ath11k_mhi_stop(struct ath11k_pci *ab_pci, bool is_suspend)
> * workaround, otherwise ath11k_core_resume() will timeout
> * during resume.
> */
> - if (is_suspend)
> + if (is_suspend) {
> mhi_power_down_keep_dev(ab_pci->mhi_ctrl, true);
> - else
> + } else {
> mhi_power_down(ab_pci->mhi_ctrl, true);
> -
> - mhi_unprepare_after_power_down(ab_pci->mhi_ctrl);
> + mhi_unprepare_after_power_down(ab_pci->mhi_ctrl);
> + }
> }
>
> int ath11k_mhi_suspend(struct ath11k_pci *ab_pci)
> diff --git a/drivers/net/wireless/ath/ath12k/mhi.c b/drivers/net/wireless/ath/ath12k/mhi.c
> index 08f44baf182a5..3af524ccf4a5a 100644
> --- a/drivers/net/wireless/ath/ath12k/mhi.c
> +++ b/drivers/net/wireless/ath/ath12k/mhi.c
> @@ -601,6 +601,12 @@ static int ath12k_mhi_set_state(struct ath12k_pci *ab_pci,
>
> ath12k_mhi_set_state_bit(ab_pci, mhi_state);
>
> + /* mhi_power_down_keep_dev() has been updated to DEINIT without
> + * freeing bhie tables
> + */
> + if (mhi_state == ATH12K_MHI_POWER_OFF_KEEP_DEV)
> + ath12k_mhi_set_state_bit(ab_pci, ATH12K_MHI_DEINIT);
> +
> return 0;
>
> out:
> @@ -635,12 +641,12 @@ void ath12k_mhi_stop(struct ath12k_pci *ab_pci, bool is_suspend)
> * workaround, otherwise ath12k_core_resume() will timeout
> * during resume.
> */
> - if (is_suspend)
> + if (is_suspend) {
> ath12k_mhi_set_state(ab_pci, ATH12K_MHI_POWER_OFF_KEEP_DEV);
> - else
> + } else {
> ath12k_mhi_set_state(ab_pci, ATH12K_MHI_POWER_OFF);
> -
> - ath12k_mhi_set_state(ab_pci, ATH12K_MHI_DEINIT);
> + ath12k_mhi_set_state(ab_pci, ATH12K_MHI_DEINIT);
> + }
> }
>
> void ath12k_mhi_suspend(struct ath12k_pci *ab_pci)
> diff --git a/include/linux/mhi.h b/include/linux/mhi.h
> index dd372b0123a6d..6fd218a877855 100644
> --- a/include/linux/mhi.h
> +++ b/include/linux/mhi.h
> @@ -306,6 +306,7 @@ struct mhi_controller_config {
> * if fw_image is NULL and fbc_download is true (optional)
> * @fw_sz: Firmware image data size for normal booting, used only if fw_image
> * is NULL and fbc_download is true (optional)
> + * @prev_fw_sz: Previous firmware image data size, when fbc_download is true
> * @edl_image: Firmware image name for emergency download mode (optional)
> * @rddm_size: RAM dump size that host should allocate for debugging purpose
> * @sbl_size: SBL image size downloaded through BHIe (optional)
> @@ -382,6 +383,7 @@ struct mhi_controller {
> const char *fw_image;
> const u8 *fw_data;
> size_t fw_sz;
> + size_t prev_fw_sz;
> const char *edl_image;
> size_t rddm_size;
> size_t sbl_size;
--
Regards,
Usama
Powered by blists - more mailing lists