[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <ZPsDdqt1RrXB+aTO@shell.armlinux.org.uk>
Date: Fri, 8 Sep 2023 12:20:22 +0100
From: "Russell King (Oracle)" <linux@...linux.org.uk>
To: Andrew Lunn <andrew@...n.ch>, Heiner Kallweit <hkallweit1@...il.com>
Cc: chenhao418@...wei.com, "David S. Miller" <davem@...emloft.net>,
Eric Dumazet <edumazet@...gle.com>,
Jakub Kicinski <kuba@...nel.org>, Jijie Shao <shaojijie@...wei.com>,
lanhao@...wei.com, liuyonglong@...wei.com, netdev@...r.kernel.org,
Paolo Abeni <pabeni@...hat.com>, shenjian15@...wei.com,
wangjie125@...wei.com, wangpeiyang1@...wei.com
Subject: [PATCH RFC net-next 0/7] net: phy: avoid race when erroring stopping
PHY
This series addresses a problem reported by Jijie Shao where the PHY
state machine can race with phy_stop() leading to an incorrect state.
The issue centres around phy_state_machine() dropping the phydev->lock
mutex briefly, which allows phy_stop() to get in half-way through the
state machine, and when the state machine resumes, it overwrites
phydev->state with a value incompatible with a stopped PHY. This causes
a subsequent phy_start() to issue a warning.
We address this firstly by using versions of functions that do not take
tne lock, moving them into the locked region. The only function that
this can't be done with is phy_suspend() which needs to call into the
driver without taking the lock.
For phy_suspend(), we split the state machine into two parts - the
initial part which runs under the phydev->lock, and the second part
which runs without the lock.
We finish off by using the split state machine in phy_stop() which
removes another unnecessary unlock-lock sequence from phylib.
drivers/net/phy/phy.c | 204 +++++++++++++++++++++++++++-----------------------
1 file changed, 110 insertions(+), 94 deletions(-)
--
RMK's Patch system: https://www.armlinux.org.uk/developer/patches/
FTTP is here! 80Mbps down 10Mbps up. Decent connectivity at last!
Powered by blists - more mailing lists