[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <20120228.143051.352474620462899753.davem@davemloft.net>
Date: Tue, 28 Feb 2012 14:30:51 -0500 (EST)
From: David Miller <davem@...emloft.net>
To: cascardo@...ux.vnet.ibm.com
Cc: yevgenyp@...lanox.co.il, netdev@...r.kernel.org,
linux-rdma@...r.kernel.org, roland@...estorage.com
Subject: Re: [PATCH] mlx4: prevent the device from being removed
concurrently
From: Thadeu Lima de Souza Cascardo <cascardo@...ux.vnet.ibm.com>
Date: Tue, 28 Feb 2012 15:36:16 -0300
> When a EEH happens, the catas poll code will try to restart the device,
> removing it and adding it back again. The EEH code will try to do the
> same. One of the threads ends up accessing memory that was freed by the
> other thread and we get a crash.
Stop adding bandaids to the locking.
If the EEH infrastructure doesn't synchronize parallel operations
on the same device, that is the real bug, and that's where the real
fix belongs.
I refuse to apply this patch.
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists