[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20231009085000.GAZSO+uD336B0/qGbQ@fat_crate.local>
Date: Mon, 9 Oct 2023 10:50:00 +0200
From: Borislav Petkov <bp@...en8.de>
To: "Zhuo, Qiuxu" <qiuxu.zhuo@...el.com>
Cc: "Luck, Tony" <tony.luck@...el.com>, "Li, Lili" <lili.li@...el.com>,
James Morse <james.morse@....com>,
Mauro Carvalho Chehab <mchehab@...nel.org>,
Robert Richter <rric@...nel.org>,
"linux-edac@...r.kernel.org" <linux-edac@...r.kernel.org>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>
Subject: Re: [PATCH 1/3] EDAC/igen6: Fix slab-use-after-free in
igen6_unregister_mci()
On Mon, Oct 09, 2023 at 02:39:25AM +0000, Zhuo, Qiuxu wrote:
> Rechecked the igen6_edac code, it has already done the exact opposite
> of the allocation steps on the unreg patch below:
allocation and *assignment*. And it doesn't do them in the exact
opposite way. Look again.
> In the reg path:
> igen6_probe()
> igen6_pvt = kzalloc() // Step a
> igen6_register_mci()
> mci = edac_mc_alloc(mc, ARRAY_SIZE(layers), layers, 0) // 0 means igen6 itself manages 'pvt_info'.
> mci->pvt_info = &igen6_pvt->imc[mc];
> edac_mc_add_mc(mci) // Step b
>
> In the unreg path:
> igen6_remove()
> igen6_unregister_mcis()
> edac_mc_free(mci) // Step B
> kfree(igen6_pvt) // Step A
>
> I assume that when calling edac_mc_alloc() with sz_pvt=0, 'pvt_info' is managed
> by the EDAC driver, not the EDAC core. Therefore, the EDAC core should not kfree
> 'pvt_info' unconditionally in this scenario.
That would mean that the EDAC core would have to remember whether it
allocated a private struct for the driver. Which is not needed - if the
driver has allocated it, the same driver can free it before calling
edac_mc_free().
> So the problem occurred in Step B when the EDAC core mistakenly kfreed
> 'pvt_info,' which should have been kfreed in Step A. 'pvt_info' is
> managed by the igen6_edac driver, not the EDAC core.
Your silly driver is allocating a single struct igen6_pvt and then it is
assigning pointers of the embedded struct igen6_imc elements from the
array in igen6_pvt:
mci->pvt_info = &igen6_pvt->imc[mc];
to the pvt pointer. I.e., only a *part* of that allocated memory.
Then, it is calling
edac_mc_free(mci);
on the mci which frees only that array element - not how it got
allocated and then at the end, in the remove function, you do
kfree(igen6_pvt);
where the array elements have been freed and then KASAN rightfully
complains.
So no, you're doing it wrong.
You either need to alloc the pvt in igen6_register_mci() and free it in
igen6_unregister_mcis(), *before* edac_mc_free() or ...
> A simple fix is to set mci->pvt_info = NULL just before Step B, but it
> may seem a bit magical.
... do that which is a hack.
And looking at your driver, it is doing it wrong - it seems it doesn't
understand what the point of mci->pvt_info is.
Instead of passing around
struct mem_ctl_info *mci
to all the functions that need
struct igen6_imc *imc
so that you can do
struct igen6_imc *imc = mci->pvt_info;
in them, which is exactly what that private pointer is actually for(!),
or to do to_mci() on the containing edac device, you're using that mc
index to index into the global igen6_pvt->imc[] array.
igen6_get_dimm_config() is the only function that does it right.
So this driver needs to be fixed properly to pass around that mci
pointer, like the others do. Not this bolted on hack.
> Similar issues could also occur in the following drivers where the
> EDAC core should NOT kfreed the pvt_info as these 'pvt_info' are
> managed by the specific EDAC drivers themselves (sz_pvt = 0).
>
> drivers/edac/amd8111_edac.c
> dev_info->edac_dev->pvt_info = dev_info;
That one is not even using edac_mc_free().
> drivers/edac/cpc925_edac.c
> dev_info->edac_dev->pvt_info = dev_info;
That's the wrong one - it is:
mci = edac_mc_alloc(edac_mc_idx, ARRAY_SIZE(layers), layers,
sizeof(struct cpc925_mc_pdata));
...
pdata = mci->pvt_info;
and that's handled by the EDAC core only.
> drivers/edac/x38_edac.c
> mci->pvt_info = window; // This is even an ioremap() memory and requires iounmap() to release it.
static void x38_remove_one(struct pci_dev *pdev)
...
iounmap(mci->pvt_info);
How about you slow down, take a piece of paper, draw the allocation and
assignment ordering and try to understand what exactly the idea behind
it is? Look at how the other drivers do it, there are good examples
in there.
Thx.
--
Regards/Gruss,
Boris.
https://people.kernel.org/tglx/notes-about-netiquette
Powered by blists - more mailing lists