[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <F4CC6FACFEB3C54C9141D49AD221F7F91A7A2371@lhreml503-mbx>
Date: Tue, 15 Nov 2016 15:52:46 +0000
From: Salil Mehta <salil.mehta@...wei.com>
To: Leon Romanovsky <leon@...nel.org>
CC: "dledford@...hat.com" <dledford@...hat.com>,
"Huwei (Xavier)" <xavier.huwei@...wei.com>,
oulijun <oulijun@...wei.com>,
"mehta.salil.lnk@...il.com" <mehta.salil.lnk@...il.com>,
"linux-rdma@...r.kernel.org" <linux-rdma@...r.kernel.org>,
"netdev@...r.kernel.org" <netdev@...r.kernel.org>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
Linuxarm <linuxarm@...wei.com>,
"Zhangping (ZP)" <zhangping5@...wei.com>
Subject: RE: [PATCH for-next 03/11] IB/hns: Optimize the logic of allocating
memory using APIs
> -----Original Message-----
> From: Leon Romanovsky [mailto:leon@...nel.org]
> Sent: Wednesday, November 09, 2016 7:22 AM
> To: Salil Mehta
> Cc: dledford@...hat.com; Huwei (Xavier); oulijun;
> mehta.salil.lnk@...il.com; linux-rdma@...r.kernel.org;
> netdev@...r.kernel.org; linux-kernel@...r.kernel.org; Linuxarm;
> Zhangping (ZP)
> Subject: Re: [PATCH for-next 03/11] IB/hns: Optimize the logic of
> allocating memory using APIs
>
> On Fri, Nov 04, 2016 at 04:36:25PM +0000, Salil Mehta wrote:
> > From: "Wei Hu (Xavier)" <xavier.huwei@...wei.com>
> >
> > This patch modified the logic of allocating memory using APIs in
> > hns RoCE driver. We used kcalloc instead of kmalloc_array and
> > bitmap_zero. And When kcalloc failed, call vzalloc to alloc
> > memory.
> >
> > Signed-off-by: Wei Hu (Xavier) <xavier.huwei@...wei.com>
> > Signed-off-by: Ping Zhang <zhangping5@...wei.com>
> > Signed-off-by: Salil Mehta <salil.mehta@...wei.com>
> > ---
> > drivers/infiniband/hw/hns/hns_roce_mr.c | 15 ++++++++-------
> > 1 file changed, 8 insertions(+), 7 deletions(-)
> >
> > diff --git a/drivers/infiniband/hw/hns/hns_roce_mr.c
> b/drivers/infiniband/hw/hns/hns_roce_mr.c
> > index fb87883..d3dfb5f 100644
> > --- a/drivers/infiniband/hw/hns/hns_roce_mr.c
> > +++ b/drivers/infiniband/hw/hns/hns_roce_mr.c
> > @@ -137,11 +137,12 @@ static int hns_roce_buddy_init(struct
> hns_roce_buddy *buddy, int max_order)
> >
> > for (i = 0; i <= buddy->max_order; ++i) {
> > s = BITS_TO_LONGS(1 << (buddy->max_order - i));
> > - buddy->bits[i] = kmalloc_array(s, sizeof(long),
> GFP_KERNEL);
> > - if (!buddy->bits[i])
> > - goto err_out_free;
> > -
> > - bitmap_zero(buddy->bits[i], 1 << (buddy->max_order - i));
> > + buddy->bits[i] = kcalloc(s, sizeof(long), GFP_KERNEL);
> > + if (!buddy->bits[i]) {
> > + buddy->bits[i] = vzalloc(s * sizeof(long));
>
> I wonder, why don't you use directly vzalloc instead of kcalloc
> fallback?
As we know we will have physical contiguous pages if the kcalloc
call succeeds. This will give us a chance to have better performance
over the allocations which are just virtually contiguous through the
function vzalloc(). Therefore, later has only been used as a fallback
when our memory request cannot be entertained through kcalloc.
Are you suggesting that there will not be much performance penalty
if we use just vzalloc ?
>
> > + if (!buddy->bits[i])
> > + goto err_out_free;
> > + }
> > }
Powered by blists - more mailing lists