[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <5bab650a-3c0b-cfd2-d6a7-2e39c8474514@oracle.com>
Date: Tue, 25 Oct 2022 10:44:12 -0700
From: Rohit Nair <rohit.sajan.kumar@...cle.com>
To: Leon Romanovsky <leon@...nel.org>
Cc: jgg@...pe.ca, saeedm@...dia.com, davem@...emloft.net,
edumazet@...gle.com, kuba@...nel.org, pabeni@...hat.com,
linux-rdma@...r.kernel.org, linux-kernel@...r.kernel.org,
netdev@...r.kernel.org, manjunath.b.patil@...cle.com,
rama.nichanamatlu@...cle.com,
Michael Guralnik <michaelgur@...dia.com>
Subject: Re: [External] : Re: [PATCH 1/1] IB/mlx5: Add a signature check to
received EQEs and CQEs
Hey Leon,
Please find my replies to your comments here below:
On 10/11/22 12:17 AM, Leon Romanovsky wrote:
> There is no need to ping anyone, the patch is registered in patchworks
> https://urldefense.com/v3/__https://patchwork.kernel.org/project/linux-rdma/patch/20221005174521.63619-1-rohit.sajan.kumar@oracle.com/__;!!ACWV5N9M2RV99hQ!IdRYzr4ujJ0haaWRKGd05SNbDiiW4v85yzCS233IObdO6ziwUhmEpWBC73PMs1dwbjwL5qHv9YwJrmKNtIo$
> and we will get to it.
>
> You sent the patch during merge window, no wonder that none looked on it.
>
> On Wed, Oct 05, 2022 at 10:45:20AM -0700, Rohit Nair wrote:
>> As PRM defines, the bytewise XOR of the EQE and the EQE index should be
>> 0xff. Otherwise, we can assume we have a corrupt EQE. The same is
>> applicable to CQE as well.
> I didn't find anything like this in my version of PRM.
The PRM does contain this information and can be seen on page 121 of the
reference manual. I have linked the refernece manual I consulted for
your reference.
https://network.nvidia.com/files/doc-2020/ethernet-adapters-programming-manual.pdf
Here is a short extract from the refernce manual: "Byte-wise XOR of CQE
- signature protection (see "Completion and Event Queue Elements (CQEs
and EQEs)" on page 156). CQE is valid if byte-wise XOR of entire CQE
(including signature field) and the CQE index is 0xff. For 128B CQE, the
GRH/inline_64 section is taken into account if data / GRH was written to
it (cqe_format == 2 or grh == 2)"
>
>> Adding a check to verify the EQE and CQE is valid in that aspect and if
>> not, dump the CQE and EQE to dmesg to be inspected.
> While it is nice to see prints in dmesg, you need to explain why other
> mechanisms (reporters, mlx5 events, e.t.c) are not enough.
We had recently faces an issue where the we failing to arm the CQ due to
an sn_mismatch. This issue was not reported in the logs or error events
and was the same for the corrupted CQE.
If we were able to dectect the corrupted CQE, or EQE earlier, we may
have been able to respond to the issue better and in a more timely
manner. This is our motivation behind have the error printed to dmesg.
>
>> This patch does not introduce any significant performance degradations
>> and has been tested using qperf.
> What does it mean? You made changes in kernel verbs flow, they are not
> executed through qperf.
We also conducted several extensive performance tests using our
test-suite which utilizes rds-stress and also saw no significant
performance degrdations in those results.
>> Suggested-by: Michael Guralnik <michaelgur@...dia.com>
>> Signed-off-by: Rohit Nair <rohit.sajan.kumar@...cle.com>
>> ---
>> drivers/infiniband/hw/mlx5/cq.c | 40 ++++++++++++++++++++++++++++
>> drivers/net/ethernet/mellanox/mlx5/core/eq.c | 39 +++++++++++++++++++++++++++
>> 2 files changed, 79 insertions(+)
>>
>> diff --git a/drivers/infiniband/hw/mlx5/cq.c b/drivers/infiniband/hw/mlx5/cq.c
>> index be189e0..2a6d722 100644
>> --- a/drivers/infiniband/hw/mlx5/cq.c
>> +++ b/drivers/infiniband/hw/mlx5/cq.c
>> @@ -441,6 +441,44 @@ static void mlx5_ib_poll_sw_comp(struct mlx5_ib_cq *cq, int num_entries,
>> }
>> }
>>
>> +static void verify_cqe(struct mlx5_cqe64 *cqe64, struct mlx5_ib_cq *cq)
>> +{
>> + int i = 0;
>> + u64 temp_xor = 0;
>> + struct mlx5_ib_dev *dev = to_mdev(cq->ibcq.device);
>> +
>> + u32 cons_index = cq->mcq.cons_index;
>> + u64 *eight_byte_raw_cqe = (u64 *)cqe64;
>> + u8 *temp_bytewise_xor = (u8 *)(&temp_xor);
>> + u8 cqe_bytewise_xor = (cons_index & 0xff) ^
>> + ((cons_index & 0xff00) >> 8) ^
>> + ((cons_index & 0xff0000) >> 16);
>> +
>> + for (i = 0; i < sizeof(struct mlx5_cqe64); i += 8) {
>> + temp_xor ^= *eight_byte_raw_cqe;
>> + eight_byte_raw_cqe++;
>> + }
>> +
>> + for (i = 0; i < (sizeof(u64)); i++) {
>> + cqe_bytewise_xor ^= *temp_bytewise_xor;
>> + temp_bytewise_xor++;
>> + }
>> +
>> + if (cqe_bytewise_xor == 0xff)
>> + return;
>> +
>> + dev_err(&dev->mdev->pdev->dev,
>> + "Faulty CQE - checksum failure: cqe=0x%x cqn=0x%x cqe_bytewise_xor=0x%x\n",
>> + cq->ibcq.cqe, cq->mcq.cqn, cqe_bytewise_xor);
>> + dev_err(&dev->mdev->pdev->dev,
>> + "cons_index=%u arm_sn=%u irqn=%u cqe_size=0x%x\n",
>> + cq->mcq.cons_index, cq->mcq.arm_sn, cq->mcq.irqn, cq->mcq.cqe_sz);
> mlx5_err ... and not dev_err ...
Will update dev_err to mlx5_err.
>
>> +
>> + print_hex_dump(KERN_WARNING, "", DUMP_PREFIX_OFFSET,
>> + 16, 1, cqe64, sizeof(*cqe64), false);
>> + BUG();
> No BUG() in new code.
I will remove the BUG() calls and only have it print the error msgs.
Thanks.
Best,
Rohit
Powered by blists - more mailing lists