[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CO1PR11MB50282B593D48A928AE1B11B0A0969@CO1PR11MB5028.namprd11.prod.outlook.com>
Date: Fri, 7 Apr 2023 16:12:49 +0000
From: "Mekala, SunithaX D" <sunithax.d.mekala@...el.com>
To: Michal Swiatkowski <michal.swiatkowski@...ux.intel.com>,
Leon Romanovsky <leon@...nel.org>
CC: "netdev@...r.kernel.org" <netdev@...r.kernel.org>,
"intel-wired-lan@...ts.osuosl.org" <intel-wired-lan@...ts.osuosl.org>
Subject: RE: [Intel-wired-lan] [PATCH net v2] ice: clear number of qs when
rings are free
Still observing system hung
Test 1: Upon PF reset
Applied reproducer.patch in kernel, followed by below commands
echo 1 > /sys/module/ice/parameters/ice_reproduce_panic
echo 1 > /sys/class/net/<ice_pf>/device/reset
System did not hang but the PF interface went down with dmesg to reload driver
On unloading driver, system hangs with no response.
2. On changing queues
Applied reproducer.patch in kernel, followed by below commands
echo 1 > /sys/module/ice/parameters/ice_reproduce_panic
ethtool -L $pf rx 1 tx 1
System stops responding
>
> -----Original Message-----
> From: Intel-wired-lan <intel-wired-lan-bounces@...osl.org> On Behalf Of Michal Swiatkowski
> Sent: Monday, March 20, 2023 7:59 AM
> To: Leon Romanovsky <leon@...nel.org>
> Cc: netdev@...r.kernel.org; intel-wired-lan@...ts.osuosl.org
> Subject: Re: [Intel-wired-lan] [PATCH net v2] ice: clear number of qs when rings are free
>
> On Mon, Mar 20, 2023 at 01:51:17PM +0200, Leon Romanovsky wrote:
> > On Mon, Mar 20, 2023 at 12:23:47PM +0100, Michal Swiatkowski wrote:
> > > In case rebuild fails not clearing this field can lead to call trace.
> > >
> > > [ +0.009792] BUG: kernel NULL pointer dereference, address:
> > > 0000000000000000 [ +0.000009] #PF: supervisor read access in kernel
> > > mode [ +0.000006] #PF: error_code(0x0000) - not-present page [
> > > +0.000005] PGD 0 P4D 0 [ +0.000009] Oops: 0000 [#1] PREEMPT SMP PTI
> > > [ +0.000009] CPU: 45 PID: 77867 Comm: ice-ptp-0000:60 Kdump: loaded Tainted: G S OE 6.2.0-rc6+ #110
> > > [ +0.000010] Hardware name: Dell Inc. PowerEdge R740/0JMK61, BIOS
> > > 2.11.2 004/21/2021 [ +0.000005] RIP:
> > > 0010:ice_ptp_update_cached_phctime+0xb0/0x130 [ice] [ +0.000145]
> > > Code: fa 7e 55 48 8b 93 48 01 00 00 48 8b 0c fa 48 85 c9 74 e1 8b 51
> > > 68 85 d2 75 da 66 83 b9 86 04 00 00 00 74 d0 31 d2 48 8b 71 20 <48>
> > > 8b 34 d6 48 85 f6 74 07 48 89 86 d8 00 00 00 0f b7 b1 86 04 00 [
> > > +0.000008] RSP: 0018:ffffa036cf7c7ea8 EFLAGS: 00010246 [ +0.000008]
> > > RAX: 174ab1a8ab400f43 RBX: ffff937cda2c01a0 RCX: ffff937cdca9b028 [
> > > +0.000005] RDX: 0000000000000000 RSI: 0000000000000000 RDI:
> > > 0000000000000000 [ +0.000005] RBP: ffffa036cf7c7eb8 R08:
> > > 0000000000000000 R09: 0000000000000000 [ +0.000005] R10:
> > > 0000000000000080 R11: 0000000000000001 R12: ffff937cdc971f40 [
> > > +0.000006] R13: ffff937cdc971f44 R14: 0000000000000001 R15:
> > > ffffffffc13f3210 [ +0.000005] FS: 0000000000000000(0000)
> > > GS:ffff93826f980000(0000) knlGS:0000000000000000 [ +0.000006] CS:
> > > 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ +0.000006] CR2: 0000000000000000 CR3: 00000004b7310002 CR4: 00000000007726e0 [ +0.000006] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [ +0.000004] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 [ +0.000005] PKRU: 55555554 [ +0.000004] Call Trace:
> > > [ +0.000004] <TASK>
> > > [ +0.000007] ice_ptp_periodic_work+0x2a/0x60 [ice] [ +0.000126]
> > > kthread_worker_fn+0xa6/0x250 [ +0.000014] ?
> > > __pfx_kthread_worker_fn+0x10/0x10 [ +0.000010] kthread+0xfc/0x130
> > > [ +0.000009] ? __pfx_kthread+0x10/0x10 [ +0.000010]
> > > ret_from_fork+0x29/0x50
> > >
> > > ice_ptp_update_cached_phctime() is calling ice_for_each_rxq macro,
> > > in case of rebuild fail the rx_ring is NULL and there is NULL
> > > pointer dereference.
> > >
> >> Also for future safety it is better to clear the size values for tx
> > > and rx ring when they are cleared.
> > >
> > > Fixes: 6624e780a577 ("ice: split ice_vsi_setup into smaller
> > > functions")
> > > Reported-by: Mateusz Palczewski <mateusz.palczewski@...el.com>
> > > Signed-off-by: Michal Swiatkowski
> > > <michal.swiatkowski@...ux.intel.com>
> > > ---
> > > v1 --> v2:
> > > * change subject to net and add fixes tag
> > > ---
> > > drivers/net/ethernet/intel/ice/ice_lib.c | 2 ++
> > > 1 file changed, 2 insertions(+)
Powered by blists - more mailing lists