[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <20190613203825.31049-1-saeedm@mellanox.com>
Date: Thu, 13 Jun 2019 20:39:13 +0000
From: Saeed Mahameed <saeedm@...lanox.com>
To: "David S. Miller" <davem@...emloft.net>
CC: "netdev@...r.kernel.org" <netdev@...r.kernel.org>,
Jiri Pirko <jiri@...lanox.com>,
Saeed Mahameed <saeedm@...lanox.com>
Subject: [pull request][net-next v2 00/15] Mellanox, mlx5 Firmware devlink
health and sw reset
Hi Dave,
This series provides the support for mlx5 Firmware devlink health and
sw reset.
For more information please see tag log below.
Please pull and let me know if there is any problem.
This is a re-spin of a previously sent series on 5.2 kernel
release.
v2:
- Improved mlx5 kernel documentation
- Addressed Jiri's comments:
Proper linkage to region and snapshot in devlink core.
Format trace dumps using fmsg helpers.
Thanks,
Saeed.
---
The following changes since commit a842fe1425cb20f457abd3f8ef98b468f83ca98b:
tcp: add optional per socket transmit delay (2019-06-12 13:05:43 -0700)
are available in the Git repository at:
git://git.kernel.org/pub/scm/linux/kernel/git/saeed/linux.git tags/mlx5-updates-2019-06-13
for you to fetch changes up to 06efeb555524a8c65ef429f2603885c31a5212b1:
Documentation: net: mlx5: Devlink health documentation (2019-06-13 13:23:19 -0700)
----------------------------------------------------------------
mlx5-updates-2019-06-13
Mlx5 devlink health fw reporters and sw reset support
This series provides mlx5 firmware reset support and firmware devlink health
reporters.
1) Add initial mlx5 kernel documentation and include devlink health reporters
2) Add CR-Space access and FW Crdump snapshot support via devlink region_snapshot
3) Issue software reset upon FW asserts
4) Add fw and fw_fatal devlink heath reporters to follow fw errors indication by
dump and recover procedures and enable trigger these functionality by user.
4.1) fw reporter:
The fw reporter implements diagnose and dump callbacks.
It follows symptoms of fw error such as fw syndrome by triggering
fw core dump and storing it and any other fw trace into the dump buffer.
The fw reporter diagnose command can be triggered any time by the user to check
current fw status.
4.2) fw_fatal repoter:
The fw_fatal reporter implements dump and recover callbacks.
It follows fatal errors indications by CR-space dump and recover flow.
The CR-space dump uses vsc interface which is valid even if the FW command
interface is not functional, which is the case in most FW fatal errors. The
CR-space dump is stored as a memory region snapshot to ease read by address.
The recover function runs recover flow which reloads the driver and triggers fw
reset if needed.
----------------------------------------------------------------
Alex Vesker (2):
net/mlx5: Add Vendor Specific Capability access gateway
net/mlx5: Add Crdump support
Aya Levin (1):
devlink: Hang reporter's dump method on a dumpit cb
Eran Ben Elisha (1):
net/mlx5: Move all devlink related functions calls to devlink.c
Feras Daoud (3):
net/mlx5: Handle SW reset of FW in error flow
net/mlx5: Control CR-space access by different PFs
net/mlx5: Issue SW reset on FW assert
Moshe Shemesh (7):
net/mlx5: Create FW devlink_health_reporter
net/mlx5: Add support for FW reporter dump
net/mlx5: Report devlink health on FW issues
net/mlx5: Add fw fatal devlink_health_reporter
net/mlx5: Add support for FW fatal reporter dump
net/mlx5: Report devlink health on FW fatal issues
Documentation: net: mlx5: Devlink health documentation
Saeed Mahameed (1):
Documentation: net: mlx5: Add mlx5 initial documentation
Documentation/networking/device_drivers/index.rst | 1 +
.../networking/device_drivers/mellanox/mlx5.rst | 173 +++++++
MAINTAINERS | 1 +
drivers/net/ethernet/mellanox/mlx5/core/Makefile | 3 +-
drivers/net/ethernet/mellanox/mlx5/core/devlink.c | 58 +++
drivers/net/ethernet/mellanox/mlx5/core/devlink.h | 14 +
.../net/ethernet/mellanox/mlx5/core/diag/crdump.c | 115 +++++
.../ethernet/mellanox/mlx5/core/diag/fw_tracer.c | 139 +++++
.../ethernet/mellanox/mlx5/core/diag/fw_tracer.h | 20 +
.../net/ethernet/mellanox/mlx5/core/en_selftest.c | 2 +-
drivers/net/ethernet/mellanox/mlx5/core/health.c | 569 ++++++++++++++++++---
drivers/net/ethernet/mellanox/mlx5/core/lib/mlx5.h | 3 +
.../net/ethernet/mellanox/mlx5/core/lib/pci_vsc.c | 316 ++++++++++++
.../net/ethernet/mellanox/mlx5/core/lib/pci_vsc.h | 32 ++
drivers/net/ethernet/mellanox/mlx5/core/main.c | 65 +--
.../net/ethernet/mellanox/mlx5/core/mlx5_core.h | 8 +-
include/linux/mlx5/device.h | 10 +-
include/linux/mlx5/driver.h | 13 +-
net/core/devlink.c | 118 ++++-
19 files changed, 1516 insertions(+), 144 deletions(-)
create mode 100644 Documentation/networking/device_drivers/mellanox/mlx5.rst
create mode 100644 drivers/net/ethernet/mellanox/mlx5/core/devlink.c
create mode 100644 drivers/net/ethernet/mellanox/mlx5/core/devlink.h
create mode 100644 drivers/net/ethernet/mellanox/mlx5/core/diag/crdump.c
create mode 100644 drivers/net/ethernet/mellanox/mlx5/core/lib/pci_vsc.c
create mode 100644 drivers/net/ethernet/mellanox/mlx5/core/lib/pci_vsc.h
Powered by blists - more mailing lists