[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <56a36d45-f779-0c67-2853-6ead9e8f9dc9@gmail.com>
Date: Thu, 15 Aug 2024 18:11:42 +0100
From: Edward Cree <ecree.xilinx@...il.com>
To: Jakub Kicinski <kuba@...nel.org>
Cc: Network Development <netdev@...r.kernel.org>
Subject: Per-queue stats question
I'm working on adding netdev_stat_ops support to sfc, and finding that
the expectations of the selftest around the relation between qstats
and rtnl stats are difficult for us to meet. I'm not sure whether it
is our existing rtnl stats or the qstats I'm adding that have the
wrong semantics.
sfc fills in rtnl_link_stats64 with MAC stats from the firmware (or
'vadaptor stats' if using SR-IOV). These count packets (or bytes)
since last FW boot/reset (for instance, "ethtool --reset $dev all"
clears them). (Also, for reasons I'm still investigating, while the
interface is administratively down they read as zero, then jump back
to what they were on "ip link set up".) Moreover, the counts are
updated by periodic DMA, so can be up to 1 second stale.
The queue stats, meanwhile, are maintained in software, and count
since ifup (efx_start_channels()), so that they can be reset on
reconfiguration; the base_stats count since driver probe
(efx_alloc_channels()).
Thus, as it stands, it is possible for qstats and rtstats to disagree,
in both directions. For example:
* Driver is unloaded and then loaded again. base_stats will reset,
but MAC stats won't.
* ethtool reset. MAC stats will reset, but base_stats won't.
* Traffic is passing during the test. qstats will be up to date,
whereas MAC stats, being up to 1s stale, could be far behind.
* RX filter drops (e.g. unwanted destination MAC address). These are
counted in MAC stats but since they never reach the driver they're
not counted in qstats/base_stats (and by my reading of netdev.yaml
they shouldn't be, even if we could).
Any of these will cause the stats.pkt_byte_sum selftest to fail.
Which side do I need to change, qstats or rtstats? Or is the test
being too strict?
On a related note, I notice that the stat_cmp() function within that
selftest returns the first nonzero delta it finds in the stats, so
that if (say) tx-packets goes forwards but rx-packets goes backwards,
it will return >0 causing the rx-packets delta to be ignored. Is
this intended behaviour, or should I submit a patch?
-ed
Powered by blists - more mailing lists