[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <4cd27a50ee6db33d1bfada1445bebb71@visp.net.lb>
Date: Wed, 13 Jan 2016 22:44:35 +0200
From: Denys Fedoryshchenko <nuclearcat@...learcat.com>
To: Netdev <netdev@...r.kernel.org>,
Santosh Raspatur <santosh@...lsio.com>
Subject: Re: cxgb3 Chelsio S310 tx drops and latency
More details:
[ 1.586586] cxgb3: Chelsio T3 Network Driver - version 1.1.5-ko
[ 1.766456] cxgb3 0000:05:00.0: Port 0 using 8 queue sets.
[ 1.766765] cxgb3 0000:05:00.0 eth0: Chelsio T310 10GBASE-R RNIC (rev
4) PCI Express x8 MSI-X
[ 1.767232] cxgb3: eth0: 128MB CM, 256MB PMTX, 256MB PMRX, S/N:
PT32100340
[ 10.665180] cxgb3 0000:05:00.0 eth0: link up, 10Gbps, full-duplex
HTTPS-VISP ~ # cxgbtool eth0 qset
QNUM IRQ TXQ0 TXQ1 TXQ2 RSPQ FL0 FL1 CONG LAT MODE
LRO
0 33 1024 1024 256 1024 1024 512 0 5 napi
1
1 34 1024 1024 256 1024 1024 512 0 5 napi
1
2 35 1024 1024 256 1024 1024 512 0 5 napi
1
3 36 1024 1024 256 1024 1024 512 0 5 napi
1
4 37 1024 1024 256 1024 1024 512 0 5 napi
1
5 38 1024 1024 256 1024 1024 512 0 5 napi
1
6 39 1024 1024 256 1024 1024 512 0 5 napi
1
7 40 1024 1024 256 1024 1024 512 0 5 napi
1
(i tired to change latency, to 200-300, disabling lro, doesn't make any
difference)
Card just hitting 2Gbps and ~550kpps limit and not going further.
qdisc mq 0: dev eth0 root
Sent 67010887991 bytes 146285692 pkt (dropped 2542108, overlimits 0
requeues 4785)
backlog 0b 0p requeues 4785
qdisc pfifo_fast 0: dev eth0 parent :1 bands 3 priomap 1 2 2 2 1 2 0 0
1 1 1 1 1 1 1 1
Sent 8680521052 bytes 18552537 pkt (dropped 360936, overlimits 0
requeues 603)
backlog 0b 0p requeues 603
qdisc pfifo_fast 0: dev eth0 parent :2 bands 3 priomap 1 2 2 2 1 2 0 0
1 1 1 1 1 1 1 1
Sent 8334648518 bytes 18343972 pkt (dropped 309649, overlimits 0
requeues 598)
backlog 0b 0p requeues 598
qdisc pfifo_fast 0: dev eth0 parent :3 bands 3 priomap 1 2 2 2 1 2 0 0
1 1 1 1 1 1 1 1
Sent 8493283599 bytes 18399753 pkt (dropped 335129, overlimits 0
requeues 627)
backlog 0b 0p requeues 627
qdisc pfifo_fast 0: dev eth0 parent :4 bands 3 priomap 1 2 2 2 1 2 0 0
1 1 1 1 1 1 1 1
Sent 8364872323 bytes 18106798 pkt (dropped 320538, overlimits 0
requeues 679)
backlog 0b 0p requeues 679
qdisc pfifo_fast 0: dev eth0 parent :5 bands 3 priomap 1 2 2 2 1 2 0 0
1 1 1 1 1 1 1 1
Sent 8359453687 bytes 18154455 pkt (dropped 237684, overlimits 0
requeues 593)
backlog 0b 0p requeues 593
qdisc pfifo_fast 0: dev eth0 parent :6 bands 3 priomap 1 2 2 2 1 2 0 0
1 1 1 1 1 1 1 1
Sent 8436917858 bytes 18269322 pkt (dropped 466950, overlimits 0
requeues 485)
backlog 0b 0p requeues 485
qdisc pfifo_fast 0: dev eth0 parent :7 bands 3 priomap 1 2 2 2 1 2 0 0
1 1 1 1 1 1 1 1
Sent 8093639865 bytes 18074244 pkt (dropped 214268, overlimits 0
requeues 611)
backlog 0b 0p requeues 611
qdisc pfifo_fast 0: dev eth0 parent :8 bands 3 priomap 1 2 2 2 1 2 0 0
1 1 1 1 1 1 1 1
Sent 8247552343 bytes 18384612 pkt (dropped 296954, overlimits 0
requeues 589)
backlog 0b 0p requeues 589
Statistics from Juniper
Interface: xe-0/0/10, Enabled, Link is Up
Encapsulation: Ethernet, Speed: 10000mbps
Traffic statistics: Current
delta
Input bytes: 734560246929350 (1950950784 bps)
[751556973]
Output bytes: 722638733836431 (1870245248 bps)
[727831935]
Input packets: 1552850512280 (519642 pps)
[1618384]
Output packets: 1474846114523 (495374 pps)
[1546618]
Error statistics:
Input errors: 0
[0]
Input drops: 0
[0]
Input framing errors: 0
[0]
Policed discards: 0
[0]
L3 incompletes: 0
[0]
L2 channel errors: 0
[0]
L2 mismatch timeouts: 0
[0]
Carrier transitions: 31
[0]
Output errors: 0
[0]
Output drops: 0
[0]
Aged packets: 0
[0]
Active alarms : None
Active defects: None
Input MAC/Filter statistics:
Unicast packets 1552850511569
[1618384]
Broadcast packets 711
[0]
Multicast packets 0
[0]
Oversized frames 0
[0]
Packet reject count 0
[0]
DA rejects 0
[0]
SA rejects 0
[0]
Output MAC/Filter Statistics:
Unicast packets 1474732465555
[1546497]
Broadcast packets 74339389
[85]
Multicast packets 39309579
[36]
Packet pad count 0
[0]
Packet error count 0
[0]
On 2016-01-13 11:08, Denys Fedoryshchenko wrote:
> Hi
>
> I am trying to use Chelsio S310 on haproxy balancers and noticed
> following problems:
>
> 1)Latency on load 2.5+ Gbps is going in spikes 10ms+. Comparing with
> other vendors who is going to 7-8Gbps without problems.
> 2)I see a lot of drops in qdisc queues, while comparing with other
> vendors without drops on higher loads.
>
> I suspect it might be or problem of card or drivers, because i am
> doing tests with similar cards from other vendor on same setup.
> CPU resources are fine, i am monitoring them with with mpstat -P ALL,
> and cpufreq set to performance (always max freq on CPU).
>
> Kernel 4.4 vanilla
>
> HTTPS-VISP ~ # ethtool -i eth0
> driver: cxgb3
> version: 1.1.5-ko
> firmware-version: T 7.12.0 TP 1.1.0
> expansion-rom-version:
> bus-info: 0000:05:00.0
> supports-statistics: yes
> supports-test: no
> supports-eeprom-access: yes
> supports-register-dump: yes
> supports-priv-flags: no
>
> Here is more details about my setup:
>
> As far as i see i am having 2.5GT/s x8 PCI-Express established, i
> checked and tried perftune.sh, it made no difference.
>
> 05:00.0 Ethernet controller: Chelsio Communications Inc S310-CR 10GbE
> Single Port Adapter
> Subsystem: Chelsio Communications Inc Device 0001
> Physical Slot: 785
> Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
> ParErr+ Stepping- SERR+ FastB2B- DisINTx+
> Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
> <TAbort- <MAbort- >SERR- <PERR- INTx-
> Latency: 0, Cache Line Size: 32 bytes
> Interrupt: pin A routed to IRQ 0
> Region 0: Memory at 92800000 (64-bit, non-prefetchable)
> [size=4K]
> Region 2: Memory at 92000000 (64-bit, non-prefetchable)
> [size=8M]
> Region 4: Memory at 92801000 (64-bit, non-prefetchable)
> [size=4K]
> Expansion ROM at 92880000 [disabled] [size=512K]
> Capabilities: [40] Power Management version 3
> Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA
> PME(D0+,D1-,D2-,D3hot+,D3cold-)
> Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
> Capabilities: [48] MSI: Enable- Count=1/32 Maskable- 64bit+
> Address: 0000000000000000 Data: 0000
> Capabilities: [58] Express (v2) Endpoint, MSI 00
> DevCap: MaxPayload 4096 bytes, PhantFunc 0, Latency
> L0s <64ns, L1 <1us
> ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset-
> DevCtl: Report errors: Correctable- Non-Fatal+ Fatal+
> Unsupported-
> RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+
> MaxPayload 256 bytes, MaxReadReq 512 bytes
> DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq+
> AuxPwr- TransPend-
> LnkCap: Port #0, Speed 2.5GT/s, Width x8, ASPM L0s L1,
> Exit Latency L0s unlimited, L1 unlimited
> ClockPM- Surprise- LLActRep- BwNot-
> ASPMOptComp-
> LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- CommClk+
> ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
> LnkSta: Speed 2.5GT/s, Width x8, TrErr- Train-
> SlotClk+ DLActive- BWMgmt- ABWMgmt-
> DevCap2: Completion Timeout: Range ABC, TimeoutDis-,
> LTR-, OBFF Not Supported
> DevCtl2: Completion Timeout: 50us to 50ms,
> TimeoutDis-, LTR-, OBFF Disabled
> LnkCtl2: Target Link Speed: 2.5GT/s, EnterCompliance-
> SpeedDis-
> Transmit Margin: Normal Operating Range,
> EnterModifiedCompliance- ComplianceSOS-
> Compliance De-emphasis: -6dB
> LnkSta2: Current De-emphasis Level: -6dB,
> EqualizationComplete-, EqualizationPhase1-
> EqualizationPhase2-, EqualizationPhase3-,
> LinkEqualizationRequest-
> Capabilities: [94] Vital Product Data
> Unknown small resource type 00, will not decode more.
> Capabilities: [9c] MSI-X: Enable+ Count=32 Masked-
> Vector table: BAR=4 offset=00000000
> PBA: BAR=4 offset=00000800
> Capabilities: [100 v1] Device Serial Number
> 00-00-00-01-00-00-00-01
> Capabilities: [300 v1] Advanced Error Reporting
> UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt-
> UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
> UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt+
> UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq+ ACSViol-
> UESvrt: DLP+ SDES+ TLP+ FCP+ CmpltTO+ CmpltAbrt+
> UnxCmplt+ RxOF+ MalfTLP+ ECRC+ UnsupReq- ACSViol-
> CESta: RxErr- BadTLP- BadDLLP- Rollover- Timeout-
> NonFatalErr-
> CEMsk: RxErr- BadTLP- BadDLLP- Rollover- Timeout-
> NonFatalErr+
> AERCap: First Error Pointer: 14, GenCap+ CGenEn-
> ChkCap+ ChkEn-
> Kernel driver in use: cxgb3
>
> With uptime just 10 hours, over lowest load time:
>
> BALANCER ~ # tc -s -d qdisc
> qdisc mq 0: dev eth0 root
> Sent 3445096608552 bytes 2777441550 pkt (dropped 6509380, overlimits
> 0 requeues 1156)
> backlog 0b 0p requeues 1156
> qdisc pfifo_fast 0: dev eth0 parent :1 bands 3 priomap 1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
> Sent 428721781866 bytes 880919921 pkt (dropped 1403007, overlimits 0
> requeues 243)
> backlog 0b 0p requeues 243
> qdisc pfifo_fast 0: dev eth0 parent :2 bands 3 priomap 1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
> Sent 429526790584 bytes 879237430 pkt (dropped 560836, overlimits 0
> requeues 102)
> backlog 0b 0p requeues 102
> qdisc pfifo_fast 0: dev eth0 parent :3 bands 3 priomap 1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
> Sent 427233606632 bytes 882026985 pkt (dropped 1255127, overlimits 0
> requeues 221)
> backlog 0b 0p requeues 221
> qdisc pfifo_fast 0: dev eth0 parent :4 bands 3 priomap 1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
> Sent 429138959430 bytes 882420516 pkt (dropped 914917, overlimits 0
> requeues 164)
> backlog 0b 0p requeues 164
> qdisc pfifo_fast 0: dev eth0 parent :5 bands 3 priomap 1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
> Sent 435498527047 bytes 888437887 pkt (dropped 325962, overlimits 0
> requeues 60)
> backlog 0b 0p requeues 60
> qdisc pfifo_fast 0: dev eth0 parent :6 bands 3 priomap 1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
> Sent 427458674980 bytes 881585256 pkt (dropped 610681, overlimits 0
> requeues 113)
> backlog 0b 0p requeues 113
> qdisc pfifo_fast 0: dev eth0 parent :7 bands 3 priomap 1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
> Sent 435164189156 bytes 891730352 pkt (dropped 891579, overlimits 0
> requeues 156)
> backlog 0b 0p requeues 156
> qdisc pfifo_fast 0: dev eth0 parent :8 bands 3 priomap 1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
> Sent 432354080111 bytes 886050500 pkt (dropped 547271, overlimits 0
> requeues 97)
Powered by blists - more mailing lists