lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Date:	Wed, 13 Jan 2016 22:44:35 +0200
From:	Denys Fedoryshchenko <nuclearcat@...learcat.com>
To:	Netdev <netdev@...r.kernel.org>,
	Santosh Raspatur <santosh@...lsio.com>
Subject: Re: cxgb3 Chelsio S310 tx drops and latency

More details:

[    1.586586] cxgb3: Chelsio T3 Network Driver - version 1.1.5-ko
[    1.766456] cxgb3 0000:05:00.0: Port 0 using 8 queue sets.
[    1.766765] cxgb3 0000:05:00.0 eth0: Chelsio T310 10GBASE-R RNIC (rev 
4) PCI Express x8 MSI-X
[    1.767232] cxgb3: eth0: 128MB CM, 256MB PMTX, 256MB PMRX, S/N: 
PT32100340
[   10.665180] cxgb3 0000:05:00.0 eth0: link up, 10Gbps, full-duplex

HTTPS-VISP ~ # cxgbtool eth0 qset
QNUM  IRQ   TXQ0   TXQ1  TXQ2   RSPQ    FL0    FL1  CONG   LAT  MODE  
LRO
    0   33   1024   1024   256   1024   1024    512     0     5  napi    
1
    1   34   1024   1024   256   1024   1024    512     0     5  napi    
1
    2   35   1024   1024   256   1024   1024    512     0     5  napi    
1
    3   36   1024   1024   256   1024   1024    512     0     5  napi    
1
    4   37   1024   1024   256   1024   1024    512     0     5  napi    
1
    5   38   1024   1024   256   1024   1024    512     0     5  napi    
1
    6   39   1024   1024   256   1024   1024    512     0     5  napi    
1
    7   40   1024   1024   256   1024   1024    512     0     5  napi    
1

(i tired to change latency, to 200-300, disabling lro, doesn't make any 
difference)

Card just hitting 2Gbps and ~550kpps limit and not going further.


qdisc mq 0: dev eth0 root
  Sent 67010887991 bytes 146285692 pkt (dropped 2542108, overlimits 0 
requeues 4785)
  backlog 0b 0p requeues 4785
qdisc pfifo_fast 0: dev eth0 parent :1 bands 3 priomap  1 2 2 2 1 2 0 0 
1 1 1 1 1 1 1 1
  Sent 8680521052 bytes 18552537 pkt (dropped 360936, overlimits 0 
requeues 603)
  backlog 0b 0p requeues 603
qdisc pfifo_fast 0: dev eth0 parent :2 bands 3 priomap  1 2 2 2 1 2 0 0 
1 1 1 1 1 1 1 1
  Sent 8334648518 bytes 18343972 pkt (dropped 309649, overlimits 0 
requeues 598)
  backlog 0b 0p requeues 598
qdisc pfifo_fast 0: dev eth0 parent :3 bands 3 priomap  1 2 2 2 1 2 0 0 
1 1 1 1 1 1 1 1
  Sent 8493283599 bytes 18399753 pkt (dropped 335129, overlimits 0 
requeues 627)
  backlog 0b 0p requeues 627
qdisc pfifo_fast 0: dev eth0 parent :4 bands 3 priomap  1 2 2 2 1 2 0 0 
1 1 1 1 1 1 1 1
  Sent 8364872323 bytes 18106798 pkt (dropped 320538, overlimits 0 
requeues 679)
  backlog 0b 0p requeues 679
qdisc pfifo_fast 0: dev eth0 parent :5 bands 3 priomap  1 2 2 2 1 2 0 0 
1 1 1 1 1 1 1 1
  Sent 8359453687 bytes 18154455 pkt (dropped 237684, overlimits 0 
requeues 593)
  backlog 0b 0p requeues 593
qdisc pfifo_fast 0: dev eth0 parent :6 bands 3 priomap  1 2 2 2 1 2 0 0 
1 1 1 1 1 1 1 1
  Sent 8436917858 bytes 18269322 pkt (dropped 466950, overlimits 0 
requeues 485)
  backlog 0b 0p requeues 485
qdisc pfifo_fast 0: dev eth0 parent :7 bands 3 priomap  1 2 2 2 1 2 0 0 
1 1 1 1 1 1 1 1
  Sent 8093639865 bytes 18074244 pkt (dropped 214268, overlimits 0 
requeues 611)
  backlog 0b 0p requeues 611
qdisc pfifo_fast 0: dev eth0 parent :8 bands 3 priomap  1 2 2 2 1 2 0 0 
1 1 1 1 1 1 1 1
  Sent 8247552343 bytes 18384612 pkt (dropped 296954, overlimits 0 
requeues 589)
  backlog 0b 0p requeues 589


Statistics from Juniper
Interface: xe-0/0/10, Enabled, Link is Up
Encapsulation: Ethernet, Speed: 10000mbps
Traffic statistics:                                              Current 
delta
   Input bytes:           734560246929350 (1950950784 bps)          
[751556973]
   Output bytes:          722638733836431 (1870245248 bps)          
[727831935]
   Input packets:           1552850512280 (519642 pps)                
[1618384]
   Output packets:          1474846114523 (495374 pps)                
[1546618]
Error statistics:
   Input errors:                        0                                 
   [0]
   Input drops:                         0                                 
   [0]
   Input framing errors:                0                                 
   [0]
   Policed discards:                    0                                 
   [0]
   L3 incompletes:                      0                                 
   [0]
   L2 channel errors:                   0                                 
   [0]
   L2 mismatch timeouts:                0                                 
   [0]
   Carrier transitions:                31                                 
   [0]
   Output errors:                       0                                 
   [0]
   Output drops:                        0                                 
   [0]
   Aged packets:                        0                                 
   [0]
Active alarms : None
Active defects: None
Input MAC/Filter statistics:
   Unicast packets          1552850511569                             
[1618384]
   Broadcast packets                  711                                 
   [0]
   Multicast packets                    0                                 
   [0]
   Oversized frames                     0                                 
   [0]
   Packet reject count                  0                                 
   [0]
   DA rejects                           0                                 
   [0]
   SA rejects                           0                                 
   [0]
Output MAC/Filter Statistics:
   Unicast packets          1474732465555                             
[1546497]
   Broadcast packets             74339389                                 
  [85]
   Multicast packets             39309579                                 
  [36]
   Packet pad count                     0                                 
   [0]
   Packet error count                   0                                 
   [0]




On 2016-01-13 11:08, Denys Fedoryshchenko wrote:
> Hi
> 
> I am trying to use Chelsio S310 on haproxy balancers and noticed
> following problems:
> 
> 1)Latency on load 2.5+ Gbps is going in spikes 10ms+. Comparing with
> other vendors who is going to 7-8Gbps without problems.
> 2)I see a lot of drops in qdisc queues, while comparing with other
> vendors without drops on higher loads.
> 
> I suspect it might be or problem of card or drivers, because i am
> doing tests with similar cards from other vendor on same setup.
> CPU resources are fine, i am monitoring them with with mpstat -P ALL,
> and cpufreq set to performance (always max freq on CPU).
> 
> Kernel 4.4 vanilla
> 
> HTTPS-VISP ~ # ethtool -i eth0
> driver: cxgb3
> version: 1.1.5-ko
> firmware-version: T 7.12.0 TP 1.1.0
> expansion-rom-version:
> bus-info: 0000:05:00.0
> supports-statistics: yes
> supports-test: no
> supports-eeprom-access: yes
> supports-register-dump: yes
> supports-priv-flags: no
> 
> Here is more details about my setup:
> 
> As far as i see i am having 2.5GT/s x8 PCI-Express established, i
> checked and tried perftune.sh, it made no difference.
> 
> 05:00.0 Ethernet controller: Chelsio Communications Inc S310-CR 10GbE
> Single Port Adapter
>         Subsystem: Chelsio Communications Inc Device 0001
>         Physical Slot: 785
>         Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
> ParErr+ Stepping- SERR+ FastB2B- DisINTx+
>         Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
> <TAbort- <MAbort- >SERR- <PERR- INTx-
>         Latency: 0, Cache Line Size: 32 bytes
>         Interrupt: pin A routed to IRQ 0
>         Region 0: Memory at 92800000 (64-bit, non-prefetchable) 
> [size=4K]
>         Region 2: Memory at 92000000 (64-bit, non-prefetchable) 
> [size=8M]
>         Region 4: Memory at 92801000 (64-bit, non-prefetchable) 
> [size=4K]
>         Expansion ROM at 92880000 [disabled] [size=512K]
>         Capabilities: [40] Power Management version 3
>                 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA
> PME(D0+,D1-,D2-,D3hot+,D3cold-)
>                 Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
>         Capabilities: [48] MSI: Enable- Count=1/32 Maskable- 64bit+
>                 Address: 0000000000000000  Data: 0000
>         Capabilities: [58] Express (v2) Endpoint, MSI 00
>                 DevCap: MaxPayload 4096 bytes, PhantFunc 0, Latency
> L0s <64ns, L1 <1us
>                         ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset-
>                 DevCtl: Report errors: Correctable- Non-Fatal+ Fatal+
> Unsupported-
>                         RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+
>                         MaxPayload 256 bytes, MaxReadReq 512 bytes
>                 DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq+
> AuxPwr- TransPend-
>                 LnkCap: Port #0, Speed 2.5GT/s, Width x8, ASPM L0s L1,
> Exit Latency L0s unlimited, L1 unlimited
>                         ClockPM- Surprise- LLActRep- BwNot- 
> ASPMOptComp-
>                 LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- CommClk+
>                         ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
>                 LnkSta: Speed 2.5GT/s, Width x8, TrErr- Train-
> SlotClk+ DLActive- BWMgmt- ABWMgmt-
>                 DevCap2: Completion Timeout: Range ABC, TimeoutDis-,
> LTR-, OBFF Not Supported
>                 DevCtl2: Completion Timeout: 50us to 50ms,
> TimeoutDis-, LTR-, OBFF Disabled
>                 LnkCtl2: Target Link Speed: 2.5GT/s, EnterCompliance- 
> SpeedDis-
>                          Transmit Margin: Normal Operating Range,
> EnterModifiedCompliance- ComplianceSOS-
>                          Compliance De-emphasis: -6dB
>                 LnkSta2: Current De-emphasis Level: -6dB,
> EqualizationComplete-, EqualizationPhase1-
>                          EqualizationPhase2-, EqualizationPhase3-,
> LinkEqualizationRequest-
>         Capabilities: [94] Vital Product Data
>                 Unknown small resource type 00, will not decode more.
>         Capabilities: [9c] MSI-X: Enable+ Count=32 Masked-
>                 Vector table: BAR=4 offset=00000000
>                 PBA: BAR=4 offset=00000800
>         Capabilities: [100 v1] Device Serial Number 
> 00-00-00-01-00-00-00-01
>         Capabilities: [300 v1] Advanced Error Reporting
>                 UESta:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt-
> UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
>                 UEMsk:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt+
> UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq+ ACSViol-
>                 UESvrt: DLP+ SDES+ TLP+ FCP+ CmpltTO+ CmpltAbrt+
> UnxCmplt+ RxOF+ MalfTLP+ ECRC+ UnsupReq- ACSViol-
>                 CESta:  RxErr- BadTLP- BadDLLP- Rollover- Timeout- 
> NonFatalErr-
>                 CEMsk:  RxErr- BadTLP- BadDLLP- Rollover- Timeout- 
> NonFatalErr+
>                 AERCap: First Error Pointer: 14, GenCap+ CGenEn- 
> ChkCap+ ChkEn-
>         Kernel driver in use: cxgb3
> 
> With uptime just 10 hours, over lowest load time:
> 
> BALANCER ~ # tc -s -d qdisc
> qdisc mq 0: dev eth0 root
>  Sent 3445096608552 bytes 2777441550 pkt (dropped 6509380, overlimits
> 0 requeues 1156)
>  backlog 0b 0p requeues 1156
> qdisc pfifo_fast 0: dev eth0 parent :1 bands 3 priomap  1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
>  Sent 428721781866 bytes 880919921 pkt (dropped 1403007, overlimits 0
> requeues 243)
>  backlog 0b 0p requeues 243
> qdisc pfifo_fast 0: dev eth0 parent :2 bands 3 priomap  1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
>  Sent 429526790584 bytes 879237430 pkt (dropped 560836, overlimits 0
> requeues 102)
>  backlog 0b 0p requeues 102
> qdisc pfifo_fast 0: dev eth0 parent :3 bands 3 priomap  1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
>  Sent 427233606632 bytes 882026985 pkt (dropped 1255127, overlimits 0
> requeues 221)
>  backlog 0b 0p requeues 221
> qdisc pfifo_fast 0: dev eth0 parent :4 bands 3 priomap  1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
>  Sent 429138959430 bytes 882420516 pkt (dropped 914917, overlimits 0
> requeues 164)
>  backlog 0b 0p requeues 164
> qdisc pfifo_fast 0: dev eth0 parent :5 bands 3 priomap  1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
>  Sent 435498527047 bytes 888437887 pkt (dropped 325962, overlimits 0
> requeues 60)
>  backlog 0b 0p requeues 60
> qdisc pfifo_fast 0: dev eth0 parent :6 bands 3 priomap  1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
>  Sent 427458674980 bytes 881585256 pkt (dropped 610681, overlimits 0
> requeues 113)
>  backlog 0b 0p requeues 113
> qdisc pfifo_fast 0: dev eth0 parent :7 bands 3 priomap  1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
>  Sent 435164189156 bytes 891730352 pkt (dropped 891579, overlimits 0
> requeues 156)
>  backlog 0b 0p requeues 156
> qdisc pfifo_fast 0: dev eth0 parent :8 bands 3 priomap  1 2 2 2 1 2 0
> 0 1 1 1 1 1 1 1 1
>  Sent 432354080111 bytes 886050500 pkt (dropped 547271, overlimits 0
> requeues 97)

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ