lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <1c03e797a32cfe05a6269e8f219e2f62@visp.net.lb>
Date:	Sat, 07 Dec 2013 08:32:15 +0200
From:	Denys Fedoryshchenko <nuclearcat@...learcat.com>
To:	e1000-devel@...ts.sourceforge.net, netdev@...r.kernel.org,
	jesse.brandeburg@...el.com
Subject: e1000e, kernel 3.12.3, packetloss and periodic Detected Hardware
 Unit Hang

Hi

On one of clients got new hardware and i started to get periodic 
watchdog/hang messages in dmesg and packetloss on e1000e driver, while 
load on device is relatively low.

Here is lspci, ifconfig, ethtool and dmesg information


00:19.0 Ethernet controller: Intel Corporation Device 1503 (rev 04)
         Subsystem: Intel Corporation Device 2031
         Flags: bus master, fast devsel, latency 0, IRQ 43
         Memory at f7c00000 (32-bit, non-prefetchable) [size=128K]
         Memory at f7c35000 (32-bit, non-prefetchable) [size=4K]
         I/O ports at f080 [size=32]
         Capabilities: [c8] Power Management version 2
         Capabilities: [d0] MSI: Enable+ Count=1/1 Maskable- 64bit+
         Capabilities: [e0] PCI Advanced Features
         Kernel driver in use: e1000e

BALANCER-WORLDNET ~ # ethtool -i eth0
driver: e1000e
version: 2.3.2-k
firmware-version: 0.13-5
bus-info: 0000:00:19.0
supports-statistics: yes
supports-test: yes
supports-eeprom-access: yes
supports-register-dump: yes
supports-priv-flags: no

BALANCER-WORLDNET ~ # ethtool -S eth0
NIC statistics:
      rx_packets: 1500180288
      tx_packets: 1555842579
      rx_bytes: 1154556141222
      tx_bytes: 1139141840890
      rx_broadcast: 168035
      tx_broadcast: 74
      rx_multicast: 40
      tx_multicast: 0
      rx_errors: 0
      tx_errors: 0
      tx_dropped: 0
      multicast: 40
      collisions: 0
      rx_length_errors: 0
      rx_over_errors: 0
      rx_crc_errors: 0
      rx_frame_errors: 0
      rx_no_buffer_count: 0
      rx_missed_errors: 56073
      tx_aborted_errors: 0
      tx_carrier_errors: 0
      tx_fifo_errors: 0
      tx_heartbeat_errors: 0
      tx_window_errors: 0
      tx_abort_late_coll: 0
      tx_deferred_ok: 0
      tx_single_coll_ok: 0
      tx_multi_coll_ok: 0
      tx_timeout_count: 4
      tx_restart_queue: 5
      rx_long_length_errors: 0
      rx_short_length_errors: 0
      rx_align_errors: 0
      tx_tcp_seg_good: 195863453
      tx_tcp_seg_failed: 0
      rx_flow_control_xon: 0
      rx_flow_control_xoff: 0
      tx_flow_control_xon: 0
      tx_flow_control_xoff: 0
      rx_csum_offload_good: 1500000575
      rx_csum_offload_errors: 1104
      rx_header_split: 0
      alloc_rx_buff_failed: 0
      tx_smbus: 0
      rx_smbus: 0
      dropped_smbus: 0
      rx_dma_failed: 0
      tx_dma_failed: 0
      rx_hwtstamp_cleared: 0
      uncorr_ecc_errors: 0
      corr_ecc_errors: 0

BALANCER-WORLDNET ~ # ethtool -d eth0
MAC Registers
-------------
0x00000: CTRL (Device control register)  0x40100240
       Endian mode (buffers):             little
       Link reset:                        normal
       Set link up:                       1
       Invert Loss-Of-Signal:             no
       Receive flow control:              disabled
       Transmit flow control:             disabled
       VLAN mode:                         enabled
       Auto speed detect:                 disabled
       Speed select:                      1000Mb/s
       Force speed:                       no
       Force duplex:                      no
0x00008: STATUS (Device status register) 0x40080083
       Duplex:                            full
       Link up:                           link config
       TBI mode:                          disabled
       Link speed:                        1000Mb/s
       Bus type:                          PCI
       Bus speed:                         33MHz
       Bus width:                         32-bit
0x00100: RCTL (Receive control register) 0x04008002
       Receiver:                          enabled
       Store bad packets:                 disabled
       Unicast promiscuous:               disabled
       Multicast promiscuous:             disabled
       Long packet:                       disabled
       Descriptor minimum threshold size: 1/2
       Broadcast accept mode:             accept
       VLAN filter:                       disabled
       Canonical form indicator:          disabled
       Discard pause frames:              filtered
       Pass MAC control frames:           don't pass
       Receive buffer size:               2048
0x02808: RDLEN (Receive desc length)     0x00001000
0x02810: RDH   (Receive desc head)       0x000000E1
0x02818: RDT   (Receive desc tail)       0x000000D0
0x02820: RDTR  (Receive delay timer)     0x00000000
0x00400: TCTL (Transmit ctrl register)   0x3003F0FA
       Transmitter:                       enabled
       Pad short packets:                 enabled
       Software XOFF Transmission:        disabled
       Re-transmit on late collision:     disabled
0x03808: TDLEN (Transmit desc length)    0x00001000
0x03810: TDH   (Transmit desc head)      0x00000036
0x03818: TDT   (Transmit desc tail)      0x00000036
0x03820: TIDV  (Transmit delay timer)    0x00000008
PHY type:                                unknown


BALANCER-WORLDNET ~ # ethtool eth0
Settings for eth0:
         Supported ports: [ TP ]
         Supported link modes:   10baseT/Half 10baseT/Full
                                 100baseT/Half 100baseT/Full
                                 1000baseT/Full
         Supported pause frame use: No
         Supports auto-negotiation: Yes
         Advertised link modes:  10baseT/Half 10baseT/Full
                                 100baseT/Half 100baseT/Full
                                 1000baseT/Full
         Advertised pause frame use: No
         Advertised auto-negotiation: Yes
         Speed: 1000Mb/s
         Duplex: Full
         Port: Twisted Pair
         PHYAD: 2
         Transceiver: internal
         Auto-negotiation: on
         MDI-X: off (auto)
         Supports Wake-on: pumbg
         Wake-on: g
         Current message level: 0x00000007 (7)
                                drv probe link
         Link detected: yes




BALANCER-WORLDNET ~ # ethtool -e eth0
Offset          Values
------          ------
0x0000:         00 1e 8c f4 5a e6 00 08 ff ff d5 00 ff ff ff ff
0x0010:         ff ff ff ff c3 10 31 20 86 80 03 15 00 00 00 00
0x0020:         02 07 00 00 00 00 05 a5 28 30 00 1a 00 00 00 0c
0x0030:         f4 18 40 0b 43 08 13 01 02 15 ad ba 02 15 03 15
0x0040:         ad ba ad ba ad ba 02 15 00 80 90 80 00 4e 86 08
0x0050:         00 00 00 00 07 00 00 00 00 00 00 00 00 00 ff ff
0x0060:         00 01 00 40 51 13 07 40 ff ff ff ff ff ff ff ff
0x0070:         ff ff ff ff ff ff ff ff ff ff 00 01 ff ff a7 97
0x0080:         00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
0x0090:         00 00 00 00 00 00 ff ff ff ff ff ff ff ff ff ff
0x00a0:         02 34 30 00 14 02 31 00 36 38 30 00 0f 00 31 00
0x00b0:         37 38 30 00 0a 00 31 00 38 38 30 00 10 00 31 00
0x00c0:         3a 38 30 00 03 00 31 00 ae 38 30 00 18 00 31 00
0x00d0:         af 38 30 00 18 00 31 00 b0 38 30 00 18 00 31 00
0x00e0:         1a 84 32 00 4c 52 3a 00 00 00 32 00 40 60 1f 00
0x00f0:         04 d1 11 00 80 60 1f 00 00 cc 10 00 80 08 15 00
0x0100:         d5 35 13 00 00 00 1f 00 ff ff ff ff ff ff ff ff
0x0110:         ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff
(rest are mostly ff and 00 values)

eth0      Link encap:Ethernet  HWaddr 00:1E:8C:F4:5A:E6
           inet addr:10.0.254.6  Bcast:10.0.254.31  Mask:255.255.255.224
           UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
           RX packets:1492876627 errors:0 dropped:110395 overruns:0 
frame:0
           TX packets:1549628201 errors:0 dropped:0 overruns:0 carrier:0
           collisions:0 txqueuelen:1000
           RX bytes:1148839270774 (1.0 TiB)  TX bytes:1133594933258 (1.0 
TiB)
           Interrupt:20 Memory:f7c00000-f7c20000


[17929.990868] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
[17929.990868]   TDH                  <78>
[17929.990868]   TDT                  <24>
[17929.990868]   next_to_use          <24>
[17929.990868]   next_to_clean        <76>
[17929.990868] buffer_info[next_to_clean]:
[17929.990868]   time_stamp           <1010cde6a>
[17929.990868]   next_to_watch        <78>
[17929.990868]   jiffies              <1010ce2cc>
[17929.990868]   next_to_watch.status <0>
[17929.990868] MAC Status             <40080083>
[17929.990868] PHY Status             <796d>
[17929.990868] PHY 1000BASE-T Status  <3800>
[17929.990868] PHY Extended Status    <3000>
[17929.990868] PCI Status             <10>
[17931.991763] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
[17931.991763]   TDH                  <78>
[17931.991763]   TDT                  <24>
[17931.991763]   next_to_use          <24>
[17931.991763]   next_to_clean        <76>
[17931.991763] buffer_info[next_to_clean]:
[17931.991763]   time_stamp           <1010cde6a>
[17931.991763]   next_to_watch        <78>
[17931.991763]   jiffies              <1010cea9c>
[17931.991763]   next_to_watch.status <0>
[17931.991763] MAC Status             <40080083>
[17931.991763] PHY Status             <796d>
[17931.991763] PHY 1000BASE-T Status  <3800>
[17931.991763] PHY Extended Status    <3000>
[17931.991763] PCI Status             <10>
[17933.992670] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
[17933.992670]   TDH                  <78>
[17933.992670]   TDT                  <24>
[17933.992670]   next_to_use          <24>
[17933.992670]   next_to_clean        <76>
[17933.992670] buffer_info[next_to_clean]:
[17933.992670]   time_stamp           <1010cde6a>
[17933.992670]   next_to_watch        <78>
[17933.992670]   jiffies              <1010cf26c>
[17933.992670]   next_to_watch.status <0>
[17933.992670] MAC Status             <40080083>
[17933.992670] PHY Status             <796d>
[17933.992670] PHY 1000BASE-T Status  <3800>
[17933.992670] PHY Extended Status    <3000>
[17933.992670] PCI Status             <10>
[17933.995923] ------------[ cut here ]------------
[17933.996003] WARNING: CPU: 0 PID: 0 at net/sched/sch_generic.c:264 
dev_watchdog+0x14d/0x1fd()
[17933.996122] NETDEV WATCHDOG: eth0 (e1000e): transmit queue 0 timed 
out
[17933.996185] Modules linked in: xt_tcpudp xt_mark iptable_mangle 
ip_tables x_tables 8021q garp stp mrp llc
[17933.997772] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 
3.12.3-build-0006 #6
[17933.997837] Hardware name:                  /DH77KC, BIOS 
KCH7710H.86A.0069.2012.0224.1825 02/24/2012
[17933.997957]  0000000000000009 ffff88031f203d80 ffffffff8151971a 
0000000000000007
[17933.998078]  ffff88031f203dd0 ffff88031f203dc0 ffffffff810356df 
ffffffff81059db4
[17933.998197]  ffffffff814bcdb7 ffff880310768000 ffff880310339000 
0000000000000001
[17933.998316] Call Trace:
[17933.998374]  <IRQ>  [<ffffffff8151971a>] dump_stack+0x46/0x58
[17933.998447]  [<ffffffff810356df>] warn_slowpath_common+0x77/0x91
[17933.998513]  [<ffffffff81059db4>] ? update_curr+0x5a/0xa8
[17933.998576]  [<ffffffff814bcdb7>] ? dev_watchdog+0x14d/0x1fd
[17933.998640]  [<ffffffff8103578d>] warn_slowpath_fmt+0x41/0x43
[17933.998705]  [<ffffffff814bcdb7>] dev_watchdog+0x14d/0x1fd
[17933.998769]  [<ffffffff814bcc6a>] ? 
psched_ratecfg_precompute+0x61/0x61
[17933.998835]  [<ffffffff8103e1b6>] call_timer_fn.isra.27+0x25/0x7f
[17933.998905]  [<ffffffff8103e357>] run_timer_softirq+0x147/0x183
[17933.998969]  [<ffffffff810390bf>] __do_softirq+0xb7/0x16d
[17933.999034]  [<ffffffff81520abc>] call_softirq+0x1c/0x30
[17933.999100]  [<ffffffff81004392>] do_softirq+0x32/0x68
[17933.999162]  [<ffffffff81039248>] irq_exit+0x3e/0x83
[17933.999227]  [<ffffffff81025270>] smp_apic_timer_interrupt+0x40/0x4d
[17933.999292]  [<ffffffff81520447>] apic_timer_interrupt+0x67/0x70
[17933.999354]  <EOI>  [<ffffffff81479426>] ? 
cpuidle_enter_state+0x49/0xac
[17933.999427]  [<ffffffff8147941f>] ? cpuidle_enter_state+0x42/0xac
[17933.999491]  [<ffffffff8147954c>] cpuidle_idle_call+0xc3/0x10f
[17933.999557]  [<ffffffff8100a0d6>] arch_cpu_idle+0x9/0x18
[17933.999621]  [<ffffffff81064e92>] cpu_startup_entry+0xf6/0x154
[17933.999685]  [<ffffffff81513d1e>] rest_init+0x72/0x74
[17933.999750]  [<ffffffff818f1c74>] start_kernel+0x38f/0x39c
[17933.999814]  [<ffffffff818f16ed>] ? repair_env_string+0x5a/0x5a
[17933.999879]  [<ffffffff818f143e>] x86_64_start_reservations+0x2a/0x2c
[17933.999950]  [<ffffffff818f14f1>] x86_64_start_kernel+0xb1/0xb5
[17934.000013] ---[ end trace 768f97ac33fb6771 ]---
[17934.000089] e1000e 0000:00:19.0 eth0: Reset adapter unexpectedly
[17937.928779] e1000e: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow 
Control: None
[40172.244444] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
[40172.244444]   TDH                  <27>
[40172.244444]   TDT                  <12>
[40172.244444]   next_to_use          <12>
[40172.244444]   next_to_clean        <25>
[40172.244444] buffer_info[next_to_clean]:
[40172.244444]   time_stamp           <10260192d>
[40172.244444]   next_to_watch        <27>
[40172.244444]   jiffies              <102601e8c>
[40172.244444]   next_to_watch.status <0>
[40172.244444] MAC Status             <40080083>
[40172.244444] PHY Status             <796d>
[40172.244444] PHY 1000BASE-T Status  <3800>
[40172.244444] PHY Extended Status    <3000>
[40172.244444] PCI Status             <10>
[40174.245374] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
[40174.245374]   TDH                  <27>
[40174.245374]   TDT                  <12>
[40174.245374]   next_to_use          <12>
[40174.245374]   next_to_clean        <25>
[40174.245374] buffer_info[next_to_clean]:
[40174.245374]   time_stamp           <10260192d>
[40174.245374]   next_to_watch        <27>
[40174.245374]   jiffies              <10260265c>
[40174.245374]   next_to_watch.status <0>
[40174.245374] MAC Status             <40080083>
[40174.245374] PHY Status             <796d>
[40174.245374] PHY 1000BASE-T Status  <3800>
[40174.245374] PHY Extended Status    <3000>
[40174.245374] PCI Status             <10>
[40176.246288] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
[40176.246288]   TDH                  <27>
[40176.246288]   TDT                  <12>
[40176.246288]   next_to_use          <12>
[40176.246288]   next_to_clean        <25>
[40176.246288] buffer_info[next_to_clean]:
[40176.246288]   time_stamp           <10260192d>
[40176.246288]   next_to_watch        <27>
[40176.246288]   jiffies              <102602e2c>
[40176.246288]   next_to_watch.status <0>
[40176.246288] MAC Status             <40080083>
[40176.246288] PHY Status             <796d>
[40176.246288] PHY 1000BASE-T Status  <3800>
[40176.246288] PHY Extended Status    <3000>
[40176.246288] PCI Status             <10>
[40178.247156] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
[40178.247156]   TDH                  <27>
[40178.247156]   TDT                  <12>
[40178.247156]   next_to_use          <12>
[40178.247156]   next_to_clean        <25>
[40178.247156] buffer_info[next_to_clean]:
[40178.247156]   time_stamp           <10260192d>
[40178.247156]   next_to_watch        <27>
[40178.247156]   jiffies              <1026035fc>
[40178.247156]   next_to_watch.status <0>
[40178.247156] MAC Status             <40080083>
[40178.247156] PHY Status             <796d>
[40178.247156] PHY 1000BASE-T Status  <3800>
[40178.247156] PHY Extended Status    <3000>
[40178.247156] PCI Status             <10>
[40178.250476] e1000e 0000:00:19.0 eth0: Reset adapter unexpectedly
[40182.163300] e1000e: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow 
Control: None
[46621.217377] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
[46621.217377]   TDH                  <67>
[46621.217377]   TDT                  <b4>
[46621.217377]   next_to_use          <b4>
[46621.217377]   next_to_clean        <66>
[46621.217377] buffer_info[next_to_clean]:
[46621.217377]   time_stamp           <102c2716b>
[46621.217377]   next_to_watch        <67>
[46621.217377]   jiffies              <102c27a3c>
[46621.217377]   next_to_watch.status <0>
[46621.217377] MAC Status             <40080083>
[46621.217377] PHY Status             <796d>
[46621.217377] PHY 1000BASE-T Status  <3800>
[46621.217377] PHY Extended Status    <3000>
[46621.217377] PCI Status             <10>
[46623.218258] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
[46623.218258]   TDH                  <67>
[46623.218258]   TDT                  <b4>
[46623.218258]   next_to_use          <b4>
[46623.218258]   next_to_clean        <66>
[46623.218258] buffer_info[next_to_clean]:
[46623.218258]   time_stamp           <102c2716b>
[46623.218258]   next_to_watch        <67>
[46623.218258]   jiffies              <102c2820c>
[46623.218258]   next_to_watch.status <0>
[46623.218258] MAC Status             <40080083>
[46623.218258] PHY Status             <796d>
[46623.218258] PHY 1000BASE-T Status  <3800>
[46623.218258] PHY Extended Status    <3000>
[46623.218258] PCI Status             <10>
[46625.219248] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
[46625.219248]   TDH                  <67>
[46625.219248]   TDT                  <b4>
[46625.219248]   next_to_use          <b4>
[46625.219248]   next_to_clean        <66>
[46625.219248] buffer_info[next_to_clean]:
[46625.219248]   time_stamp           <102c2716b>
[46625.219248]   next_to_watch        <67>
[46625.219248]   jiffies              <102c289dc>
[46625.219248]   next_to_watch.status <0>
[46625.219248] MAC Status             <40080083>
[46625.219248] PHY Status             <796d>
[46625.219248] PHY 1000BASE-T Status  <3800>
[46625.219248] PHY Extended Status    <3000>
[46625.219248] PCI Status             <10>
[46625.222534] e1000e 0000:00:19.0 eth0: Reset adapter unexpectedly
[46629.154379] e1000e: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow 
Control: None
[59201.016664] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
[59201.016664]   TDH                  <ba>
[59201.016664]   TDT                  <a5>
[59201.016664]   next_to_use          <a5>
[59201.016664]   next_to_clean        <b8>
[59201.016664] buffer_info[next_to_clean]:
[59201.016664]   time_stamp           <103824c6c>
[59201.016664]   next_to_watch        <ba>
[59201.016664]   jiffies              <10382576c>
[59201.016664]   next_to_watch.status <0>
[59201.016664] MAC Status             <40080083>
[59201.016664] PHY Status             <796d>
[59201.016664] PHY 1000BASE-T Status  <3800>
[59201.016664] PHY Extended Status    <3000>
[59201.016664] PCI Status             <10>
[59203.017583] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
[59203.017583]   TDH                  <ba>
[59203.017583]   TDT                  <a5>
[59203.017583]   next_to_use          <a5>
[59203.017583]   next_to_clean        <b8>
[59203.017583] buffer_info[next_to_clean]:
[59203.017583]   time_stamp           <103824c6c>
[59203.017583]   next_to_watch        <ba>
[59203.017583]   jiffies              <103825f3c>
[59203.017583]   next_to_watch.status <0>
[59203.017583] MAC Status             <40080083>
[59203.017583] PHY Status             <796d>
[59203.017583] PHY 1000BASE-T Status  <3800>
[59203.017583] PHY Extended Status    <3000>
[59203.017583] PCI Status             <10>
[59205.018454] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
[59205.018454]   TDH                  <ba>
[59205.018454]   TDT                  <a5>
[59205.018454]   next_to_use          <a5>
[59205.018454]   next_to_clean        <b8>
[59205.018454] buffer_info[next_to_clean]:
[59205.018454]   time_stamp           <103824c6c>
[59205.018454]   next_to_watch        <ba>
[59205.018454]   jiffies              <10382670c>
[59205.018454]   next_to_watch.status <0>
[59205.018454] MAC Status             <40080083>
[59205.018454] PHY Status             <796d>
[59205.018454] PHY 1000BASE-T Status  <3800>
[59205.018454] PHY Extended Status    <3000>
[59205.018454] PCI Status             <10>
[59205.021788] e1000e 0000:00:19.0 eth0: Reset adapter unexpectedly
[59208.789543] e1000e: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow 
Control: None
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ