lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-ID: <CAL8zT=gNwDDXcbH+hoZApc63ArW2quzHJ1gRrPSxKP7jE04KLw@mail.gmail.com>
Date:	Wed, 18 Jan 2012 12:33:39 +0100
From:	Jean-Michel Hautbois <jhautbois@...il.com>
To:	netdev@...r.kernel.org
Subject: Interpreting perf stat on netperf and netserver

Hi all,

I am currently using netperf/netserver in order to characterize a
benet emulex network device on a machine with 2 Xeon5670.
I am using the latest linux kernel from git (3.2.0+).
I am facing several issues, and I am trying to understand the
following perf stat launched on netserver :

 Performance counter stats for process id '5043':

      15452.992135 task-clock                #    0.450 CPUs utilized
            189678 context-switches          #    0.012 M/sec
                 5 CPU-migrations            #    0.000 M/sec
               275 page-faults               #    0.000 M/sec
       48490467936 cycles                    #    3.138 GHz
       33005879963 stalled-cycles-frontend   #   68.07% frontend cycles idle
       16325855769 stalled-cycles-backend    #   33.67% backend  cycles idle
       27340520316 instructions              #    0.56  insns per cycle
                                             #    1.21  stalled cycles per insn
        4745604818 branches                  #  307.099 M/sec
          67513124 branch-misses             #    1.42% of all branches

      34.303567279 seconds time elapsed

I am trying to understand the "stalled-cycles-frontend" and
"stalled-cycles-backend" lines.
It seems that frontend is high, and in red :) but I can't say why...

The be2net driver seems to have difficulties woth IRQ affinity also,
because it always uses CPU0 even if the affinity is 0-23 !
netperf result is quite good, and perf top shows :
   PerfTop:     689 irqs/sec  kernel:99.6% us: 0.3% guest kernel: 0.0%
guest us: 0.0% exact:  0.0% [1000Hz cycles],  (all, 24 CPUs)
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

    29.28%  [kernel]          [k] csum_partial
     8.46%  [kernel]          [k] copy_user_generic_string
     3.70%  [be2net]          [k] be_poll_rx
     3.08%  [be2net]          [k] event_handle
     2.37%  [kernel]          [k] irq_entries_start
     2.21%  [be2net]          [k] be_rx_compl_get
     1.65%  [be2net]          [k] be_post_rx_frags
     1.64%  [kernel]          [k] __napi_complete
     1.50%  [kernel]          [k] ip_defrag
     1.35%  [kernel]          [k] put_page
     1.34%  [kernel]          [k] get_page_from_freelist
     1.29%  [kernel]          [k] __netif_receive_skb
     1.16%  [kernel]          [k] __alloc_pages_nodemask
     1.14%  [kernel]          [k] debug_smp_processor_id
     1.08%  [kernel]          [k] add_preempt_count
     1.06%  [kernel]          [k] sub_preempt_count
     1.03%  [be2net]          [k] get_rx_page_info
     1.01%  [kernel]          [k] alloc_pages_current

Checksum calculation seems quite complex :).
Regards,
JM
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ