lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Date:	Mon, 05 Nov 2012 08:58:39 -0800
From:	Eric Dumazet <eric.dumazet@...il.com>
To:	Vimalkumar <j.vimal@...il.com>
Cc:	davem@...emloft.net, Jamal Hadi Salim <jhs@...atatu.com>,
	netdev@...r.kernel.org
Subject: Re: [PATCH net-next V3] htb: improved accuracy at high rates

On Wed, 2012-10-31 at 09:04 -0700, Vimalkumar wrote:
> Current HTB (and TBF) uses rate table computed by the "tc"
> userspace program, which has the following issue:
> 
> The rate table has 256 entries to map packet lengths
> to token (time units).  With TSO sized packets, the
> 256 entry granularity leads to loss/gain of rate,
> making the token bucket inaccurate.
> 
> Thus, instead of relying on rate table, this patch
> explicitly computes the time and accounts for packet
> transmission times with nanosecond granularity.
> 
> This greatly improves accuracy of HTB with a wide
> range of packet sizes.
> 
> Example:
> 
> tc qdisc add dev $dev root handle 1: \
>         htb default 1
> 
> tc class add dev $dev classid 1:1 parent 1: \
>         rate 5Gbit mtu 64k
> 
> Here is an example of inaccuracy:
> 
> $ iperf -c host -t 10 -i 1
> 
> With old htb:
> eth4:   34.76 Mb/s In  5827.98 Mb/s Out -  65836.0 p/s In  481273.0 p/s Out
> [SUM]  9.0-10.0 sec   669 MBytes  5.61 Gbits/sec
> [SUM]  0.0-10.0 sec  6.50 GBytes  5.58 Gbits/sec
> 
> With new htb:
> eth4:   28.36 Mb/s In  5208.06 Mb/s Out -  53704.0 p/s In  430076.0 p/s Out
> [SUM]  9.0-10.0 sec   594 MBytes  4.98 Gbits/sec
> [SUM]  0.0-10.0 sec  5.80 GBytes  4.98 Gbits/sec
> 
> The bits per second on the wire is still 5200Mb/s with new HTB
> because qdisc accounts for packet length using skb->len, which
> is smaller than total bytes on the wire if GSO is used.  But
> that is for another patch regardless of how time is accounted.
> 
> Many thanks to Eric Dumazet for review and feedback.
> 
> Signed-off-by: Vimalkumar <j.vimal@...il.com>
> ---
>  net/sched/sch_htb.c |  128 ++++++++++++++++++++++++++++++++++++---------------
>  1 files changed, 90 insertions(+), 38 deletions(-)
> 

> @@ -684,7 +729,7 @@ static psched_time_t htb_do_events(struct htb_sched *q, int level,
>  			return cl->pq_key;
>  
>  		htb_safe_rb_erase(p, q->wait_pq + level);
> -		diff = psched_tdiff_bounded(q->now, cl->t_c, cl->mbuffer);
> +		diff = min_t(s64, q->now - cl->t_c, cl->mbuffer);
>  		htb_change_class_mode(q, cl, &diff);
>  		if (cl->cmode != HTB_CAN_SEND)
>  			htb_add_to_wait_tree(q, cl, diff);
> @@ -834,7 +879,6 @@ next:
>  	} while (cl != start);
>  
>  	if (likely(skb != NULL)) {
> -		bstats_update(&cl->bstats, skb);
>  		cl->un.leaf.deficit[level] -= qdisc_pkt_len(skb);
>  		if (cl->un.leaf.deficit[level] < 0) {
>  			cl->un.leaf.deficit[level] += cl->quantum;
> @@ -871,10 +915,10 @@ ok:
>  

Why this bstats_update() call was removed ?



--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ