[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20190110144812.mkbokbj2iyryj6lv@breakpoint.cc>
Date:   Thu, 10 Jan 2019 15:48:12 +0100
From:   Florian Westphal <fw@...len.de>
To:     Peter Zijlstra <peterz@...radead.org>
Cc:     Florian Westphal <fw@...len.de>,
        Anatol Pomozov <anatol.pomozov@...il.com>,
        Dmitry Vyukov <dvyukov@...gle.com>, paulmck@...ux.ibm.com,
        LKML <linux-kernel@...r.kernel.org>
Subject: Re: seqcount usage in xt_replace_table()
Peter Zijlstra <peterz@...radead.org> wrote:
> 	/*
> 	 * Ensure contents of newinfo are visible before assigning to
> 	 * private.
> 	 */
> 	smp_wmb();
> 	table->private = newinfo;
> 
> we have:
> 
> 	smp_store_release(&table->private, newinfo);
> 
> But what store does that second smp_wmb() order against? The comment:
> 
> 	/* make sure all cpus see new ->private value */
> 	smp_wmb();
> 
> makes no sense what so ever, no smp_*() barrier can provide such
> guarantees.
IIRC I added this at the request of a reviewer of an earlier iteration
of that patch.
IIRC the concern was that compiler/hw could re-order
smb_wmb();
table->private = newinfo;
/* wait until all cpus are done with old table */
 into:
smb_wmb();
/* wait until all cpus are done with old table */
...
table->private = newinfo;
and that (obviously) makes the wait-loop useless.
> > Only alternative I see that might work is synchronize_rcu (the
> > _do_table functions are called with rcu read lock held).
> > 
> > I guess current scheme is cheaper though.
> 
> Is performance a concern in this path? There is no comment justifying
> this 'creative' stuff.
We have to wait until all cpus are done with current iptables ruleset.
Before this 'creative' change, this relied on get_counters
synchronization.  And that caused wait times of 30 seconds or more on
busy systems.
I have no objections swapping this with a synchronize_rcu() if that
makes it easier.
(synchronize_rcu might be confusing though, as we don't use rcu
 for table->private), and one 'has to know' that all the netfilter
 hooks, including the iptables eval loop, run with rcu_read_lock
 held).
Powered by blists - more mailing lists
 
