[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <50AB3F9D.4070905@canonical.com>
Date: Tue, 20 Nov 2012 09:30:21 +0100
From: Stefan Bader <stefan.bader@...onical.com>
To: Sander Eikelenboom <linux@...elenboom.it>
CC: ANNIE LI <annie.li@...cle.com>,
Ian Campbell <Ian.Campbell@...rix.com>,
Eric Dumazet <eric.dumazet@...il.com>,
"netdev@...r.kernel.org" <netdev@...r.kernel.org>,
"Marcos E. Matsunaga" <Marcos.Matsunaga@...cle.com>,
xen-devel <xen-devel@...ts.xen.org>,
Konrad Rzeszutek Wilk <konrad@...nel.org>,
Eric Dumazet <edumazet@...gle.com>
Subject: Re: [Xen-devel] compound skb frag pages appearing in start_xmit
On 19.11.2012 16:43, Sander Eikelenboom wrote:
>
> Thursday, November 15, 2012, 3:31:42 AM, you wrote:
>
>> On 2012-10-11 18:14, Ian Campbell wrote:
>>> On Thu, 2012-10-11 at 11:05 +0100, Eric Dumazet wrote:
>>>> On Thu, 2012-10-11 at 12:00 +0200, Sander Eikelenboom wrote:
>>>>
>>>>> Probably due to the BUG_ON from the patch below, i changed it into a WARN_ON.
>>>>> And i seem to hit it, but only in one of the guests at the moment and it triggers quite irregularly.
>>>> xennet_make_frags() is able to split the skb->head in multiple page-size
>>>> chunks.
>>>>
>>>> It should do the same for fragments
>>> Right, I just want to be reproduce the issue so I can know I've fixed it
>>> properly ;-)
>> Hi Ian,
>
>> I can reproduce this BUG_ON when running netperf/netserver test between
>> two domus running on the same dom0. The domu and dom0 all use v3.7-rc1.
>
>> When I tried to rebase my persistent grant netfront/netback patch on
>> latest kernel, netperf/netserver test never succeeded. I did some test
>> to find out that v3.6-rc7 works fine, but v3.7-rc1, v3.7-rc2 and
>> v3.7-rc4 does not succeed in netperf/netserver test. So I keep my
>> persistent grant patch only based on v3.4-rc3 now.
>
>> Konrad thought about commit 6a8ed462f16b8455eec5ae00eb6014159a6721f0 in
>> v3.7-rc1, and suggested me to test your debug patch in netfront. This
>> BUG_ON happens soon after running the netperf/netserver test case.
>
>> Thanks
>> Annie
>
> Is there any progression with this bug (rc6 is out the door, so the release of 3.7-final seems to be eminent and this bug completely cripples any networking with guests) ?
>
+1 on that. I was testing yesterday with a PVM domU running 3.7-rc5 on Xen 4.2
(but also reported from EC2 running Xen 3.4.3) c with one VCPU. I actually can
trigger it by just ssh'ing into the domU (from another machine) and then run
"find /". Output starts to stutter and then stops completely. When this happens
a new connection still can be made and as long as only shorter output is
generated the ssh connection is ok. From a dump taken it looks like user-space
is waiting in some select call (without any warnon I rather won't see the tx path).
-Stefan
Download attachment "signature.asc" of type "application/pgp-signature" (898 bytes)
Powered by blists - more mailing lists