NetBSD Problem Report #45956

From riz@wintermute.localdomain  Thu Feb  9 04:40:46 2012
Return-Path: <riz@wintermute.localdomain>
Received: from mail.netbsd.org (mail.netbsd.org [149.20.53.66])
	by www.NetBSD.org (Postfix) with ESMTP id 8BE9A63D90F
	for <gnats-bugs@gnats.NetBSD.org>; Thu,  9 Feb 2012 04:40:46 +0000 (UTC)
Message-Id: <20120209032229.416C711C974@wintermute.localdomain>
Date: Wed,  8 Feb 2012 19:22:29 -0800 (PST)
From: riz@NetBSD.org
Reply-To: riz@NetBSD.org
To: gnats-bugs@gnats.NetBSD.org
Subject: Network packet corruption on MP kernel
X-Send-Pr-Version: 3.95

>Number:         45956
>Category:       port-xen
>Synopsis:       ssh disconnects with corrupt packet with MP kernel
>Confidential:   no
>Severity:       serious
>Priority:       high
>Responsible:    port-xen-maintainer
>State:          closed
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Thu Feb 09 04:45:00 +0000 2012
>Closed-Date:    Sat May 20 19:18:54 +0000 2017
>Last-Modified:  Sat May 20 19:18:54 +0000 2017
>Originator:     Jeff Rizzo <riz@NetBSD.org>
>Release:        NetBSD 5.99.64
>Organization:

>Environment:


System: NetBSD ip-10-252-1-233.us-west-2.compute.internal 5.99.64 NetBSD 5.99.64 (XEN3PAE_DOMU) #2: Tue Feb  7 20:13:15 PST 2012  riz@wintermute:/space/build/obj.i386/sys/arch/i386/compile/XEN3PAE_DOMU i386
Architecture: i386
Machine: i386
>Description:
	With an MP kernel on a domU (I've seen it on both amd64 and i386 PAE),
	terminal output (such as a build.sh) will sometimes disconnect with
	the following message:

	Corrupted MAC on input.
	Disconnecting: Packet corrupt 

	I haven't seen this on anything but MP xen (that I'm aware of).
>How-To-Repeat:
	Run a fast build with lots of output on a -current xen domU.
>Fix:
	None given.

>Release-Note:

>Audit-Trail:
From: Manuel Bouyer <bouyer@antioche.eu.org>
To: gnats-bugs@NetBSD.org
Cc: port-xen-maintainer@NetBSD.org, gnats-admin@NetBSD.org,
        netbsd-bugs@NetBSD.org
Subject: Re: port-xen/45956: Network packet corruption on MP kernel
Date: Thu, 9 Feb 2012 15:32:14 +0100

 On Thu, Feb 09, 2012 at 04:45:00AM +0000, riz@NetBSD.org wrote:
 > 	With an MP kernel on a domU (I've seen it on both amd64 and i386 PAE),
 > 	terminal output (such as a build.sh) will sometimes disconnect with
 > 	the following message:
 > 
 > 	Corrupted MAC on input.
 > 	Disconnecting: Packet corrupt 
 > 
 > 	I haven't seen this on anything but MP xen (that I'm aware of).

 I have some problems with MP domUs, but not this one. What is your dom0 ?

 -- 
 Manuel Bouyer <bouyer@antioche.eu.org>
      NetBSD: 26 ans d'experience feront toujours la difference
 --

From: Jeff Rizzo <riz@NetBSD.org>
To: gnats-bugs@NetBSD.org
Cc: 
Subject: Re: port-xen/45956: Network packet corruption on MP kernel
Date: Thu, 09 Feb 2012 07:01:17 -0800

 On 2/9/12 6:35 AM, Manuel Bouyer wrote:
 >  > 
 >  > 	I haven't seen this on anything but MP xen (that I'm aware of).
 >  
 >  I have some problems with MP domUs, but not this one. What is your dom0 ?
 >  

 I've seen it on 3 different dom0s;  two are running xen 4.1.2, the third
 (and the one I've seen it on most recently) is Amazon EC2, which I
 believe is based on xen 3.1.

From: Manuel Bouyer <bouyer@antioche.eu.org>
To: gnats-bugs@NetBSD.org
Cc: port-xen-maintainer@NetBSD.org, gnats-admin@NetBSD.org,
        netbsd-bugs@NetBSD.org, riz@NetBSD.org
Subject: Re: port-xen/45956: Network packet corruption on MP kernel
Date: Fri, 10 Feb 2012 16:40:04 +0100

 On Thu, Feb 09, 2012 at 03:05:03PM +0000, Jeff Rizzo wrote:
 >  I've seen it on 3 different dom0s;  two are running xen 4.1.2, the third
 >  (and the one I've seen it on most recently) is Amazon EC2, which I
 >  believe is based on xen 3.1.

 What OS is are the xen 4.1.2 dom0 running ?

 -- 
 Manuel Bouyer <bouyer@antioche.eu.org>
      NetBSD: 26 ans d'experience feront toujours la difference
 --

From: Jeff Rizzo <riz@netbsd.org>
To: Manuel Bouyer <bouyer@antioche.eu.org>
Cc: gnats-bugs@NetBSD.org
Subject: Re: port-xen/45956: Network packet corruption on MP kernel
Date: Fri, 10 Feb 2012 08:24:19 -0800

 On 2/10/12 7:40 AM, Manuel Bouyer wrote:
 > What OS is are the xen 4.1.2 dom0 running ?
 >

 NetBSD - 5.99.59, possibly 5.99.64 (not sure if i've seen it since I 
 upgraded one of them.)

 +j

From: Manuel Bouyer <bouyer@antioche.eu.org>
To: Jeff Rizzo <riz@netbsd.org>
Cc: gnats-bugs@netbsd.org
Subject: Re: port-xen/45956: Network packet corruption on MP kernel
Date: Fri, 10 Feb 2012 17:27:13 +0100

 On Fri, Feb 10, 2012 at 08:24:19AM -0800, Jeff Rizzo wrote:
 > On 2/10/12 7:40 AM, Manuel Bouyer wrote:
 > >What OS is are the xen 4.1.2 dom0 running ?
 > >
 > 
 > NetBSD - 5.99.59, possibly 5.99.64 (not sure if i've seen it since I
 > upgraded one of them.)

 OK, and, just to make sure, you're not running a MULTIPROCESSOR in
 the dom0, right ? It's not supported at this time, and the backends
 are not SMP-safe.

 -- 
 Manuel Bouyer <bouyer@antioche.eu.org>
      NetBSD: 26 ans d'experience feront toujours la difference
 --

From: Jeff Rizzo <riz@netbsd.org>
To: Manuel Bouyer <bouyer@antioche.eu.org>
Cc: gnats-bugs@netbsd.org
Subject: Re: port-xen/45956: Network packet corruption on MP kernel
Date: Fri, 10 Feb 2012 08:34:03 -0800

 On 2/10/12 8:27 AM, Manuel Bouyer wrote:
 > OK, and, just to make sure, you're not running a MULTIPROCESSOR in
 > the dom0, right ? It's not supported at this time, and the backends
 > are not SMP-safe.
 >
 no, UP only.

State-Changed-From-To: open->feedback
State-Changed-By: bouyer@NetBSD.org
State-Changed-When: Sat, 25 Feb 2012 12:24:10 +0000
State-Changed-Why:
Have you been able to reproduce this with reent kernels ?
If not, OK to close the PR ?


From: "Cherry G. Mathew" <cherry.g.mathew@gmail.com>
To: gnats-bugs@netbsd.org
Cc: port-xen-maintainer@netbsd.org, gnats-admin@netbsd.org, 
	netbsd-bugs@netbsd.org, riz@netbsd.org
Subject: Re: port-xen/45956: Network packet corruption on MP kernel
Date: Sun, 11 Mar 2012 11:59:46 +0900

 On 11 February 2012 01:35, Jeff Rizzo <riz@netbsd.org> wrote:
 > The following reply was made to PR port-xen/45956; it has been noted by G=
 NATS.
 >
 > From: Jeff Rizzo <riz@netbsd.org>
 > To: Manuel Bouyer <bouyer@antioche.eu.org>
 > Cc: gnats-bugs@netbsd.org
 > Subject: Re: port-xen/45956: Network packet corruption on MP kernel
 > Date: Fri, 10 Feb 2012 08:34:03 -0800
 >
 > =A0On 2/10/12 8:27 AM, Manuel Bouyer wrote:
 > =A0> OK, and, just to make sure, you're not running a MULTIPROCESSOR in
 > =A0> the dom0, right ? It's not supported at this time, and the backends
 > =A0> are not SMP-safe.
 > =A0>
 > =A0no, UP only.
 >


 Is this still reproduceable (on -current or 6.0) ?

 --=20
 ~Cherry

State-Changed-From-To: feedback->closed
State-Changed-By: dholland@NetBSD.org
State-Changed-When: Sat, 20 May 2017 19:18:54 +0000
State-Changed-Why:
No feedback since 2012; also this problem matches 51753 fairly closely
and if so it's fixed.


>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.8 2006/05/07 09:23:38 tsutsui Exp $
Copyright © 1994-2007 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.