NetBSD Problem Report #44897

From www@NetBSD.org  Fri Apr 22 17:51:22 2011
Return-Path: <www@NetBSD.org>
Received: from mail.netbsd.org (mail.netbsd.org [204.152.190.11])
	by www.NetBSD.org (Postfix) with ESMTP id A19DF63B89A
	for <gnats-bugs@gnats.NetBSD.org>; Fri, 22 Apr 2011 17:51:22 +0000 (UTC)
Message-Id: <20110422175121.C9F2763B842@www.NetBSD.org>
Date: Fri, 22 Apr 2011 17:51:21 +0000 (UTC)
From: cyber@netbsd.org
Reply-To: cyber@netbsd.org
To: gnats-bugs@NetBSD.org
Subject: bge initialization/run failures
X-Send-Pr-Version: www-1.0

>Number:         44897
>Category:       kern
>Synopsis:       bge initialization/run failures
>Confidential:   no
>Severity:       critical
>Priority:       medium
>Responsible:    msaitoh
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Fri Apr 22 17:55:00 +0000 2011
>Last-Modified:  Fri Aug 01 04:01:59 +0000 2014
>Originator:     Erik Berls
>Release:        amd64-5.1
>Organization:
>Environment:
/netbsd: NetBSD 5.1 (GENERIC) #0: Sat Nov  6 13:19:33 UTC 2010 
/netbsd: builds@b6.netbsd.org:/home/builds/ab/netbsd-5-1-RELEASE/amd64/201011061943Z-obj/home/builds/ab/netbsd-5-1-RELEASE/src/sys/arch/amd64/compile/GENERIC

>Description:
[snip]
/netbsd: bge0 at pci4 dev 9 function 0: Altima AC9100 Gigabit Ethernet
/netbsd: APC2: Picked IRQ 17 with weight 0
/netbsd: bge0: interrupting at ioapic0 pin 17
/netbsd: bge0: ASIC BCM5701 B5 (0x0105), Ethernet address 00:09:5b:60:d6:21
/netbsd: brgphy0 at bge0 phy 1: BCM5701 1000BASE-T media interface, rev. 0
/netbsd: brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
[snip]
/netbsd: bge0: watchdog timeout -- resetting 
/netbsd: bge0: block failed to stop: reg 0x4800, bit 0x00000002
/netbsd: bge0: watchdog timeout -- resetting  
/netbsd: bge0: block failed to stop: reg 0x4800, bit 0x00000002
/netbsd: bge0: block failed to stop: reg 0x3c00, bit 0x00000002
[snip]

Network never actually functions.

>How-To-Repeat:
Run a system with an unlucky bge(4).  Try to access the network.

*** This works in 5.0.2. ***


(Also, this GENERIC kernel was run with -c, "delete azalia"

>Fix:

>Release-Note:

>Audit-Trail:
From: Martin Husemann <martin@duskware.de>
To: gnats-bugs@NetBSD.org
Cc: 
Subject: Re: kern/44897: bge initialization/run failures
Date: Sat, 23 Apr 2011 14:07:29 +0200

 Current (and IIRC netbsd-5 too) have changes in this area, could you test
 both a -current and a later netbsd-5 kernel?

 Martin

From: Erik Berls <cyber@netbsd.org>
To: gnats-bugs@netbsd.org
Cc: kern-bug-people@netbsd.org, netbsd-bugs@netbsd.org
Subject: Re: kern/44897: bge initialization/run failures
Date: Mon, 4 Jul 2011 13:07:07 -0700

 On Sat, Apr 23, 2011 at 05:10, Martin Husemann <martin@duskware.de> wrote:
 [...]
 > =A0Current (and IIRC netbsd-5 too) have changes in this area, could you t=
 est
 > =A0both a -current and a later netbsd-5 kernel?

 I tried the latest on netbsd-5 (July 4, 2011) and i'm not getting any
 traffic through the interface.
 Again, I had to disable azalia or the system would not boot.


 With -current (6/28, from nyftp) I get a hang right after acpicpu1
 (it's a dual core system) regardless of azalia state.
 Disabling ACPI also causes a hang (after audio0), regardless of azalia stat=
 e.

 Specific host system is a Dell Dimension C521

From: David Holland <dholland-bugs@netbsd.org>
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: kern/44897: bge initialization/run failures
Date: Sun, 17 Jul 2011 03:03:01 +0000

 Not sent to gnats.

    ------

 From: Jukka Ruohonen <jruohonen@iki.fi>
 To: netbsd-bugs@NetBSD.org
 Cc: Erik Berls <cyber@NetBSD.org>
 Subject: Re: kern/44897: bge initialization/run failures
 Date: Mon, 4 Jul 2011 23:12:18 +0300
 Mail-Followup-To: netbsd-bugs@NetBSD.org, Erik Berls <cyber@NetBSD.org>

 On Mon, Jul 04, 2011 at 08:10:08PM +0000, Erik Berls wrote:
 >  With -current (6/28, from nyftp) I get a hang right after acpicpu1
 >  (it's a dual core system) regardless of azalia state.

 Can you verify this with more recent -curent (from July)? And if the problem
 is still present, please file a separate problem report. Thanks.


 From: Erik Berls <cyber@netbsd.org>
 To: jruohonen@iki.fi, netbsd-bugs@netbsd.org, Erik Berls <cyber@netbsd.org>
 Subject: Re: kern/44897: bge initialization/run failures
 Date: Mon, 4 Jul 2011 16:35:50 -0700

 Submitted as port-amd64/45118

 Summary:
 GENERIC, Source update as of July 4, 2011 14:30 PST:
 - booted normally has the same issue (hang).
 - booted with -12c & 'disable azalia' gets to multi user, but then
 suspicious issues with arp
 - booted with -1 gets to multi user
 - booted -with -2 hangs after audio0 at hdafg4


 Running non-smp under some nfs load managed to get -current to panic, as well.
 *sigh*
 kern/45119


 From: Erik Berls <cyber@netbsd.org>
 To: jruohonen@iki.fi, netbsd-bugs@netbsd.org, Erik Berls <cyber@netbsd.org>
 Subject: Re: kern/44897: bge initialization/run failures
 Date: Mon, 4 Jul 2011 17:08:33 -0700

 I'm now able to replicate this on 5.0.2 by pushing NFS traffic (remote
 builds) to the system.

 I get the set of 3 messages accompanied by a network hang.  after a
 minute or two it recovers. (usually about 4 sets worth)


 On a whim from the other tests, i tried running 5.0.2 with -1:
 longer times to any recovery.

 I was able to make -current panic in the whole mess (kern/45119)


Responsible-Changed-From-To: kern-bug-people->msaitoh
Responsible-Changed-By: msaitoh@NetBSD.org
Responsible-Changed-When: Fri, 01 Aug 2014 04:01:59 +0000
Responsible-Changed-Why:
mine.


>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.8 2006/05/07 09:23:38 tsutsui Exp $
Copyright © 1994-2007 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.