NetBSD Problem Report #40345

From www@NetBSD.org  Thu Jan  8 21:13:21 2009
Return-Path: <www@NetBSD.org>
Received: from mail.netbsd.org (mail.netbsd.org [204.152.190.11])
	by narn.NetBSD.org (Postfix) with ESMTP id 7738163B909
	for <gnats-bugs@gnats.netbsd.org>; Thu,  8 Jan 2009 21:13:21 +0000 (UTC)
Message-Id: <20090108211321.3D81E63B8BA@narn.NetBSD.org>
Date: Thu,  8 Jan 2009 21:13:21 +0000 (UTC)
From: rkraevskiy@gmail.com
Reply-To: rkraevskiy@gmail.com
To: gnats-bugs@NetBSD.org
Subject: nfe: watchdog timeout
X-Send-Pr-Version: www-1.0

>Number:         40345
>Category:       kern
>Synopsis:       nfe: watchdog timeout
>Confidential:   no
>Severity:       critical
>Priority:       high
>Responsible:    kern-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Thu Jan 08 21:15:04 +0000 2009
>Closed-Date:    
>Last-Modified:  Mon Nov 21 05:26:17 +0000 2016
>Originator:     Roman Kraevskiy
>Release:        NetBSD 5.0_BETA 200901020002Z
>Organization:
>Environment:
NetBSD dhcppc2 5.0_BETA NetBSD 5.0_BETA (GENERIC) #0: Fri Jan  2 17:22:22 PST 2009  builds@wb37:/home/builds/ab/netbsd-5/i386/200901020002Z-obj/home/builds/ab/netbsd-5/src/sys/arch/i386/compile/GENERIC i386
>Description:

nfe is not working/usable, periodic "nfe0: watchdog timeout" on network activity. tcpdump on this host shows all outgoing packets (for example ping from this host) and no incoming packets (eg. icmp to this host). No outgouing packets on the wire at all.

MB: Biostar nf4st-a9 ( http://www.biostar-usa.com/mbdetails.asp?model=NF4ST-A9 ) 
Chipset: nForce4

dmesg:
nfe0 at pci0 dev 10 function 0: vendor 0x10de product 0x0057 (rev. 0xa3)
nfe0: interrupting at ioapic0 pin 21
nfe0: Ethernet address 00:14:2a:7f:ba:ea
ciphy0 at nfe0 phy 17: Cicada CS8201 10/100/1000TX PHY, rev. 3
ciphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto

lspci:
00:0a.0 Bridge: nVidia Corporation CK804 Ethernet Controller (rev a3)


>How-To-Repeat:
try to use nfe interface on nf4st-a9
>Fix:

>Release-Note:

>Audit-Trail:
From: Christoph Egger <Christoph_Egger@gmx.de>
To: gnats-bugs@NetBSD.org
Cc: kern-bug-people@netbsd.org, gnats-admin@netbsd.org, 
 netbsd-bugs@netbsd.org
Subject: Re: kern/40345: nfe: watchdog timeout
Date: Thu, 08 Jan 2009 23:27:43 +0100

 > nfe is not working/usable, periodic "nfe0: watchdog timeout" on network activity.
 > tcpdump on this host shows all outgoing packets (for example ping from this host)
 > and no incoming packets (eg. icmp to this host). No outgouing packets on the wire at all.
 > 
 > MB: Biostar nf4st-a9 ( http://www.biostar-usa.com/mbdetails.asp?model=NF4ST-A9 ) 
 > Chipset: nForce4
 > 
 > dmesg:
 > nfe0 at pci0 dev 10 function 0: vendor 0x10de product 0x0057 (rev. 0xa3)
 > nfe0: interrupting at ioapic0 pin 21
 > nfe0: Ethernet address 00:14:2a:7f:ba:ea
 > ciphy0 at nfe0 phy 17: Cicada CS8201 10/100/1000TX PHY, rev. 3
 > ciphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
 > 
 > lspci:
 > 00:0a.0 Bridge: nVidia Corporation CK804 Ethernet Controller (rev a3)

 Give this a try:

 disable ciphy in userconf and check if ukphy attaches on nfe0.
 If it does, test if you still have watchdog timeouts or not.

 If you still have watchdog timeouts, then the problem is nfe0.
 If there are no more watchdog timeouts, then the problem is ciphy0.

 Christoph

From: "Roman Kraevskiy" <rkraevskiy@gmail.com>
To: Christoph_Egger@gmx.de
Cc: kern-bug-people@netbsd.org, gnats-bugs@netbsd.org, netbsd-bugs@netbsd.org
Subject: Re: kern/40345: nfe: watchdog timeout
Date: Fri, 9 Jan 2009 17:24:09 +0200

 >  Give this a try:
 >
 >  disable ciphy in userconf and check if ukphy attaches on nfe0.
 >  If it does, test if you still have watchdog timeouts or not.

 nfe0 at pci0 dev 10 function 0: vendor 0x10de product 0x0057 (rev. 0xa3)
 nfe0: interrupting at ioapic0 pin 21
 nfe0: Ethernet address 00:14:2a:7f:ba:ea
 ukphy1 at nfe0 phy 17: Generic IEEE 802.3u media interface
 ukphy1: OUI 0x00c08f, model 0x0001, rev. 3
 ukphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT,
 1000baseT-FDX, auto

 Nothing changes: "nfe0: watchdog timeout" on network activity.

 >  If you still have watchdog timeouts, then the problem is nfe0.
 >  If there are no more watchdog timeouts, then the problem is ciphy0.

From: David Young <dyoung@pobox.com>
To: Roman Kraevskiy <rkraevskiy@gmail.com>
Cc: Christoph_Egger@gmx.de, kern-bug-people@netbsd.org,
	gnats-bugs@netbsd.org, netbsd-bugs@netbsd.org
Subject: Re: kern/40345: nfe: watchdog timeout
Date: Fri, 9 Jan 2009 11:25:00 -0600

 On Fri, Jan 09, 2009 at 05:24:09PM +0200, Roman Kraevskiy wrote:
 > >  Give this a try:
 > >
 > >  disable ciphy in userconf and check if ukphy attaches on nfe0.
 > >  If it does, test if you still have watchdog timeouts or not.
 > 
 > nfe0 at pci0 dev 10 function 0: vendor 0x10de product 0x0057 (rev. 0xa3)
 > nfe0: interrupting at ioapic0 pin 21
 > nfe0: Ethernet address 00:14:2a:7f:ba:ea
 > ukphy1 at nfe0 phy 17: Generic IEEE 802.3u media interface
 > ukphy1: OUI 0x00c08f, model 0x0001, rev. 3
 > ukphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT,
 > 1000baseT-FDX, auto
 > 
 > Nothing changes: "nfe0: watchdog timeout" on network activity.
 > 
 > >  If you still have watchdog timeouts, then the problem is nfe0.
 > >  If there are no more watchdog timeouts, then the problem is ciphy0.

 nfe_intr() is bracketed by

         NFE_WRITE(sc, NFE_IRQ_MASK, 0);

 and

         NFE_WRITE(sc, NFE_IRQ_MASK, NFE_IRQ_WANTED);

 That should not be necessary.  It may be harmful.  I suggest moving
 those lines out of nfe_intr() and into nfe_init(), where they should
 bracket the nfe_intr() call.

 Dave

 -- 
 David Young             OJC Technologies
 dyoung@ojctech.com      Urbana, IL * (217) 278-3933

From: Christoph Egger <cegger@netbsd.org>
To: gnats-bugs@gnats.NetBSD.org
Cc: 
Subject: PR/40345 CVS commit: src/sys/dev/pci
Date: Sun, 18 Jan 2009 11:21:06 +0000 (UTC)

 Module Name:	src
 Committed By:	cegger
 Date:		Sun Jan 18 11:21:06 UTC 2009

 Modified Files:
 	src/sys/dev/pci: if_nfe.c

 Log Message:
 Bracket IRQ masking around nfe_intr() in nfe_init() rather inside nfe_intr().
 Proposed by dyoung@ in PR 40345. Should fix PR 40345.


 To generate a diff of this commit:
 cvs rdiff -r1.38 -r1.39 src/sys/dev/pci/if_nfe.c

 Please note that diffs are not public domain; they are subject to the
 copyright notices on the relevant files.

State-Changed-From-To: open->feedback
State-Changed-By: cegger@NetBSD.org
State-Changed-When: Sun, 18 Jan 2009 11:37:33 +0000
State-Changed-Why:
Proposed fix committed. Want to have test result.


From: Roman Kraevskiy <rkraevskiy@gmail.com>
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: PR/40345 CVS commit: src/sys/dev/pci
Date: Sun, 22 Mar 2009 21:31:17 +0200

 NetBSD dhcppc1 5.0_RC3 NetBSD 5.0_RC3 (GENERIC) #0: Sat Mar 21
 23:57:42 UTC 2009
 builds@b6.netbsd.org:/home/builds/ab/netbsd-5/i386/200903210000Z-obj/home/builds/ab/netbsd-5/src/sys/arch/i386/compile/GENERIC
 i386

 sys/dev/pci/if_nfe.c revision 1.42 pulled to RC3 on Mon Mar 2 20:46:03
 2009 UTC (2 weeks, 5 days ago)

 Result: "nfe0: watchdog timeout".

From: Christoph Egger <Christoph_Egger@gmx.de>
To: gnats-bugs@NetBSD.org
Cc: kern-bug-people@netbsd.org, gnats-admin@netbsd.org, 
 netbsd-bugs@netbsd.org, rkraevskiy@gmail.com
Subject: Re: PR/40345 CVS commit: src/sys/dev/pci
Date: Sun, 22 Mar 2009 22:29:30 +0100

 Roman Kraevskiy wrote:

 The related fix is revision 1.39 and not 1.42.
 Please test rev. 1.39.

 Christoph

From: Hauke Fath <hf@spg.tu-darmstadt.de>
To: gnats-bugs@netbsd.org
Cc: Hauke Fath <hf@spg.tu-darmstadt.de>
Subject: Re: kern/40345: nfe: watchdog timeout
Date: Wed, 20 May 2009 15:52:41 +0200

 The bug is alive and well in netbsd-5. Unfortunately, applying the change in

 cvs rdiff -r1.38 -r1.39 src/sys/dev/pci/if_nfe.c

 does not fix it, either.

 	hauke


 -- 
       The ASCII Ribbon Campaign                    Hauke Fath
 ()     No HTML/RTF in email            Institut für Nachrichtentechnik
 /\     No Word docs in email                     TU Darmstadt
       Respect for open standards              Ruf +49-6151-16-3281

State-Changed-From-To: feedback->open
State-Changed-By: dholland@NetBSD.org
State-Changed-When: Sun, 15 Nov 2009 01:57:36 +0000
State-Changed-Why:
feedback was received months ago, problem still exists


State-Changed-From-To: open->feedback
State-Changed-By: jakllsch@NetBSD.org
State-Changed-When: Fri, 25 Nov 2011 23:40:54 +0000
State-Changed-Why:
may have been fixed by if_nfe.c 1.52


From: Hauke Fath <hf@spg.tu-darmstadt.de>
To: gnats-bugs@NetBSD.org
Cc: Hauke Fath <hf@spg.tu-darmstadt.de>
Subject: Re: kern/40345: nfe: watchdog timeout 
Date: Thu, 27 Oct 2016 14:20:28 +0200

 No change:

 %  uname -a
 NetBSD Eisenhut 7.0_STABLE NetBSD 7.0_STABLE 
 (FEUERKOGEL_DOM0-$Revision$) #0: Wed May 18 13:31:12 CEST 2016  
 hf@Hochstuhl:/var/obj/netbsd-builds/7/amd64/sys/arch/amd64/compile/FEUERKOGEL_DOM0 
 amd64
 % fgrep "nfe0: watchdog timeout" /var/log/messages
 Sep 11 18:23:45 Eisenhut /netbsd: nfe0: watchdog timeout
 Sep 22 16:03:43 Eisenhut /netbsd: nfe0: watchdog timeout
 Oct 13 07:47:34 Eisenhut /netbsd: nfe0: watchdog timeout
 Oct 14 09:37:54 Eisenhut /netbsd: nfe0: watchdog timeout
 Oct 15 23:25:12 Eisenhut /netbsd: nfe0: watchdog timeout
 Oct 16 08:35:34 Eisenhut /netbsd: nfe0: watchdog timeout
 Oct 24 12:52:54 Eisenhut /netbsd: nfe0: watchdog timeout
 Oct 26 10:25:42 Eisenhut /netbsd: nfe0: watchdog timeout
 %

 Cheerio,
 hauke

State-Changed-From-To: feedback->open
State-Changed-By: dholland@NetBSD.org
State-Changed-When: Mon, 21 Nov 2016 05:26:17 +0000
State-Changed-Why:
no such luck


>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.8 2006/05/07 09:23:38 tsutsui Exp $
Copyright © 1994-2014 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.