NetBSD Problem Report #56994

From www@netbsd.org  Sun Sep  4 15:41:46 2022
Return-Path: <www@netbsd.org>
Received: from mail.netbsd.org (mail.netbsd.org [199.233.217.200])
	(using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits))
	(Client CN "mail.NetBSD.org", Issuer "mail.NetBSD.org CA" (not verified))
	by mollari.NetBSD.org (Postfix) with ESMTPS id 22A3C1A921F
	for <gnats-bugs@gnats.NetBSD.org>; Sun,  4 Sep 2022 15:41:46 +0000 (UTC)
Message-Id: <20220904154114.49DC71A9243@mollari.NetBSD.org>
Date: Sun,  4 Sep 2022 15:41:14 +0000 (UTC)
From: tnn@nygren.pp.se
Reply-To: tnn@nygren.pp.se
To: gnats-bugs@NetBSD.org
Subject: wm(4) panic on attach in interrupt_distribute, sometimes
X-Send-Pr-Version: www-1.0

>Number:         56994
>Category:       kern
>Synopsis:       wm(4) panic on attach in interrupt_distribute, sometimes
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    kern-bug-people
>State:          closed
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Sun Sep 04 15:45:00 +0000 2022
>Closed-Date:    Tue Sep 06 06:04:00 +0000 2022
>Last-Modified:  Tue Sep 06 06:04:00 +0000 2022
>Originator:     Tobias Nygren
>Release:        9.99.99
>Organization:
NetBSD
>Environment:
9.99.99 aarch64
>Description:
Sometimes it boots OK but more than half of the time it panics.
This started sometime between 2022-07-16 and now.
Reverted to the old kernel meanwhile.

addr2line -e netbsd.gdb ffffc00000113074
/usr/src/sys/dev/pci/if_wm.c:6153

ident netbsd.gdb | grep if_wm
     $NetBSD: if_wm.c,v 1.763 2022/08/12 10:59:42 riastradh Exp $

wm0 at pci4 dev 0 function 0: Intel i82574L (rev. 0x00)
panic: Trap: Data Abort (EL1): Translation Fault L0 with read access for 0000000000002000: pc ffffc0000000347c: ldr x1, [x20]

cpu0: Begin traceback...
trace fp ffffc00000fece50
fp ffffc00000fece80 vpanic() at ffffc000004de1ec netbsd:vpanic+0x17c
fp ffffc00000fecef0 panic() at ffffc000004de2f4 netbsd:panic+0x44
fp ffffc00000fecf80 data_abort_handler() at ffffc000000aaddc netbsd:data_abort_handler+0x1ec
tf ffffc00000fecff0 el1_trap() at ffffc000000ac784 netbsd:el1_vectors+0x784
---- Data Abort (EL1): trapframe 0xffffc00000fecff0 (304 bytes) ----
...
------------------------------------------------
fp ffffc00000fed320 interrupt_distribute() at ffffc0000000347c netbsd:interrupt_distribute+0x1c
fp ffffc00000fed390 wm_attach() at ffffc00000113074 netbsd:wm_attach+0x714


dmesg output from previous kernel:
wm0 at pci4 dev 0 function 0, 64-bit DMA: Intel i82574L (rev. 0x00)
wm0: for TX and RX interrupting at irq 8193 (MSI-X vec 0)
wm0: for TX and RX interrupting at irq 8194 (MSI-X vec 1)
wm0: for LINK interrupting at irq 8195 (MSI-X vec 2)
wm0: PCI-Express bus
wm0: 2048 words FLASH, version 1.8.0, Image Unique ID 0000ffff
wm0: ASPM L0s and L1 are disabled to workaround the errata.
wm0: Ethernet address 00:1b:41:01:0f:56
wm0: 0x224080<FLASH,PCIE,ASF_FIRM,WOL>
makphy0 at wm0 phy 1: Marvell 88E1149 Gigabit PHY, rev. 1
>How-To-Repeat:
reboot a few times

>Fix:

>Release-Note:

>Audit-Trail:

State-Changed-From-To: open->feedback
State-Changed-By: skrll@NetBSD.org
State-Changed-When: Sun, 04 Sep 2022 16:06:18 +0000
State-Changed-Why:
please try with sys/arch/arm/fdt/pcihost_fdt.c:1.30


From: Tobias Nygren <tnn@nygren.pp.se>
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: kern/56994 (wm(4) panic on attach in interrupt_distribute,
 sometimes)
Date: Sun, 4 Sep 2022 18:22:45 +0200

 > please try with sys/arch/arm/fdt/pcihost_fdt.c:1.30

 Seems better now! Didn't expect it to be such a recent change. :-)
 Let me know if you're ok with keeping the revert or if you want me to
 test some different patch.

State-Changed-From-To: feedback->closed
State-Changed-By: skrll@NetBSD.org
State-Changed-When: Tue, 06 Sep 2022 06:04:00 +0000
State-Changed-Why:
Fixed.


>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.46 2020/01/03 16:35:01 leot Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2020 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.