NetBSD Problem Report #53997

From martin@duskware.de  Thu Feb 21 12:48:08 2019
Return-Path: <martin@duskware.de>
Received: from mail.netbsd.org (mail.netbsd.org [199.233.217.200])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(Client CN "mail.NetBSD.org", Issuer "mail.NetBSD.org CA" (not verified))
	by mollari.NetBSD.org (Postfix) with ESMTPS id 2851B7A156
	for <gnats-bugs@gnats.NetBSD.org>; Thu, 21 Feb 2019 12:48:08 +0000 (UTC)
From: martin@NetBSD.org
Reply-To: martin@NetBSD.org
To: gnats-bugs@NetBSD.org
Subject: evbarm mvsata crash
X-Send-Pr-Version: 3.95

>Number:         53997
>Category:       kern
>Synopsis:       evbarm mvsata crash
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    kern-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Thu Feb 21 12:50:00 +0000 2019
>Originator:     Martin Husemann
>Release:        NetBSD 7.2_STABLE
>Organization:
The NetBSD Foundation, Inc.
>Environment:
System: NetBSD plug.duskware.de 7.2_STABLE NetBSD 7.2_STABLE (PLUG) #24: Thu Feb 21 11:56:47 CET 2019 martin@plug.duskware.de:/nbsd/src-7/sys/arch/evbarm/compile/PLUG evbarm
Architecture: earm
Machine: evbarm
>Description:

I have a GuruPlug evbarm machine that fails to run -8 or -current.
It is rock solid when running latest netbsd-7.

What is a bit strange is that I have a second identical machine, but with
a different SATA disk / setup and that other machine is working both with
-current as with -8 pretty fine, even if I stress the disk.

This is an example of what happens with -current on the affected machine:


[ 226.2104770] Fatal kernel mode prefetch abort at 0x00000040
[ 226.2104770] trapframe: 0xcbc71f30, spsr=40000013
[ 226.2104770] r0 =00000016, r1 =00000000, r2 =00000000, r3 =c03019c4
[ 226.2104770] r4 =00000218, r5 =cbc80218, r6 =cbc80000, r7 =00000006
[ 226.2104770] r8 =00000000, r9 =00000000, r10=00000001, r11=cbc71fac
[ 226.2104770] r12=cbc71f80, ssp=cbc71f80, slr=c0210c0c, pc =00000040

Stopped in pid 0.5 (system) at  40:     address 0x40 is invalid
andeq   r0, r0, r0
db> bt
0xcbc71fac: c0210b98 is in softint_schedule (../../../../kern/kern_softint.c:476).
0xcbc71fbc: c000ca98 is in clockhandler (../../../../arch/arm/marvell/mvsoctmr.c:227).
0xcbc71fdc: c0071518 is in pic_dispatch (../../../../arch/arm/pic/pic.c:369).
0xcbc7204c: c0071824 is in pic_do_pending_ints (./arm/cpufunc.h:276).
0xcbc720c4: c000bcec
0xcbc720e4: c0086a50 is in mvsata_fix_phy_gen2 (../../../../dev/ic/mvsata.c:3782).
0xcbc72104: c00868e0 is in mvsata_hreset_port (../../../../dev/ic/mvsata.c:3321).
0xcbc72154: c008a51c is in mvsata_edma_rqq_remove (../../../../dev/ic/mvsata.c:2900).
0xcbc7218c: c008aa04 is in mvsata_bio_intr (../../../../dev/ic/mvsata.c:1315).
0xcbc721ec: c008ae0c is in mvsata_edma_handle (../../../../dev/ic/mvsata.c:2843).
0xcbc7223c: c008a528 is in mvsata_edma_rqq_remove (../../../../dev/ic/mvsata.c:2902).
0xcbc72274: c008aa04 is in mvsata_bio_intr (../../../../dev/ic/mvsata.c:1315).
0xcbc722d4: c008ae0c is in mvsata_edma_handle (../../../../dev/ic/mvsata.c:2843).
0xcbc72324: c008a528 is in mvsata_edma_rqq_remove (../../../../dev/ic/mvsata.c:2902).
0xcbc7235c: c008aa04 is in mvsata_bio_intr (../../../../dev/ic/mvsata.c:1315).
0xcbc723bc: c008ae0c is in mvsata_edma_handle (../../../../dev/ic/mvsata.c:2843).
0xcbc7240c: c008a528 is in mvsata_edma_rqq_remove (../../../../dev/ic/mvsata.c:2902).
0xcbc72444: c008aa04 is in mvsata_bio_intr (../../../../dev/ic/mvsata.c:1315).
0xcbc724a4: c008ae0c is in mvsata_edma_handle (../../../../dev/ic/mvsata.c:2843).
0xcbc724f4: c008a528 is in mvsata_edma_rqq_remove (../../../../dev/ic/mvsata.c:2902).
0xcbc7252c: c008aa04 is in mvsata_bio_intr (../../../../dev/ic/mvsata.c:1315).
0xcbc7258c: c008ae0c is in mvsata_edma_handle (../../../../dev/ic/mvsata.c:2843).
..
..

Does it overflow the kernel stack in an infinite recursion?


Full dmesg:

[   1.0000000] NetBSD/evbarm (sheevaplug) booting ...
[   1.0000000] [ Kernel symbol table missing! ]
[   1.0000000] Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
[   1.0000000]     2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017,
[   1.0000000]     2018, 2019 The NetBSD Foundation, Inc.  All rights reserved.
[   1.0000000] Copyright (c) 1982, 1986, 1989, 1991, 1993
[   1.0000000]     The Regents of the University of California.  All rights reserved.

[   1.0000000] NetBSD 8.99.34 (PLUG) #3: Thu Feb 21 10:56:24 CET 2019
[   1.0000000] 	martin@seven-days-to-the-wolves.aprisoft.de:/work/src/sys/arch/evbarm/compile/PLUG
[   1.0000000] total memory = 512 MB
[   1.0000000] avail memory = 497 MB
[   1.0000000] mainbus0 (root)
[   1.0000000] cpu0 at mainbus0 core 0: Sheeva 88SV131 rev 1 (ARM9E-S V5TE core)
[   1.0000000] cpu0: DC enabled IC enabled WB enabled LABT branch prediction enabled
[   1.0000000] cpu0: 16KB/32B 4-way L1 VIVT Instruction cache
[   1.0000000] cpu0: 16KB/32B 4-way write-back-locking-C L1 VIVT Data cache
[   1.0000000] mvsoc0 at mainbus0: Marvell 88F6281 Rev. A1  Kirkwood
[   1.0000000] mvsoc0: CPU Clock 1200.000 MHz  SysClock 400.000 MHz  TClock 200.000 MHz
[   1.0000000] mvsoctmr0 at mvsoc0 unit 0 offset 0x20300-0x203ff irq 65: Marvell SoC Timer
[   1.0000000] mvsocgpp0 at mvsoc0 unit 0 offset 0x10100-0x101ff irq 35: Marvell SoC General Purpose I/O Port Interface
[   1.0000000] mvsocgpp0: 50 gpio pins
[   1.0000000] mvsocgpp0: interrupts 96..103, intr 35
[   1.0000000] mvsocgpp0: interrupts 104..111, intr 36
[   1.0000000] mvsocgpp0: interrupts 112..119, intr 37
[   1.0000000] mvsocgpp0: interrupts 120..127, intr 38
[   1.0000000] mvsocgpp0: interrupts 128..135, intr 39
[   1.0000000] mvsocgpp0: interrupts 136..143, intr 40
[   1.0000000] mvsocgpp0: interrupts 144..151, intr 41
[   1.0000000] mvsocgpp0:   Data Out:                 	0x00000000
[   1.0000000] mvsocgpp0:   Data Out Enable Control:  	0xffffffff
[   1.0000000] mvsocgpp0:   Data Blink Enable:        	0x00000000
[   1.0000000] mvsocgpp0:   Data In Polarity:         	0x00000000
[   1.0000000] mvsocgpp0:   Data In:                  	0x00000080
[   1.0000000] mvsocgpp0:   Interrupt Cause:          	0x00000000
[   1.0000000] mvsocgpp0:   Interrupt Mask:           	0x00000000
[   1.0000000] mvsocgpp0:   Interrupt Level Mask:     	0x00000000
[   1.0000000] mvsocgpp0:   High Data Out:            	0x000f0000
[   1.0000000] mvsocgpp0:   High Data Out Enable Ctrl:	0xffffffff
[   1.0000000] mvsocgpp0:   High Blink Enable:        	0x00000000
[   1.0000000] mvsocgpp0:   High Data In Polarity:    	0x00000000
[   1.0000000] mvsocgpp0:   High Data In:             	0x0003f000
[   1.0000000] mvsocgpp0:   High Interrupt Cause:     	0x00000000
[   1.0000000] mvsocgpp0:   High Interrupt Mask:      	0x00000000
[   1.0000000] mvsocgpp0:   High Interrupt Level Mask:	0x00000000
[   1.0000000] gpio0 at mvsocgpp0: 50 pins
[   1.0000000] mvsocrtc0 at mvsoc0 unit 0 offset 0x10300-0x10317: Marvell SoC Real Time Clock
[   1.0000000] com0 at mvsoc0 unit 0 offset 0x12000-0x1201f irq 33: ns16550a, working fifo
[   1.0000000] com0: console
[   1.0000000] com1 at mvsoc0 unit 1 offset 0x12100-0x1211f irq 34: ns16550a, working fifo
[   1.0000000] ehci0 at mvsoc0 unit 0 offset 0x50000-0x50fff irq 19: Marvell USB 2.0 Interface
[   1.0000000] usb0 at ehci0: USB revision 2.0
[   1.0000000] gtidmac0 at mvsoc0 unit 0 offset 0x60000-0x60fff: Marvell IDMA Controller/XOR Engine
[   1.0000000] gtidmac0: XOR Engine 4 channels, intr 5, 6, 7, 8
[   1.0000000] gttwsi0 at mvsoc0 unit 0 offset 0x11000-0x110ff irq 29: Marvell TWSI controller
[   1.0000000] iic0 at gttwsi0: I2C bus
[   1.0000000] mvcesa0 at mvsoc0 unit 0 offset 0x3d000-0x3dfff irq 22: Marvell Cryptographic Engines and Security Accelerator
[   1.0000000] mvgbec0 at mvsoc0 unit 0 offset 0x70000-0x73fff: Marvell Gigabit Ethernet Controller
[   1.0000000] mvgbe0 at mvgbec0 port 0 irq 11
[   1.0000000] mvgbe0: Ethernet address 00:50:43:01:83:dc
[   1.0000000] makphy0 at mvgbe0 phy 0: Marvell 88E1149 Gigabit PHY, rev. 3
[   1.0000000] makphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
[   1.0000000] mvgbec1 at mvsoc0 unit 1 offset 0x74000-0x77fff: Marvell Gigabit Ethernet Controller
[   1.0000000] mvgbe1 at mvgbec1 port 0 irq 15
[   1.0000000] mvgbe1: Ethernet address 00:50:43:01:83:dd
[   1.0000000] makphy1 at mvgbe1 phy 1: Marvell 88E1149 Gigabit PHY, rev. 3
[   1.0000000] makphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
[   1.0000000] mvpex0 at mvsoc0 unit 0 offset 0x40000-0x41fff irq 9: Marvell PCI Express Interface
[   1.0000000] pci0 at mvpex0
[   1.0000000] vendor 11ab product 6281 (miscellaneous memory, revision 0x03) at pci0 dev 0 function 0 not configured
[   1.0000000] mvsata0 at mvsoc0 unit 0 offset 0x80000-0x87fff irq 21: Marvell Serial-ATA Host Controller (SATAHC)
[   1.0000000] mvsata0: GenIIe, 1hc, 2port/hc
[   1.0000000] atabus0 at mvsata0 channel 0
[   1.0000000] atabus1 at mvsata0 channel 1
[   1.0000000] mvsdio0 at mvsoc0 unit 0 offset 0x90000-0x9ffff irq 28: Marvell Secure Digital Input/Output Interface
[   1.0000000] sdmmc0 at mvsdio0
[   1.0999663] uhub0 at usb0: NetBSD (0000) EHCI root hub (0000), class 9/0, rev 2.00/1.00, addr 1
[   1.1599699] mvsata0 port 0: device present, speed: 3.0Gb/s
[   1.1699690] sdmmc0: bad CIS ptr 0
[   1.1699690] sdmmc0: couldn't read CIS
[   1.1699690] sdmmc0: i/o init failed
[   1.1699690] sdmmc0: bad CIS ptr 0
[   1.1699690] sdmmc0: couldn't read CIS
[   1.1699690] sdmmc0: i/o init failed
[   1.1699690] "Marvell, 802.11 SDIO ID: 10, " (manufacturer 0x2df, product 0x9103) at sdmmc0 function 1 not configured
[   1.1699690] "Marvell, 802.11 SDIO ID: 10, " (manufacturer 0x2df, product 0x9103) at sdmmc0 function 2 not configured
[   1.2299726] wd0 at atabus0 drive 0
[   1.2299726] wd0: <ST3000DM001-1ER166>
[   1.2299726] wd0: 2794 GB, 5814021 cyl, 16 head, 63 sec, 512 bytes/sect x 5860533168 sectors
[   1.4699741] dk0 at wd0: "Guru-Root", 5856338797 blocks at 34, type: ffs
[   1.4699741] dk1 at wd0: "d5c3eff7-a35d-41b3-9c58-76679533f5cb", 4194304 blocks at 5856338831, type: swap
[   3.3999699] uhub1 at uhub0 port 1: vendor 05e3 (0x5e3) USB2.0 Hub (0x608), class 9/0, rev 2.00/77.60, addr 2
[   3.3999699] uhub1: single transaction translator
[   4.7199705] umass0 at uhub1 port 1 configuration 1 interface 0
[   4.7199705] umass0: vendor 05e3 (0x5e3) USB Storage (0x726), rev 2.00/99.09, addr 3
[   4.8599678] scsibus0 at umass0: 2 targets, 2 luns per target
[   4.8799703] sd0 at scsibus0 target 0 lun 0: <Generic, STORAGE DEVICE, 9909> disk removable
[   4.8799703] sd0: drive offline
[   4.8999735] sd1 at scsibus0 target 0 lun 1: <Generic, STORAGE DEVICE, 9909> disk removable
[   4.9099725] sd1: fabricating a geometry
[   4.9099725] sd1: 1886 MB, 1886 cyl, 64 head, 32 sec, 512 bytes/sect x 3862528 sectors
[   4.9299736] sd1: fabricating a geometry
[   5.8499700] uhub2 at uhub1 port 2: vendor 0409 (0x409) product 005a (0x5a), class 9/0, rev 2.00/1.00, addr 4
[   5.8499700] uhub2: single transaction translator
[   7.0699710] umcs0 at uhub2 port 1: vendor 9710 (0x9710) product 7840 (0x7840), rev 2.00/0.01, addr 5
[   7.0699710] ucom0 at umcs0 portno 0
[   7.0699710] ucom1 at umcs0 portno 1
[   7.0699710] ucom2 at umcs0 portno 2
[   7.0699710] ucom3 at umcs0 portno 3
[   8.0599717] umcs1 at uhub2 port 2: vendor 9710 (0x9710) product 7840 (0x7840), rev 2.00/0.01, addr 6
[   8.0599717] ucom4 at umcs1 portno 0
[   8.0599717] ucom5 at umcs1 portno 1
[   8.0599717] ucom6 at umcs1 portno 2
[   8.0599717] ucom7 at umcs1 portno 3
[   8.0599717] WARNING: 6 errors while detecting hardware; check system log.
[   8.0599717] boot device: <unknown>
[   8.0599717] root on dk0 dumps on dk1
[   8.1199719] kern.module.path=/stand/evbarm/8.99.34/modules


>How-To-Repeat:

Boot -current on this particular machine, wait a bit while some activity
happens with the hard disk.

>Fix:
n/a

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.43 2018/01/16 07:36:43 maya Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2017 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.