NetBSD Problem Report #54211

From www@netbsd.org  Fri May 17 10:28:27 2019
Return-Path: <www@netbsd.org>
Received: from mail.netbsd.org (mail.netbsd.org [199.233.217.200])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(Client CN "mail.NetBSD.org", Issuer "mail.NetBSD.org CA" (not verified))
	by mollari.NetBSD.org (Postfix) with ESMTPS id CB9067A187
	for <gnats-bugs@gnats.NetBSD.org>; Fri, 17 May 2019 10:28:27 +0000 (UTC)
Message-Id: <20190517102826.2008F7A1BB@mollari.NetBSD.org>
Date: Fri, 17 May 2019 10:28:26 +0000 (UTC)
From: tobiasu@tmux.org
Reply-To: tobiasu@tmux.org
To: gnats-bugs@NetBSD.org
Subject: esiop stopped detecting boot disk
X-Send-Pr-Version: www-1.0

>Number:         54211
>Category:       port-sparc64
>Synopsis:       esiop stopped detecting boot disk
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    port-sparc64-maintainer
>State:          closed
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Fri May 17 10:30:00 +0000 2019
>Closed-Date:    Tue Jan 28 10:56:24 +0000 2020
>Last-Modified:  Tue Jan 28 10:56:24 +0000 2020
>Originator:     Tobias Ulmer
>Release:        NetBSD 8.99.40 (GENERIC) #0: Wed May 15 18:29:21 UTC 2019
>Organization:
>Environment:
>Description:
I'm having problems with (from the looks of it) esiop again. Couple of days ago I
wasted Martins time with a disk problem on Ultra 60 which resolved itself magically,
now it's back on another machine. I'm getting slightly paranoid here.

Upgraded the machine from .29. Now the boot disk isn't recognized at all.
Cold boot, warm boot, doesn't matter. I can hear it spin down at
[   1.9139548] scsibus0: waiting 2 seconds for devices.

Will setup netboot tonight, but for now that's all I have.


Sun Blade 2500, No Keyboard
Copyright 1998-2003 Sun Microsystems, Inc.  All rights reserved.
OpenBoot 4.9.5, 16384 MB memory installed, Serial #58218530.
Ethernet address 0:3:ba:78:58:22, Host ID: 83785822.



Boot device: disk  File and args:
NetBSD IEEE 1275 Multi-FS Bootblock
Version $NetBSD: bootblk.fth,v 1.15 2015/08/20 05:40:08 dholland Exp $
>> NetBSD/sparc64 OpenFirmware Boot, Revision 1.20 (Sun Sep  9 02:00:29 UTC 2018)
=0xa3b098
Loading netbsd: 10727512+478800+344296 [717912+470732]=0xdebc10
[   1.0000000] Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
[   1.0000000]     2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017,
[   1.0000000]     2018, 2019 The NetBSD Foundation, Inc.  All rights reserved.
[   1.0000000] Copyright (c) 1982, 1986, 1989, 1991, 1993
[   1.0000000]     The Regents of the University of California.  All rights reserved.

[   1.0000000] NetBSD 8.99.40 (GENERIC) #0: Wed May 15 18:29:21 UTC 2019
[   1.0000000]  mkrepro@mkrepro.NetBSD.org:/usr/src/sys/arch/sparc64/compile/GENERIC
[   1.0000000] total memory = 16384 MB
[   1.0000000] avail memory = 16070 MB
[   1.0000000] running cgd selftest aes-xts-256 aes-xts-512 done
[   1.0000000] mainbus0 (root): SUNW,Sun-Blade-2500 (Sun Blade 2500): hostid 83785822
[   1.0000000] cpu0 at mainbus0: SUNW,UltraSPARC-IIIi @ 1280 MHz, CPU id 0
[   1.0000000] cpu0: manuf 3e, impl 16, mask 24
[   1.0000000] cpu0: system tick frequency 12 MHz
[   1.0000000] cpu0: 32K instruction (32 b/l), 64K data (32 b/l), 1024K external (64 b/l)
[   1.0000000] cpu1 at mainbus0: SUNW,UltraSPARC-IIIi @ 1280 MHz, CPU id 1
[   1.0000000] cpu1: manuf 3e, impl 16, mask 24
[   1.0000000] cpu1: system tick frequency 12 MHz
[   1.0000000] cpu1: 32K instruction (32 b/l), 64K data (32 b/l), 1024K external (64 b/l)
[   1.0000000] memory-controller at mainbus0 not configured
[   1.0000000] memory-controller at mainbus0 not configured
[   1.0000000] schizo0 at mainbus0: addr 4000e600000: Tomatillo, version 4, ign 700, bus A 0 to 0
[   1.0000000] schizo0:  pci0 at schizo0
[   1.0000000] bge0 at pci0 dev 3 function 0: Broadcom BCM5703 Gigabit Ethernet
[   1.0000000] bge0: interrupting at ivec 371c
[   1.0000000] bge0: HW config 00000000, 00000000, 00000000, 00000000 00000000
[   1.0000000] bge0: ASIC BCM5702/5703 A2 (0x1002), Ethernet address 00:03:ba:78:58:22
[   1.0000000] brgphy0 at bge0 phy 1: BCM5703 1000BASE-T media interface, rev. 2
[   1.0000000] brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
[   1.0000000] ppm at mainbus0 not configured
[   1.0000000] schizo1 at mainbus0: addr 4000ef00000: Tomatillo, version 4, ign 740, bus B 0 to 0
[   1.0000000] schizo1:  pci1 at schizo1
[   1.0000000] esiop0 at pci1 dev 4 function 0: Symbios Logic 53c1010-66 (ultra3-wide scsi)
[   1.0000000] esiop0: using on-board RAM
[   1.0000000] esiop0: interrupting at ivec 1f69
[   1.0000000] scsibus0 at esiop0: 16 targets, 8 luns per target
[   1.0000000] esiop1 at pci1 dev 4 function 1: Symbios Logic 53c1010-66 (ultra3-wide scsi)
[   1.0000000] esiop1: using on-board RAM
[   1.0000000] esiop1: interrupting at ivec 1f68
[   1.0000000] scsibus1 at esiop1: 16 targets, 8 luns per target
[   1.0000000] schizo2 at mainbus0: addr 4000f600000: Tomatillo, version 4, ign 780, bus A 0 to 1
[   1.0000000] schizo2:  pci2 at schizo2
[   1.0000000] ebus0 at pci2 dev 7 function 0: Acer Labs M1533 PCI-ISA Bridge, revision 0x00
[   1.0000000] flashprom at ebus0 addr 0-fffff not configured
[   1.0000000] rtc0 at ebus0 addr 70-71: mc146818 compatible time-of-day clock: m5819p
[   1.0000000] pcfiic0 at ebus0 addr 320-321 ipl 2e
[   1.0000000] iic0 at pcfiic0: I2C bus
[   1.0000000] i2c-bridge at iic0 addr 0x09 not configured
[   1.0000000] admtemp0 at iic0 addr 0x18: LM84 temperature sensor
[   1.0000000] dbcool0 at iic0 addr 0x2c
[   1.0000000] dbcool0: ADM1031 dBCool(tm) Controller (rev 0x0083)
[   1.0000000] dbcool1 at iic0 addr 0x2e
[   1.0000000] dbcool1: ADM1031 dBCool(tm) Controller (rev 0x0083)
[   1.0000000] gpio at iic0 addr 0x37 not configured
[   1.0000000] lmtemp0 at iic0 addr 0x4e: gpio LM75 Temperature Sensor
[   1.0000000] lmtemp0: unable to read Tos register
[   1.0000000] seeprom0 at iic0 addr 0x50: audio-card-fru-prom: size 8192
[   1.0000000] seeprom1 at iic0 addr 0x51: motherboard-fru-prom: size 8192
[   1.0000000] seeprom2 at iic0 addr 0x54: scsi-backplane-fru-prom: size 8192
[   1.0000000] spdmem0 at iic0 addr 0x5b
[   1.0000000] spdmem0: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
[   1.0000000] spdmem1 at iic0 addr 0x5c
[   1.0000000] spdmem1: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
[   1.0000000] spdmem2 at iic0 addr 0x5d
[   1.0000000] spdmem2: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
[   1.0000000] spdmem3 at iic0 addr 0x5e
[   1.0000000] spdmem3: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
[   1.0000000] spdmem4 at iic0 addr 0x63
[   1.0000000] spdmem4: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
[   1.0000000] spdmem5 at iic0 addr 0x64
[   1.0000000] spdmem5: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
[   1.0000000] spdmem6 at iic0 addr 0x65
[   1.0000000] spdmem6: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
[   1.0000000] spdmem7 at iic0 addr 0x66
[   1.0000000] spdmem7: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
[   1.0000000] clock-generator at iic0 addr 0x69 not configured
[   1.0000000] power at ebus0 addr 800-82f ipl 20 not configured
[   1.0000000] com0 at ebus0 addr 3f8-3ff ipl 2c: ns16550a, working fifo
[   1.0000000] com0: console
[   1.0000000] com1 at ebus0 addr 2e8-2ef ipl 2c: ns16550a, working fifo
[   1.0000000] dma at ebus0 addr 0-ffff not configured
[   1.0000000] alipm0 at pci2 dev 6 function 0: 223KHz clock
[   1.0000000] iic1 at alipm0: I2C bus
[   1.0000000] card-reader at iic1 addr 0x20 not configured
[   1.0000000] autri0 at pci2 dev 8 function 0: Acer Labs M5451 AC-Link Controller Audio Device (rev. 0x02)
[   1.0000000] autri0: interrupting at ivec 7a4
[   1.0000000] autri0: ac97: Analog Devices AD1881A codec; headphone, Analog Devices Phat Stereo
[   1.0000000] audio0 at autri0: full duplex, playback, capture, mmap, independent
[   1.0000000] audio0: slinear_be:16 -> slinear_le:16 2ch 48000Hz, blk 40ms for playback
[   1.0000000] audio0: slinear_be:16 <- slinear_le:16 2ch 48000Hz, blk 40ms for recording
[   1.0000000] spkr0 at audio0: PC Speaker (synthesized)
[   1.0000000] wsbell at spkr0 not configured
[   1.0000000] midi0 at autri0: 4DWAVE MIDI UART
[   1.0000000] ohci0 at pci2 dev 10 function 0: Acer Labs M5237 USB 1.1 Host Controller (rev. 0x03)
[   1.0000000] ohci0: interrupting at ivec 7a7
[   1.0000000] ohci0: OHCI version 1.0, legacy support
[   1.0000000] usb0 at ohci0: USB revision 1.0
[   1.0000000] ohci1 at pci2 dev 11 function 0: Acer Labs M5237 USB 1.1 Host Controller (rev. 0x03)
[   1.0000000] ohci1: interrupting at ivec 7a6
[   1.0000000] ohci1: OHCI version 1.0, legacy support
[   1.0000000] usb1 at ohci1: USB revision 1.0
[   1.0000000] aceride0 at pci2 dev 13 function 0: Acer Labs M5229 UDMA IDE Controller (rev. 0xc4)
[   1.0000000] aceride0: using ivec 1f98 for native-PCI interrupt
[   1.0000000] atabus0 at aceride0 channel 0
[   1.0000000] atabus1 at aceride0 channel 1
[   1.0000000] ppb0 at pci2 dev 4 function 0: Texas Instruments product ac23 (rev. 0x02)
[   1.0000000] pci3 at ppb0 bus 1
[   1.0000000] ohci2 at pci3 dev 8 function 0: NEC USB Host Controller (rev. 0x43)
[   1.0000000] ohci2: interrupting at ivec 794
[   1.0000000] ohci2: OHCI version 1.0
[   1.0000000] usb2 at ohci2: USB revision 1.0
[   1.0000000] ohci3 at pci3 dev 8 function 1: NEC USB Host Controller (rev. 0x43)
[   1.0000000] ohci3: interrupting at ivec 795
[   1.0000000] ohci3: OHCI version 1.0
[   1.0000000] usb3 at ohci3: USB revision 1.0
[   1.0000000] ehci0 at pci3 dev 8 function 2: NEC USB2 Host Controller (rev. 0x04)
[   1.0000000] ehci0: interrupting at ivec 796
[   1.0000000] ehci0: 2 companion controllers, 3 ports each: ohci2 ohci3
[   1.0000000] usb4 at ehci0: USB revision 2.0
[   1.0000000] fwohci0 at pci3 dev 11 function 0: Texas Instruments TSB43AA23 IEEE 1394 Host Controller (rev. 0x00)
[   1.0000000] fwohci0: interrupting at ivec 797
[   1.0000000] fwohci0: OHCI version 1.10 (ROM=1)
[   1.0000000] fwohci0: No. of Isochronous channels is 4.
[   1.0000000] fwohci0: EUI64 00:05:16:00:00:71:1e:d4
[   1.0000000] fwohci0: Phy 1394a available S400, 3 ports.
[   1.0000000] fwohci0: Link S400, max_rec 2048 bytes.
[   1.0000000] ieee1394if0 at fwohci0: IEEE1394 bus
[   1.0000000] fwip0 at ieee1394if0: IP over IEEE1394
[   1.0000000] fwohci0: Initiate bus reset
[   1.0000000] ppm at mainbus0 not configured
[   1.0000000] schizo3 at mainbus0: addr 4000ff00000: Tomatillo, version 4, ign 7c0, bus B 0 to 0
[   1.0000000] schizo3:  pci4 at schizo3
[   1.0000000] radeonfb0 at pci4 dev 2 function 0: ATI Technologies Radeon 7000/VE QY (rev. 0x00)
[   1.0000000] radeonfb0: 64 MB aperture at 0x08000000, 64 KB registers at 0x00100000
[   1.0000000] radeonfb0: display 0: initial virtual resolution 640x480 at 8 bpp
[   1.0000000] radeonfb0: using 32 MB per display
[   1.0000000] radeonfb0: port 0: physical 1024x768 60Hz
[   1.0000000] radeonfb0: port 1: physical 1024x768 60Hz
[   1.0000000] wsdisplay1 at radeonfb0 kbdmux 1
[   1.0000000] drm at radeonfb0 not configured
[   1.0000000] jbusi2c0 at mainbus0: addr 4000fc64000
[   1.0000000] iic2 at jbusi2c0: I2C bus
[   1.0000000] seeprom3 at iic2 addr 0x57: nvram: size 8192
[   1.0000000] pcons at mainbus0 not configured
[   1.0000000] fwohci0: BUS reset
[   1.0000000] fwohci0: node_id=0xc800ffc0, gen=1, CYCLEMASTER mode
[   1.0000000] ieee1394if0: 1 nodes, maxhop <= 0 cable IRM irm(0) (me)
[   1.0000000] ieee1394if0: bus manager 0
[   1.0000156] No counter-timer -- using %stick at 12MHz as system clock.
[   1.9139548] scsibus0: waiting 2 seconds for devices to settle...
[   1.9139548] scsibus1: waiting 2 seconds for devices to settle...
[   2.1300957] uhub0 at usb0: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
[   2.2400946] uhub1 at usb1: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
[   2.3447490] uhub2 at usb2: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
[   2.4400945] uhub3 at usb3: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
[   2.5527003] uhub4 at usb4: NetBSD (0000) EHCI root hub (0000), class 9/0, rev 2.00/1.00, addr 1
[   8.6200948] WARNING: 1 error while detecting hardware; check system log.
[   8.7074139] FATAL: boot device not found, check your firmware settings!
[   8.7864336] root device:
[   8.9106096] use one of: bge0 fwip0 ddb halt reboot
[   8.9678703] root device:


>How-To-Repeat:

>Fix:

>Release-Note:

>Audit-Trail:
From: Tobias Ulmer <tobiasu@tmux.org>
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: port-sparc64/54211: esiop stopped detecting boot disk
Date: Sat, 18 May 2019 03:02:29 +0200

 On Fri, May 17, 2019 at 10:30:01AM +0000, tobiasu@tmux.org wrote:
 > Will setup netboot tonight, but for now that's all I have.

 No change with latest INSTALL 8.99.41

 Dmesg below from an older INSTALL kernel I had sitting around.
 Not only is sd0 back (and working), but also atapibus0 and cd0


 [   1.0000000] NetBSD 8.99.25 (INSTALL) #0: Sun Sep  9 02:00:29 UTC 2018
 [   1.0000000]  mkrepro@mkrepro.NetBSD.org:/usr/src/sys/arch/sparc64/compile/INSTALL
 [   1.0000000] total memory = 16384 MB
 [   1.0000000] avail memory = 16062 MB
 [   1.0000000] running cgd selftest aes-xts-256 aes-xts-512 done
 [   1.0000000] mainbus0 (root): SUNW,Sun-Blade-2500 (Sun Blade 2500): hostid 83785822
 [   1.0000000] cpu0 at mainbus0: SUNW,UltraSPARC-IIIi @ 1280 MHz, CPU id 0
 [   1.0000000] cpu0: manuf 3e, impl 16, mask 24
 [   1.0000000] cpu0: system tick frequency 12 MHz
 [   1.0000000] cpu0: 32K instruction (32 b/l), 64K data (32 b/l), 1024K external (64 b/l)
 [   1.0000000] cpu1 at mainbus0: SUNW,UltraSPARC-IIIi @ 1280 MHz, CPU id 1
 [   1.0000000] cpu1: manuf 3e, impl 16, mask 24
 [   1.0000000] cpu1: system tick frequency 12 MHz
 [   1.0000000] cpu1: 32K instruction (32 b/l), 64K data (32 b/l), 1024K external (64 b/l)
 [   1.0000000] memory-controller at mainbus0 not configured
 [   1.0000000] memory-controller at mainbus0 not configured
 [   1.0000000] schizo0 at mainbus0: addr 4000e600000: Tomatillo, version 4, ign 700, bus A 0 to 0
 [   1.0000000] schizo0:  pci0 at schizo0
 [   1.0000000] bge0 at pci0 dev 3 function 0: Broadcom BCM5703 Gigabit Ethernet
 [   1.0000000] bge0: interrupting at ivec 371c
 [   1.0000000] bge0: HW config 00000000, 00000000, 00000000, 00000000 00000000
 [   1.0000000] bge0: ASIC BCM5702/5703 A2 (0x1002), Ethernet address 00:03:ba:78:58:22
 [   1.0000000] brgphy0 at bge0 phy 1: BCM5703 1000BASE-T media interface, rev. 2
 [   1.0000000] brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
 [   1.0000000] ppm at mainbus0 not configured
 [   1.0000000] schizo1 at mainbus0: addr 4000ef00000: Tomatillo, version 4, ign 740, bus B 0 to 0
 [   1.0000000] schizo1:  pci1 at schizo1
 [   1.0000000] esiop0 at pci1 dev 4 function 0: Symbios Logic 53c1010-66 (ultra3-wide scsi)
 [   1.0000000] esiop0: using on-board RAM
 [   1.0000000] esiop0: interrupting at ivec 1f69
 [   1.0000000] scsibus0 at esiop0: 16 targets, 8 luns per target
 [   1.0000000] esiop1 at pci1 dev 4 function 1: Symbios Logic 53c1010-66 (ultra3-wide scsi)
 [   1.0000000] esiop1: using on-board RAM
 [   1.0000000] esiop1: interrupting at ivec 1f68
 [   1.0000000] scsibus1 at esiop1: 16 targets, 8 luns per target
 [   1.0000000] schizo2 at mainbus0: addr 4000f600000: Tomatillo, version 4, ign 780, bus A 0 to 1
 [   1.0000000] schizo2:  pci2 at schizo2
 [   1.0000000] ebus0 at pci2 dev 7 function 0: Acer Labs M1533 PCI-ISA Bridge, revision 0x00
 [   1.0000000] flashprom at ebus0 addr 0-fffff not configured
 [   1.0000000] rtc0 at ebus0 addr 70-71: mc146818 compatible time-of-day clock: m5819p
 [   1.0000000] pcfiic0 at ebus0 addr 320-321 ipl 2e
 [   1.0000000] iic0 at pcfiic0: I2C bus
 [   1.0000000] i2c-bridge at iic0 addr 0x09 not configured
 [   1.0000000] gpio at iic0 addr 0x18 not configured
 [   1.0000000] dbcool0 at iic0 addr 0x2c
 [   1.0000000] dbcool0: ADM1031 dBCool(tm) Controller (rev 0x0083)
 [   1.0000000] dbcool1 at iic0 addr 0x2e
 [   1.0000000] dbcool1: ADM1031 dBCool(tm) Controller (rev 0x0083)
 [   1.0000000] gpio at iic0 addr 0x37 not configured
 [   1.0000000] gpio at iic0 addr 0x4e not configured
 [   1.0000000] seeprom0 at iic0 addr 0x50: audio-card-fru-prom: size 8192
 [   1.0000000] seeprom1 at iic0 addr 0x51: motherboard-fru-prom: size 8192
 [   1.0000000] seeprom2 at iic0 addr 0x54: scsi-backplane-fru-prom: size 8192
 [   1.0000000] spdmem0 at iic0 addr 0x5b
 [   1.0000000] spdmem0: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] spdmem1 at iic0 addr 0x5c
 [   1.0000000] spdmem1: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] spdmem2 at iic0 addr 0x5d
 [   1.0000000] spdmem2: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] spdmem3 at iic0 addr 0x5e
 [   1.0000000] spdmem3: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] spdmem4 at iic0 addr 0x63
 [   1.0000000] spdmem4: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] spdmem5 at iic0 addr 0x64
 [   1.0000000] spdmem5: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] spdmem6 at iic0 addr 0x65
 [   1.0000000] spdmem6: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] spdmem7 at iic0 addr 0x66
 [   1.0000000] spdmem7: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] clock-generator at iic0 addr 0x69 not configured
 [   1.0000000] power at ebus0 addr 800-82f ipl 20 not configured
 [   1.0000000] com0 at ebus0 addr 3f8-3ff ipl 2c: ns16550a, working fifo
 [   1.0000000] com0: console
 [   1.0000000] com1 at ebus0 addr 2e8-2ef ipl 2c: ns16550a, working fifo
 [   1.0000000] dma at ebus0 addr 0-ffff not configured
 [   1.0000000] alipm0 at pci2 dev 6 function 0: 223KHz clock
 [   1.0000000] iic1 at alipm0: I2C bus
 [   1.0000000] card-reader at iic1 addr 0x20 not configured
 [   1.0000000] autri0 at pci2 dev 8 function 0: Acer Labs M5451 AC-Link Controller Audio Device (rev. 0x02)
 [   1.0000000] autri0: interrupting at ivec 7a4
 [   1.0000000] autri0: ac97: Analog Devices AD1881A codec; headphone, Analog Devices Phat Stereo
 [   1.0000000] audio0 at autri0: full duplex, playback, capture, mmap, independent
 [   1.0000000] autri0: Virtual format configured - Format SLINEAR, precision 16, channels 2, frequency 48000
 [   1.0000000] autri0: Latency: 256 milliseconds
 [   1.0000000] spkr0 at audio0: PC Speaker (synthesized)
 [   1.0000000] wsbell at spkr0 not configured
 [   1.0000000] midi0 at autri0: 4DWAVE MIDI UART
 [   1.0000000] ohci0 at pci2 dev 10 function 0: Acer Labs M5237 USB 1.1 Host Controller (rev. 0x03)
 [   1.0000000] ohci0: interrupting at ivec 7a7
 [   1.0000000] ohci0: OHCI version 1.0, legacy support
 [   1.0000000] usb0 at ohci0: USB revision 1.0
 [   1.0000000] ohci1 at pci2 dev 11 function 0: Acer Labs M5237 USB 1.1 Host Controller (rev. 0x03)
 [   1.0000000] ohci1: interrupting at ivec 7a6
 [   1.0000000] ohci1: OHCI version 1.0, legacy support
 [   1.0000000] usb1 at ohci1: USB revision 1.0
 [   1.0000000] aceride0 at pci2 dev 13 function 0: Acer Labs M5229 UDMA IDE Controller (rev. 0xc4)
 [   1.0000000] aceride0: using ivec 1f98 for native-PCI interrupt
 [   1.0000000] atabus0 at aceride0 channel 0
 [   1.0000000] atabus1 at aceride0 channel 1
 [   1.0000000] ppb0 at pci2 dev 4 function 0: Texas Instruments product ac23 (rev. 0x02)
 [   1.0000000] pci3 at ppb0 bus 1
 [   1.0000000] ohci2 at pci3 dev 8 function 0: NEC USB Host Controller (rev. 0x43)
 [   1.0000000] ohci2: interrupting at ivec 794
 [   1.0000000] ohci2: OHCI version 1.0
 [   1.0000000] usb2 at ohci2: USB revision 1.0
 [   1.0000000] ohci3 at pci3 dev 8 function 1: NEC USB Host Controller (rev. 0x43)
 [   1.0000000] ohci3: interrupting at ivec 795
 [   1.0000000] ohci3: OHCI version 1.0
 [   1.0000000] usb3 at ohci3: USB revision 1.0
 [   1.0000000] ehci0 at pci3 dev 8 function 2: NEC USB2 Host Controller (rev. 0x04)
 [   1.0000000] ehci0: interrupting at ivec 796
 [   1.0000000] ehci0: 2 companion controllers, 3 ports each: ohci2 ohci3
 [   1.0000000] usb4 at ehci0: USB revision 2.0
 [   1.0000000] fwohci0 at pci3 dev 11 function 0: Texas Instruments TSB43AA23 IEEE 1394 Host Controller (rev. 0x00)
 [   1.0000000] fwohci0: interrupting at ivec 797
 [   1.0000000] fwohci0: OHCI version 1.10 (ROM=1)
 [   1.0000000] fwohci0: No. of Isochronous channels is 4.
 [   1.0000000] fwohci0: EUI64 00:05:16:00:00:71:1e:d4
 [   1.0000000] fwohci0: Phy 1394a available S400, 3 ports.
 [   1.0000000] fwohci0: Link S400, max_rec 2048 bytes.
 [   1.0000000] ieee1394if0 at fwohci0: IEEE1394 bus
 [   1.0000000] fwip0 at ieee1394if0: IP over IEEE1394
 [   1.0000000] fwohci0: Initiate bus reset
 [   1.0000000] ppm at mainbus0 not configured
 [   1.0000000] schizo3 at mainbus0: addr 4000ff00000: Tomatillo, version 4, ign 7c0, bus B 0 to 0
 [   1.0000000] schizo3:  pci4 at schizo3
 [   1.0000000] radeonfb0 at pci4 dev 2 function 0: ATI Technologies Radeon 7000/VE QY (rev. 0x00)
 [   1.0000000] radeonfb0: 64 MB aperture at 0x08000000, 64 KB registers at 0x00100000
 [   1.0000000] radeonfb0: display 0: initial virtual resolution 640x480 at 8 bpp
 [   1.0000000] radeonfb0: using 32 MB per display
 [   1.0000000] radeonfb0: port 0: physical 1024x768 60Hz
 [   1.0000000] radeonfb0: port 1: physical 1024x768 60Hz
 [   1.0000000] wsdisplay1 at radeonfb0 kbdmux 1
 [   1.0000000] drm at radeonfb0 not configured
 [   1.0000000] i2c at mainbus0 not configured
 [   1.0000000] pcons at mainbus0 not configured
 [   1.0000000] fwohci0: BUS reset
 [   1.0000000] fwohci0: node_id=0xc800ffc0, gen=1, CYCLEMASTER mode
 [   1.0000000] ieee1394if0: 1 nodes, maxhop <= 0 cable IRM irm(0) (me)
 [   1.0000000] ieee1394if0: bus manager 0
 [   1.0000188] No counter-timer -- using %stick at 12MHz as system clock.
 [   1.9139561] scsibus0: waiting 2 seconds for devices to settle...
 [   1.9139561] scsibus1: waiting 2 seconds for devices to settle...
 [   2.0739486] atapibus0 at atabus1: 2 targets
 [   2.1239487] uhub0 at usb0: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
 [   2.2239471] uhub1 at usb1: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
 [   2.3239471] uhub2 at usb2: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
 [   2.4339471] cd0 at atapibus0 drive 0: <JLMS XJ-HD166S, , D3S4> cdrom removable
 [   2.5215748] uhub3 at usb3: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
 [   2.6239471] uhub4 at usb4: NetBSD (0000) EHCI root hub (0000), class 9/0, rev 2.00/1.00, addr 1
 [   4.4839472] sd0 at scsibus0 target 0 lun 0: <ModusLnk, MXJ3147SC800600W, M108> disk fixed
 [   4.5790520] sd0: 136 GB, 78753 cyl, 5 head, 729 sec, 512 bytes/sect x 287277984 sectors
 [   4.6739470] sd0: sync (12.50ns offset 62), 16-bit (160.000MB/s) transfers, tagged queueing
 [   9.0539495] root on md0a dumps on md0b
 [   9.1039474] root file system type: ffs
 [   9.1439470] kern.module.path=/stand/sparc64/8.99.25/modules

From: Martin Husemann <martin@duskware.de>
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: port-sparc64/54211: esiop stopped detecting boot disk
Date: Sat, 18 May 2019 17:46:44 +0200

 Are you using a self build kernel or one from the auto builds?

 Martin

From: Tobias Ulmer <tobiasu@tmux.org>
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: port-sparc64/54211: esiop stopped detecting boot disk
Date: Sat, 18 May 2019 18:47:19 +0200

 On Sat, May 18, 2019 at 03:50:01PM +0000, Martin Husemann wrote:
 > The following reply was made to PR port-sparc64/54211; it has been noted by GNATS.
 > 
 > From: Martin Husemann <martin@duskware.de>
 > To: gnats-bugs@netbsd.org
 > Cc: 
 > Subject: Re: port-sparc64/54211: esiop stopped detecting boot disk
 > Date: Sat, 18 May 2019 17:46:44 +0200
 > 
 >  Are you using a self build kernel or one from the auto builds?

 auto build.

 Loading netbsd-INSTALL.gz: 9666680+9916240+344280=0x15c9870
 [   1.0000000] ksyms: Symbol table not found
 [   1.0000000] ksyms: String table not found
 [   1.0000000] ksyms: Perhaps the kernel is stripped?
 [   1.0000000] Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
 [   1.0000000]     2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017,
 [   1.0000000]     2018, 2019 The NetBSD Foundation, Inc.  All rights reserved.
 [   1.0000000] Copyright (c) 1982, 1986, 1989, 1991, 1993
 [   1.0000000]     The Regents of the University of California.  All rights reserved.

 [   1.0000000] NetBSD 8.99.41 (INSTALL) #0: Fri May 17 23:51:35 UTC 2019
 [   1.0000000]  mkrepro@mkrepro.NetBSD.org:/usr/src/sys/arch/sparc64/compile/INSTALL
 [   1.0000000] total memory = 16384 MB
 [   1.0000000] avail memory = 16062 MB
 [   1.0000000] running cgd selftest aes-xts-256 aes-xts-512 done
 [   1.0000000] mainbus0 (root): SUNW,Sun-Blade-2500 (Sun Blade 2500): hostid 83785822
 [   1.0000000] cpu0 at mainbus0: SUNW,UltraSPARC-IIIi @ 1280 MHz, CPU id 0
 [   1.0000000] cpu0: manuf 3e, impl 16, mask 24
 [   1.0000000] cpu0: system tick frequency 12 MHz
 [   1.0000000] cpu0: 32K instruction (32 b/l), 64K data (32 b/l), 1024K external (64 b/l)
 [   1.0000000] cpu1 at mainbus0: SUNW,UltraSPARC-IIIi @ 1280 MHz, CPU id 1
 [   1.0000000] cpu1: manuf 3e, impl 16, mask 24
 [   1.0000000] cpu1: system tick frequency 12 MHz
 [   1.0000000] cpu1: 32K instruction (32 b/l), 64K data (32 b/l), 1024K external (64 b/l)
 [   1.0000000] memory-controller at mainbus0 not configured
 [   1.0000000] memory-controller at mainbus0 not configured
 [   1.0000000] schizo0 at mainbus0: addr 4000e600000: Tomatillo, version 4, ign 700, bus A 0 to 0
 [   1.0000000] schizo0:  pci0 at schizo0
 [   1.0000000] bge0 at pci0 dev 3 function 0: Broadcom BCM5703 Gigabit Ethernet
 [   1.0000000] bge0: interrupting at ivec 371c
 [   1.0000000] bge0: HW config 00000000, 00000000, 00000000, 00000000 00000000
 [   1.0000000] bge0: ASIC BCM5702/5703 A2 (0x1002), Ethernet address 00:03:ba:78:58:22
 [   1.0000000] brgphy0 at bge0 phy 1: BCM5703 1000BASE-T media interface, rev. 2
 [   1.0000000] brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
 [   1.0000000] ppm at mainbus0 not configured
 [   1.0000000] schizo1 at mainbus0: addr 4000ef00000: Tomatillo, version 4, ign 740, bus B 0 to 0
 [   1.0000000] schizo1:  pci1 at schizo1
 [   1.0000000] esiop0 at pci1 dev 4 function 0: Symbios Logic 53c1010-66 (ultra3-wide scsi)
 [   1.0000000] esiop0: using on-board RAM
 [   1.0000000] esiop0: interrupting at ivec 1f69
 [   1.0000000] scsibus0 at esiop0: 16 targets, 8 luns per target
 [   1.0000000] esiop1 at pci1 dev 4 function 1: Symbios Logic 53c1010-66 (ultra3-wide scsi)
 [   1.0000000] esiop1: using on-board RAM
 [   1.0000000] esiop1: interrupting at ivec 1f68
 [   1.0000000] scsibus1 at esiop1: 16 targets, 8 luns per target
 [   1.0000000] schizo2 at mainbus0: addr 4000f600000: Tomatillo, version 4, ign 780, bus A 0 to 1
 [   1.0000000] schizo2:  pci2 at schizo2
 [   1.0000000] ebus0 at pci2 dev 7 function 0: Acer Labs M1533 PCI-ISA Bridge, revision 0x00
 [   1.0000000] flashprom at ebus0 addr 0-fffff not configured
 [   1.0000000] rtc0 at ebus0 addr 70-71: mc146818 compatible time-of-day clock: m5819p
 [   1.0000000] pcfiic0 at ebus0 addr 320-321 ipl 2e
 [   1.0000000] iic0 at pcfiic0: I2C bus
 [   1.0000000] i2c-bridge at iic0 addr 0x09 not configured
 [   1.0000000] admtemp0 at iic0 addr 0x18: LM84 temperature sensor
 [   1.0000000] dbcool0 at iic0 addr 0x2c
 [   1.0000000] dbcool0: ADM1031 dBCool(tm) Controller (rev 0x0083)
 [   1.0000000] dbcool1 at iic0 addr 0x2e
 [   1.0000000] dbcool1: ADM1031 dBCool(tm) Controller (rev 0x0083)
 [   1.0000000] gpio at iic0 addr 0x37 not configured
 [   1.0000000] lmtemp0 at iic0 addr 0x4e: gpio LM75 Temperature Sensor
 [   1.0000000] lmtemp0: unable to read Tos register
 [   1.0000000] seeprom0 at iic0 addr 0x50: audio-card-fru-prom: size 8192
 [   1.0000000] seeprom1 at iic0 addr 0x51: motherboard-fru-prom: size 8192
 [   1.0000000] seeprom2 at iic0 addr 0x54: scsi-backplane-fru-prom: size 8192
 [   1.0000000] spdmem0 at iic0 addr 0x5b
 [   1.0000000] spdmem0: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] spdmem1 at iic0 addr 0x5c
 [   1.0000000] spdmem1: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] spdmem2 at iic0 addr 0x5d
 [   1.0000000] spdmem2: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] spdmem3 at iic0 addr 0x5e
 [   1.0000000] spdmem3: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] spdmem4 at iic0 addr 0x63
 [   1.0000000] spdmem4: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] spdmem5 at iic0 addr 0x64
 [   1.0000000] spdmem5: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] spdmem6 at iic0 addr 0x65
 [   1.0000000] spdmem6: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] spdmem7 at iic0 addr 0x66
 [   1.0000000] spdmem7: DDR SDRAM (registered), data ECC, 2GB, 400MHz (PC-3200)
 [   1.0000000] clock-generator at iic0 addr 0x69 not configured
 [   1.0000000] power at ebus0 addr 800-82f ipl 20 not configured
 [   1.0000000] com0 at ebus0 addr 3f8-3ff ipl 2c: ns16550a, working fifo
 [   1.0000000] com0: console
 [   1.0000000] com1 at ebus0 addr 2e8-2ef ipl 2c: ns16550a, working fifo
 [   1.0000000] dma at ebus0 addr 0-ffff not configured
 [   1.0000000] alipm0 at pci2 dev 6 function 0: 223KHz clock
 [   1.0000000] iic1 at alipm0: I2C bus
 [   1.0000000] card-reader at iic1 addr 0x20 not configured
 [   1.0000000] autri0 at pci2 dev 8 function 0: Acer Labs M5451 AC-Link Controller Audio Device (rev. 0x02)
 [   1.0000000] autri0: interrupting at ivec 7a4
 [   1.0000000] autri0: ac97: Analog Devices AD1881A codec; headphone, Analog Devices Phat Stereo
 [   1.0000000] audio0 at autri0: full duplex, playback, capture, mmap, independent
 [   1.0000000] audio0: slinear_be:16 -> slinear_le:16 2ch 48000Hz, blk 40ms for playback
 [   1.0000000] audio0: slinear_be:16 <- slinear_le:16 2ch 48000Hz, blk 40ms for recording
 [   1.0000000] spkr0 at audio0: PC Speaker (synthesized)
 [   1.0000000] wsbell at spkr0 not configured
 [   1.0000000] midi0 at autri0: 4DWAVE MIDI UART
 [   1.0000000] ohci0 at pci2 dev 10 function 0: Acer Labs M5237 USB 1.1 Host Controller (rev. 0x03)
 [   1.0000000] ohci0: interrupting at ivec 7a7
 [   1.0000000] ohci0: OHCI version 1.0, legacy support
 [   1.0000000] usb0 at ohci0: USB revision 1.0
 [   1.0000000] ohci1 at pci2 dev 11 function 0: Acer Labs M5237 USB 1.1 Host Controller (rev. 0x03)
 [   1.0000000] ohci1: interrupting at ivec 7a6
 [   1.0000000] ohci1: OHCI version 1.0, legacy support
 [   1.0000000] usb1 at ohci1: USB revision 1.0
 [   1.0000000] aceride0 at pci2 dev 13 function 0: Acer Labs M5229 UDMA IDE Controller (rev. 0xc4)
 [   1.0000000] aceride0: using ivec 1f98 for native-PCI interrupt
 [   1.0000000] atabus0 at aceride0 channel 0
 [   1.0000000] atabus1 at aceride0 channel 1
 [   1.0000000] ppb0 at pci2 dev 4 function 0: Texas Instruments product ac23 (rev. 0x02)
 [   1.0000000] pci3 at ppb0 bus 1
 [   1.0000000] ohci2 at pci3 dev 8 function 0: NEC USB Host Controller (rev. 0x43)
 [   1.0000000] ohci2: interrupting at ivec 794
 [   1.0000000] ohci2: OHCI version 1.0
 [   1.0000000] usb2 at ohci2: USB revision 1.0
 [   1.0000000] ohci3 at pci3 dev 8 function 1: NEC USB Host Controller (rev. 0x43)
 [   1.0000000] ohci3: interrupting at ivec 795
 [   1.0000000] ohci3: OHCI version 1.0
 [   1.0000000] usb3 at ohci3: USB revision 1.0
 [   1.0000000] ehci0 at pci3 dev 8 function 2: NEC USB2 Host Controller (rev. 0x04)
 [   1.0000000] ehci0: interrupting at ivec 796
 [   1.0000000] ehci0: 2 companion controllers, 3 ports each: ohci2 ohci3
 [   1.0000000] usb4 at ehci0: USB revision 2.0
 [   1.0000000] fwohci0 at pci3 dev 11 function 0: Texas Instruments TSB43AA23 IEEE 1394 Host Controller (rev. 0x00)
 [   1.0000000] fwohci0: interrupting at ivec 797
 [   1.0000000] fwohci0: OHCI version 1.10 (ROM=1)
 [   1.0000000] fwohci0: No. of Isochronous channels is 4.
 [   1.0000000] fwohci0: EUI64 00:05:16:00:00:71:1e:d4
 [   1.0000000] fwohci0: Phy 1394a available S400, 3 ports.
 [   1.0000000] fwohci0: Link S400, max_rec 2048 bytes.
 [   1.0000000] ieee1394if0 at fwohci0: IEEE1394 bus
 [   1.0000000] fwip0 at ieee1394if0: IP over IEEE1394
 [   1.0000000] fwohci0: Initiate bus reset
 [   1.0000000] ppm at mainbus0 not configured
 [   1.0000000] schizo3 at mainbus0: addr 4000ff00000: Tomatillo, version 4, ign 7c0, bus B 0 to 0
 [   1.0000000] schizo3:  pci4 at schizo3
 [   1.0000000] radeonfb0 at pci4 dev 2 function 0: ATI Technologies Radeon 7000/VE QY (rev. 0x00)
 [   1.0000000] radeonfb0: 64 MB aperture at 0x08000000, 64 KB registers at 0x00100000
 [   1.0000000] radeonfb0: display 0: initial virtual resolution 640x480 at 8 bpp
 [   1.0000000] radeonfb0: using 32 MB per display
 [   1.0000000] radeonfb0: port 0: physical 1024x768 60Hz
 [   1.0000000] radeonfb0: port 1: physical 1024x768 60Hz
 [   1.0000000] wsdisplay1 at radeonfb0 kbdmux 1
 [   1.0000000] drm at radeonfb0 not configured
 [   1.0000000] jbusi2c0 at mainbus0: addr 4000fc64000
 [   1.0000000] iic2 at jbusi2c0: I2C bus
 [   1.0000000] seeprom3 at iic2 addr 0x57: nvram: size 8192
 [   1.0000000] pcons at mainbus0 not configured
 [   1.0000000] fwohci0: BUS reset
 [   1.0000000] fwohci0: node_id=0xc800ffc0, gen=1, CYCLEMASTER mode
 [   1.0000000] ieee1394if0: 1 nodes, maxhop <= 0 cable IRM irm(0) (me)
 [   1.0000000] ieee1394if0: bus manager 0
 [   1.0000169] No counter-timer -- using %stick at 12MHz as system clock.
 [   1.9139563] scsibus0: waiting 2 seconds for devices to settle...
 [   1.9139563] scsibus1: waiting 2 seconds for devices to settle...
 [   2.1300193] uhub0 at usb0: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
 [   2.2400183] uhub1 at usb1: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
 [   2.3458517] uhub2 at usb2: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
 [   2.4500225] uhub3 at usb3: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
 [   2.5538013] uhub4 at usb4: NetBSD (0000) EHCI root hub (0000), class 9/0, rev 2.00/1.00, addr 1
 [   8.6300186] WARNING: 1 error while detecting hardware; check system log.
 [   8.7160060] root on md0a dumps on md0b
 [   8.7607141] root file system type: ffs
 [   8.8054243] kern.module.path=/stand/sparc64/8.99.41/modules
 erase ^?, werase ^W, kill ^U, intr ^C

From: "Michael L. Hitch" <mhitch@NetBSD.org>
To: gnats-bugs@netbsd.org
Cc: port-sparc64-maintainer@netbsd.org, gnats-admin@netbsd.org, 
    netbsd-bugs@netbsd.org
Subject: Re: port-sparc64/54211: esiop stopped detecting boot disk
Date: Thu, 14 Nov 2019 11:28:54 -0700 (MST)

 On Fri, 17 May 2019, tobiasu@tmux.org wrote:

 > I'm having problems with (from the looks of it) esiop again. Couple of days ago I
 > wasted Martins time with a disk problem on Ultra 60 which resolved itself magically,
 > now it's back on another machine. I'm getting slightly paranoid here.
 >
 > Upgraded the machine from .29. Now the boot disk isn't recognized at all.
 > Cold boot, warm boot, doesn't matter. I can hear it spin down at
 > [   1.9139548] scsibus0: waiting 2 seconds for devices.

    Has there been any change in this?

    I was playing with my Ultra 45 systems again, and tried updating to 
 NetBSD 9.0_BETA, and am seeing the same kind of behaviour, except with 
 mpt.  My 8.1_STABLE system works fine (although I have to increase the 
 kernel stack size to get through the boot).

    I noticed that sparc64 switched from gcc6 to gcc7 back in Februrary, so 
 my first wild guess is something in gcc7 may be causing the problem. 
 There were no noticable changes in the mpt code between NetBSD-8 and 
 NetBSD-9, so I don't know if the problem is in the mpt code, or something 
 else it uses.

    I will probably try this again with an older tree that still has gcc6 
 and I can try with both versions).  I was doing that a year ago when I was 
 having trouble with the Ultra 45 crashing while configuring the audio 
 driver.

    I had also tried a current autobuild kernel, which fails in the same 
 way, but also does not crash in the audio driver.

 Mike

 ---
 Michael L. Hitch                        mhitch@NetBSD.org
 Operations Consulting,  University Information Technology
 Montana State University, Bozeman, MT     USA

From: "Michael L. Hitch" <mhitch@montana.edu>
To: "Michael L. Hitch" <mhitch@NetBSD.org>
Cc: gnats-bugs@netbsd.org, port-sparc64-maintainer@netbsd.org, 
    gnats-admin@netbsd.org, netbsd-bugs@netbsd.org
Subject: Re: port-sparc64/54211: esiop stopped detecting boot disk
Date: Tue, 3 Dec 2019 11:32:08 -0700 (MST)

 On Thu, 14 Nov 2019, Michael L. Hitch wrote:

 >  I noticed that sparc64 switched from gcc6 to gcc7 back in Februrary, so my 
 > first wild guess is something in gcc7 may be causing the problem. There were 
 > no noticable changes in the mpt code between NetBSD-8 and NetBSD-9, so I 
 > don't know if the problem is in the mpt code, or something else it uses.
 >
 >  I will probably try this again with an older tree that still has gcc6 and I 
 > can try with both versions).  I was doing that a year ago when I was having 
 > trouble with the Ultra 45 crashing while configuring the audio driver.

    I had time over the Thanksgiving weekend to track this down.  I did 
 determine that the switch to gcc7 did not cause this, and the commit that 
 did was a few weeks later.  The culprit is revision 1.69 of sys/i2c/i2c.c 
 (certainly not very obvious).

    That change resulted in the device probe trying to configure a 
 non-existant i2c device on the Ultra 45.  Inspecting the dmesg output in 
 this ticket shows the same thing.  A working kernel shows:

 [   1.0000000] gpio at iic0 addr 0x18 not configured

    While a failing kernel shows:

 [   1.0000000] admtemp0 at iic0 addr 0x18:  temperature sensor: id. 0xf0, 
 rev. 0xf0

    Reverting this change results in a working kernel. both for -current and 
 netbsd-9.

 Mike


 ---
 Michael L. Hitch                        mhitch@montana.edu
 Operations Consulting,  University Information Technology
 Montana State University, Bozeman, MT     USA

From: Jason Thorpe <thorpej@me.com>
To: "Michael L. Hitch" <mhitch@montana.edu>
Cc: "Michael L. Hitch" <mhitch@NetBSD.org>,
 gnats-bugs@netbsd.org,
 port-sparc64-maintainer@netbsd.org,
 gnats-admin@netbsd.org,
 netbsd-bugs@netbsd.org
Subject: Re: port-sparc64/54211: esiop stopped detecting boot disk
Date: Tue, 3 Dec 2019 11:39:53 -0800

 > On Dec 3, 2019, at 10:32 AM, Michael L. Hitch <mhitch@montana.edu> =
 wrote:
 >=20
 >  I had time over the Thanksgiving weekend to track this down.  I did =
 determine that the switch to gcc7 did not cause this, and the commit =
 that did was a few weeks later.  The culprit is revision 1.69 of =
 sys/i2c/i2c.c (certainly not very obvious).

 Rev 1.69 of i2c.c should be reverted.  It is not correct, and was =
 clearly made without an understanding of the semantics of the function =
 in questions.

 -- thorpej

From: Julian Coleman <jdc@coris.org.uk>
To: Jason Thorpe <thorpej@me.com>
Cc: "Michael L. Hitch" <mhitch@montana.edu>,
	"Michael L. Hitch" <mhitch@netbsd.org>, gnats-bugs@netbsd.org,
	port-sparc64-maintainer@netbsd.org, gnats-admin@netbsd.org,
	netbsd-bugs@netbsd.org
Subject: Re: port-sparc64/54211: esiop stopped detecting boot disk
Date: Wed, 4 Dec 2019 16:46:41 +0100

 Hi all,

 > A working kernel shows:
 > 
 > [   1.0000000] gpio at iic0 addr 0x18 not configured
 > 
 >   While a failing kernel shows:
 > 
 > [   1.0000000] admtemp0 at iic0 addr 0x18:  temperature sensor: id. 0xf0, rev.
 0xf0

 > Rev 1.69 of i2c.c should be reverted.  It is not correct, and was clearly made without an understanding of the semantics of the function in questions.

 I think that it also makes sense to either remove indirect matching or make
 admtemp more resistant to false matches.  This would also cover other cases.

 For example, dbcool.c already has such a check by calling dbcool_chip_ident()
 from dbcool_match().  I think that we could add a similar function called
 from admtemp_match() and also do the same for adm1026, where we already have
 adm1026_ident() but don't call it from adm1026_match().

 Regards,

 Julian

From: Martin Husemann <martin@duskware.de>
To: Julian Coleman <jdc@coris.org.uk>
Cc: Jason Thorpe <thorpej@me.com>, "Michael L. Hitch" <mhitch@montana.edu>,
	"Michael L. Hitch" <mhitch@netbsd.org>, gnats-bugs@netbsd.org,
	port-sparc64-maintainer@netbsd.org, gnats-admin@netbsd.org,
	netbsd-bugs@netbsd.org
Subject: Re: port-sparc64/54211: esiop stopped detecting boot disk
Date: Wed, 4 Dec 2019 19:01:05 +0100

 On Wed, Dec 04, 2019 at 04:46:41PM +0100, Julian Coleman wrote:
 > For example, dbcool.c already has such a check by calling dbcool_chip_ident()
 > from dbcool_match().  I think that we could add a similar function called
 > from admtemp_match() and also do the same for adm1026, where we already have
 > adm1026_ident() but don't call it from adm1026_match().

 Also remember that with the gcc 8 fixes for adm1026 it totaly breaks some
 machines (just like in Michaels case at hand, but earlier). You wanted
 to look into it ;-)

 Martin

From: Julian Coleman <jdc@coris.org.uk>
To: "Michael L. Hitch" <mhitch@montana.edu>
Cc: "Michael L. Hitch" <mhitch@netbsd.org>, gnats-bugs@netbsd.org,
	port-sparc64-maintainer@netbsd.org, gnats-admin@netbsd.org,
	netbsd-bugs@netbsd.org
Subject: Re: port-sparc64/54211: esiop stopped detecting boot disk
Date: Wed, 11 Dec 2019 22:40:20 +0100

 --b8GWCKCLzrXbuNet
 Content-Type: text/plain; charset=us-ascii
 Content-Disposition: inline

 Hi,

 > I think that it also makes sense to either remove indirect matching or make
 > admtemp more resistant to false matches.  This would also cover other cases.

 Can you try the attached, please?  It probes to check if the detected chip
 is an adm1021 or clone.  It should fail to match here, so restoring the old
 behaviour.

 Regards,

 Julian

 --b8GWCKCLzrXbuNet
 Content-Type: text/plain; charset=us-ascii
 Content-Disposition: attachment; filename="adm1021.c.diff"

 --- sys/dev/i2c/adm1021.c.dist	2019-12-07 15:58:05.209974319 +0100
 +++ sys/dev/i2c/adm1021.c	2019-12-10 03:06:31.239950413 +0100
 @@ -131,7 +131,13 @@
  	uint8_t sc_thermlim[ADMTEMP_NUM_SENSORS];
  };

 +static int	admtemp_ident(struct admtemp_softc *);
  int	admtemp_match(device_t, cfdata_t, void *);
 +static int	admtemp_exec(struct admtemp_softc *, i2c_op_t,
 +			uint8_t *, uint8_t *);
 +static void	admtemp_setflags(struct admtemp_softc *,
 +			struct i2c_attach_args *, uint8_t *,
 +			uint8_t *, char* );
  void	admtemp_attach(device_t, device_t, void *);
  void	admtemp_refresh(struct sysmon_envsys *, envsys_data_t *);
  void	admtemp_getlim_1021(struct sysmon_envsys *, envsys_data_t *,
 @@ -158,12 +164,34 @@
  	{ NULL,				0 }
  };

 +/* Check company ID for possible matches */
 +static int
 +admtemp_ident(struct admtemp_softc *sc)
 +{
 +	uint8_t cmd, comp;
 +	int err;
 +
 +	comp = 0;
 +	cmd = ADM1021_COMPANY;
 +	err = admtemp_exec(sc, I2C_OP_READ_WITH_STOP, &cmd, &comp);
 +	if (err)
 +		return 0;
 +	if (comp == 0 || comp == ADM1021_COMPANY_MAXIM ||
 +	    comp == ADM1021_COMPANY_GMT || comp == ADM1021_COMPANY_ADM)
 +		return 1;
 +	return 0;
 +}
 +
  int
  admtemp_match(device_t parent, cfdata_t match, void *aux)
  {
  	struct i2c_attach_args *ia = aux;
 +	struct admtemp_softc sc;	/* For chip ident */
  	int match_result;

 +	sc.sc_tag = ia->ia_tag;
 +	sc.sc_addr = ia->ia_addr;
 +
  	if (iic_use_direct_match(ia, match, compat_data, &match_result))
  		return match_result;

 @@ -171,10 +199,11 @@
  	 * Indirect config - not much we can do!
  	 * Check typical addresses.
  	 */
 -	if (((ia->ia_addr >= 0x18) && (ia->ia_addr <= 0x1a)) ||
 +	if ((((ia->ia_addr >= 0x18) && (ia->ia_addr <= 0x1a)) ||
  	    ((ia->ia_addr >= 0x29) && (ia->ia_addr <= 0x2b)) ||
 -	    ((ia->ia_addr >= 0x48) && (ia->ia_addr <= 0x4e)))
 -		return I2C_MATCH_ADDRESS_ONLY;
 +	    ((ia->ia_addr >= 0x48) && (ia->ia_addr <= 0x4e))) &&
 +	    admtemp_ident(&sc))
 +		return I2C_MATCH_ADDRESS_AND_PROBE;

  	return 0;
  }
 @@ -207,7 +236,7 @@
   */
  static void
  admtemp_setflags(struct admtemp_softc *sc, struct i2c_attach_args *ia,
 -    uint8_t* comp, uint8_t *rev, char* name)
 +    uint8_t *comp, uint8_t *rev, char* name)
  {
  	uint8_t cmd, data, tmp;
  	int i;

 --b8GWCKCLzrXbuNet--

From: Jason Thorpe <thorpej@me.com>
To: gnats-bugs@netbsd.org
Cc: port-sparc64-maintainer@netbsd.org,
 gnats-admin@netbsd.org,
 netbsd-bugs@netbsd.org,
 tobiasu@tmux.org
Subject: Re: port-sparc64/54211: esiop stopped detecting boot disk
Date: Wed, 11 Dec 2019 13:49:39 -0800

 > On Dec 11, 2019, at 1:45 PM, Julian Coleman <jdc@coris.org.uk> wrote:
 > 
 > Can you try the attached, please?  It probes to check if the detected chip
 > is an adm1021 or clone.  It should fail to match here, so restoring the old
 > behaviour.

 Do these i2c devices appear in the OFW device tree?

 -- thorpej

From: Julian Coleman <jdc@coris.org.uk>
To: Jason Thorpe <thorpej@me.com>
Cc: gnats-bugs@netbsd.org, port-sparc64-maintainer@netbsd.org,
	gnats-admin@netbsd.org, netbsd-bugs@netbsd.org, tobiasu@tmux.org
Subject: Re: port-sparc64/54211: esiop stopped detecting boot disk
Date: Wed, 18 Dec 2019 12:37:57 +0100

 Hi,

 > Do these i2c devices appear in the OFW device tree?

 I guess so, because we only used direct matching before the change to i2c.c.
 So, with that change reverted, this problem should have gone.  However, the
 change to make the admtemp matching more targetted still makes sense to me.

 Regards,

 Julian

From: "Michael L. Hitch" <mhitch@NetBSD.org>
To: Jason Thorpe <thorpej@me.com>
Cc: gnats-bugs@netbsd.org, port-sparc64-maintainer@netbsd.org, 
    gnats-admin@netbsd.org, netbsd-bugs@netbsd.org, tobiasu@tmux.org
Subject: Re: port-sparc64/54211: esiop stopped detecting boot disk
Date: Thu, 26 Dec 2019 10:09:57 -0700 (MST)

 On Wed, 11 Dec 2019, Jason Thorpe wrote:

 >> On Dec 11, 2019, at 1:45 PM, Julian Coleman <jdc@coris.org.uk> wrote:
 >>
 >> Can you try the attached, please?  It probes to check if the detected chip
 >> is an adm1021 or clone.  It should fail to match here, so restoring the old
 >> behaviour.
 >
 > Do these i2c devices appear in the OFW device tree?
 >
 > -- thorpej

    The Ultra 45 OFW device tree does not show an adm device, but the 
 GENERIC config includes an admtemp* device, so I presume there is a model 
 that does have it.

 Mike


 ---
 Michael L. Hitch                        mhitch@montana.edu
 Operations Consulting,  University Information Technology
 Montana State University, Bozeman, MT     USA

State-Changed-From-To: open->closed
State-Changed-By: martin@NetBSD.org
State-Changed-When: Tue, 28 Jan 2020 10:56:24 +0000
State-Changed-Why:
Fixed (was fallout from i2c changes that have been backed out)


>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.46 2020/01/03 16:35:01 leot Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2020 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.