NetBSD Problem Report #58344

From h.fath@spg.tu-darmstadt.de  Fri Jun 14 12:05:48 2024
Return-Path: <h.fath@spg.tu-darmstadt.de>
Received: from mail.netbsd.org (mail.netbsd.org [199.233.217.200])
	(using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)
	 key-exchange X25519 server-signature RSA-PSS (2048 bits)
	 client-signature RSA-PSS (2048 bits))
	(Client CN "mail.NetBSD.org", Issuer "mail.NetBSD.org CA" (not verified))
	by mollari.NetBSD.org (Postfix) with ESMTPS id 8E0321A9238
	for <gnats-bugs@gnats.NetBSD.org>; Fri, 14 Jun 2024 12:05:48 +0000 (UTC)
Message-Id: <202406141201.45EC1xbQ016105@Gstoder.nt.e-technik.tu-darmstadt.de>
Date: Fri, 14 Jun 2024 14:01:59 +0200 (CEST)
From: Hauke Fath <hf@spg.tu-darmstadt.de>
Reply-To: Hauke Fath <hf@spg.tu-darmstadt.de>
To: gnats-bugs@NetBSD.org
Cc: Hauke Fath <hf@spg.tu-darmstadt.de>
Subject: Machine hangs at end of shutdown -r 
X-Send-Pr-Version: 3.95

>Number:         58344
>Category:       port-amd64
>Synopsis:       Machine hangs at end of shutdown -r
>Confidential:   no
>Severity:       serious
>Priority:       high
>Responsible:    port-amd64-maintainer
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Fri Jun 14 12:10:00 +0000 2024
>Last-Modified:  Fri Sep 27 17:30:01 +0000 2024
>Originator:     Hauke Fath
>Release:        NetBSD 9.4_STABLE
>Organization:
Technische Universitaet Darmstadt
>Environment:


System: NetBSD Hochstuhl 9.4_STABLE NetBSD 9.4_STABLE (RADMINDSRV) #3: Fri May  3 18:54:35 CEST 2024  hf4kh@Hochstuhl:/var/obj/netbsd-builds/9/amd64/sys/arch/amd64/compile/RADMINDSRV amd64
Architecture: x86_64
Machine: amd64
>Description:

	I have a machine which for all of netbsd-9 has been hanging at
	'shutdown -r'

[...]
[ 7564512.5113085] unmounting 0xffff87886ae02008 /var (/dev/raid0e)...
[ 7564512.5531619] unmounting 0xffff87886bcf4008 / (/dev/raid0a)...
[ 7564512.5831671] forcefully unmounting /var (/dev/raid0e)...
[ 7564512.6632000] unmounting 0xffff87886bcf4008 / (/dev/raid0a)...
[ 7564512.7032133] forcefully unmounting / (/dev/raid0a)...
[ 7564512.7932552] raid0: detached
[ 7564512.8129638] sd9: detached
[ 7564512.8306441] sd1: detached
[ 7564512.8483243] rebooting...

	instead of rebooting. It will then require a power cycle,
	which is most awkward when I happen to not be on site.

	The machine has the latest BIOS.

	dmesg output is

Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003,
    2004, 2005, 2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013,
    2014, 2015, 2016, 2017, 2018, 2019, 2020, 2021, 2022, 2023,
    2024
    The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
    The Regents of the University of California.  All rights reserved.

NetBSD 9.4_STABLE (RADMINDSRV) #3: Fri May  3 18:54:35 CEST 2024
	hf4kh@Hochstuhl:/var/obj/netbsd-builds/9/amd64/sys/arch/amd64/compile/RADMINDSRV
total memory = 32750 MB
avail memory = 31774 MB
timecounter: Timecounters tick every 10.000 msec
Kernelized RAIDframe activated
running cgd selftest aes-xts-256 aes-xts-512 done
timecounter: Timecounter "i8254" frequency 1193182 Hz quality 100
Supermicro H8DCL (1234567890)
mainbus0 (root)
ACPI: RSDP 0x00000000000FA240 000024 (v02 ACPIAM)
ACPI: XSDT 0x00000000DFEA0100 00008C (v01 SMCI            20180419 MSFT 00000097)
ACPI: FACP 0x00000000DFEA0290 0000F4 (v03 041918 FACP1644 20180419 MSFT 00000097)
Firmware Warning (ACPI): 32/64X length mismatch in FADT/Gpe0Block: 64/32 (20190405/tbfadt-642)
ACPI: DSDT 0x00000000DFEA0640 005A69 (v01 1CD11  1CD11000 00000000 INTL 20051117)
ACPI: FACS 0x00000000DFEB2000 000040
ACPI: APIC 0x00000000DFEA0390 0000E4 (v01 041918 APIC1644 20180419 MSFT 00000097)
ACPI: MCFG 0x00000000DFEA0480 00003C (v01 041918 OEMMCFG  20180419 MSFT 00000097)
ACPI: OEMB 0x00000000DFEB2040 000075 (v01 041918 OEMB1644 20180419 MSFT 00000097)
ACPI: HPET 0x00000000DFEAA640 000038 (v01 041918 OEMHPET  20180419 MSFT 00000097)
ACPI: IVRS 0x00000000DFEAA680 0000C8 (v01 AMD    RD890S   00202031 AMD  00000000)
ACPI: SRAT 0x00000000DFEAA750 000108 (v02 AMD    AGESA    00000001 AMD  00000001)
ACPI: SLIT 0x00000000DFEAA860 00002D (v01 AMD    AGESA    00000001 AMD  00000001)
ACPI: SSDT 0x00000000DFEAA890 0009F6 (v01 A M I  POWERNOW 00000001 AMD  00000001)
ACPI: EINJ 0x00000000DFEAB290 000130 (v01 AMIER  AMI_EINJ 20180419 MSFT 00000097)
ACPI: BERT 0x00000000DFEAB420 000030 (v01 AMIER  AMI_BERT 20180419 MSFT 00000097)
ACPI: ERST 0x00000000DFEAB450 000210 (v01 AMIER  AMI_ERST 20180419 MSFT 00000097)
ACPI: HEST 0x00000000DFEAB660 0000A8 (v01 AMIER  ABC_HEST 20180419 MSFT 00000097)
ACPI: 2 ACPI AML tables successfully acquired and loaded
ioapic0 at mainbus0 apid 0: pa 0xfec00000, version 0x21, 24 pins
ioapic1 at mainbus0 apid 1: pa 0xfec20000, version 0x21, 32 pins
cpu0 at mainbus0 apid 16
cpu0: AMD Opteron(tm) Processor 4226                 , id 0x600f12
cpu0: package 0, core 0, smt 0
cpu1 at mainbus0 apid 17
cpu1: AMD Opteron(tm) Processor 4226                 , id 0x600f12
cpu1: package 0, core 1, smt 0
cpu2 at mainbus0 apid 18
cpu2: AMD Opteron(tm) Processor 4226                 , id 0x600f12
cpu2: package 0, core 2, smt 0
cpu3 at mainbus0 apid 19
cpu3: AMD Opteron(tm) Processor 4226                 , id 0x600f12
cpu3: package 0, core 3, smt 0
cpu4 at mainbus0 apid 20
cpu4: AMD Opteron(tm) Processor 4226                 , id 0x600f12
cpu4: package 0, core 4, smt 0
cpu5 at mainbus0 apid 21
cpu5: AMD Opteron(tm) Processor 4226                 , id 0x600f12
cpu5: package 0, core 5, smt 0
acpi0 at mainbus0: Intel ACPICA 20190405
acpi0: X/RSDT: OemId <SMCI  ,        ,20180419>, AslId <MSFT,00000097>
acpi0: MCFG: segment 0, bus 0-255, address 0x00000000e0000000
acpi0: SCI interrupting at int 9
acpi0: fixed power button present
timecounter: Timecounter "ACPI-Safe" frequency 3579545 Hz quality 900
hpet0 at acpi0: high precision event timer (mem 0xfed00000-0xfed00400)
timecounter: Timecounter "hpet0" frequency 14318180 Hz quality 2000
NMEM (PNP0C02) at acpi0 not configured
UMEM (PNP0C02) at acpi0 not configured
attimer1 at acpi0 (TMR, PNP0100): io 0x40-0x43 irq 0
pcppi1 at acpi0 (SPKR, PNP0800): io 0x61
spkr0 at pcppi1: PC Speaker
wsbell at spkr0 not configured
midi0 at pcppi1: PC speaker
sysbeep0 at pcppi1
SIOR (PNP0C02) at acpi0 not configured
OMSC (PNP0C02) at acpi0 not configured
RMSC (PNP0C02) at acpi0 not configured
UAR1 (PNP0501) at acpi0 not configured
UAR2 (PNP0501) at acpi0 not configured
PCIE (PNP0C02) at acpi0 not configured
RMEM (PNP0C01) at acpi0 not configured
acpibut0 at acpi0 (PWRB, PNP0C0C-170): ACPI Power Button
attimer1: attached to pcppi1
ipmi0 at mainbus0
pci0 at mainbus0 bus 0: configuration mode 1
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
pchb0 at pci0 dev 0 function 0: vendor 1002 product 5a10 (rev. 0x02)
vendor 1002 product 5a23 (IOMMU system) at pci0 dev 0 function 2 not configured
ppb0 at pci0 dev 3 function 0: vendor 1002 product 5a17 (rev. 0x00)
ppb0: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x8 @ 5.0GT/s
pci1 at ppb0 bus 1
pci1: i/o space, memory space enabled, rd/line, wr/inv ok
ixg0 at pci1 dev 0 function 0: Intel(R) PRO/10GbE PCI-Express Network Driver, Version - 4.0.1-k
ixg0: clearing prefetchable bit
ixg0: device X540
ixg0: NVM Image Version 4.3 ID 0x0, PHY FW Revision 4.2 ID 0x0, ETrackID 8000037c
ixg0: PBA number G54042-005
ixg0: autoconfiguration error: failed to allocate MSI-X interrupt
ixg0: interrupting at ioapic1 pin 4
ixg0: Ethernet address a0:36:9f:25:62:98
ixg0: PHY OUI 0x00aa00, model 0x0020, rev. 0
ixg0: PCI Express Bus: Speed 5.0GT/s Width x8
ixg0: feature cap 0x1780<LEGACY_TX,FDIR,MSI,MSIX,LEGACY_IRQ>
ixg0: feature ena 0x1000<LEGACY_IRQ>
ixg0: device cap 0xfffd<ALLOW_ANY_SFP,WOL_PORT0_1,WOL_PORT0,NO_CROSSTALK_WR>
ppb1 at pci0 dev 9 function 0: vendor 1002 product 5a1c (rev. 0x00)
ppb1: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x2 @ 5.0GT/s
ppb1: link is x1 @ 2.5GT/s
pci2 at ppb1 bus 2
pci2: i/o space, memory space enabled, rd/line, wr/inv ok
wm0 at pci2 dev 0 function 0, 64-bit DMA: Intel i82574L (rev. 0x00)
wm0: interrupting at ioapic1 pin 24
wm0: PCI-Express bus
wm0: 2048 words (8 address bits) SPI EEPROM, version 2.1.2, Image Unique ID ffffffff
wm0: ASPM L0s and L1 are disabled to workaround the errata.
wm0: RX packet buffer size: 20KB
wm0: Ethernet address 00:25:90:4f:1e:d6
wm0: 0x224040<SPI,PCIE,ASF_FIRM,WOL>
makphy0 at wm0 phy 1: Marvell 88E1149 Gigabit PHY, rev. 1
makphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
ppb2 at pci0 dev 10 function 0: vendor 1002 product 5a1d (rev. 0x00)
ppb2: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x2 @ 5.0GT/s
ppb2: link is x1 @ 2.5GT/s
pci3 at ppb2 bus 3
pci3: i/o space, memory space enabled, rd/line, wr/inv ok
wm1 at pci3 dev 0 function 0, 64-bit DMA: Intel i82574L (rev. 0x00)
wm1: interrupting at ioapic1 pin 23
wm1: PCI-Express bus
wm1: 2048 words (8 address bits) SPI EEPROM, version 2.1.2, Image Unique ID ffffffff
wm1: ASPM L0s and L1 are disabled to workaround the errata.
wm1: RX packet buffer size: 20KB
wm1: Ethernet address 00:25:90:4f:1e:d7
wm1: 0x224040<SPI,PCIE,ASF_FIRM,WOL>
makphy1 at wm1 phy 1: Marvell 88E1149 Gigabit PHY, rev. 1
makphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
ppb3 at pci0 dev 11 function 0: vendor 1002 product 5a1f (rev. 0x00)
ppb3: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x8 @ 5.0GT/s
ppb3: link is x2 @ 5.0GT/s
pci4 at ppb3 bus 4
pci4: i/o space, memory space enabled, rd/line, wr/inv ok
nvme0 at pci4 dev 0 function 0: vendor 8086 product 2522 (rev. 0x00)
nvme0: NVMe 1.1
nvme0: interrupting at ioapic1 pin 8
nvme0: INTEL MEMPEK1W016GA, firmware K3110300, serial PHBT712200CQ016D
ld0 at nvme0 nsid 1
ld0: 13736 MB, 6977 cyl, 64 head, 63 sec, 512 bytes/sect x 28131328 sectors
ppb4 at pci0 dev 12 function 0: vendor 1002 product 5a20 (rev. 0x00)
ppb4: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x8 @ 5.0GT/s
pci5 at ppb4 bus 5
pci5: i/o space, memory space enabled, rd/line, wr/inv ok
mpii0 at pci5 dev 0 function 0: vendor 1000 product 0087 (rev. 0x05)
mpii0: interrupting at ioapic1 pin 12
mpii0: SAS9207-4i4e, firmware 20.0.2.0, MPI 2.0
mpii0: physical device inserted in slot 12
mpii0: physical device inserted in slot 13
mpii0: physical device inserted in slot 14
mpii0: physical device inserted in slot 15
mpii0: physical device inserted in slot 16
mpii0: physical device inserted in slot 17
mpii0: physical device inserted in slot 18
mpii0: physical device inserted in slot 19
mpii0: physical device inserted in slot 20
mpii0: physical device inserted in slot 21
mpii0: physical device inserted in slot 22
mpii0: physical device inserted in slot 23
mpii0: physical device inserted in slot 24
mpii0: physical device inserted in slot 25
mpii0: physical device inserted in slot 26
mpii0: physical device inserted in slot 27
mpii0: physical device inserted in slot 28
scsibus0 at mpii0: 1024 targets, 8 luns per target
ppb5 at pci0 dev 13 function 0: vendor 1002 product 5a1e (rev. 0x00)
ppb5: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x4 @ 5.0GT/s
ppb5: link is x2 @ 5.0GT/s
pci6 at ppb5 bus 6
pci6: i/o space, memory space enabled, rd/line, wr/inv ok
nvme1 at pci6 dev 0 function 0: vendor 8086 product 2522 (rev. 0x00)
nvme1: NVMe 1.1
nvme1: interrupting at ioapic1 pin 16
nvme1: INTEL MEMPEK1W016GA, firmware K3110300, serial PHBT71220152016D
ld1 at nvme1 nsid 1
ld1: 13736 MB, 6977 cyl, 64 head, 63 sec, 512 bytes/sect x 28131328 sectors
ahcisata0 at pci0 dev 17 function 0: vendor 1002 product 4391 (rev. 0x00)
ahcisata0: 64-bit DMA
ahcisata0: ignoring broken port multiplier support
ahcisata0: ignoring broken NCQ support
ahcisata0: AHCI revision 1.10, 6 ports, 32 slots, CAP 0xb720ff85<CCCS,PSC,SSC,PMD,ISS=0x2=Gen2,SCLO,SAL,SALP,SMPS,SSNTF,S64A>
ahcisata0: interrupting at ioapic0 pin 22
atabus0 at ahcisata0 channel 0
atabus1 at ahcisata0 channel 1
atabus2 at ahcisata0 channel 2
atabus3 at ahcisata0 channel 3
atabus4 at ahcisata0 channel 4
atabus5 at ahcisata0 channel 5
ohci0 at pci0 dev 18 function 0: vendor 1002 product 4397 (rev. 0x00)
csr: 02a00117
ohci0: interrupting at ioapic0 pin 16
ohci0: OHCI version 1.0, legacy support
usb0 at ohci0: USB revision 1.0
ohci1 at pci0 dev 18 function 1: vendor 1002 product 4398 (rev. 0x00)
csr: 02a00117
ohci1: interrupting at ioapic0 pin 16
ohci1: OHCI version 1.0, legacy support
usb1 at ohci1: USB revision 1.0
ehci0 at pci0 dev 18 function 2: vendor 1002 product 4396 (rev. 0x00)
ehci0: interrupting at ioapic0 pin 17
ehci0: dropped intr workaround enabled
ehci0: BIOS has given up ownership
ehci0: EHCI version 1.0
ehci0: 2 companion controllers, 3 ports each: ohci0 ohci1
usb2 at ehci0: USB revision 2.0
ohci2 at pci0 dev 19 function 0: vendor 1002 product 4397 (rev. 0x00)
csr: 02a00117
ohci2: interrupting at ioapic0 pin 18
ohci2: OHCI version 1.0, legacy support
usb3 at ohci2: USB revision 1.0
ohci3 at pci0 dev 19 function 1: vendor 1002 product 4398 (rev. 0x00)
csr: 02a00117
ohci3: interrupting at ioapic0 pin 18
ohci3: OHCI version 1.0, legacy support
usb4 at ohci3: USB revision 1.0
ehci1 at pci0 dev 19 function 2: vendor 1002 product 4396 (rev. 0x00)
ehci1: interrupting at ioapic0 pin 19
ehci1: dropped intr workaround enabled
ehci1: EHCI version 1.0
ehci1: 2 companion controllers, 3 ports each: ohci2 ohci3
usb5 at ehci1: USB revision 2.0
piixpm0 at pci0 dev 20 function 0: vendor 1002 product 4385 (rev. 0x3d)
piixpm0: interrupting at SMI, 
iic0 at piixpm0 port 0: I2C bus
spdmem0 at iic0 addr 0x50: M393B1K70D
spdmem0: DDR3 SDRAM (registered), ECC, temp-sensor, 8GB, 1333MHz (PC3-10666)
spdmem0: 15 rows, 11 cols, 8 log. banks, 2 phys. banks, 1.500ns cycle time
spdmem0: tAA-tRCD-tRP-tRAS: 9-9-9-24
spdmem0: 1.5V 1.35V operable
spdmem1 at iic0 addr 0x52: M393B1K70DH0-YH9
spdmem1: DDR3 SDRAM (registered), ECC, temp-sensor, 8GB, 1333MHz (PC3-10666)
spdmem1: 15 rows, 11 cols, 8 log. banks, 2 phys. banks, 1.500ns cycle time
spdmem1: tAA-tRCD-tRP-tRAS: 9-9-9-24
spdmem1: 1.5V 1.35V operable
spdmem2 at iic0 addr 0x53: M393B1K70CH0-YH9
spdmem2: DDR3 SDRAM (registered), ECC, temp-sensor, 8GB, 1333MHz (PC3-10666)
spdmem2: 15 rows, 11 cols, 8 log. banks, 2 phys. banks, 1.500ns cycle time
spdmem2: tAA-tRCD-tRP-tRAS: 9-9-9-24
spdmem2: 1.5V 1.35V operable
sdtemp0 at iic0 addr 0x18: Microchip Tech MCP98243 Temp Sensor
sdtemp0: high accuracy, wider range, 0.25C resolution, event with shutdown
sdtemp0: Hardware limits: none set
sdtemp1 at iic0 addr 0x19: Microchip Tech MCP98243 Temp Sensor
sdtemp1: high accuracy, wider range, 0.25C resolution, event with shutdown
sdtemp1: Hardware limits: none set
sdtemp2 at iic0 addr 0x1a: Microchip Tech MCP98243 Temp Sensor
sdtemp2: high accuracy, wider range, 0.25C resolution, event with shutdown
sdtemp2: Hardware limits: none set
sdtemp3 at iic0 addr 0x1b: Microchip Tech MCP98243 Temp Sensor
sdtemp3: high accuracy, wider range, 0.25C resolution, event with shutdown
sdtemp3: Hardware limits: none set
pcib0 at pci0 dev 20 function 3: vendor 1002 product 439d (rev. 0x00)
ppb6 at pci0 dev 20 function 4: vendor 1002 product 4384 (rev. 0x00)
pci7 at ppb6 bus 7
pci7: i/o space, memory space enabled
vga0 at pci7 dev 4 function 0: vendor 102b product 0532 (rev. 0x0a)
wsdisplay0 at vga0 kbdmux 1
wsmux1: connecting to wsdisplay0
drm at vga0 not configured
ohci4 at pci0 dev 20 function 5: vendor 1002 product 4399 (rev. 0x00)
csr: 02a00117
ohci4: interrupting at ioapic0 pin 18
ohci4: OHCI version 1.0, legacy support
usb6 at ohci4: USB revision 1.0
pchb1 at pci0 dev 24 function 0: vendor 1022 product 1600 (rev. 0x00)
pchb2 at pci0 dev 24 function 1: vendor 1022 product 1601 (rev. 0x00)
pchb3 at pci0 dev 24 function 2: vendor 1022 product 1602 (rev. 0x00)
amdnb_misc0 at pci0 dev 24 function 3: AMD NB Misc Configuration
amdtemp0 at amdnb_misc0: AMD CPU Temperature Sensors (Family15h)
pchb4 at pci0 dev 24 function 4: vendor 1022 product 1604 (rev. 0x00)
pchb5 at pci0 dev 24 function 5: vendor 1022 product 1605 (rev. 0x00)
isa0 at pcib0
com0 at isa0 port 0x3f8-0x3ff irq 4: ns16550a, working fifo
com0: console
com1 at isa0 port 0x2f8-0x2ff irq 3: ns16550a, working fifo
pckbc0 at isa0 port 0x60-0x64
acpicpu0 at cpu0: ACPI CPU
acpicpu0: C1: HLT, lat   0 us, pow     0 mW
acpicpu0: C2: I/O, lat 100 us, pow     0 mW
acpicpu0: P0: FFH, lat   5 us, pow 14140 mW, 2700 MHz
acpicpu0: P1: FFH, lat   5 us, pow  4070 mW, 1400 MHz
acpicpu0: T0: I/O, lat   1 us, pow     0 mW, 100 %
acpicpu0: T1: I/O, lat   1 us, pow     0 mW,  88 %
acpicpu0: T2: I/O, lat   1 us, pow     0 mW,  76 %
acpicpu0: T3: I/O, lat   1 us, pow     0 mW,  64 %
acpicpu0: T4: I/O, lat   1 us, pow     0 mW,  52 %
acpicpu0: T5: I/O, lat   1 us, pow     0 mW,  40 %
acpicpu0: T6: I/O, lat   1 us, pow     0 mW,  28 %
acpicpu0: T7: I/O, lat   1 us, pow     0 mW,  16 %
acpicpu1 at cpu1: ACPI CPU
acpicpu2 at cpu2: ACPI CPU
acpicpu3 at cpu3: ACPI CPU
acpicpu4 at cpu4: ACPI CPU
acpicpu5 at cpu5: ACPI CPU
timecounter: Timecounter "clockinterrupt" frequency 100 Hz quality 0
autoconfiguration error: ERROR: 4827 cycle TSC drift observed
scsibus0: waiting 2 seconds for devices to settle...
IPsec: Initialized Security Association Processing.
uhub0 at usb2: NetBSD (0000) EHCI root hub (0000), class 9/0, rev 2.00/1.00, addr 1
uhub0: 6 ports with 6 removable, self powered
uhub1 at usb0: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
uhub1: 3 ports with 3 removable, self powered
uhub2 at usb5: NetBSD (0000) EHCI root hub (0000), class 9/0, rev 2.00/1.00, addr 1
uhub2: 6 ports with 6 removable, self powered
uhub3 at usb3: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
uhub3: 3 ports with 3 removable, self powered
uhub4 at usb1: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
uhub4: 3 ports with 3 removable, self powered
uhub5 at usb6: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
uhub5: 2 ports with 2 removable, self powered
uhub6 at usb4: NetBSD (0000) OHCI root hub (0000), class 9/0, rev 1.00/1.00, addr 1
uhub6: 3 ports with 3 removable, self powered
ehci1: handing over full speed device on port 3 to ohci2
sd0 at scsibus0 target 12 lun 0: <ATA, SAMSUNG MZ7WD480, 7W3Q> disk fixed
sd0: 447 GB, 457863 cyl, 16 head, 127 sec, 512 bytes/sect x 937703088 sectors
sd0: GPT GUID: 854d6758-bfc2-51e4-a19c-c77cbd4c08c8
dk0 at sd0: "zfs", 937686415 blocks at 256, type: <unknown>
dk1 at sd0: "a34c7679-62c5-becd-e106-9e1526d16839", 16384 blocks at 937686671, type: <unknown>
sd0: tagged queueing
sd1 at scsibus0 target 13 lun 0: <ATA, Samsung SSD 840, 5B0Q> disk fixed
sd1: 476 GB, 488387 cyl, 16 head, 127 sec, 512 bytes/sect x 1000215216 sectors
sd1: tagged queueing
sd2 at scsibus0 target 14 lun 0: <ATA, Samsung SSD 840, 5B0Q> disk fixed
sd2: 476 GB, 488387 cyl, 16 head, 127 sec, 512 bytes/sect x 1000215216 sectors
sd2: GPT GUID: 85e59a83-4195-a4e2-97b9-e5cc4b3dd9ea
dk2 at sd2: "d8b4474c-31b7-1e65-d7d9-e8c8efef3182", 1000198543 blocks at 256, type: <unknown>
autoconfiguration error: sd2: wedge named 'zfs' already existed, using 'd8b4474c-31b7-1e65-d7d9-e8c8efef3182'
dk3 at sd2: "c82514dc-7550-926e-8aac-f48d791ef035", 16384 blocks at 1000198799, type: <unknown>
umass0 at uhub0 port 5 configuration 1 interface 0
umass0: vendor 05e3 (0x5e3) USB Storage (0x719), rev 2.00/0.15, addr 2
umass0: using SCSI over Bulk-Only
sd2: tagged queueing
scsibus1 at umass0: 2 targets, 1 lun per target
sd3 at scsibus0 target 15 lun 0: <TOSHIBA, MK1001TRKB, 0106> disk fixed
sd3: 931 GB, 237280 cyl, 4 head, 2058 sec, 512 bytes/sect x 1953525168 sectors
sd3: tagged queueing
sd4 at scsibus0 target 16 lun 0: <TOSHIBA, MK1001TRKB, 0106> disk fixed
sd4: 931 GB, 237280 cyl, 4 head, 2058 sec, 512 bytes/sect x 1953525168 sectors
sd4: tagged queueing
sd5 at scsibus0 target 17 lun 0: <TOSHIBA, MK1001TRKB, 0106> disk fixed
sd5: 931 GB, 237280 cyl, 4 head, 2058 sec, 512 bytes/sect x 1953525168 sectors
sd5: tagged queueing
sd6 at scsibus0 target 18 lun 0: <TOSHIBA, MK1001TRKB, 0106> disk fixed
sd6: 931 GB, 237280 cyl, 4 head, 2058 sec, 512 bytes/sect x 1953525168 sectors
sd6: tagged queueing
uhidev0 at uhub3 port 3 configuration 1 interface 0
uhidev0: Winbond Electronics Corp (0x557) Hermon USB hidmouse Device (0x2221), rev 1.10/0.01, addr 2, iclass 3/1
sd7 at scsibus0 target 19 lun 0: <TOSHIBA, MK1001TRKB, 0106> disk fixed
sd7: 931 GB, 237280 cyl, 4 head, 2058 sec, 512 bytes/sect x 1953525168 sectors
ums0 at uhidev0: 3 buttons and Z dir
wsmouse0 at ums0 mux 0
uhidev1 at uhub3 port 3 configuration 1 interface 1
uhidev1: Winbond Electronics Corp (0x557) Hermon USB hidmouse Device (0x2221), rev 1.10/0.01, addr 2, iclass 3/1
sd7: tagged queueing
sd8 at scsibus0 target 20 lun 0: <ATA, SAMSUNG MZ7WD480, 7W3Q> disk fixed
sd8: 447 GB, 457863 cyl, 16 head, 127 sec, 512 bytes/sect x 937703088 sectors
ukbd0 at uhidev1: 8 Variable keys, 6 Array codes
sd8: GPT GUID: 3e15f05b-c46d-2462-f99d-d955eb0f1cdd
dk4 at sd8: "d3bedd33-3561-1241-ee57-c322d7621053", 937686415 blocks at 256, type: <unknown>
autoconfiguration error: sd8: wedge named 'zfs' already existed, using 'd3bedd33-3561-1241-ee57-c322d7621053'
dk5 at sd8: "af79b035-124e-27c5-8d37-dbce2529df07", 16384 blocks at 937686671, type: <unknown>
sd8: tagged queueing
sd9 at scsibus0 target 21 lun 0: <ATA, Samsung SSD 840, 5B0Q> disk fixed
sd9: 476 GB, 488387 cyl, 16 head, 127 sec, 512 bytes/sect x 1000215216 sectors
sd9: tagged queueing
sd10 at scsibus0 target 22 lun 0: <ATA, Samsung SSD 840, 5B0Q> disk fixed
sd10: 476 GB, 488387 cyl, 16 head, 127 sec, 512 bytes/sect x 1000215216 sectors
sd10: GPT GUID: 3b20c78b-138b-7f6b-c43a-d13c2efe6d77
dk6 at sd10: "a6f716af-8a9d-df40-c1bc-e3118ed5e9ed", 1000198543 blocks at 256, type: <unknown>
autoconfiguration error: sd10: wedge named 'zfs' already existed, using 'a6f716af-8a9d-df40-c1bc-e3118ed5e9ed'
dk7 at sd10: "1d60b4c7-cbee-a8c5-88b3-b8ce6959c4ef", 16384 blocks at 1000198799, type: <unknown>
wskbd0 at ukbd0 mux 1
wskbd0: connecting to wsdisplay0
sd10: tagged queueing
sd11 at scsibus0 target 23 lun 0: <TOSHIBA, MK1001TRKB, 0106> disk fixed
sd11: 931 GB, 237280 cyl, 4 head, 2058 sec, 512 bytes/sect x 1953525168 sectors
sd11: tagged queueing
sd12 at scsibus0 target 24 lun 0: <TOSHIBA, MK1001TRKB, 0106> disk fixed
sd12: 931 GB, 237280 cyl, 4 head, 2058 sec, 512 bytes/sect x 1953525168 sectors
sd12: tagged queueing
sd13 at scsibus0 target 25 lun 0: <TOSHIBA, MK1001TRKB, 0106> disk fixed
sd13: 931 GB, 237280 cyl, 4 head, 2058 sec, 512 bytes/sect x 1953525168 sectors
sd13: tagged queueing
sd14 at scsibus0 target 26 lun 0: <TOSHIBA, MK1001TRKB, 0106> disk fixed
sd14: 931 GB, 237280 cyl, 4 head, 2058 sec, 512 bytes/sect x 1953525168 sectors
sd14: tagged queueing
sd15 at scsibus0 target 27 lun 0: <TOSHIBA, MK1001TRKB, 0106> disk fixed
sd15: 931 GB, 237280 cyl, 4 head, 2058 sec, 512 bytes/sect x 1953525168 sectors
sd15: tagged queueing
ses0 at scsibus0 target 28 lun 0: <LSI CORP, SAS2X28, 0717> enclosure services fixed
ses0: SCSI-3 SES Device
ses0: tagged queueing
cd0 at scsibus1 target 0 lun 0: <Slimtype, DVD A  DS8A9SH, EP53> cdrom removable
ipmi0: version 2.0 interface KCS iobase 0xca2/0x2 spacing 1
ipmi0: ID 32.1 IPMI 2.0 Available
ipmi0: Additional Chassis IPMBGen IPMBRcv FRU SEL SDR Sensor
ipmi0: Manufacturer 0b980 Product cd80
ipmi0: Firmware 2.50
Component on: sd1e: 1000213168
   Row: 0 Column: 1 Num Rows: 1 Num Columns: 2
   Version: 2 Serial Number: 20211026 Mod Counter: 989
   Clean: Yes Status: 0
   sectPerSU: 128 SUsPerPU: 1 SUsPerRU: 1
   RAID Level: 1  blocksize: 512 numBlocks: 1000212992
   Autoconfig: Yes
   Root partition: Force
   Last configured as: raid0
Component on: sd9d: 1000215216
   Row: 0 Column: 0 Num Rows: 1 Num Columns: 2
   Version: 2 Serial Number: 20211026 Mod Counter: 989
   Clean: Yes Status: 0
   sectPerSU: 128 SUsPerPU: 1 SUsPerRU: 1
   RAID Level: 1  blocksize: 512 numBlocks: 1000212992
   Autoconfig: Yes
   Root partition: Force
   Last configured as: raid0
Found: sd9d at 0
Found: sd1e at 1
RAID autoconfigure
Configuring raid0:
Starting autoconfiguration of RAID set...
Looking for 0 in autoconfig
Found: sd9d at 0
Looking for 1 in autoconfig
Found: sd1e at 1
raid0: allocating 20 buffers of 65536 bytes.
raid0: RAID Level 1
raid0: Components: /dev/sd9d /dev/sd1e
raid0: Total Sectors: 1000212992 (488385 MB)
WARNING: 5 errors while detecting hardware; check system log.
boot device: raid0
root on raid0a dumps on raid0b
dump_misc_init: max_paddr = 0x81f000000
mountroot: trying ffs...
root file system type: ffs
kern.module.path=/stand/amd64/9.4/modules
init: copying out path `/sbin/init' 11
WARNING: ZFS on NetBSD is under development
ZFS filesystem version: 5
ixg0: link state DOWN (was UNKNOWN)
wsdisplay0: screen 1 added (80x25, vt100 emulation)
wsdisplay0: screen 2 added (80x25, vt100 emulation)
wsdisplay0: screen 3 added (80x25, vt100 emulation)
wsdisplay0: screen 4 added (80x25, vt100 emulation)
ixg0: link state UP (was DOWN)



>How-To-Repeat:


	Reboot a Supermicro H8DCL based machine running netbsd-9, find
	it doesn't.

>Fix:


	Yes, please. I guess the kernel shutdown code jumps to the
	wrong place, or somesuch.

>Audit-Trail:
From: Taylor R Campbell <riastradh@NetBSD.org>
To: Hauke Fath <hf@spg.tu-darmstadt.de>
Cc: gnats-bugs@NetBSD.org, port-amd64-maintainer@NetBSD.org,
	gnats-admin@NetBSD.org, netbsd-bugs@NetBSD.org
Subject: Re: port-amd64/58344: Machine hangs at end of shutdown -r
Date: Fri, 14 Jun 2024 12:32:30 +0000

 If you have serial console access, can you enter ddb and get a stack
 trace when this happens?

From: Hauke Fath <hf@spg.tu-darmstadt.de>
To: Taylor R Campbell <riastradh@NetBSD.org>
Cc: gnats-bugs@NetBSD.org, port-amd64-maintainer@NetBSD.org,
        gnats-admin@NetBSD.org, netbsd-bugs@NetBSD.org
Subject: Re: port-amd64/58344: Machine hangs at end of shutdown -r
Date: Fri, 14 Jun 2024 15:16:26 +0200

 On Fri, 14 Jun 2024 12:32:30 +0000, Taylor R Campbell wrote:
 > If you have serial console access, can you enter ddb and get a stack
 > trace when this happens?

 Yes (conserver), and no - it looks like the kernel has handed control=20
 to something else at this point.

 Cheerio,
 Hauke

 --=20
      The ASCII Ribbon Campaign                    Hauke Fath
 ()     No HTML/RTF in email            Institut f=FCr Nachrichtentechnik
 /\     No Word docs in email                     TU Darmstadt
      Respect for open standards              Ruf +49-6151-16-21344

From: "Hauke Fath (SPG)" <hf@spg.tu-darmstadt.de>
To: Taylor R Campbell <riastradh@NetBSD.org>
Cc: gnats-bugs@netbsd.org, port-amd64-maintainer@NetBSD.org,
        gnats-admin@netbsd.org, netbsd-bugs@netbsd.org
Subject: Re: port-amd64/58344: Machine hangs at end of shutdown -r
Date: Mon, 17 Jun 2024 10:43:20 +0200

 On 2024-06-14 14:32, Taylor R Campbell wrote:
 > If you have serial console access, can you enter ddb and get a stack
 > trace when this happens?

 Since I can reproduce the issue at will: Is there any debug option that 
 would give more information about the last seconds before reboot?

 Cheerio,
 Hauke

 -- 
       The ASCII Ribbon Campaign                    Hauke Fath
 ()     No HTML/RTF in email	        Institut für Nachrichtentechnik
 /\     No Word docs in email                     TU Darmstadt
       Respect for open standards              Ruf +49-6151-16-21344

From: Taylor R Campbell <riastradh@NetBSD.org>
To: "Hauke Fath (SPG)" <hf@spg.tu-darmstadt.de>
Cc: gnats-bugs@NetBSD.org, port-amd64-maintainer@NetBSD.org,
	gnats-admin@NetBSD.org, netbsd-bugs@NetBSD.org
Subject: Re: port-amd64/58344: Machine hangs at end of shutdown -r
Date: Mon, 17 Jun 2024 13:46:40 +0000

 > Date: Mon, 17 Jun 2024 10:43:20 +0200
 > From: "Hauke Fath (SPG)" <hf@spg.tu-darmstadt.de>
 > 
 > On 2024-06-14 14:32, Taylor R Campbell wrote:
 > > If you have serial console access, can you enter ddb and get a stack
 > > trace when this happens?
 > 
 > Since I can reproduce the issue at will: Is there any debug option that 
 > would give more information about the last seconds before reboot?

 Some things you could try:

 1. set cpureset_delay to 0 (e.g., enter ddb and `w cpureset_delay 0')
    before attempting reboot, and see if that makes a difference --
    maybe delay() is broken at that point somehow

 2. sprinkle printfs into x86_reset in sys/arch/x86/x86/x86_machdep.c
    and acpi_reset in sys/dev/acpi/acpi.c to see exactly what it is
    trying and where it is stopping

 3. share `acpidump -dt' output if it's hanging in acpi_reset

From: "Hauke Fath (SPG)" <hf@spg.tu-darmstadt.de>
To: gnats-bugs@netbsd.org
Cc: Taylor R Campbell <riastradh@NetBSD.org>, port-amd64-maintainer@netbsd.org,
        gnats-admin@netbsd.org
Subject: Re: port-amd64/58344: Machine hangs at end of shutdown -r
Date: Fri, 27 Sep 2024 19:29:31 +0200

 On 2024-06-17 15:50, Taylor R Campbell wrote:
 >   1. set cpureset_delay to 0 (e.g., enter ddb and `w cpureset_delay 0')
 >      before attempting reboot, and see if that makes a difference --
 >      maybe delay() is broken at that point somehow

 This helps, the machine reboots without a problem!


>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.47 2022/09/11 19:34:41 kim Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2024 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.