NetBSD Problem Report #50828

From martin@aprisoft.de  Thu Feb 18 08:42:59 2016
Return-Path: <martin@aprisoft.de>
Received: from mail.netbsd.org (mail.NetBSD.org [199.233.217.200])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(Client CN "mail.netbsd.org", Issuer "Postmaster NetBSD.org" (verified OK))
	by mollari.NetBSD.org (Postfix) with ESMTPS id 2CFB77ABFD
	for <gnats-bugs@gnats.NetBSD.org>; Thu, 18 Feb 2016 08:42:59 +0000 (UTC)
Message-Id: <20160218084218.5DF4DED0E4F@emmas.aprisoft.de>
Date: Thu, 18 Feb 2016 09:42:18 +0100 (CET)
From: martin@NetBSD.org
Reply-To: martin@NetBSD.org
To: gnats-bugs@NetBSD.org
Subject: current amd64 kernel crashes at boot
X-Send-Pr-Version: 3.95

>Number:         50828
>Category:       kern
>Synopsis:       current amd64 kernel crashes at boot
>Confidential:   no
>Severity:       critical
>Priority:       high
>Responsible:    riastradh
>State:          closed
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Thu Feb 18 08:45:00 +0000 2016
>Closed-Date:    Sun Aug 20 07:58:48 +0000 2023
>Last-Modified:  Sun Aug 20 07:58:48 +0000 2023
>Originator:     Martin Husemann
>Release:        NetBSD 7.99.26
>Organization:
The NetBSD Foundation, Inc.
>Environment:
System: NetBSD martins.aprisoft.de 7.99.26 NetBSD 7.99.26 (GENERIC) #36: Fri Jan 29 19:36:03 CET 2016 martin@martins.aprisoft.de:/ssd/src/sys/arch/amd64/compile/GENERIC amd64
Architecture: x86_64
Machine: amd64

this is the older kernel, new kernel is from cvs update a few minutes ago

>Description:

After Taylor fixed the rndq softint issue, my machines gets further in boot,
but still not to the login prompt. Previous nouveaux enabled kernels did
work for me. I am not sure this is related to the noveau errors.

root file system type: ffs
kern.module.path=/stand/amd64/7.99.26/modules
drm kern info: nouveau  [  DEVICE][nouveau0] BOOT0  : 0x0c4100a1
drm kern info: nouveau  [  DEVICE][nouveau0] Chipset: GF104 (NVC4)
drm kern info: nouveau  [  DEVICE][nouveau0] Family : NVC0
drm kern info: nouveau  [   VBIOS][nouveau0] checking PRAMIN for image...
drm kern info: nouveau  [   VBIOS][nouveau0] ... appears to be valid
drm kern info: nouveau  [   VBIOS][nouveau0] using image from PRAMIN
drm kern info: nouveau  [   VBIOS][nouveau0] BIT signature found
drm kern info: nouveau  [   VBIOS][nouveau0] version 70.04.2e.00.04
drm kern info: nouveau  [     PFB][nouveau0] RAM type: GDDR5
drm kern info: nouveau  [     PFB][nouveau0] RAM size: 1024 MiB
drm kern info: nouveau  [     PFB][nouveau0]    ZCOMP: 0 tags
drm kern info: nouveau  [    VOLT][nouveau0] GPU voltage: 875000uv
drm kern info: nouveau  [  PTHERM][nouveau0] FAN control: PWM
drm kern info: nouveau  [  PTHERM][nouveau0] fan management: automatic
drm kern info: nouveau  [  PTHERM][nouveau0] internal sensor: yes
drm kern info: nouveau  [     CLK][nouveau0] 03: core 50 MHz memory 135 MHz 
drm kern info: nouveau  [     CLK][nouveau0] 07: core 405 MHz memory 324 MHz 
drm kern info: nouveau  [     CLK][nouveau0] 0c: core 405 MHz memory 1800 MHz 
drm kern info: nouveau  [     CLK][nouveau0] 0f: core 675 MHz memory 1800 MHz 
drm kern info: nouveau  [     CLK][nouveau0] --: core 50 MHz memory 135 MHz 
Zone  kernel: Available graphics memory: 5734730 kiB
Zone   dma32: Available graphics memory: 2097152 kiB
drm kern info: nouveau  [     DRM] VRAM: 1024 MiB
drm kern info: nouveau  [     DRM] GART: 1048576 MiB
drm kern info: nouveau  [     DRM] TMDS table version 2.0
drm kern info: nouveau  [     DRM] DCB version 4.0
drm kern info: nouveau  [     DRM] DCB outp 00: 02000300 00000000
drm kern info: nouveau  [     DRM] DCB outp 01: 01000302 00020030
drm kern info: nouveau  [     DRM] DCB outp 02: 04011380 00000000
drm kern info: nouveau  [     DRM] DCB outp 03: 08011382 00020030
drm kern info: nouveau  [     DRM] DCB outp 04: 02022362 00020010
drm kern info: nouveau  [     DRM] DCB conn 00: 00001030
drm kern info: nouveau  [     DRM] DCB conn 01: 00010130
drm kern info: nouveau  [     DRM] DCB conn 02: 00002261
drm: Supports vblank timestamp caching Rev 2 (21.10.2013).
drm: Driver supports precise vblank timestamp query.
drm kern info: nouveau  [     DRM] MM: using COPY1 for buffer copies
nouveaufb0 at nouveau0
nouveau0: info: registered panic notifier
wsdisplay0 at nouveaufb0 kbdmux 1
drm kern error: nouveau E[   PDISP][nouveau0] INVALID_STATE [UNK0B] chid 1 mthd 0x0080 data 0x00000000
drm kern error: nouveau E[   PDISP][nouveau0] Base 0:
drm kern error: nouveau E[   PDISP][nouveau0]   0x0084: 0x00000000        drm kern error: nouveau E[   PDISP][nouveau0]   0x0088: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x008c: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x0090: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x0094: 0x00000000 -> 0xcafe0000
drm kern error: nouveau E[   PDISP][nouveau0]   0x00a0: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x00a4: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x00c0: 0x00000000 -> 0x01000003
drm kern error: nouveau E[   PDISP][nouveau0]   0x00c4: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x00c8: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x00cc: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x00e0: 0x00000000 -> 0x40000000
drm kern error: nouveau E[   PDISP][nouveau0]   0x00e4: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x00e8: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x00ec: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x00fc: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x0100: 0xfffe0000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x0104: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x0110: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x0114: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] Base 0 - Image 0:
drm kern error: nouveau E[   PDISP][nouveau0]   0x0800: 0x00000000 -> 0x00000600
drm kern error: nouveau E[   PDISP][nouveau0]   0x0804: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x0808: 0x00000000 -> 0x04380780
drm kern error: nouveau E[   PDISP][nouveau0]   0x080c: 0x00000000 -> 0x00101e00
drm kern error: nouveau E[   PDISP][nouveau0]   0x0810: 0x0000e900 -> 0x0000cf00
drm kern error: nouveau E[   PDISP][nouveau0] Base 0 - Image 1:
drm kern error: nouveau E[   PDISP][nouveau0]   0x0c00: 0x00000000 -> 0x00000600
drm kern error: nouveau E[   PDISP][nouveau0]   0x0c04: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0]   0x0c08: 0x00000000 -> 0x04380780
drm kern error: nouveau E[   PDISP][nouveau0]   0x0c0c: 0x00000000 -> 0x00101e00
drm kern error: nouveau E[   PDISP][nouveau0]   0x0c10: 0x0000e900 -> 0x0000cf00
uvm_fault(0xffffffff81264000, 0x0, 2) -> e
fatal page fault in supervisor mode
trap type 6 code 2 rip ffffffff8030af87 cs 8 rflags 10246 cr2 0 ilevel 8 rsp fffffe811d591ec8
curlwp 0xfffffe811d572240 pid 0.58 lowest kstack 0xfffffe811d58f2c0
ugrn0l: puhub0uhldev0 pt uhue= 
Stopped in pid 0.58 (system) at netbsd:usb_task_thread+0x5b:    movq    %rdx,0(%
rax)
db{5}> bt
usb_task_thread() at netbsd:usb_task_thread+0x5b

it is reproducable:

drm kern error: nouveau E[   PDISP][nouveau0]   0x0c10: 0x0000e900 -> 0x0000cf00
uvm_fault(0xffffffff81264000, 0x0, 2) -> e
fatal page fault in supervisor mode
trap type 6 code 2 rip ffffffff8030af87 cs 8 rflags 10246 cr2 0 ilevel 8 rsp fffffe811d591ec8
curlwp 0xfffffe811d572240 pid 0.58 lowest kstack 0xfffffe811d58f2c0
kernel: page fault trap, code=0
Stopped in pid 0.58 (system) at netbsd:usb_task_thread+0x5b:    movq    %rdx,0(%
rax)


Previous (sligthly old kernel) works, dmesg below.

Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
    2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015, 2016
    The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
    The Regents of the University of California.  All rights reserved.

NetBSD 7.99.26 (GENERIC) #36: Fri Jan 29 19:36:03 CET 2016
	martin@martins.aprisoft.de:/ssd/src/sys/arch/amd64/compile/GENERIC
total memory = 16382 MB
avail memory = 15887 MB
timecounter: Timecounters tick every 10.000 msec
Kernelized RAIDframe activated
timecounter: Timecounter "i8254" frequency 1193182 Hz quality 100
System manufacturer System Product Name (System Version)
mainbus0 (root)
ACPI: RSDP 0x00000000000FBAD0 000024 (v02 ACPIAM)
ACPI: XSDT 0x00000000CFE90100 00005C (v01 030811 XSDT1042 20110308 MSFT 00000097)
ACPI: FACP 0x00000000CFE90290 0000F4 (v03 030811 FACP1042 20110308 MSFT 00000097)
ACPI BIOS Warning (bug): 32/64X length mismatch in FADT/Gpe0Block: 64/32 (20160108/tbfadt-651)
ACPI: DSDT 0x00000000CFE90450 00E040 (v01 A1585  A1585000 00000000 INTL 20060113)
ACPI: FACS 0x00000000CFEA8000 000040
ACPI: FACS 0x00000000CFEA8000 000040
ACPI: APIC 0x00000000CFE90390 00007C (v01 030811 APIC1042 20110308 MSFT 00000097)
ACPI: MCFG 0x00000000CFE90410 00003C (v01 030811 OEMMCFG  20110308 MSFT 00000097)
ACPI: OEMB 0x00000000CFEA8040 000072 (v01 030811 OEMB1042 20110308 MSFT 00000097)
ACPI: SRAT 0x00000000CFE9F8A0 000108 (v01 AMD    FAM_F_10 00000002 AMD  00000001)
ACPI: HPET 0x00000000CFE9F9B0 000038 (v01 030811 OEMHPET  20110308 MSFT 00000097)
ACPI: SSDT 0x00000000CFE9F9F0 000DA4 (v01 A M I  POWERNOW 00000001 AMD  00000001)
ACPI: 2 ACPI AML tables successfully acquired and loaded

ioapic0 at mainbus0 apid 6: pa 0xfec00000, version 0x21, 24 pins
cpu0 at mainbus0 apid 0
cpu0: AMD Phenom(tm) II X6 1075T Processor, id 0x100fa0
cpu1 at mainbus0 apid 1
cpu1: AMD Phenom(tm) II X6 1075T Processor, id 0x100fa0
cpu2 at mainbus0 apid 2
cpu2: AMD Phenom(tm) II X6 1075T Processor, id 0x100fa0
cpu3 at mainbus0 apid 3
cpu3: AMD Phenom(tm) II X6 1075T Processor, id 0x100fa0
cpu4 at mainbus0 apid 4
cpu4: AMD Phenom(tm) II X6 1075T Processor, id 0x100fa0
cpu5 at mainbus0 apid 5
cpu5: AMD Phenom(tm) II X6 1075T Processor, id 0x100fa0
acpi0 at mainbus0: Intel ACPICA 20160108
acpi0: X/RSDT: OemId <030811,XSDT1042,20110308>, AslId <MSFT,00000097>
acpi0: MCFG: segment 0, bus 0-255, address 0x00000000e0000000
acpi0: SCI interrupting at int 9
timecounter: Timecounter "ACPI-Fast" frequency 3579545 Hz quality 1000
hpet0 at acpi0: high precision event timer (mem 0xfed00000-0xfed00400)
timecounter: Timecounter "hpet0" frequency 14318180 Hz quality 2000
acpiec0 at acpi0 (EC0, PNP0C09)
: io 0x62,0x66
attimer1 at acpi0 (TMR, PNP0100): io 0x40-0x43 irq 0
pcppi1 at acpi0 (SPKR, PNP0800): io 0x61
midi0 at pcppi1: PC speaker
sysbeep0 at pcppi1
UAR1 (PNP0501) at acpi0 not configured
aibs0 at acpi0 (ASOC, ATK0110-16843024): ASUSTeK AI Booster
OMSC (PNP0C02) at acpi0 not configured
RMSC (PNP0C02) at acpi0 not configured
SIOR (PNP0C02) at acpi0 not configured
PCIE (PNP0C02) at acpi0 not configured
RMEM (PNP0C01) at acpi0 not configured
acpibut0 at acpi0 (PWRB, PNP0C0C-170): ACPI Power Button
acpiwmi0 at acpi0 (AOD, PNP0C14-0): ACPI WMI Interface
acpiwmibus at acpiwmi0 not configured
attimer1: attached to pcppi1
pci0 at mainbus0 bus 0: configuration mode 1
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
pchb0 at pci0 dev 0 function 0: vendor 1002 product 5957 (rev. 0x00)
ppb0 at pci0 dev 2 function 0: vendor 1002 product 5978 (rev. 0x00)
ppb0: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x16 @ 5.0GT/s
ppb0: link is x2 @ 2.5GT/s
pci1 at ppb0 bus 5
pci1: i/o space, memory space enabled, rd/line, wr/inv ok
nouveau0 at pci1 dev 0 function 0: vendor 10de product 0e22 (rev. 0xa1)
hdaudio0 at pci1 dev 0 function 1: HD Audio Controller
hdaudio0: interrupting at ioapic0 pin 19
hdafg0 at hdaudio0: vendor 10de product 0012
hdafg0: DP00 8ch: Digital Out [Jack]
hdafg0: 8ch/0ch 48000Hz PCM16*
hdafg1 at hdaudio0: vendor 10de product 0012
hdafg1: DP00 8ch: Digital Out [Jack]
hdafg1: 8ch/0ch 48000Hz PCM16*
hdafg2 at hdaudio0: vendor 10de product 0012
hdafg2: DP00 8ch: Digital Out [Jack]
hdafg2: 8ch/0ch 48000Hz PCM16*
hdafg3 at hdaudio0: vendor 10de product 0012
hdafg3: DP00 8ch: Digital Out [Jack]
hdafg3: 8ch/0ch 48000Hz PCM16*
ppb1 at pci0 dev 9 function 0: vendor 1002 product 597e (rev. 0x00)
ppb1: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x2 @ 5.0GT/s
ppb1: link is x1 @ 2.5GT/s
pci2 at ppb1 bus 4
pci2: i/o space, memory space enabled, rd/line, wr/inv ok
jmide0 at pci2 dev 0 function 0: vendor 197b product 2361 (rev. 0x10)
jmide0: 1 PATA port, 1 SATA port
jmide0: interrupting at ioapic0 pin 17
ahcisata0 at jmide0
ahcisata0: AHCI revision 1.10, 1 port, 32 slots, CAP 0xc722ff00<PSC,SSC,PMD,SPM,ISS=0x2=Gen2,SCLO,SAL,SALP,SNCQ,S64A>
atabus0 at ahcisata0 channel 0
jmide0: PCI IDE interface used
jmide0: bus-master DMA support present
jmide0: primary channel wired to native-PCI mode
jmide0: primary channel is unused
jmide0: secondary channel wired to native-PCI mode
jmide0: secondary channel is PATA
atabus1 at jmide0 channel 1
ppb2 at pci0 dev 10 function 0: vendor 1002 product 597f (rev. 0x00)
ppb2: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x2 @ 5.0GT/s
ppb2: link is x1 @ 5.0GT/s
pci3 at ppb2 bus 3
pci3: i/o space, memory space enabled, rd/line, wr/inv ok
vendor 1033 product 0194 (USB serial bus, xHCI, revision 0x03) at pci3 dev 0 function 0 not configured
ahcisata1 at pci0 dev 17 function 0: vendor 1002 product 4391 (rev. 0x40)
ahcisata1: interrupting at ioapic0 pin 19
ahcisata1: 64-bit DMA
ahcisata1: AHCI revision 1.20, 6 ports, 32 slots, CAP 0xf732ff05<PSC,SSC,PMD,SPM,ISS=0x3=Gen3,SCLO,SAL,SALP,SMPS,SSNTF,SNCQ,S64A>
atabus2 at ahcisata1 channel 0
atabus3 at ahcisata1 channel 1
atabus4 at ahcisata1 channel 2
atabus5 at ahcisata1 channel 3
atabus6 at ahcisata1 channel 4
atabus7 at ahcisata1 channel 5
ohci0 at pci0 dev 18 function 0: vendor 1002 product 4397 (rev. 0x00)
ohci0: interrupting at ioapic0 pin 18
ohci0: OHCI version 1.0, legacy support
usb0 at ohci0: USB revision 1.0
ehci0 at pci0 dev 18 function 2: vendor 1002 product 4396 (rev. 0x00)
ehci0: interrupting at ioapic0 pin 17
ehci0: dropped intr workaround enabled
ehci0: EHCI version 1.0
ehci0: companion controller, 5 ports each: ohci0
usb1 at ehci0: USB revision 2.0
ohci1 at pci0 dev 19 function 0: vendor 1002 product 4397 (rev. 0x00)
ohci1: interrupting at ioapic0 pin 18
ohci1: OHCI version 1.0, legacy support
usb2 at ohci1: USB revision 1.0
ehci1 at pci0 dev 19 function 2: vendor 1002 product 4396 (rev. 0x00)
ehci1: interrupting at ioapic0 pin 17
ehci1: dropped intr workaround enabled
ehci1: EHCI version 1.0
ehci1: companion controller, 5 ports each: ohci1
usb3 at ehci1: USB revision 2.0
piixpm0 at pci0 dev 20 function 0: vendor 1002 product 4385 (rev. 0x42)
piixpm0: polling (SB800)
iic0 at piixpm0: I2C bus
iic1 at piixpm0: I2C bus
iic2 at piixpm0: I2C bus
iic3 at piixpm0: I2C bus
hdaudio1 at pci0 dev 20 function 2: HD Audio Controller
hdaudio1: interrupting at ioapic0 pin 16
hdafg4 at hdaudio1: vendor 1106 product 0440
hdafg4: DAC00 8ch: Speaker [Jack], HP Out [Jack]
hdafg4: ADC01 2ch: Line In [Jack], Mic In [Jack]
hdafg4: HDMI02 2ch: Digital Out [Jack]
hdafg4: DIG03 2ch: SPDIF Out [Jack]
hdafg4: 8ch/2ch 48000Hz PCM16*
audio0 at hdafg4: full duplex, playback, capture, mmap, independent
pcib0 at pci0 dev 20 function 3: vendor 1002 product 439d (rev. 0x40)
ppb3 at pci0 dev 20 function 4: vendor 1002 product 4384 (rev. 0x40)
pci4 at ppb3 bus 2
pci4: i/o space, memory space enabled
fwohci0 at pci4 dev 8 function 0: vendor 1106 product 3044 (rev. 0xc0)
fwohci0: interrupting at ioapic0 pin 20
fwohci0: OHCI version 1.10 (ROM=1)
fwohci0: No. of Isochronous channels is 4.
fwohci0: EUI64 00:1f:c6:00:00:0a:ca:1a
fwohci0: Phy 1394a available S400, 2 ports.
fwohci0: Link S400, max_rec 2048 bytes.
ieee1394if0 at fwohci0: IEEE1394 bus
fwip0 at ieee1394if0: IP over IEEE1394
fwohci0: Initiate bus reset
ohci2 at pci0 dev 20 function 5: vendor 1002 product 4399 (rev. 0x00)
ohci2: interrupting at ioapic0 pin 18
ohci2: OHCI version 1.0, legacy support
usb4 at ohci2: USB revision 1.0
ppb4 at pci0 dev 21 function 0: vendor 1002 product 43a0 (rev. 0x00)
ppb4: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x1 @ 5.0GT/s
ppb4: link is x1 @ 2.5GT/s
pci5 at ppb4 bus 1
pci5: i/o space, memory space enabled, rd/line, wr/inv ok
re0 at pci5 dev 0 function 0: RealTek 8168/8111 PCIe Gigabit Ethernet (rev. 0x06)
re0: interrupting at ioapic0 pin 16
re0: Ethernet address bc:ae:c5:46:16:58
re0: using 256 tx descriptors
rgephy0 at re0 phy 7: RTL8169S/8110S/8211 1000BASE-T media interface, rev. 4
rgephy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
ohci3 at pci0 dev 22 function 0: vendor 1002 product 4397 (rev. 0x00)
ohci3: interrupting at ioapic0 pin 18
ohci3: OHCI version 1.0, legacy support
usb5 at ohci3: USB revision 1.0
ehci2 at pci0 dev 22 function 2: vendor 1002 product 4396 (rev. 0x00)
ehci2: interrupting at ioapic0 pin 17
ehci2: dropped intr workaround enabled
ehci2: EHCI version 1.0
ehci2: companion controller, 4 ports each: ohci3
usb6 at ehci2: USB revision 2.0
pchb1 at pci0 dev 24 function 0: vendor 1022 product 1200 (rev. 0x00)
pchb2 at pci0 dev 24 function 1: vendor 1022 product 1201 (rev. 0x00)
pchb3 at pci0 dev 24 function 2: vendor 1022 product 1202 (rev. 0x00)
amdnb_misc0 at pci0 dev 24 function 3: AMD NB Misc Configuration
amdtemp0 at amdnb_misc0: AMD CPU Temperature Sensors (Family10h)
pchb4 at pci0 dev 24 function 4: vendor 1022 product 1204 (rev. 0x00)
isa0 at pcib0
com0 at isa0 port 0x3f8-0x3ff irq 4: ns16550a, working fifo
com0: console
pckbc0 at isa0 port 0x60-0x64
acpicpu0 at cpu0: ACPI CPU
acpicpu0: C1: HLT, lat   0 us, pow     0 mW
acpicpu0: C2: I/O, lat  75 us, pow     1 mW
acpicpu0: P0: FFH, lat   4 us, pow 19507 mW, 3000 MHz
acpicpu0: P1: FFH, lat   4 us, pow 14500 mW, 2300 MHz
acpicpu0: P2: FFH, lat   4 us, pow 10535 mW, 1600 MHz
acpicpu0: P3: FFH, lat   4 us, pow  6210 mW,  800 MHz
acpicpu0: T0: I/O, lat   1 us, pow     0 mW, 100 %
acpicpu0: T1: I/O, lat   1 us, pow     0 mW,  88 %
acpicpu0: T2: I/O, lat   1 us, pow     0 mW,  76 %
acpicpu0: T3: I/O, lat   1 us, pow     0 mW,  64 %
acpicpu0: T4: I/O, lat   1 us, pow     0 mW,  52 %
acpicpu0: T5: I/O, lat   1 us, pow     0 mW,  40 %
acpicpu0: T6: I/O, lat   1 us, pow     0 mW,  28 %
acpicpu0: T7: I/O, lat   1 us, pow     0 mW,  16 %
acpicpu1 at cpu1: ACPI CPU
acpicpu2 at cpu2: ACPI CPU
acpicpu3 at cpu3: ACPI CPU
acpicpu4 at cpu4: ACPI CPU
acpicpu5 at cpu5: ACPI CPU
fwohci0: BUS reset
fwohci0: node_id=0xc800ffc0, gen=1, CYCLEMASTER mode
ieee1394if0: 1 nodes, maxhop <= 0 cable IRM irm(0) (me)
ieee1394if0: bus manager 0
timecounter: Timecounter "clockinterrupt" frequency 100 Hz quality 0
timecounter: Timecounter "TSC" frequency 3010203180 Hz quality 3000
IPsec: Initialized Security Association Processing.
uhub0 at usb0: vendor 1002 OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 5 ports with 5 removable, self powered
uhub1 at usb3: vendor 1002 EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub1: 5 ports with 5 removable, self powered
uhub2 at usb5: vendor 1002 OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub2: 4 ports with 4 removable, self powered
uhub3 at usb6: vendor 1002 EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub3: 4 ports with 4 removable, self powered
uhub4 at usb2: vendor 1002 OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub4: 5 ports with 5 removable, self powered
uhub5 at usb1: vendor 1002 EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub5: 5 ports with 5 removable, self powered
ahcisata1 port 2: device present, speed: 1.5Gb/s
ahcisata1 port 5: device present, speed: 6.0Gb/s
ahcisata1 port 0: device present, speed: 6.0Gb/s
ahcisata1 port 1: device present, speed: 6.0Gb/s
uhub6 at usb4: vendor 1002 OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub6: 2 ports with 2 removable, self powered
wd0 at atabus2 drive 0
wd0: <ST2000DL003-9VT166>
wd0: drive supports 16-sector PIO transfers, LBA48 addressing
wd0: 1863 GB, 3876021 cyl, 16 head, 63 sec, 512 bytes/sect x 3907029168 sectors
wd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133)
wd0(ahcisata1:0:0): using PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133) (using DMA)
wd1 at atabus3 drive 0
wd1: <Samsung SSD 840 PRO Series>
wd1: drive supports 16-sector PIO transfers, LBA48 addressing
wd1: 238 GB, 496149 cyl, 16 head, 63 sec, 512 bytes/sect x 500118192 sectors
wd1: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133)
wd1(ahcisata1:1:0): using PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133) (using DMA)
atapibus0 at atabus4: 1 targets
cd0 at atapibus0 drive 0: <HL-DT-ST BD-RE  BH10LS38, K89C2CJ1805, 1.03> cdrom removable
cd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133)
cd0(ahcisata1:2:0): using PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133) (using DMA)
wd2 at atabus7 drive 0
wd2: <Samsung SSD 840 Series>
wd2: drive supports 16-sector PIO transfers, LBA48 addressing
wd2: 111 GB, 232581 cyl, 16 head, 63 sec, 512 bytes/sect x 234441648 sectors
wd2: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133)
wd2(ahcisata1:5:0): using PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133) (using DMA)
pad0: outputs: 44100Hz, 16-bit, stereo
audio1 at pad0: half duplex, playback, capture
boot device: wd0
root on wd0a dumps on wd0b
root file system type: ffs
kern.module.path=/stand/amd64/7.99.26/modules
Zone  kernel: Available graphics memory: 5735232 kiB
Zone   dma32: Available graphics memory: 2097152 kiB
00001030
00010130
00002261
drm: Supports vblank timestamp caching Rev 2 (21.10.2013).
drm: Driver supports precise vblank timestamp query.
nouveaufb0 at nouveau0
nouveau0: info: registered panic notifier
nouveaufb0: framebuffer at 0xffff800125653000, size 1920x1080, depth 32, stride 7680
wsdisplay0 at nouveaufb0 kbdmux 1
wsmux1: connecting to wsdisplay0
ugen0 at uhub0uhidev0 at uhub4 port 2 port 1
 configuration 1ugen0: Syncrosoft eLicenser, rev 1.00/1.01, addr 2
 interface 0
uhidev0: Logitech USB Keyboard, rev 1.10/66.00, addr 2, iclass 3/1
ukbd0 at uhidev0
wskbd0 at ukbd0 mux 1
wskbd0: connecting to wsdisplay0
uhidev1 at uhub4 port 1 configuration 1 interface 1
uhidev1: Logitech USB Keyboard, rev 1.10/66.00, addr 2, iclass 3/0
uhidev1: 3 report ids
uhid0 at uhidev1 reportid 1: input=1, output=0, feature=0
uhid1 at uhidev1 reportid 2: input=1, output=0, feature=0
uhid2 at uhidev1 reportid 3: input=3, output=0, feature=0
wsdisplay0: screen 1 added (default, vt100 emulation)
wsdisplay0: screen 2 added (default, vt100 emulation)
wsdisplay0: screen 3 added (default, vt100 emulation)
wsdisplay0: screen 4 added (default, vt100 emulation)


>How-To-Repeat:
Boot current on amd64?

>Fix:
n/a

>Release-Note:

>Audit-Trail:
From: Martin Husemann <martin@NetBSD.org>
To: gnats-bugs@NetBSD.org
Cc: 
Subject: Re: kern/50828: current amd64 kernel crashes at boot
Date: Fri, 19 Feb 2016 08:39:03 +0000

 The machine boots fine if I disable nouveau* and nouveaufb*.

 Martin

Responsible-Changed-From-To: kern-bug-people->riastradh
Responsible-Changed-By: riastradh@NetBSD.org
Responsible-Changed-When: Fri, 19 Feb 2016 19:02:56 +0000
Responsible-Changed-Why:
mine


From: Martin Husemann <martin@duskware.de>
To: gnats-bugs@NetBSD.org
Cc: 
Subject: Re: kern/50828 (current amd64 kernel crashes at boot)
Date: Fri, 19 Feb 2016 21:31:46 +0100

 This patch avoids the crash

 Martin

 Index: files.nouveau
 ===================================================================
 RCS file: /cvsroot/src/sys/external/bsd/drm2/nouveau/files.nouveau,v
 retrieving revision 1.14
 diff -u -r1.14 files.nouveau
 --- files.nouveau	11 Feb 2016 04:43:32 -0000	1.14
 +++ files.nouveau	19 Feb 2016 20:29:41 -0000
 @@ -19,8 +19,8 @@
  makeoptions	nouveau	CPPFLAGS+="-I$S/external/bsd/drm2/dist/drm/nouveau/core/include"
  makeoptions	nouveau	CPPFLAGS+="-I$S/external/bsd/drm2/nouveau"

 -makeoptions	nouveau	CPPFLAGS+="-DCONFIG_NOUVEAU_DEBUG=5"
 -makeoptions	nouveau	CPPFLAGS+="-DCONFIG_NOUVEAU_DEBUG_DEFAULT=3"
 +makeoptions	nouveau	CPPFLAGS+="-DCONFIG_NOUVEAU_DEBUG=0"
 +makeoptions	nouveau	CPPFLAGS+="-DCONFIG_NOUVEAU_DEBUG_DEFAULT=0"

  # XXX If you find a way to apply the warning flags to all Nouveau
  # sources, please apply it here and remove this stupidly gigantic list!

From: Martin Husemann <martin@duskware.de>
To: gnats-bugs@NetBSD.org
Cc: 
Subject: Re: kern/50828 (current amd64 kernel crashes at boot)
Date: Sun, 21 Feb 2016 12:12:14 +0100

 On Fri, Feb 19, 2016 at 09:31:46PM +0100, Martin Husemann wrote:
 > -makeoptions	nouveau	CPPFLAGS+="-DCONFIG_NOUVEAU_DEBUG=5"
 > -makeoptions	nouveau	CPPFLAGS+="-DCONFIG_NOUVEAU_DEBUG_DEFAULT=3"
 > +makeoptions	nouveau	CPPFLAGS+="-DCONFIG_NOUVEAU_DEBUG=0"
 > +makeoptions	nouveau	CPPFLAGS+="-DCONFIG_NOUVEAU_DEBUG_DEFAULT=0"

 The crash does not happen always (but most of the time, never needed
 more than two tries), and I have never seen it with a kernel with all
 nouveau debugging turned off.

 CONFIG_NOUVEAU_DEBUG=1 is enough to trigger it.

 When it crashes, it is in usb device discovery, i.e. cpu 1 is doing:

 bus_space_write_1() at netbsd:bus_space_write_1+0xe
 putchar() at netbsd:putchar+0x115
 kprintf() at netbsd:kprintf+0x7c7
 aprint_normal_internal() at netbsd:aprint_normal_internal+0x70
 aprint_normal_dev() at netbsd:aprint_normal_dev+0x4d
 ugen_attach() at netbsd:ugen_attach+0x8b
 config_attach_loc() at netbsd:config_attach_loc+0x16e
 usbd_attachwholedevice() at netbsd:usbd_attachwholedevice+0xa1
 usbd_probe_and_attach() at netbsd:usbd_probe_and_attach+0x12c
 usbd_new_device() at netbsd:usbd_new_device+0x41d
 uhub_explore() at netbsd:uhub_explore+0x215
 usb_discover.isra.0() at netbsd:usb_discover.isra.0+0x3e
 usb_event_thread() at netbsd:usb_event_thread+0x74

 and cpu 5 crashes on a NULL deref:

 netbsd:usb_task_thread+0x5b:    movq    %rdx,0(%rax)

 with

  rax         0

 All other cpus are idle. Note that I am using serial console here, so putchar
 should not touch the nouveau framebuffer.

 Martin

From: Martin Husemann <martin@duskware.de>
To: gnats-bugs@NetBSD.org
Cc: 
Subject: Re: kern/50828 (current amd64 kernel crashes at boot)
Date: Sun, 21 Feb 2016 12:32:29 +0100

 ... which is:

 (gdb) list *(usb_task_thread+0x5b) 
 0xffffffff8030af87 is in usb_task_thread (../../../../dev/usb/usb.c:509).
 504                             task = TAILQ_FIRST(&taskq->tasks);
 505                     }
 506                     DPRINTFN(2,("usb_task_thread: woke up task=%p\n", task));
 507                     if (task != NULL) {
 508                             mpsafe = ISSET(task->flags, USB_TASKQ_MPSAFE);
 509                             TAILQ_REMOVE(&taskq->tasks, task, next);
 510                             task->queue = USB_NUM_TASKQS;
 511                             mutex_exit(&taskq->lock);


 Martin

State-Changed-From-To: open->feedback
State-Changed-By: riastradh@NetBSD.org
State-Changed-When: Sun, 20 Aug 2023 06:18:02 +0000
State-Changed-Why:
Several drm updates since this PR, is it still reproducible?


State-Changed-From-To: feedback->closed
State-Changed-By: martin@NetBSD.org
State-Changed-When: Sun, 20 Aug 2023 07:58:48 +0000
State-Changed-Why:
This works now


>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.47 2022/09/11 19:34:41 kim Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2023 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.