NetBSD Problem Report #51577

From www@NetBSD.org  Tue Oct 25 20:25:19 2016
Return-Path: <www@NetBSD.org>
Received: from mail.netbsd.org (mail.netbsd.org [199.233.217.200])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(Client CN "mail.netbsd.org", Issuer "Postmaster NetBSD.org" (verified OK))
	by mollari.NetBSD.org (Postfix) with ESMTPS id 26B5E7A106
	for <gnats-bugs@gnats.NetBSD.org>; Tue, 25 Oct 2016 20:25:19 +0000 (UTC)
Message-Id: <20161025202517.4FF547A283@mollari.NetBSD.org>
Date: Tue, 25 Oct 2016 20:25:17 +0000 (UTC)
From: dhgutteridge@sympatico.ca
Reply-To: gutteridge@NetBSD.org
To: gnats-bugs@NetBSD.org
Subject: 7.99.40 (and older) i386 kernels freeze when nouveau is enabled
X-Send-Pr-Version: www-1.0

>Number:         51577
>Category:       kern
>Synopsis:       7.99.40 (and older) i386 kernels freeze when nouveau is enabled
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    riastradh
>State:          feedback
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Tue Oct 25 20:30:00 +0000 2016
>Closed-Date:    
>Last-Modified:  Tue Aug 22 22:05:01 +0000 2023
>Originator:     David H. Gutteridge
>Release:        NetBSD 7.99.40 (GENERIC.201610250000Z)
>Organization:
>Environment:
As above, off a releng build from earlier today.
>Description:
When attempting to boot an i386 GENERIC kernel from HEAD (7.99.39 or
7.99.40), I consistently get a hang when the nouveau driver tries to
attach. I'm not able to get a ddb prompt; I have to power the machine
off to get out of it.

I tried building a kernel with DEBUG enabled, and this made no
difference. No additional output was generated when it hangs.

The dmesg looks like:

total memory = 511 MB
avail memory = 483 MB
timecounter: Timecounters tick every 10.000 msec
Kernelized RAIDframe activated
timecounter: Timecounter "i8254" frequency 1193182 Hz quality 100
MICRO-STAR INTERNATIONAL CO., LTD AMD761/VIA686 ( )
mainbus0 (root)
ACPI: RSDP 0x00000000000F7330 000014 (v00 761686)
ACPI: RSDT 0x000000001FFF3000 000028 (v01 761686 MSI ACPI 42302E31 AWRD 00000000)
ACPI: FACP 0x000000001FFF3040 000074 (v01 761686 MSI ACPI 42302E31 AWRD 00000000)
ACPI: DSDT 0x000000001FFF30C0 0023FD (v01 761686 AWRDACPI 00001000 MSFT 0100000C)
ACPI: FACS 0x000000001FFF0000 000040
ACPI: 1 ACPI AML tables successfully acquired and loaded

cpu0 at mainbus0
cpu0: AMD Athlon(tm) Processor, id 0x644
acpi0 at mainbus0: Intel ACPICA 20160527
acpi0: X/RSDT: OemId <761686,MSI ACPI,42302e31>, AslId <AWRD,00000000>
LNKC: ACPI: Found matching pin for 0.11.INTA at func 0: 11
LNKA: ACPI: Found matching pin for 0.13.INTA at func 0: 5
LNKD: ACPI: Found matching pin for 0.7.INTD at func 2: 11
acpi0: SCI interrupting at int 9
timecounter: Timecounter "ACPI-Fast" frequency 3579545 Hz quality 1000
acpibut0 at acpi0 (PWRB, PNP0C0C): ACPI Power Button
acpibut1 at acpi0 (SLPB, PNP0C0E): ACPI Sleep Button
MEM (PNP0C01) at acpi0 not configured
SYSR (PNP0C02) at acpi0 not configured
attimer1 at acpi0 (TMR, PNP0100): io 0x40-0x43 irq 0
pcppi1 at acpi0 (SPKR, PNP0800): io 0x61
midi0 at pcppi1: PC speaker
sysbeep0 at pcppi1
COPR (PNP0C04) at acpi0 not configured
FDC0 (PNP0700) at acpi0 not configured
UAR1 (PNP0501) at acpi0 not configured
UAR2 (PNP0501) at acpi0 not configured
LPT1 (PNP0400) at acpi0 not configured
pckbc1 at acpi0 (PS2M, PNP0F13) (aux port): irq 12
pckbc2 at acpi0 (PS2K, PNP0303) (kbd port): io 0x60,0x64 irq 1
apm0 at acpi0: Power Management spec V1.2
attimer1: attached to pcppi1
pckbd0 at pckbc2 (kbd slot)
pckbc2: using irq 1 for kbd slot
wskbd0 at pckbd0: console keyboard
pms0 at pckbc2 (aux slot)
pckbc2: using irq 12 for aux slot
wsmouse0 at pms0 mux 0
pci0 at mainbus0 bus 0: configuration mode 1
pci0: This pci host supports neither MSI nor MSI-X.
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
pchb0 at pci0 dev 0 function 0: vendor 1022 product 700e (rev. 0x13)
agp0 at pchb0: aperture at 0xd0000000, size 0x8000000
ppb0 at pci0 dev 1 function 0: vendor 1022 product 700f (rev. 0x00)
pci1 at ppb0 bus 1
nouveau0 at pci1 dev 5 function 0: vendor 10de product 0110 (rev. 0xb2)

At which point dmesg output stops and the machine is frozen.

Full dmesg generated after disabling nouveau via the bootloader:

NetBSD 7.99.40 (GENERIC.201610250000Z)
total memory = 511 MB
avail memory = 483 MB
timecounter: Timecounters tick every 10.000 msec
Kernelized RAIDframe activated
[429] nouveau* disabled
timecounter: Timecounter "i8254" frequency 1193182 Hz quality 100
MICRO-STAR INTERNATIONAL CO., LTD AMD761/VIA686 ( )
mainbus0 (root)
ACPI: RSDP 0x00000000000F7330 000014 (v00 761686)
ACPI: RSDT 0x000000001FFF3000 000028 (v01 761686 MSI ACPI 42302E31 AWRD 00000000)
ACPI: FACP 0x000000001FFF3040 000074 (v01 761686 MSI ACPI 42302E31 AWRD 00000000)
ACPI: DSDT 0x000000001FFF30C0 0023FD (v01 761686 AWRDACPI 00001000 MSFT 0100000C)
ACPI: FACS 0x000000001FFF0000 000040
ACPI: 1 ACPI AML tables successfully acquired and loaded

cpu0 at mainbus0
cpu0: AMD Athlon(tm) Processor, id 0x644
acpi0 at mainbus0: Intel ACPICA 20160527
acpi0: X/RSDT: OemId <761686,MSI ACPI,42302e31>, AslId <AWRD,00000000>
LNKC: ACPI: Found matching pin for 0.11.INTA at func 0: 11
LNKA: ACPI: Found matching pin for 0.13.INTA at func 0: 5
LNKD: ACPI: Found matching pin for 0.7.INTD at func 2: 11
acpi0: SCI interrupting at int 9
timecounter: Timecounter "ACPI-Fast" frequency 3579545 Hz quality 1000
acpibut0 at acpi0 (PWRB, PNP0C0C): ACPI Power Button
acpibut1 at acpi0 (SLPB, PNP0C0E): ACPI Sleep Button
MEM (PNP0C01) at acpi0 not configured
SYSR (PNP0C02) at acpi0 not configured
attimer1 at acpi0 (TMR, PNP0100): io 0x40-0x43 irq 0
pcppi1 at acpi0 (SPKR, PNP0800): io 0x61
midi0 at pcppi1: PC speaker
sysbeep0 at pcppi1
COPR (PNP0C04) at acpi0 not configured
FDC0 (PNP0700) at acpi0 not configured
UAR1 (PNP0501) at acpi0 not configured
UAR2 (PNP0501) at acpi0 not configured
LPT1 (PNP0400) at acpi0 not configured
pckbc1 at acpi0 (PS2M, PNP0F13) (aux port): irq 12
pckbc2 at acpi0 (PS2K, PNP0303) (kbd port): io 0x60,0x64 irq 1
apm0 at acpi0: Power Management spec V1.2
attimer1: attached to pcppi1
pckbd0 at pckbc2 (kbd slot)
pckbc2: using irq 1 for kbd slot
wskbd0 at pckbd0: console keyboard
pms0 at pckbc2 (aux slot)
pckbc2: using irq 12 for aux slot
wsmouse0 at pms0 mux 0
pci0 at mainbus0 bus 0: configuration mode 1
pci0: This pci host supports neither MSI nor MSI-X.
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
pchb0 at pci0 dev 0 function 0: vendor 1022 product 700e (rev. 0x13)
agp0 at pchb0: aperture at 0xd0000000, size 0x8000000
ppb0 at pci0 dev 1 function 0: vendor 1022 product 700f (rev. 0x00)
pci1 at ppb0 bus 1
pci1: This pci host supports neither MSI nor MSI-X.
pci1: i/o space, memory space enabled
vga0 at pci1 dev 5 function 0: vendor 10de product 0110 (rev. 0xb2)
wsdisplay0 at vga0 kbdmux 1: console (80x25, vt100 emulation), using wskbd0
wsmux1: connecting to wsdisplay0
drm at vga0 not configured
pcib0 at pci0 dev 7 function 0: vendor 1106 product 0686 (rev. 0x40)
viaide0 at pci0 dev 7 function 1
viaide0: VIA Technologies VT82C686A (Apollo KX133) ATA100 controller
viaide0: bus-master DMA support present
viaide0: primary channel configured to compatibility mode
viaide0: primary channel interrupting at irq 14
atabus0 at viaide0 channel 0
viaide0: secondary channel configured to compatibility mode
viaide0: secondary channel interrupting at irq 15
atabus1 at viaide0 channel 1
uhci0 at pci0 dev 7 function 2: vendor 1106 product 3038 (rev. 0x16)
uhci0: interrupting at irq 11
usb0 at uhci0: USB revision 1.0
uhci1 at pci0 dev 7 function 3: vendor 1106 product 3038 (rev. 0x16)
uhci1: interrupting at irq 11
usb1 at uhci1: USB revision 1.0
vendor 1106 product 3057 (SMBus serial bus, revision 0x40) at pci0 dev 7 function 4 not configured
ex0 at pci0 dev 11 function 0: 3Com 3c905B-TX 10/100 Ethernet (rev. 0x30)
ex0: interrupting at irq 11
ex0: MAC address 00:01:02:ec:2c:1a
exphy0 at ex0 phy 24: 3Com internal media interface
exphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
emuxki0 at pci0 dev 13 function 0: vendor 1102 product 0002 (rev. 0x05)
emuxki0: interrupting at irq 5
emuxki0: ac97: TriTech TR28602 codec; no 3D stereo
audio0 at emuxki0: full duplex, playback, capture, mmap, independent
joy0 at pci0 dev 13 function 1: vendor 1102 product 7002 (rev. 0x05)
joy0: joystick not connected
isa0 at pcib0
lpt0 at isa0 port 0x378-0x37b irq 7
com0 at isa0 port 0x3f8-0x3ff irq 4: ns16550a, working fifo
com1 at isa0 port 0x2f8-0x2ff irq 3: ns16550a, working fifo
fdc0 at isa0 port 0x3f0-0x3f7 irq 6 drq 2
timecounter: Timecounter "clockinterrupt" frequency 100 Hz quality 0
IPsec: Initialized Security Association Processing.
uhub0 at usb0: vendor 1106 UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhub1 at usb1: vendor 1106 UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
wd0 at atabus0 drive 0
wd0: <IC35L060AVER07-0>
wd0: drive supports 16-sector PIO transfers, LBA addressing
wd0: 58644 MB, 119150 cyl, 16 head, 63 sec, 512 bytes/sect x 120103200 sectors
wd0: 32-bit data port
wd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 5 (Ultra/100)
wd1 at atabus0 drive 1
wd1: <IC35L060AVER07-0>
wd1: drive supports 16-sector PIO transfers, LBA addressing
wd1: 58644 MB, 119150 cyl, 16 head, 63 sec, 512 bytes/sect x 120103200 sectors
wd1: 32-bit data port
wd1: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 5 (Ultra/100)
wd0(viaide0:0:0): using PIO mode 4, Ultra-DMA mode 5 (Ultra/100) (using DMA)
wd1(viaide0:0:1): using PIO mode 4, Ultra-DMA mode 5 (Ultra/100) (using DMA)
atapibus0 at atabus1: 2 targets
cd0 at atapibus0 drive 0: <MATSHITADVD-ROM SR-8586, , 3Z24> cdrom removable
cd0: 32-bit data port
cd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 2 (Ultra/33)
cd1 at atapibus0 drive 1: <LG CD-RW CED-8120B, , 1.03> cdrom removable
cd1: 32-bit data port
cd1: drive supports PIO mode 4, DMA mode 2
cd0(viaide0:1:0): using PIO mode 4, Ultra-DMA mode 2 (Ultra/33) (using DMA)
cd1(viaide0:1:1): using PIO mode 4, DMA mode 2 (using DMA)
boot device: wd1
root on wd1a dumps on wd1b
root file system type: ffs
kern.module.path=/stand/i386/7.99.40/modules
wsdisplay0: screen 1 added (80x25, vt100 emulation)
wsdisplay0: screen 2 added (80x25, vt100 emulation)
wsdisplay0: screen 3 added (80x25, vt100 emulation)
wsdisplay0: screen 4 added (80x25, vt100 emulation)

This is a rather old machine with a GeForce 2 MX400 card. (I don't
actually run X on it normally, but was just trying out a current
kernel to see what happens.)

>How-To-Repeat:
Boot with nouveau drivers enabled.
>Fix:

>Release-Note:

>Audit-Trail:

Responsible-Changed-From-To: kern-bug-people->riastradh
Responsible-Changed-By: dholland@NetBSD.org
Responsible-Changed-When: Mon, 31 Oct 2016 05:16:53 +0000
Responsible-Changed-Why:
drmkms, fyi.
(maybe we should have a drmkms-bug-people?)


From: coypu@SDF.ORG
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: kern/51577: 7.99.40 (and older) i386 kernels freeze when nouveau
 is enabled
Date: Sun, 22 Jan 2017 17:44:30 +0000

 Hi,

 It seems that drm2 is doing in a lot of places allocation with WAITOK.
 I imagine that it is possible that this just hangs for some, especially
 on an older machine.

 Some of these can fail without problems, so I wonder if violently
 converting all WAITOK to NOWAIT will get you somewhere booting, if so
 we can narrow it down.

From: coypu@SDF.ORG
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: kern/51577: 7.99.40 (and older) i386 kernels freeze when nouveau
 is enabled
Date: Sun, 22 Jan 2017 20:42:12 +0000

 Keep in mind I mean just stuff in sys/external/bsd/drm2.
 (And it would be a horrible idea to use this machine as such)

State-Changed-From-To: open->feedback
State-Changed-By: mrg@NetBSD.org
State-Changed-When: Mon, 24 Dec 2018 09:15:42 +0000
State-Changed-Why:
can you test -current?  it has a higher chance of working now..


State-Changed-From-To: feedback->open
State-Changed-By: dholland@NetBSD.org
State-Changed-When: Mon, 21 Jan 2019 02:00:36 +0000
State-Changed-Why:
email bounces


State-Changed-From-To: open->feedback
State-Changed-By: dholland@NetBSD.org
State-Changed-When: Mon, 21 Jan 2019 02:01:08 +0000
State-Changed-Why:
update submitter address and try again:

can you test -current?  it has a higher chance of working now..


From: "David H. Gutteridge" <david@gutteridge.ca>
To: gnats-bugs@NetBSD.org
Cc: 
Subject: Re: kern/51577 (7.99.40 (and older) i386 kernels freeze when
 nouveau is enabled)
Date: Sun, 27 Jan 2019 21:25:57 -0500

 On Mon, 2019-01-21 at 02:01 +0000, dholland@NetBSD.org wrote:
 > can you test -current?  it has a higher chance of working now..

 There's a definite improvement with 8.99.31 vs. 8.0_STABLE. That is,
 with -current it successfully completes booting, where it simply froze
 before. The issue now is that the console is unusable: at a certain
 point in the process, it changes to a white cursor at the bottom left
 of the screen which moves back and forth a bit over time, but that's
 it. I'm able to ssh into the machine and use it remotely, though.

 Relevant dmesg excerpts from 8.99.31 are:

 [   1.0015065] nouveau0 at pci1 dev 5 function 0: vendor 10de product 0110 (rev. 0xb2)
 [   1.0015065] nouveau0: info: NVIDIA NV11 (011000b2)
 [   1.0015065] nouveau0: info: bios: version 03.11.01.24.00
 [   1.0015065] nouveau0: notice: tmr: unknown input clock freq
 [   1.0015065] nouveau0: info: fb: 32 MiB SDRAM
 [   1.0015065] Zone  kernel: Available graphics memory: 33866 kiB
 [   1.0015065] nouveau0: info: DRM: VRAM: 31 MiB
 [   1.0015065] nouveau0: info: DRM: GART: 128 MiB
 [   1.0015065] nouveau0: info: DRM: BMP version 5.20
 [   1.0015065] nouveau0: autoconfiguration error: warn: DRM: No useful DCB data in VBIOS
 [   1.0015065] nouveau0: info: DRM: Saving VGA fonts
 [   1.0015065] nouveau0: autoconfiguration error: warn: DRM: No useful DCB data in VBIOS
 [   1.0015065] kern info: [drm] Supports vblank timestamp caching Rev 2 (21.10.2013).
 [   1.0015065] kern info: [drm] Driver supports precise vblank timestamp query.
 [   1.0015065] nouveau0: info: DRM: 0xB3AC: Parsing digital output script table
 [   1.0015065] nouveau0: info: DRM: MM: using M2MF for buffer copies
 [   1.0015065] nouveaufb0 at nouveau0
 [   1.0015065] nouveaufb0: framebuffer at 0xd9ba4000, size 1024x768, depth 8, stride 1024
 [   1.0015065] wsdisplay0 at nouveaufb0 kbdmux 1: console (default, vt100 emulation), using wskbd0

 Dave


State-Changed-From-To: feedback->open
State-Changed-By: gutteridge@NetBSD.org
State-Changed-When: Sat, 02 Feb 2019 20:43:54 +0000
State-Changed-Why:
Feedback provided.

State-Changed-From-To: open->feedback
State-Changed-By: riastradh@NetBSD.org
State-Changed-When: Sun, 20 Aug 2023 06:37:15 +0000
State-Changed-Why:
Any chance you can still reproduce this one?  Lotta fixes since then.


From: "David H. Gutteridge" <david@gutteridge.ca>
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: kern/51577 (7.99.40 (and older) i386 kernels freeze when nouveau
 is enabled)
Date: Tue, 22 Aug 2023 18:00:45 -0400

 On 2023-08-20 02:37, riastradh@NetBSD.org wrote:
 > Synopsis: 7.99.40 (and older) i386 kernels freeze when nouveau is 
 > enabled
 > 
 > State-Changed-From-To: open->feedback
 > State-Changed-By: riastradh@NetBSD.org
 > State-Changed-When: Sun, 20 Aug 2023 06:37:15 +0000
 > State-Changed-Why:
 > Any chance you can still reproduce this one?  Lotta fixes since then.

 There's another significant incremental improvement. That is, the
 console now works (albeit it is weirdly faint), at least to the point I
 was interacting with it. Then I tried an X session, which also worked
 flawlessly for about a minute, then the screen got scrambled and it
 seems unable to redraw anything (mouse clicks do not result in anything
 being re-rendered, for instance).

 Initial dmesg output is:

 [   2.4670916] kern.module.path=/stand/i386/10.99.7/modules
 [   2.4670916] nouveau0: NVIDIA NV11 (011000b2)
 [   2.4970932] nouveau0: bios: version 03.11.01.24.00
 [   2.4970932] nouveau0: interrupting at irq 10 (nouveau0)
 [   2.4970932] nouveau0: notice: tmr: unknown input clock freq
 [   2.5071036] nouveau0: fb: 32 MiB SDRAM
 [   2.5071036] Zone  kernel: Available graphics memory: 27182 KiB
 [   2.5071036] nouveau0: DRM: VRAM: 31 MiB
 [   2.5071036] nouveau0: DRM: GART: 128 MiB
 [   2.5071036] nouveau0: DRM: BMP version 5.20
 [   2.5071036] nouveau0: warn: DRM: No useful DCB data in VBIOS
 [   2.5071036] nouveau0: DRM: MM: using M2MF for buffer copies
 [   2.5071036] nouveau0: DRM: Saving VGA fonts
 [   2.5470936] nouveau0: warn: DRM: No useful DCB data in VBIOS
 [   2.5470936] [drm] Supports vblank timestamp caching Rev 2 
 (21.10.2013).
 [   2.5470936] [drm] Driver supports precise vblank timestamp query.
 [   2.5870984] nouveaufb0 at nouveau0
 [   2.5870984] [drm] Initialized nouveau 1.3.1 20120801 for nouveau0 on 
 minor 0
 [   2.5870984] nouveaufb0: framebuffer at 0xd8008000, size 1920x1080, 
 depth 8, stride 2048

 At the point the screen scrambles and freezes, we then get:

 [ 170.0896665] nouveau0: autoconfiguration error: error: fifo: 
 DMA_PUSHER - ch 1 [user] get 0002b000 put 0001eda0 state 80000000 (err: 
 INVALID_CMD) push 00000000

 Last time around enabling debugging didn't yield anything extra, but I
 could try again.

 Thanks,

 Dave

>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.47 2022/09/11 19:34:41 kim Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2023 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.