NetBSD Problem Report #53559

From www@NetBSD.org  Mon Aug 27 16:09:57 2018
Return-Path: <www@NetBSD.org>
Received: from mail.netbsd.org (mail.netbsd.org [199.233.217.200])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(Client CN "mail.NetBSD.org", Issuer "mail.NetBSD.org CA" (not verified))
	by mollari.NetBSD.org (Postfix) with ESMTPS id DAFD27A16B
	for <gnats-bugs@gnats.NetBSD.org>; Mon, 27 Aug 2018 16:09:57 +0000 (UTC)
Message-Id: <20180827160955.830217A264@mollari.NetBSD.org>
Date: Mon, 27 Aug 2018 16:09:55 +0000 (UTC)
From: scole_mail@gmx.com
Reply-To: scole_mail@gmx.com
To: gnats-bugs@NetBSD.org
Subject: drm nouveau errors and lockup
X-Send-Pr-Version: www-1.0

>Number:         53559
>Category:       kern
>Synopsis:       drm nouveau errors and lockup
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    kern-bug-people
>State:          closed
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Mon Aug 27 16:10:00 +0000 2018
>Closed-Date:    Thu Sep 17 20:13:35 +0000 2020
>Last-Modified:  Thu Sep 17 20:13:35 +0000 2020
>Originator:     scole_mail
>Release:        8.0
>Organization:
none
>Environment:
NetBSD dstar 8.0_STABLE NetBSD 8.0_STABLE (GENERIC) #1: Sun Aug 26 12:25:32 PDT 2018  scole@dstar:/home/scole/nbsd/cvs/8_0/obj/sys/arch/i386/compile/GENERIC i386
>Description:
I recently installed 8.0 on my i386 machine.  I noticed there were some error messages regarding drm in the dmesg.  Here is some of the dmesg with the errors:

...
pci1: i/o space, memory space enabled, rd/line, wr/inv ok
nouveau0 at pci1 dev 0 function 0: vendor 10de product 0638 (rev. 0xa1)
drm kern info: nouveau  [  DEVICE][nouveau0] BOOT0  : 0x094c00a1
drm kern info: nouveau  [  DEVICE][nouveau0] Chipset: G94 (NV94)
drm kern info: nouveau  [  DEVICE][nouveau0] Family : NV50
drm kern info: nouveau  [   VBIOS][nouveau0] checking PRAMIN for image...
drm kern info: nouveau  [   VBIOS][nouveau0] ... appears to be valid
drm kern info: nouveau  [   VBIOS][nouveau0] using image from PRAMIN
drm kern info: nouveau  [   VBIOS][nouveau0] BIT signature found
drm kern info: nouveau  [   VBIOS][nouveau0] version 62.94.90.00.0d
nouveau0: interrupting at ioapic0 pin 16 (nouveau)
drm kern info: nouveau  [     PFB][nouveau0] RAM type: GDDR3
drm kern info: nouveau  [     PFB][nouveau0] RAM size: 768 MiB
drm kern info: nouveau  [     PFB][nouveau0]    ZCOMP: 3048 tags
drm kern info: nouveau  [    VOLT][nouveau0] GPU voltage: 1050000uv
drm kern info: nouveau  [  PTHERM][nouveau0] FAN control: none / external
drm kern info: nouveau  [  PTHERM][nouveau0] fan management: automatic
drm kern info: nouveau  [  PTHERM][nouveau0] internal sensor: yes
drm kern info: nouveau  [     CLK][nouveau0] 03: core 275 MHz shader 550 MHz memory 300 MHz
drm kern info: nouveau  [     CLK][nouveau0] 0f: core 550 MHz shader 1375 MHz memory 800 MHz
drm kern info: nouveau  [     CLK][nouveau0] --: core 275 MHz shader 550 MHz memory 300 MHz
Zone  kernel: Available graphics memory: 3808956 kiB
Zone   dma32: Available graphics memory: 2097152 kiB
drm kern info: nouveau  [     DRM] VRAM: 768 MiB
drm kern info: nouveau  [     DRM] GART: 1048576 MiB
drm kern info: nouveau  [     DRM] TMDS table version 2.0
drm kern info: nouveau  [     DRM] DCB version 4.0
drm kern info: nouveau  [     DRM] DCB outp 00: 02000300 00000028
drm kern info: nouveau  [     DRM] DCB outp 01: 01000302 00020030
drm kern info: nouveau  [     DRM] DCB outp 02: 02011386 0f200010
drm kern info: nouveau  [     DRM] DCB outp 03: 02011332 00020010
drm kern info: nouveau  [     DRM] DCB outp 04: 040223a6 0f200010
drm kern info: nouveau  [     DRM] DCB outp 05: 04022342 00020010
drm kern info: nouveau  [     DRM] DCB conn 00: 00001030
drm kern info: nouveau  [     DRM] DCB conn 01: 00050146
drm kern info: nouveau  [     DRM] DCB conn 02: 000a0246
drm: Supports vblank timestamp caching Rev 2 (21.10.2013).
drm: Driver supports precise vblank timestamp query.
drm kern info: nouveau  [     DRM] MM: using CRYPT for buffer copies
nouveaufb0 at nouveau0
nouveau0: info: registered panic notifier
nouveaufb0: framebuffer at 0xe4e3b000, size 1920x1080, depth 32, stride 7680
wsdisplay0 at nouveaufb0 kbdmux 1: console (default, vt100 emulation)
wsmux1: connecting to wsdisplay0
...
drm kern error: nouveau E[   PDISP][nouveau0] INVALID_STATE [UNK0B] chid 1 mthd 0x0080 data 0x00000000
drm kern error: nouveau E[   PDISP][nouveau0] Base 0:
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0084: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0088: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x008c: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0090: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0094: 0x00000000 -> 0xcafe0000
drm kern error: nouveau E[   PDISP][nouveau0] 	0x00a0: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x00a4: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x00c0: 0x00000000 -> 0x01000003
drm kern error: nouveau E[   PDISP][nouveau0] 	0x00c4: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x00c8: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x00cc: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x00e0: 0x00000000 -> 0x40000000
drm kern error: nouveau E[   PDISP][nouveau0] 	0x00e4: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x00e8: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x00ec: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x00fc: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0100: 0xfffe0000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0104: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0110: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0114: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] Base 0 - Image 0:
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0800: 0x00000000 -> 0x00000600
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0804: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0808: 0x00000000 -> 0x04380780
drm kern error: nouveau E[   PDISP][nouveau0] 	0x080c: 0x00000000 -> 0x00101e00
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0810: 0x0000e900 -> 0x0000cf00
drm kern error: nouveau E[   PDISP][nouveau0] Base 0 - Image 1:
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0c00: 0x00000000 -> 0x00000600
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0c04: 0x00000000              
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0c08: 0x00000000 -> 0x04380780
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0c0c: 0x00000000 -> 0x00101e00
drm kern error: nouveau E[   PDISP][nouveau0] 	0x0c10: 0x0000e900 -> 0x0000cf00

The machine booted up and seemed to run fine,  It even started and ran X without a problem for hours.   But eventually while running X, the screen froze, the mouse cursor disappeared, and I couldn't break or enter any keyboard input.  Also, after rebooting, I noticed dmesg contained many "^G" like

^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G
^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G
^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G
^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G
^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G
^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G
^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^G^GCopyright (c) 1996, 1997, 
1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
    2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015, 20
16, 2017,
    2018 The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
    The Regents of the University of California.  All rights reserved.
...

In a side issue, I couldn't disable nouveau with 'boot -c'.   When I booted that way, at the userconv "uc>" prompt, it looked like the cursor was blinking really fast and I couldn't type any input.  So I recompiled the GENERIC kernel without nouveau (but with PAE).

scole@dstar:~/nbsd/cvs/8_0/src/sys/arch/i386/conf> cvs diff GENERIC
Index: GENERIC
===================================================================
RCS file: /cvsroot/src/sys/arch/i386/conf/GENERIC,v
retrieving revision 1.1156.2.10
diff -r1.1156.2.10 GENERIC
32c32
< #options      PAE             # PAE mode (36 bits physical addressing)
---
> options       PAE             # PAE mode (36 bits physical addressing)
586,587c586,587
< nouveau*      at pci? dev ? function ?
< nouveaufb*    at nouveaufbbus?
---
> #nouveau*     at pci? dev ? function ?
> #nouveaufb*   at nouveaufbbus?

I've now been X running without drm for awhile, without any lockups yet.
>How-To-Repeat:
Boot 8.0 i386 GENERIC kernel with NVIDIA Quadro FX 1800, run X

>Fix:

>Release-Note:

>Audit-Trail:

State-Changed-From-To: open->closed
State-Changed-By: scole@NetBSD.org
State-Changed-When: Thu, 17 Sep 2020 13:13:35 -0700
State-Changed-Why:
I don't run i386 or 8.0 on any machine anymore, not experiencing lock ups with 9.0 amd64


>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.46 2020/01/03 16:35:01 leot Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2020 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.