NetBSD Problem Report #42319

From www@NetBSD.org  Sun Nov 15 08:37:33 2009
Return-Path: <www@NetBSD.org>
Received: from mail.netbsd.org (mail.netbsd.org [204.152.190.11])
	by www.NetBSD.org (Postfix) with ESMTP id E0D2B63C411
	for <gnats-bugs@gnats.netbsd.org>; Sun, 15 Nov 2009 08:37:33 +0000 (UTC)
Message-Id: <20091115083733.4E9DC63B844@www.NetBSD.org>
Date: Sun, 15 Nov 2009 08:37:33 +0000 (UTC)
From: dalibor.gudzic@gmail.com
Reply-To: dalibor.gudzic@gmail.com
To: gnats-bugs@NetBSD.org
Subject: fatal page fault on 5.0_STABLE
X-Send-Pr-Version: www-1.0

>Number:         42319
>Category:       kern
>Synopsis:       kernel crash: fatal page fault in ath driver on 5.0_STABLE
>Confidential:   no
>Severity:       critical
>Priority:       high
>Responsible:    kern-bug-people
>State:          closed
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Sun Nov 15 08:40:00 +0000 2009
>Closed-Date:    Fri Jun 30 17:31:27 +0000 2017
>Last-Modified:  Fri Jun 30 17:31:27 +0000 2017
>Originator:     Dalibor Gudzic
>Release:        5.0_STABLE
>Organization:
None
>Environment:
NetBSD 5.0_STABLE (GENERIC) #0: Fri Nov 13 20:55:43 CET 2009
      root@nbsd.dot.org.dot:/usr/obj/sys/arch/i386/compile/GENERIC

>Description:
On FS AMilo Pro V3515 I'm getting fatal page fault after cvs update on 13/11/09. Kernel build went fine, but when booting the new kernel it spits error and drops to ddb:

db{0}>trace

uvm_fault (0xc0b69240, 0, 1) -> 0xe
fatal page fault in supervisor mode
trap type 6 code 0 eip c057a491 cs 8 eflags 10246 cr2 0 ilevel 8
kernel: supervisor trap page fault code=0
Faulted in DDB; continuing

dmesg in ddm showing (sorry, a few lines shown only as I don't have other means to get the full dmesg):

...
ppb4 at pci0 dev 19 function 1: vendor 0x1106 product 0x337a (rev. 0x00)
pci5 at ppb4 bus5
pci5: i/o space, memory space enabled
ath0 at pci5 dev 1 function 0
ath0: interrupting at ioapic0 pin 18
uvm_fault (0xc0b69240, 0, 1) -> 0xe
fatal page fault in supervisor mode
trap type 6 code 0 eip 0 cs 8 eflags 10282 cr2 0 ilevel 8
uvm_fault (0xc0b69240, 0, 1) -> 0xe
fatal page fault in supervisor mode
trap type 6 code 0 eip 0 cs 8 eflags 10282 cr2 0 ilevel 8

db{0}>

Could it be an ath0 related problem? It is not functional, I've submitted the PR earlier this year:

http://www.netbsd.org/cgi-bin/query-pr-single.pl?number=40507

This is the last working dmesg (from May):

Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
2006, 2007, 2008
The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
The Regents of the University of California.  All rights reserved.

NetBSD 5.0_STABLE (GENERIC) #0: Tue May 19 15:20:13 UTC 2009
builds@b6.netbsd.org:/home/builds/ab/netbsd-5/i386/200905190000Z-obj/home/builds/ab/netbsd-5/src/sys/arch/i386/compile/GENERIC
total memory = 1406 MB
avail memory = 1370 MB
timecounter: Timecounters tick every 10.000 msec
timecounter: Timecounter "i8254" frequency 1193182 Hz quality 100
FUJITSU SIEMENS AMILO PRO V3515 (20)
mainbus0 (root)
cpu0 at mainbus0 apid 0: Intel 686-class, 1729MHz, id 0x6e8
ioapic0 at mainbus0 apid 1: pa 0xfec00000, version 3, 24 pins
ioapic1 at mainbus0 apid 2: pa 0xfecc0000, version 3, 24 pins
acpi0 at mainbus0: Intel ACPICA 20080321
acpi0: X/RSDT: OemId <FSC   ,PC      ,06040000>, AslId < LTP,00000000>
acpi0: SCI interrupting at int 10
acpi0: fixed-feature power button present
timecounter: Timecounter "ACPI-Fast" frequency 3579545 Hz quality 1000
ACPI-Fast 24-bit timer
acpibut0 at acpi0 (PWRB, PNP0C0C): ACPI Power Button
acpilid0 at acpi0 (LID, PNP0C0D): ACPI Lid Switch
npx1 at acpi0 (COPR, PNP0C04): io 0xf0-0xff irq 13
npx1: reported by CPUID; using exception 16
attimer1 at acpi0 (TIME, PNP0100): io 0x40-0x43 irq 0
pcppi1 at acpi0 (SPKR, PNP0800): io 0x61
midi0 at pcppi1: PC speaker (CPU-intensive output)
sysbeep0 at pcppi1
pckbc1 at acpi0 (PS2M, STL3842) (aux port): irq 12
pckbc2 at acpi0 (PS2K, PNP0303) (kbd port): io 0x60,0x64 irq 1
acpiacad0 at acpi0 (ACAD, ACPI0003): ACPI AC Adapter
acpibat0 at acpi0 (BAT0, PNP0C0A-1): ACPI Battery (Control Method)
acpibat0: battery info: DPK , Lion, DPK-LMXXSS6 
FAN (PNP0C0B) at acpi0 not configured
acpitz0 at acpi0 (THRM): active cooling level 0: 59.8C critical 109.8C passive 52.8C
apm0 at acpi0: Power Management spec V1.2
attimer1: attached to pcppi1
pckbd0 at pckbc2 (kbd slot)
pckbc2: using irq 1 for kbd slot
wskbd0 at pckbd0: console keyboard
pms0 at pckbc2 (aux slot)
pckbc2: using irq 12 for aux slot
wsmouse0 at pms0 mux 0
pci0 at mainbus0 bus 0: configuration mode 1
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
pchb0 at pci0 dev 0 function 0
pchb0: vendor 0x1106 product 0x0364 (rev. 0x00)
agp0 at pchb0 (v3): aperture at 0xc0000000, size 0x10000000
pchb1 at pci0 dev 0 function 1
pchb1: vendor 0x1106 product 0x1364 (rev. 0x00)
pchb2 at pci0 dev 0 function 2
pchb2: vendor 0x1106 product 0x2364 (rev. 0x00)
pchb3 at pci0 dev 0 function 3
pchb3: vendor 0x1106 product 0x3364 (rev. 0x00)
pchb4 at pci0 dev 0 function 4
pchb4: vendor 0x1106 product 0x4364 (rev. 0x00)
vendor 0x1106 product 0x5364 (interrupt system, interface 0x20) at pci0 dev 0 function 5 not configured
pchb5 at pci0 dev 0 function 6
pchb5: vendor 0x1106 product 0x6364 (rev. 0x00)
pchb6 at pci0 dev 0 function 7
pchb6: vendor 0x1106 product 0x7364 (rev. 0x00)
ppb0 at pci0 dev 1 function 0: vendor 0x1106 product 0xb198 (rev. 0x00)
pci1 at ppb0 bus 1
pci1: i/o space, memory space enabled, rd/line, wr/inv ok
vga1 at pci1 dev 0 function 0: vendor 0x1106 product 0x3371 (rev. 0x01)
wsdisplay0 at vga1 kbdmux 1: console (80x25, vt100 emulation), using wskbd0
wsmux1: connecting to wsdisplay0
drm at vga1 not configured
ppb1 at pci0 dev 2 function 0: vendor 0x1106 product 0xa364 (rev. 0x80)
pci2 at ppb1 bus 2
pci2: i/o space, memory space enabled, rd/line, wr/inv ok
ppb2 at pci0 dev 3 function 0: vendor 0x1106 product 0xc364 (rev. 0x80)
pci3 at ppb2 bus 3
pci3: i/o space, memory space enabled, rd/line, wr/inv ok
viaide0 at pci0 dev 15 function 0
viaide0: VIA Technologies VT8237A SATA Controller (rev. 0x80)
viaide0: bus-master DMA support present
viaide0: primary channel configured to native-PCI mode
viaide0: using ioapic0 pin 21 for native-PCI interrupt
atabus0 at viaide0 channel 0
viaide0: secondary channel configured to native-PCI mode
atabus1 at viaide0 channel 1
viaide1 at pci0 dev 15 function 1
viaide1: VIA Technologies VT8237A ATA133 controller
viaide1: bus-master DMA support present
viaide1: primary channel configured to compatibility mode
viaide1: primary channel ignored (disabled)
viaide1: secondary channel configured to compatibility mode
viaide1: secondary channel interrupting at ioapic0 pin 15
atabus2 at viaide1 channel 1
uhci0 at pci0 dev 16 function 0: vendor 0x1106 product 0x3038 (rev. 0xa0)
uhci0: interrupting at ioapic0 pin 20
usb0 at uhci0: USB revision 1.0
uhci1 at pci0 dev 16 function 1: vendor 0x1106 product 0x3038 (rev. 0xa0)
uhci1: interrupting at ioapic0 pin 22
usb1 at uhci1: USB revision 1.0
uhci2 at pci0 dev 16 function 2: vendor 0x1106 product 0x3038 (rev. 0xa0)
uhci2: interrupting at ioapic0 pin 21
usb2 at uhci2: USB revision 1.0
uhci3 at pci0 dev 16 function 3: vendor 0x1106 product 0x3038 (rev. 0xa0)
uhci3: interrupting at ioapic0 pin 23
usb3 at uhci3: USB revision 1.0
ehci0 at pci0 dev 16 function 4: vendor 0x1106 product 0x3104 (rev. 0x86)
ehci0: interrupting at ioapic0 pin 21
ehci0: dropped intr workaround enabled
ehci0: BIOS refuses to give up ownership, using force
ehci0: EHCI version 1.0
ehci0: companion controllers, 2 ports each: uhci0 uhci1 uhci2 uhci3
usb4 at ehci0: USB revision 2.0
pcib0 at pci0 dev 17 function 0
pcib0: vendor 0x1106 product 0x3337 (rev. 0x00)
pchb7 at pci0 dev 17 function 7
pchb7: vendor 0x1106 product 0x287e (rev. 0x00)
vr0 at pci0 dev 18 function 0: VIA VT6102 (Rhine II) 10/100 Ethernet
vr0: interrupting at ioapic0 pin 23
vr0: Ethernet address: 00:14:0b:08:64:5b
ukphy0 at vr0 phy 1: Generic IEEE 802.3u media interface
ukphy0: OUI 0x0002c6, model 0x0032, rev. 10
ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
ppb3 at pci0 dev 19 function 0: vendor 0x1106 product 0x337b (rev. 0x00)
pci4 at ppb3 bus 4
pci4: i/o space, memory space enabled
azalia0 at pci4 dev 1 function 0: Generic High Definition Audio Controller
azalia0: interrupting at ioapic0 pin 17
azalia0: host: 0x1106/0x3288 (rev. 16), HDA rev. 1.0
ppb4 at pci0 dev 19 function 1: vendor 0x1106 product 0x337a (rev. 0x00)
pci5 at ppb4 bus 5
pci5: i/o space, memory space enabled
ath0 at pci5 dev 1 function 0
ath0: interrupting at ioapic0 pin 18
ath0: unable to attach hardware; HAL status 3
isa0 at pcib0
isapnp0 at isa0 port 0x279: ISA Plug 'n Play device support
isapnp0: no ISA Plug 'n Play devices found
timecounter: Timecounter "clockinterrupt" frequency 100 Hz quality 0
timecounter: Timecounter "TSC" frequency 1729223860 Hz quality 3000
acpiacad0: AC adapter online.
azalia0: codec[0]: 0x14f1/0x5045 (rev. 1.0), HDA rev. 1.0
audio0 at azalia0: full duplex, independent
uhub0 at usb0: vendor 0x1106 UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhub1 at usb1: vendor 0x1106 UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
uhub2 at usb2: vendor 0x1106 UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub2: 2 ports with 2 removable, self powered
uhub3 at usb3: vendor 0x1106 UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub3: 2 ports with 2 removable, self powered
uhub4 at usb4: vendor 0x1106 EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub4: 8 ports with 8 removable, self powered
viaide0 port 0: device present, speed: 1.5Gb/s
wd0 at atabus0 drive 0: <WDC WD600BEVS-07LAT0>
wd0: drive supports 16-sector PIO transfers, LBA48 addressing
wd0: 57231 MB, 116280 cyl, 16 head, 63 sec, 512 bytes/sect x 117210240 sectors
wd0: 32-bit data port
wd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133)
wd0(viaide0:0:0): using PIO mode 4, Ultra-DMA mode 6 (Ultra/133) (using DMA)
atapibus0 at atabus2: 2 targets
cd0 at atapibus0 drive 0: <HL-DT-ST DVDRAM GSA-T10N, M006BM21412, PW02> cdrom removable
cd0: 32-bit data port
cd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 2 (Ultra/33)
cd0(viaide1:1:0): using PIO mode 4, Ultra-DMA mode 2 (Ultra/33) (using DMA)
Kernelized RAIDframe activated
pad0: outputs: 44100Hz, 16-bit, stereo
audio1 at pad0: half duplex
boot device: wd0
root on wd0a dumps on wd0b
root file system type: ffs
uhidev0 at uhub0 port 2 configuration 1 interface 0
uhidev0: vendor 0x15d9 USB Mouse, rev 1.10/1.00, addr 2, iclass 3/1
ums0 at uhidev0: 3 buttons and Z dir.
wsmouse1 at ums0 mux 0
/usr/sbin/ifwatchd[146]: watching interface pppoe0
savecore: no core dump
No/etc/rc: WARNING: $mysqld is not set properly - see rc.conf(5).
/etc/rc: WARNING: $apache is not set properly - see rc.conf(5).



Before updating src with cvs I've tried the snapshot and it failed also. I think the problem started before this update, but I haven't updated this machine in a while, and I do remember some problem with booting a few months ago. I also tried to update from source on 15/11/09 but the error is still there, preventing the boot.

Pls let me know if I could provide some more output.

Thanks
>How-To-Repeat:
Get recent source and try booting the new kernel on FS Amilo Pro.
>Fix:

>Release-Note:

>Audit-Trail:
From: Manuel Bouyer <bouyer@antioche.eu.org>
To: gnats-bugs@NetBSD.org
Cc: kern-bug-people@NetBSD.org, gnats-admin@NetBSD.org, netbsd-bugs@NetBSD.org
Subject: Re: kern/42319: uvm_fault/fatal page fault on 5.0_STABLE
Date: Sun, 15 Nov 2009 22:56:15 +0100

 > ppb4 at pci0 dev 19 function 1: vendor 0x1106 product 0x337a (rev. 0x00)
 > pci5 at ppb4 bus5
 > pci5: i/o space, memory space enabled
 > ath0 at pci5 dev 1 function 0
 > ath0: interrupting at ioapic0 pin 18
 > uvm_fault (0xc0b69240, 0, 1) -> 0xe

 Looks like the problem is in ath(4). Could you try to boot -c
 at the boot prompt, and then
 disable ath
 quit

 (or alternatively rebuild a kernel without the ath driver)

 -- 
 Manuel Bouyer <bouyer@antioche.eu.org>
      NetBSD: 26 ans d'experience feront toujours la difference
 --

From: Dalibor Gudzic <dalibor.gudzic@gmail.com>
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: kern/42319: uvm_fault/fatal page fault on 5.0_STABLE
Date: Mon, 16 Nov 2009 09:38:36 +0100

 Hi Manuel,

 Actually I have not had much time when I run into a problem, so I
 tried disabling the ath driver yesterday evening. It worked, but
 unfortunatally I had electricity problems and couldn't make a reply on
 my original PR message.
 The botom line is, yes, the ath driver is the culprit. I already said
 the ath(4) driver did not work for my card, but never prevented kernel
 from booting. Pls tell me if you need some more output.

 Thanx

From: David Holland <dholland-bugs@netbsd.org>
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: kern/42319: uvm_fault/fatal page fault on 5.0_STABLE
Date: Sat, 6 Mar 2010 19:33:10 +0000

 (sent to gnats-admin instead of gnats-bugs)

    ------

 From: Dalibor Gudzic <dalibor.gudzic@gmail.com>
 To: kern-bug-people@netbsd.org, gnats-admin@netbsd.org,
 	netbsd-bugs@netbsd.org
 Subject: Re: kern/42319: uvm_fault/fatal page fault on 5.0_STABLE
 Date: Tue, 2 Mar 2010 23:12:17 +0100

 Hi,

 Apology for top-posting, but a quick update;

 I forgot to say that I'm allowed to boot using 5.0.1, and now 5.0.2.
 The error message from dmesg (as reported in pr/40507) is still there.
 Probably the code that prevents system from booting was introduced
 later in -STABLE, or it was a bug that got corrected and is now not
 present in 5.0.2.

 All best


State-Changed-From-To: open->closed
State-Changed-By: jdolecek@NetBSD.org
State-Changed-When: Fri, 30 Jun 2017 17:31:27 +0000
State-Changed-Why:
This was actually confirmed as fixed in 5.0.2 by submitter on 2 Mar 2010.
Thanks.


>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.8 2006/05/07 09:23:38 tsutsui Exp $
Copyright © 1994-2014 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.