NetBSD Problem Report #50364

From Manuel.Bouyer@lip6.fr  Sat Oct 24 11:07:25 2015
Return-Path: <Manuel.Bouyer@lip6.fr>
Received: from mail.netbsd.org (mail.netbsd.org [149.20.53.66])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(Client CN "mail.netbsd.org", Issuer "Postmaster NetBSD.org" (verified OK))
	by mollari.NetBSD.org (Postfix) with ESMTPS id 9D189A5858
	for <gnats-bugs@gnats.NetBSD.org>; Sat, 24 Oct 2015 11:07:25 +0000 (UTC)
Message-Id: <20151024110720.999D621C51E@samba.lip6.fr>
Date: Sat, 24 Oct 2015 13:07:20 +0200 (MEST)
From: Manuel.Bouyer@lip6.fr
Reply-To: Manuel.Bouyer@lip6.fr
To: gnats-bugs@gnats.NetBSD.org
Subject: sparc64 32bit GENERIC_SUN4U unstable
X-Send-Pr-Version: 3.95

>Number:         50364
>Category:       port-sparc64
>Synopsis:       sparc64 32bit GENERIC_SUN4U unstable
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    port-sparc64-maintainer
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Sat Oct 24 11:10:00 +0000 2015
>Originator:     Manuel.Bouyer@lip6.fr
>Release:        NetBSD 7.99.21, 7.0, possibly 6.x
>Organization:
>Environment:
System: NetBSD samba.lip6.fr 7.99.21 NetBSD 7.99.21 (GENERIC_SUN4U) #1: Mon Oct 19 17:19:13 CEST 2015 bouyer@houla:/dsk/l1/misc/bouyer/tmp/sparc/obj/dsk/l1/misc/bouyer/HEAD/src/sys/arch/sparc/compile/GENERIC_SUN4U sparc
Architecture: sparc
Machine: sparc
>Description:
	I have Netra T1s running NetBSD/sparc with the GENERIC_SUN4U kernel
	running automated pbulk. While this machine was rock-solid running
	NetBSD 5.x, it's unstable since I upgraded to NetBSD 7.0 (I tried a
	7.0 kernel, and a HEAD kernel). I also noticed such problems with
	the box running 6.x pbulk, while I initially suspected a hardware
	issue I now think that this issue is at last as old as 6.x.
	Here are a few panic and backtraces:
cpu0: data fault: pc=16565b0 rpc=6cb8700 addr=20202000
kernel trap 30: data access exception
Stopped in pid 5646.1 (rm) at   netbsd:vcache_get+0xf0: ld              [%l1 + 0
x8], %g1
db{0}> tr
ufs_lookup(2f66cf78, 0, 4eba238, 1, 2, 5c00) at netbsd:ufs_lookup+0xae4
VOP_LOOKUP(4eba238, 3178bb0c, 3178bd50, 0, 0, 0) at netbsd:VOP_LOOKUP+0x30
lookup_once(3178bc40, 4eba238, 3178bba0, 3178bba4, 3178bd50, 4eba238) at netbsd: lookup_once+0x1a4
namei_tryemulroot(3178bd50, 3178bd50, 3178bd28, 3178bd50, 3d7dc10, 3178bd28) at netbsd:namei_tryemulroot+0x520
namei(3178bd28, 3d7dc10, 0, 39fe1c0, 0, 751e12c) at netbsd:namei+0x1c
fd_nameiat.isra.0(ffffff9c, 3178bd28, 3178bd24, 0, 0, ffffff9c) at netbsd:fd_nameiat.isra.0+0x4c
do_sys_unlinkat(0, ffffff9c, ff430fbc, 0, 0, 3d7c800) at netbsd:do_sys_unlinkat+0x80
syscall(3178bed0, 3178bf48, ff7049e0, 42177c0, 0, ff7049e4) at netbsd:syscall+0x3d0
?(ff430fbc, 0, 4, ff402050, ff430f60, ff430f60) at 1010d6c


cpu0: data fault: pc=15c8570 rpc=ff00080000000000 addr=0
kernel trap 30: data access exception
Stopped in pid 2456.1 (xsltproc) at     netbsd:amap_wipeout+0x90:       ld
        [%g1 + 0x4], %g2
db{0}> tr
uvm_unmap_detach(422e718, 0, 4227dc0, 0, 1cbde40, 422e718) at netbsd:uvm_unmap_detach+0x3c 
uvmspace_free(4227d90, 3f06810, ffffffff, 1c65c80, 0, 4227d9c) at netbsd:uvmspace_free+0xd8
exit1(41c2560, 8b, 1c9c400, 1c82400, 30a4bc00, 421fb10) at netbsd:exit1+0x154
sigexit(41c2560, 8b, 421fb10, 180000, 8b, 0) at netbsd:sigexit+0x13c
postsig(b, 0, 421fb10, 41c26f4, 30a4bd00, 41c2560) at netbsd:postsig+0xd4
lwp_userret(41c2560, 1000000, 20000, 100000, 91a0002, 421fb10) at netbsd:lwp_userret+0x164
data_access_fault(30a4bed0, c8, ff53fa0c, 1232, 0, 421fb10) at netbsd:data_access_fault+0x1ac
?(fce0ca10, a, 0, 24e30, 800, fdd4b000) at 10106cc

trap type 0x34: cpu 0, pc=15dcff4 npc=15dcff8 pstate=0xffffffffff82000e<AM,PRIV,IE>
kernel trap 34: mem address not aligned
Stopped in pid 0.37 (system) at netbsd:uvmpd_trylockowner+0x14: ld [ %g1 + %g0], %i0
db{0}> tr
uvmpdpol_balancequeue(0, 4bdcd00, 0, 673c, 38b25d0, 31509e0) at netbsd:uvmpdpol_balancequeue+0x5c
uvm_pageout(0, 1f242, 0, 0, 0, 1cbd44c) at netbsd:uvm_pageout+0x290
lwp_trampoline(f0075db8, fffa3cf8, 111800, 1106c8, fffa3df8, 0) at netbsd:lwp_trampoline+0x8



login: cpu0: data fault: pc=141320c rpc=620401032123a47 addr=0
kernel trap 30: data access exception
Stopped in pid 21062.1 (cc1plus) at     netbsd:pmap_remove_pv+0x6c:     ld [%i0 + %g0], %g3
db{0}>
db{0}> tr                              
pmap_remove(4c76040, fdece000, fdfe0000, 1caf658, 0, 17cce000) at netbsd:pmap_remove+0xd8
uvm_unmap_remove(3d80af8, 1e0000, 7481f78, 32123b74, 0, 7480e48) at netbsd:uvm_unmap_remove+0x114
uvmspace_free(3d80af8, 4ea5090, ffffffff, 1c65c80, 0, 3d80b04) at netbsd:uvmspace_free+0xc0
exit1(4ac5a80, 86, 1c9c400, 1c82400, 32123bd8, 424f0f0) at netbsd:exit1+0x154
sigexit(4ac5a80, 86, 424f0f0, 180000, 86, 0) at netbsd:sigexit+0x13c
postsig(6, 0, 424f0f0, 4ac5c14, 32123cd8, 4ac5a80) at netbsd:postsig+0xd4
lwp_userret(4ac5a80, 1000000, 20000, 100000, 91a0002, 424f0f0) at netbsd:lwp_userret+0x164
syscall(32123ed0, 32123f50, ff7ded7c, 4ac5a80, 0, 424f0f0) at netbsd:syscall+0x138
?(1, 6, 0, ff7dcce8, 2, 0) at 1010d40
db{0}>




>How-To-Repeat:
	run pbulk on a NetBSD/sparc GENERIC_SUN4U system, 6.0 or 7.0
>Fix:
	unknown. 

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.8 2006/05/07 09:23:38 tsutsui Exp $
Copyright © 1994-2014 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.