NetBSD Problem Report #42985

From woods@once.weird.com  Tue Mar 16 22:12:02 2010
Return-Path: <woods@once.weird.com>
Received: from mail.netbsd.org (mail.netbsd.org [204.152.190.11])
	by www.NetBSD.org (Postfix) with ESMTP id 96DFD63C49E
	for <gnats-bugs@gnats.NetBSD.org>; Tue, 16 Mar 2010 22:12:02 +0000 (UTC)
Message-Id: <m1Nre2B-002a1EC@once.weird.com>
Date: Tue, 16 Mar 2010 17:10:39 -0400 (EDT)
From: "Greg A. Woods" <woods@planix.com>
Sender: "Greg A. Woods" <woods@once.weird.com>
Reply-To: "Greg A. Woods" <woods@planix.com>
To: gnats-bugs@gnats.NetBSD.org
Subject: 5.99.24 ataraid/ld(?) "Mutex error: lockdebug_barrier: spin lock held"
X-Send-Pr-Version: 3.95

>Number:         42985
>Category:       kern
>Synopsis:       5.99.24 ataraid/ld(?) "Mutex error: lockdebug_barrier: spin lock held"
>Confidential:   no
>Severity:       serious
>Priority:       high
>Responsible:    kern-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Tue Mar 16 22:15:01 +0000 2010
>Originator:     Greg A. Woods
>Release:        NetBSD-current 2010/03/16
>Organization:
Planix, Inc.; Toronto, Ontario; Canada
>Environment:
System: NetBSD 5.99.24 MONOLITHIC
Architecture: i386
Machine: i386
>Description:

	I have a machine with an Adaptect ATA RAID controller on its
	motherboard.  It has never worked properly with NetBSD, but
	occasionally I boot a NetBSD-current kernel on it to see how
	things are progressing.  Recently it looked like things were
	almost working again, but it seems that it still doesn't take
	very much activity on the device to trigger a lock error.

	I don't think this is the same as my old PR# 38273 any more,
	though I'm really not sure about that.  In any case I've filed
	this as a new PR -- anyone who knows better should feel free to
	merge it with the old one if they think that's appropriate.

	Also, as comments to PR# 40017 say, it may be related as well.


>How-To-Repeat:

historically# newfs /dev/rld0a
/dev/rld0a: 190782.0MB (390721536 sectors) block size 16384, fragment size 2048
        using 1035 cylinder groups of 184.34MB, 11798 blks, 23296 inodes.
super-block backups (for fsck_ffs -b #) at:
32, 377568, 755104, 1132640, 1510176, 1887712, 2265248, 2642784, 3020320, 3397856, 3775392, 4152928, 4530464, 4908000, 5285536,
......................................................Mutex error: lockdebug_barrier: spin lock held

lock address : 0x00000000cd832da8 type     :               spin
initialized  : 0x00000000c0537e13
shared holds :                  0 exclusive:                  1
shares wanted:                  0 exclusive:                  0
current cpu  :                  1 last held:                  1
current lwp  : 0x00000000ce28c560 last held: 0x00000000ce28c560
last locked  : 0x00000000c0539c2c unlocked : 0x00000000c0539c03
owner field  : 0x0000000000010600 wait/spin:                0/1

panic: LOCKDEBUG
fatal breakpoint trap in supervisor mode
trap type 1 code 0 eip c0258784 cs 8 eflags 246 cr2 cd2f0a5f ilevel 8
Stopped in pid 427.1 (newfs) at netbsd:breakpoint+0x4:  popl    %ebp
db{1}> machine cpu 0
using CPU 0
db{1}> trace
x86_pause(1,0,0,0,cc54ed20,ccb3fd0c,ccb7f314,c010762b,cce59b64,cc53fca8) at netbsd:x86_pause+0x2
intr_biglock_wrapper(cce59b64,cc53fca8,ccbc4960,c302c970,c303ed00,0,0,d4aa000,0,48) at netbsd:intr_biglock_wrapper+0x16
DDB lost frame for netbsd:Xintr_ioapic_level3+0xab, trying 0xccb24f74
Xintr_ioapic_level3() at netbsd:Xintr_ioapic_level3+0xab
--- interrupt ---
0:
db{1}> x/I 0x00000000c0537e13
netbsd:ldattach+0x23:   addl    $0x10,%esp
db{1}> x/I 0x00000000ce28c560                  
0xce28c560:     addb    %al,0(%eax)
db{1}> x/I 0x00000000c0539c2c
netbsd:cbufpool_ctor+0x3c:      xorl    %eax,%eax
db{1}> x/I 0x00000000c0539c03
netbsd:cbufpool_ctor+0x13:      movl    $0x1,0(%esp)
db{1}> reboot
Skipping crash dump on recursive panic
panic: wdc_exec_command: polled command not done
fatal breakpoint trap in supervisor mode
trap type 1 code 0 eip c0258784 cs 8 eflags 246 cr2 cd2f0a5f ilevel 8
Stopped in pid 427.1 (newfs) at netbsd:breakpoint+0x4:  popl    %ebp
db{1}> reboot
Skipping crash dump on recursive panic
panic: wdc_exec_command: polled command not done
fatal breakpoint trap in supervisor mode
trap type 1 code 0 eip c0258784 cs 8 eflags 246 cr2 cd2f0a5f ilevel 8
Stopped in pid 427.1 (newfs) at netbsd:breakpoint+0x4:  popl    %ebp
db{1}> reboot
rebooting...


>Fix:

	unknown

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.8 2006/05/07 09:23:38 tsutsui Exp $
Copyright © 1994-2007 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.