NetBSD Problem Report #45648

From hf@spg.tu-darmstadt.de  Thu Nov 24 12:25:06 2011
Return-Path: <hf@spg.tu-darmstadt.de>
Received: from mail.netbsd.org (mail.netbsd.org [204.152.190.11])
	by www.NetBSD.org (Postfix) with ESMTP id A4E7063D8E3
	for <gnats-bugs@gnats.NetBSD.org>; Thu, 24 Nov 2011 12:25:06 +0000 (UTC)
Message-Id: <201111241225.pAOCP0VX010998@venediger.nt.e-technik.tu-darmstadt.de>
Date: Thu, 24 Nov 2011 13:25:00 +0100 (CET)
From: Hauke Fath <hf@spg.tu-darmstadt.de>
Reply-To: Hauke Fath <hf@spg.tu-darmstadt.de>
To: gnats-bugs@gnats.NetBSD.org
Cc: Hauke Fath <hf@spg.tu-darmstadt.de>
Subject: Powering off unmounted SATA disk panics
X-Send-Pr-Version: 3.95

>Number:         45648
>Category:       kern
>Synopsis:       Powering off unmounted SATA disk panics
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    kern-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Thu Nov 24 12:30:01 +0000 2011
>Last-Modified:  Thu Nov 24 17:40:02 +0000 2011
>Originator:     Hauke Fath
>Release:        NetBSD 5.1_STABLE
>Organization:
TU Darmstadt
>Environment:


System: NetBSD venediger 5.1_STABLE NetBSD 5.1_STABLE (VENEDIGER) #0: Wed Feb 23 21:05:10 CET 2011 hf@Hochstuhl:/var/obj/netbsd-builds/5/i386/sys/arch/i386/compile/VENEDIGER i386
Architecture: i386
Machine: i386
>Description:

	A sata disk, powered up and connected to the machine, but not
	mounted (might not have been mounted ever, but since the
	machine had been up 150+ d, it is hard to tell) was powered
	off. A few hours later, the machine panicked with


piixide0:1:1: recal drive fault
wd0d: device fault reading fsbn 0 (wd0 bn 0; cn 0 tn 0 sn 0), retrying
NMI ... going to debugger
Stopped in pid 0.33 (system) at netbsd:lapic_delay+0x42: movl netbsd:__link_set_bufq_strats_sym_bufq_strat_dummy+0x1684,%edi
db{1}> 


	The above is from serial console - no stack trace, since the
	machine had to be brought up asap.

>How-To-Repeat:

	Power off an unused (as in: not mounted)  sata disk.

	-- I don't know if it is that easy... 

>Fix:
	Something in the kernel appears to have hung on to the disk,
	and something else appears to have considered the resulting
	error 'fatal'. Both should not have happened.

>Audit-Trail:
From: Manuel Bouyer <bouyer@antioche.eu.org>
To: gnats-bugs@NetBSD.org
Cc: kern-bug-people@NetBSD.org, gnats-admin@NetBSD.org, netbsd-bugs@NetBSD.org
Subject: Re: kern/45648: Powering off unmounted SATA disk panics
Date: Thu, 24 Nov 2011 18:11:49 +0100

 On Thu, Nov 24, 2011 at 12:30:01PM +0000, Hauke Fath wrote:
 > System: NetBSD venediger 5.1_STABLE NetBSD 5.1_STABLE (VENEDIGER) #0: Wed Feb 23 21:05:10 CET 2011 hf@Hochstuhl:/var/obj/netbsd-builds/5/i386/sys/arch/i386/compile/VENEDIGER i386
 > Architecture: i386
 > Machine: i386
 > >Description:
 > 
 > 	A sata disk, powered up and connected to the machine, but not
 > 	mounted (might not have been mounted ever, but since the
 > 	machine had been up 150+ d, it is hard to tell) was powered
 > 	off. A few hours later, the machine panicked with
 > 
 > 
 > piixide0:1:1: recal drive fault
 > wd0d: device fault reading fsbn 0 (wd0 bn 0; cn 0 tn 0 sn 0), retrying
 > NMI ... going to debugger

 NMI, this is bad. It's usually caused by hardware problems.

 >  [...]
 > >Fix:
 > 	Something in the kernel appears to have hung on to the disk,

 possibly one of the daily task, trying to read the partition table.
 After a few errors,  the controller would be reset, powering up the
 drive.

 > 	and something else appears to have considered the resulting
 > 	error 'fatal'. Both should not have happened.

 The hardware considered it fatal, as it posted an NMI.

 -- 
 Manuel Bouyer <bouyer@antioche.eu.org>
      NetBSD: 26 ans d'experience feront toujours la difference
 --

From: "Jonathan A. Kollasch" <jakllsch@kollasch.net>
To: gnats-bugs@NetBSD.org
Cc: kern-bug-people@netbsd.org, gnats-admin@netbsd.org,
	netbsd-bugs@netbsd.org
Subject: Re: kern/45648: Powering off unmounted SATA disk panics
Date: Thu, 24 Nov 2011 17:39:41 +0000

 You left a wd(4) attached to a disk that had disappeared?

 That's bound to cause trouble.

>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.8 2006/05/07 09:23:38 tsutsui Exp $
Copyright © 1994-2007 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.