NetBSD Problem Report #39927

From christos@zoulas.com  Sat Nov 15 20:11:08 2008
Return-Path: <christos@zoulas.com>
Received: from mail.netbsd.org (mail.netbsd.org [204.152.190.11])
	by narn.NetBSD.org (Postfix) with ESMTP id 253DF63B966
	for <gnats-bugs@gnats.NetBSD.org>; Sat, 15 Nov 2008 20:11:08 +0000 (UTC)
Message-Id: <20081115185457.38F24284E@shiny.astron.com>
Date: Sat, 15 Nov 2008 13:54:57 -0500 (EST)
From: christos@netbsd.org
Reply-To: christos@netbsd.org
To: gnats-bugs@gnats.NetBSD.org
Subject: bad sector handling broken on wd; failed reads crash the kernel.
X-Send-Pr-Version: 3.95

>Number:         39927
>Category:       kern
>Synopsis:       Bad sector handling broken on wd
>Confidential:   no
>Severity:       critical
>Priority:       high
>Responsible:    bouyer
>State:          closed
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Sat Nov 15 20:15:00 +0000 2008
>Closed-Date:    Sun Nov 15 00:51:29 +0000 2009
>Last-Modified:  Sun Nov 15 00:51:29 +0000 2009
>Originator:     Christos Zoulas
>Release:        NetBSD 5.99.01
>Organization:
	None but trying.
>Environment:
System: NetBSD shiny.astron.com 5.99.01 NetBSD 5.99.01 (SHINY) #55: Fri Nov 14 20:27:39 EST 2008 root@:/net/shiny/src-6/NetBSD/cvsroot/src/sys/arch/i386/compile/SHINY i386
Architecture: i386
Machine: i386
>Description:
	When a drive has bad sectors the wd code re-tries, but wdrestart
	is called via a softintr and it also tries to ltsleep which is not
	going to work. So you get an assertion. Kernels from january this
	year work, but once they give up, there is a secondary failure
	in the filesystem code where we die in ffs_freefile. This happens
	even when you cat or stat the file that has the bad blocks.
>How-To-Repeat:
	get a disk with bad sectors or simulate a read failure on a particular
	fsbn.
>Fix:
	don't use a softintr to dispatch retries; fix the filesystem code
	so that it does not die for no reason. It is really frustrating
	to have a disk going back and the OS dying on you.

>Release-Note:

>Audit-Trail:

Responsible-Changed-From-To: kern-bug-people->bouyer
Responsible-Changed-By: bouyer@NetBSD.org
Responsible-Changed-When: Sat, 15 Nov 2008 21:14:53 +0000
Responsible-Changed-Why:
My bug
.


State-Changed-From-To: open->feedback
State-Changed-By: bouyer@NetBSD.org
State-Changed-When: Sun, 16 Nov 2008 20:40:44 +0000
State-Changed-Why:
I just commited a fix, can you see if you can reproduce it ?


State-Changed-From-To: feedback->closed
State-Changed-By: dholland@NetBSD.org
State-Changed-When: Sun, 15 Nov 2009 00:51:29 +0000
State-Changed-Why:
1-year feedback timeout


>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.8 2006/05/07 09:23:38 tsutsui Exp $
Copyright © 1994-2007 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.