NetBSD Problem Report #31544
From he@smistad.uninett.no Mon Oct 10 13:08:40 2005
Return-Path: <he@smistad.uninett.no>
Received: from smistad.uninett.no (smistad.uninett.no [158.38.62.77])
by narn.netbsd.org (Postfix) with ESMTP id A70DB63B878
for <gnats-bugs@gnats.NetBSD.org>; Mon, 10 Oct 2005 13:08:39 +0000 (UTC)
Message-Id: <20051010130837.E2C6621DC42@smistad.uninett.no>
Date: Mon, 10 Oct 2005 15:08:37 +0200 (CEST)
From: he@uninett.no
Reply-To: he@uninett.no
To: gnats-bugs@netbsd.org
Subject: The ffs softdep code appears to fail to write dirty bits to disk
X-Send-Pr-Version: 3.95
>Number: 31544
>Category: kern
>Synopsis: The ffs softdep code appears to fail to write dirty bits to disk
>Confidential: no
>Severity: serious
>Priority: medium
>Responsible: kern-bug-people
>State: closed
>Class: sw-bug
>Submitter-Id: net
>Arrival-Date: Mon Oct 10 13:09:00 +0000 2005
>Closed-Date: Wed Apr 01 04:08:42 +0000 2009
>Last-Modified: Wed Apr 01 04:08:42 +0000 2009
>Originator: Havard Eidnes <he@uninett.no>
>Release: NetBSD 3.99.9 19 Sep 2005
>Organization:
UNINETT AS
>Environment:
System: NetBSD bean.urc.uninett.no 3.99.9 NetBSD 3.99.9 (GENERIC) #64: Mon Sep 19 13:49:02 CEST 2005 he@quattro.urc.uninett.no:/u/build/HEAD/obj/i386/sys/arch/i386/compile/GENERIC i386
Architecture: i386
Machine: i386
>Description:
Around the 20th last month I installed NetBSD 3.99.9 as
described above on an IBM x336 system with a 3.2G Xeon, 1GB
memory and a single SCSI disk drive. I partitioned the drive
with one large /usr ffs file system covering most of the disk,
mounted softdep, and the same day I checked out src, pkgsrc
and xsrc from a CVS mirror.
For various reasons I did not get around to doing what I had
planned to do with the machine, so it was sitting there mostly
idle, basically only running its periodic cron jobs.
Two days ago we had an unexpected power outage which of course
took out the power to this machine since it is not connected to
a UPS. One would have thought or hoped that all the dirty bits
from the CVS checkout operation would have hit the platters on
the disk drive after nearly two weeks idle time, but, no, that
did not appear to be the case.
When the machine came back up again, fsck found a multitude of
problems in the /usr file system, some examples:
BAD TYPE VALUE I=209247 OWNER=he MODE=100644
SIZE=2472 MTIME=Sep 20 14:38 2005
FILE=/src/contrib
UNALLOCATED I=214598 OWNER=he MODE=0
SIZE=0 MTIME=Sep 20 14:37 2005
NAME=/src/include/CVS/Entries
I count a total of 5657 "UNALLOCATED" and 1335 "BAD TYPE VALUE"
errors in my "fsck -y" output.
Clearly, something which was supposed to happen ("trickle sync?")
did not.
>How-To-Repeat:
Mount a file system with softdep, make lots of updates, such
as checking out the NetBSD source from CVS, leave machine idle
for a few days, and *then* yank it's power. Watch near-
unending lossage related to the bits you checked out from CVS
more than 24 hours ago when the machine is brought up again.
>Fix:
Sorry, don't know.
>Release-Note:
>Audit-Trail:
From: Andrew Doran <ad@netbsd.org>
To: gnats-bugs@gnats.NetBSD.org
Cc:
Subject: PR/31544 CVS commit: src
Date: Sun, 22 Feb 2009 20:28:07 +0000 (UTC)
Module Name: src
Committed By: ad
Date: Sun Feb 22 20:28:07 UTC 2009
Modified Files:
src/doc: CHANGES
src/lib/libp2k: p2k.c
src/sbin/fsck_lfs: lfs.c
src/sbin/mount: mount.8
src/sbin/newfs_lfs: make_lfs.c
src/sbin/tunefs: tunefs.8 tunefs.c
src/sys/arch/vax/conf: VAX780
src/sys/conf: files
src/sys/kern: sys_aio.c vfs_bio.c vfs_subr.c vfs_syscalls.c
src/sys/miscfs/specfs: spec_vnops.c
src/sys/miscfs/syncfs: sync_subr.c
src/sys/modules/ffs: Makefile
src/sys/rump/fs/lib/libffs: Makefile
src/sys/rump/include/rump: rump.h
src/sys/rump/librump/rumpvfs: rump_vfs.c vm_vfs.c
src/sys/sys: buf.h vnode.h
src/sys/ufs: files.ufs
src/sys/ufs/ffs: ffs_alloc.c ffs_balloc.c ffs_extern.h ffs_inode.c
ffs_snapshot.c ffs_vfsops.c ffs_vnops.c ffs_wapbl.c
src/sys/ufs/lfs: lfs_rfw.c lfs_vfsops.c lfs_vnops.c
src/sys/ufs/ufs: inode.h ufs_dirhash.c ufs_extern.h ufs_inode.c
ufs_lookup.c ufs_readwrite.c ufs_vnops.c ufs_wapbl.c
src/sys/uvm: uvm_pager.c
Removed Files:
src/sys/rump/librump/rumpkern/opt: opt_softdep.h
src/sys/ufs/ffs: ffs_softdep.c ffs_softdep.stub.c softdep.h
Log Message:
PR kern/26878 FFSv2 + softdep = livelock (no free ram)
PR kern/16942 panic with softdep and quotas
PR kern/19565 panic: softdep_write_inodeblock: indirect pointer #1 mismatch
PR kern/26274 softdep panic: allocdirect_merge: ...
PR kern/26374 Long delay before non-root users can write to softdep partitions
PR kern/28621 1.6.x "vp != NULL" panic in ffs_softdep.c:4653 while unmounting a softdep (+quota) filesystem
PR kern/29513 FFS+Softdep panic with unfsck-able file-corruption
PR kern/31544 The ffs softdep code appears to fail to write dirty bits to disk
PR kern/31981 stopping scsi disk can cause panic (softdep)
PR kern/32116 kernel panic in softdep (assertion failure)
PR kern/32532 softdep_trackbufs deadlock
PR kern/37191 softdep: locking against myself
PR kern/40474 Kernel panic after remounting raid root with softdep
Retire softdep, pass 2. As discussed and later formally announced on the
mailing lists.
To generate a diff of this commit:
cvs rdiff -r1.1191 -r1.1192 src/doc/CHANGES
cvs rdiff -r1.8 -r1.9 src/lib/libp2k/p2k.c
cvs rdiff -r1.29 -r1.30 src/sbin/fsck_lfs/lfs.c
cvs rdiff -r1.65 -r1.66 src/sbin/mount/mount.8
cvs rdiff -r1.13 -r1.14 src/sbin/newfs_lfs/make_lfs.c
cvs rdiff -r1.37 -r1.38 src/sbin/tunefs/tunefs.8 src/sbin/tunefs/tunefs.c
cvs rdiff -r1.1 -r1.2 src/sys/arch/vax/conf/VAX780
cvs rdiff -r1.942 -r1.943 src/sys/conf/files
cvs rdiff -r1.22 -r1.23 src/sys/kern/sys_aio.c
cvs rdiff -r1.215 -r1.216 src/sys/kern/vfs_bio.c
cvs rdiff -r1.368 -r1.369 src/sys/kern/vfs_subr.c
cvs rdiff -r1.388 -r1.389 src/sys/kern/vfs_syscalls.c
cvs rdiff -r1.122 -r1.123 src/sys/miscfs/specfs/spec_vnops.c
cvs rdiff -r1.36 -r1.37 src/sys/miscfs/syncfs/sync_subr.c
cvs rdiff -r1.2 -r1.3 src/sys/modules/ffs/Makefile
cvs rdiff -r1.6 -r1.7 src/sys/rump/fs/lib/libffs/Makefile
cvs rdiff -r1.9 -r1.10 src/sys/rump/include/rump/rump.h
cvs rdiff -r1.1 -r0 src/sys/rump/librump/rumpkern/opt/opt_softdep.h
cvs rdiff -r1.12 -r1.13 src/sys/rump/librump/rumpvfs/rump_vfs.c
cvs rdiff -r1.3 -r1.4 src/sys/rump/librump/rumpvfs/vm_vfs.c
cvs rdiff -r1.110 -r1.111 src/sys/sys/buf.h
cvs rdiff -r1.200 -r1.201 src/sys/sys/vnode.h
cvs rdiff -r1.18 -r1.19 src/sys/ufs/files.ufs
cvs rdiff -r1.121 -r1.122 src/sys/ufs/ffs/ffs_alloc.c
cvs rdiff -r1.51 -r1.52 src/sys/ufs/ffs/ffs_balloc.c
cvs rdiff -r1.74 -r1.75 src/sys/ufs/ffs/ffs_extern.h
cvs rdiff -r1.102 -r1.103 src/sys/ufs/ffs/ffs_inode.c
cvs rdiff -r1.91 -r1.92 src/sys/ufs/ffs/ffs_snapshot.c
cvs rdiff -r1.116 -r0 src/sys/ufs/ffs/ffs_softdep.c
cvs rdiff -r1.23 -r0 src/sys/ufs/ffs/ffs_softdep.stub.c
cvs rdiff -r1.242 -r1.243 src/sys/ufs/ffs/ffs_vfsops.c
cvs rdiff -r1.110 -r1.111 src/sys/ufs/ffs/ffs_vnops.c
cvs rdiff -r1.11 -r1.12 src/sys/ufs/ffs/ffs_wapbl.c
cvs rdiff -r1.11 -r0 src/sys/ufs/ffs/softdep.h
cvs rdiff -r1.11 -r1.12 src/sys/ufs/lfs/lfs_rfw.c
cvs rdiff -r1.269 -r1.270 src/sys/ufs/lfs/lfs_vfsops.c
cvs rdiff -r1.219 -r1.220 src/sys/ufs/lfs/lfs_vnops.c
cvs rdiff -r1.55 -r1.56 src/sys/ufs/ufs/inode.h
cvs rdiff -r1.27 -r1.28 src/sys/ufs/ufs/ufs_dirhash.c
cvs rdiff -r1.60 -r1.61 src/sys/ufs/ufs/ufs_extern.h
cvs rdiff -r1.77 -r1.78 src/sys/ufs/ufs/ufs_inode.c
cvs rdiff -r1.100 -r1.101 src/sys/ufs/ufs/ufs_lookup.c
cvs rdiff -r1.93 -r1.94 src/sys/ufs/ufs/ufs_readwrite.c
cvs rdiff -r1.172 -r1.173 src/sys/ufs/ufs/ufs_vnops.c
cvs rdiff -r1.4 -r1.5 src/sys/ufs/ufs/ufs_wapbl.c
cvs rdiff -r1.93 -r1.94 src/sys/uvm/uvm_pager.c
Please note that diffs are not public domain; they are subject to the
copyright notices on the relevant files.
State-Changed-From-To: open->closed
State-Changed-By: dholland@NetBSD.org
State-Changed-When: Wed, 01 Apr 2009 04:08:42 +0000
State-Changed-Why:
softdep (softupdates) has been removed.
>Unformatted:
(Contact us)
$NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.8 2006/05/07 09:23:38 tsutsui Exp $
Copyright © 1994-2007
The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.