NetBSD Problem Report #28621

From woods@building.weird.com  Sun Dec 12 03:32:55 2004
Return-Path: <woods@building.weird.com>
Received: from building.weird.com (building.weird.com [204.92.254.24])
	by narn.netbsd.org (Postfix) with ESMTP id F2E61251EB0
	for <gnats-bugs@gnats.netbsd.org>; Sun, 12 Dec 2004 03:32:54 +0000 (UTC)
Message-Id: <m1CdKTe-0024gMC@building.weird.com>
Date: Sat, 11 Dec 2004 22:32:54 -0500 (EST)
From: "Greg A. Woods" <woods@weird.com>
Reply-To: "Greg A. Woods" <woods@planix.com>
To: gnats-bugs@netbsd.org
Subject: 1.6.x "vp != NULL" panic in ffs_softdep.c:4653 while unmounting a softdep (+quota) filesystem
X-Send-Pr-Version: 3.95

>Number:         28621
>Category:       kern
>Synopsis:       1.6.x "vp != NULL" panic in ffs_softdep.c:4653 while unmounting a softdep (+quota) filesystem
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    kern-bug-people
>State:          closed
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Sun Dec 12 03:33:00 +0000 2004
>Closed-Date:    Wed Apr 01 04:08:33 +0000 2009
>Last-Modified:  Wed Apr 01 04:08:33 +0000 2009
>Originator:     Greg A. Woods
>Release:        NetBSD 1.6.2_STABLE
>Organization:
Planix, Inc.; Toronto, Ontario; Canada
>Environment:
System: NetBSD 1.6.2_STABLE
Architecture: i386
Machine: i386
>Description:

	I was unmounting a filesystem after playing with quotas.  It
	was mounted with softdeps, but I had just run "quotaoff -v -a",
	so quotas should have been disabled by then.

	there hadn't been any file I/O, other than by the kernel to the
	quota file, on that filesystem for quite some time....

>How-To-Repeat:

panic: kernel diagnostic assertion "vp != NULL" failed: file "/building/work/woods/m-NetBSD-1.6/sys/ufs/ffs/ffs_softdep.c", line 4653

Stopped in pid 27332 (umount) at        cpu_Debugger+0x4:       movl    %ebp,%esp
db> where
No such command  (someday I'm going to make that an alias! :-)
db> trace
cpu_Debugger(c04ca5ed,ffffffff,e412a1d4,c025789d,e4122b64) at cpu_Debugger+0x4
panic(c062b1e0,c04ca5ed,c04cbee7,c04cbe80,122d) at panic+0xb0
__main(c04ca5ed,c04cbe80,122d,c04cbee7,e554699c) at __main
flush_inodedep_deps(c1e14000,454401,e4122c20,c029584f,e412a1f8) at flush_inodedep_deps+0x3c
softdep_sync_metadata(e4122dac,0,e4122c90,c029c9f2) at softdep_sync_metadata+0x2fb
ffs_full_fsync(e4122dac,0,e4122d10,c0251fa9,e4122dac) at ffs_full_fsync+0x260
ffs_fsync(e4122dac,20002,0,10,0) at ffs_fsync+0x3f
ffs_flushfiles(c225f400,0,e4244cac,c02c2bb1,0) at ffs_flushfiles+0xfd
softdep_flushfiles(c225f400,0,e4244cac,c02c824d,0) at softdep_flushfiles+0x56
ffs_unmount(c225f400,0,e4244cac,e4244cac,0) at ffs_unmount+0x3d
dounmount(c225f400,0,e4244cac,0,e4122f80) at dounmount+0xea
sys_unmount(e4244cac,e4122f80,e4122f78,c033dde0) at sys_unmount+0xf5
syscall_plain(1f,1f,1f,1f,bfbfcfb4) at syscall_plain+0xa7
db> 

I (dyoung@netbsd.org) see this, too, on a Soekris board. Below is the Soekris
panic.  I ran umount(8) with the attached script.  This is a very -current
kernel, 2.99.11.  I see this often.  I can provide console access to a Soekris
board where it occurs.  It is really too bad if I cannot use softdep because
it speeds up the script by several minutes. -dcy

  Stopped in pid 26781.1 (umount) at      netbsd:cpu_Debugger+0x4:        popl    %
  ebp
  db> trace/u
  cpu_Debugger(0,c385e934,2,c3978d0c,c0271fb9) at netbsd:cpu_Debugger+0x4
  panic(c02c50a0,c029f1fc,c02a206d,c02b3340,1408) at netbsd:panic+0xa9
  __assert(c029f1fc,c02b3340,1408,c02a206d,0) at netbsd:__assert+0x19
  flush_inodedep_deps(c0546800,80,0,0,0) at netbsd:flush_inodedep_deps+0x3b
  softdep_sync_metadata(c3978e1c,0,0,4,c385e934) at netbsd:softdep_sync_metadata+0
  x6b
  ffs_full_fsync(c3978e1c,c3a5a2b8,c05f7000,c3a5c04c,0) at netbsd:ffs_full_fsync+0
  x20e
  ffs_fsync(c3978e1c,c02800a0,c385e934,c2e3f0fc,1) at netbsd:ffs_fsync+0x48
  VOP_FSYNC(c385e934,c2e3f0fc,1,0,0) at netbsd:VOP_FSYNC+0x4c
  softdep_flushworklist(c05f7000,c3978ea4,c3906008,0,0) at netbsd:softdep_flushwor
  klist+0x87
  ffs_sync(c05f7000,1,c2e3f0fc,c3906008,0) at netbsd:ffs_sync+0x10e
  dounmount(c05f7000,0,c3906008,c3906008,bfbfe930) at netbsd:dounmount+0xc3
  sys_unmount(c36efe70,c3978f70,c3978f68,c02c9ce8,0) at netbsd:sys_unmount+0xf2
  syscall_plain() at netbsd:syscall_plain+0xc2
  --- syscall (number 22) ---
  ?(4807053c,bfbfedbc,0,8068000,8065d68) at 0x4807b78b
  Bad user frame pointer: 0x48078200
  db>

>Fix:

	unknown

	In my application, a very unpleasant workaround is to disable
	softdep. -dcy

>Release-Note:

>Audit-Trail:

From: David Young <dyoung@pobox.com>
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: kern/28621: 1.6.x "vp != NULL" crash in ffs_sfotdep.c:4653 while unmounting a softdep (+quota) filesystem
Date: Fri, 7 Jan 2005 22:39:55 -0600

 I (dyoung@netbsd.org) see this, too, on a Soekris board. Below is the
 Soekris panic.  I ran umount(8) with the attached script.  This is a very
 -current kernel, 2.99.11.  I see this often.  I can provide console access
 to a Soekris board where it occurs.  It is really too bad if I cannot
 use softdep because it speeds up the script by several minutes. -dcy

   Stopped in pid 26781.1 (umount) at      netbsd:cpu_Debugger+0x4:        popl  
   %
   ebp
   db> trace/u
   cpu_Debugger(0,c385e934,2,c3978d0c,c0271fb9) at netbsd:cpu_Debugger+0x4
   panic(c02c50a0,c029f1fc,c02a206d,c02b3340,1408) at netbsd:panic+0xa9
   __assert(c029f1fc,c02b3340,1408,c02a206d,0) at netbsd:__assert+0x19
   flush_inodedep_deps(c0546800,80,0,0,0) at netbsd:flush_inodedep_deps+0x3b
   softdep_sync_metadata(c3978e1c,0,0,4,c385e934) at netbsd:softdep_sync_metadata
 +0
   x6b
   ffs_full_fsync(c3978e1c,c3a5a2b8,c05f7000,c3a5c04c,0) at netbsd:ffs_full_fsync
 +0
   x20e
   ffs_fsync(c3978e1c,c02800a0,c385e934,c2e3f0fc,1) at netbsd:ffs_fsync+0x48
   VOP_FSYNC(c385e934,c2e3f0fc,1,0,0) at netbsd:VOP_FSYNC+0x4c
   softdep_flushworklist(c05f7000,c3978ea4,c3906008,0,0) at netbsd:softdep_flushw
 or
   klist+0x87
   ffs_sync(c05f7000,1,c2e3f0fc,c3906008,0) at netbsd:ffs_sync+0x10e
   dounmount(c05f7000,0,c3906008,c3906008,bfbfe930) at netbsd:dounmount+0xc3
   sys_unmount(c36efe70,c3978f70,c3978f68,c02c9ce8,0) at netbsd:sys_unmount+0xf2
   syscall_plain() at netbsd:syscall_plain+0xc2
   --- syscall (number 22) ---
   ?(4807053c,bfbfedbc,0,8068000,8065d68) at 0x4807b78b
   Bad user frame pointer: 0x48078200
   db>

 *************
 *************

 This is a representative mount(8) output from one of the Soekris boxes:

 # mount 
 /dev/wd0a on / type ffs (read-only, local)
 mfs:10 on /dev type mfs (synchronous, local)
 /etc on /permanent/etc type null (local)
 mfs:1390 on /etc type mfs (synchronous, noatime, local)
 /home on /permanent/home type null (local)
 mfs:1398 on /home type mfs (synchronous, noatime, local)
 /tmp on /permanent/tmp type null (local)
 mfs:1420 on /tmp type mfs (synchronous, noexec, nosuid, nodev, noatime, local)
 /var on /permanent/var type null (local)
 mfs:1429 on /var type mfs (synchronous, noatime, local)

 *************
 *************

 Here is the script:

 #!/bin/sh
 # $Id: upgrade 2288 2004-12-23 07:16:30Z dyoung $

 [ "$(whoami)" = "root" ] || { 
 	echo "This script is intended to be run as root on a CUW node." 1>&2; 
 	exit 1; 
 }
 [ $1 ] || { echo "Usage: $0 user@host:/path/to/tar" 1>&2; exit 1; }

 gripe () {
 	echo "$*" 1>&2
 }

 bomb () {
 	gripe "$*"
 	cd
 	if mount | grep -q "on /mnt " ; then
 		umount /mnt
 	fi
 	exit 1
 }

 attempt () {
 	eval $* || bomb "Upgrade failed on $* [$?]"
 }

 set -u

 extract_scp_format="\([^@]*\)@\([^:]*\):\(.*\)"
 user=$(echo $1 | sed -n -e "s/$extract_scp_format/\1/p")
 host=$(echo $1 | sed -n -e "s/$extract_scp_format/\2/p")
 tar=$(echo $1 | sed -n -e "s/$extract_scp_format/\3/p")

 current=$(mount | sed -n -e "s/^[^ ]*\([ae]\) on \/ .*/\1/p")
 if [ $current = 'a' ] ; then
 	setactive=1
 	dev="/dev/wd0e"
 	rdev="/dev/rwd0e"
 elif [ $current = 'e' ] ; then
 	setactive=0
 	dev="/dev/wd0a"
 	rdev="/dev/rwd0a"
 fi
 ddev="/dev/rwd0d"

 echo "Preparing for upgrade on $dev."

 attempt newfs $rdev
 attempt mount -o noatime $dev /mnt
 attempt cd /mnt

 echo "Installing the upgrade."
 attempt "ssh $user@$host cat $tar | pax -pe -r -z"

 echo "Updating etc/fstab"
 fstab=$(mktemp /var/tmp/$(basename $0).fstab.XXXXXX)
 sed -e "s|/dev/wd0[ae] / \(.*\)|$dev / \1|" /mnt/etc/fstab > $fstab
 attempt install -o root -g wheel -m 0644 $fstab /mnt/etc/fstab
 rm $fstab

 echo "Updating bootstrap"
 attempt fdisk -f -a -$setactive $ddev
 attempt mbrlabel -frw $ddev
 attempt installboot -o console=com0kbd,speed=19200 $rdev /mnt/usr/mdec/bootxx_ffsv1

 if [ -f /usr/share/cuw_config.subr ] ; then
 	. /usr/share/cuw_config.subr
 	if [ -f $cuw_conf_file ] ; then
 		echo "Copying existing $cuw_conf_file."
 		cp $cuw_conf_file /mnt/$cuw_conf_file
 	else
 		echo "No existing configuration found."
 	fi
 fi

 cd -
 umount /mnt

 echo "Upgrade complete ($dev)."

 # $Id: upgrade 2288 2004-12-23 07:16:30Z dyoung $

 -- 
 David Young             OJC Technologies
 dyoung@ojctech.com      Urbana, IL * (217) 278-3933

From: Andrew Doran <ad@netbsd.org>
To: gnats-bugs@gnats.NetBSD.org
Cc: 
Subject: PR/28621 CVS commit: src
Date: Sun, 22 Feb 2009 20:28:07 +0000 (UTC)

 Module Name:	src
 Committed By:	ad
 Date:		Sun Feb 22 20:28:07 UTC 2009

 Modified Files:
 	src/doc: CHANGES
 	src/lib/libp2k: p2k.c
 	src/sbin/fsck_lfs: lfs.c
 	src/sbin/mount: mount.8
 	src/sbin/newfs_lfs: make_lfs.c
 	src/sbin/tunefs: tunefs.8 tunefs.c
 	src/sys/arch/vax/conf: VAX780
 	src/sys/conf: files
 	src/sys/kern: sys_aio.c vfs_bio.c vfs_subr.c vfs_syscalls.c
 	src/sys/miscfs/specfs: spec_vnops.c
 	src/sys/miscfs/syncfs: sync_subr.c
 	src/sys/modules/ffs: Makefile
 	src/sys/rump/fs/lib/libffs: Makefile
 	src/sys/rump/include/rump: rump.h
 	src/sys/rump/librump/rumpvfs: rump_vfs.c vm_vfs.c
 	src/sys/sys: buf.h vnode.h
 	src/sys/ufs: files.ufs
 	src/sys/ufs/ffs: ffs_alloc.c ffs_balloc.c ffs_extern.h ffs_inode.c
 	    ffs_snapshot.c ffs_vfsops.c ffs_vnops.c ffs_wapbl.c
 	src/sys/ufs/lfs: lfs_rfw.c lfs_vfsops.c lfs_vnops.c
 	src/sys/ufs/ufs: inode.h ufs_dirhash.c ufs_extern.h ufs_inode.c
 	    ufs_lookup.c ufs_readwrite.c ufs_vnops.c ufs_wapbl.c
 	src/sys/uvm: uvm_pager.c
 Removed Files:
 	src/sys/rump/librump/rumpkern/opt: opt_softdep.h
 	src/sys/ufs/ffs: ffs_softdep.c ffs_softdep.stub.c softdep.h

 Log Message:
 PR kern/26878 FFSv2 + softdep = livelock (no free ram)
 PR kern/16942 panic with softdep and quotas
 PR kern/19565 panic: softdep_write_inodeblock: indirect pointer #1 mismatch
 PR kern/26274 softdep panic: allocdirect_merge: ...
 PR kern/26374 Long delay before non-root users can write to softdep partitions
 PR kern/28621 1.6.x "vp != NULL" panic in ffs_softdep.c:4653 while unmounting a softdep (+quota) filesystem
 PR kern/29513 FFS+Softdep panic with unfsck-able file-corruption
 PR kern/31544 The ffs softdep code appears to fail to write dirty bits to disk
 PR kern/31981 stopping scsi disk can cause panic (softdep)
 PR kern/32116 kernel panic in softdep (assertion failure)
 PR kern/32532 softdep_trackbufs deadlock
 PR kern/37191 softdep: locking against myself
 PR kern/40474 Kernel panic after remounting raid root with softdep

 Retire softdep, pass 2. As discussed and later formally announced on the
 mailing lists.


 To generate a diff of this commit:
 cvs rdiff -r1.1191 -r1.1192 src/doc/CHANGES
 cvs rdiff -r1.8 -r1.9 src/lib/libp2k/p2k.c
 cvs rdiff -r1.29 -r1.30 src/sbin/fsck_lfs/lfs.c
 cvs rdiff -r1.65 -r1.66 src/sbin/mount/mount.8
 cvs rdiff -r1.13 -r1.14 src/sbin/newfs_lfs/make_lfs.c
 cvs rdiff -r1.37 -r1.38 src/sbin/tunefs/tunefs.8 src/sbin/tunefs/tunefs.c
 cvs rdiff -r1.1 -r1.2 src/sys/arch/vax/conf/VAX780
 cvs rdiff -r1.942 -r1.943 src/sys/conf/files
 cvs rdiff -r1.22 -r1.23 src/sys/kern/sys_aio.c
 cvs rdiff -r1.215 -r1.216 src/sys/kern/vfs_bio.c
 cvs rdiff -r1.368 -r1.369 src/sys/kern/vfs_subr.c
 cvs rdiff -r1.388 -r1.389 src/sys/kern/vfs_syscalls.c
 cvs rdiff -r1.122 -r1.123 src/sys/miscfs/specfs/spec_vnops.c
 cvs rdiff -r1.36 -r1.37 src/sys/miscfs/syncfs/sync_subr.c
 cvs rdiff -r1.2 -r1.3 src/sys/modules/ffs/Makefile
 cvs rdiff -r1.6 -r1.7 src/sys/rump/fs/lib/libffs/Makefile
 cvs rdiff -r1.9 -r1.10 src/sys/rump/include/rump/rump.h
 cvs rdiff -r1.1 -r0 src/sys/rump/librump/rumpkern/opt/opt_softdep.h
 cvs rdiff -r1.12 -r1.13 src/sys/rump/librump/rumpvfs/rump_vfs.c
 cvs rdiff -r1.3 -r1.4 src/sys/rump/librump/rumpvfs/vm_vfs.c
 cvs rdiff -r1.110 -r1.111 src/sys/sys/buf.h
 cvs rdiff -r1.200 -r1.201 src/sys/sys/vnode.h
 cvs rdiff -r1.18 -r1.19 src/sys/ufs/files.ufs
 cvs rdiff -r1.121 -r1.122 src/sys/ufs/ffs/ffs_alloc.c
 cvs rdiff -r1.51 -r1.52 src/sys/ufs/ffs/ffs_balloc.c
 cvs rdiff -r1.74 -r1.75 src/sys/ufs/ffs/ffs_extern.h
 cvs rdiff -r1.102 -r1.103 src/sys/ufs/ffs/ffs_inode.c
 cvs rdiff -r1.91 -r1.92 src/sys/ufs/ffs/ffs_snapshot.c
 cvs rdiff -r1.116 -r0 src/sys/ufs/ffs/ffs_softdep.c
 cvs rdiff -r1.23 -r0 src/sys/ufs/ffs/ffs_softdep.stub.c
 cvs rdiff -r1.242 -r1.243 src/sys/ufs/ffs/ffs_vfsops.c
 cvs rdiff -r1.110 -r1.111 src/sys/ufs/ffs/ffs_vnops.c
 cvs rdiff -r1.11 -r1.12 src/sys/ufs/ffs/ffs_wapbl.c
 cvs rdiff -r1.11 -r0 src/sys/ufs/ffs/softdep.h
 cvs rdiff -r1.11 -r1.12 src/sys/ufs/lfs/lfs_rfw.c
 cvs rdiff -r1.269 -r1.270 src/sys/ufs/lfs/lfs_vfsops.c
 cvs rdiff -r1.219 -r1.220 src/sys/ufs/lfs/lfs_vnops.c
 cvs rdiff -r1.55 -r1.56 src/sys/ufs/ufs/inode.h
 cvs rdiff -r1.27 -r1.28 src/sys/ufs/ufs/ufs_dirhash.c
 cvs rdiff -r1.60 -r1.61 src/sys/ufs/ufs/ufs_extern.h
 cvs rdiff -r1.77 -r1.78 src/sys/ufs/ufs/ufs_inode.c
 cvs rdiff -r1.100 -r1.101 src/sys/ufs/ufs/ufs_lookup.c
 cvs rdiff -r1.93 -r1.94 src/sys/ufs/ufs/ufs_readwrite.c
 cvs rdiff -r1.172 -r1.173 src/sys/ufs/ufs/ufs_vnops.c
 cvs rdiff -r1.4 -r1.5 src/sys/ufs/ufs/ufs_wapbl.c
 cvs rdiff -r1.93 -r1.94 src/sys/uvm/uvm_pager.c

 Please note that diffs are not public domain; they are subject to the
 copyright notices on the relevant files.

State-Changed-From-To: open->closed
State-Changed-By: dholland@NetBSD.org
State-Changed-When: Wed, 01 Apr 2009 04:08:33 +0000
State-Changed-Why:
softdep (softupdates) has been removed.


>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.8 2006/05/07 09:23:38 tsutsui Exp $
Copyright © 1994-2007 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.