NetBSD Problem Report #40017
From Manuel.Bouyer@lip6.fr Mon Nov 24 16:07:07 2008
Return-Path: <Manuel.Bouyer@lip6.fr>
Received: from mail.netbsd.org (mail.netbsd.org [204.152.190.11])
by narn.NetBSD.org (Postfix) with ESMTP id 9425363B8BD
for <gnats-bugs@gnats.NetBSD.org>; Mon, 24 Nov 2008 16:07:07 +0000 (UTC)
Message-Id: <20081124160633.073F5FF49@pop.lip6.fr>
Date: Mon, 24 Nov 2008 17:06:32 +0100 (CET)
From: Manuel.Bouyer@lip6.fr
Reply-To: Manuel.Bouyer@lip6.fr
To: gnats-bugs@gnats.NetBSD.org
Subject: ataraid(4) deadlock
X-Send-Pr-Version: 3.95
>Number: 40017
>Category: kern
>Synopsis: ataraid(4) deadlock
>Confidential: no
>Severity: serious
>Priority: medium
>Responsible: kern-bug-people
>State: open
>Class: sw-bug
>Submitter-Id: net
>Arrival-Date: Mon Nov 24 16:10:00 +0000 2008
>Last-Modified: Tue Nov 25 09:30:02 +0000 2008
>Originator: Manuel.Bouyer@lip6.fr
>Release: NetBSD 5.0_BETA
>Organization:
>Environment:
System: NetBSD pop.lip6.fr 5.0_BETA NetBSD 5.0_BETA (GENERIC) #0: Sat Nov 22 16:55:37 PST 2008 builds@wb25:/home/builds/ab/netbsd-5/i386/200811220002Z-obj/home/builds/ab/netbsd-5/src/sys/arch/i386/compile/GENERIC i386
Architecture: i386
Machine: i386
>Description:
This box has 2 CPUs and a promise pdcide(4) controller which attaches
a ataraid volume:
pdcide0 at pci0 dev 12 function 0
pdcide0: Promise Ultra100/ATA Bus Master IDE Accelerator (rev. 0x02)
pdcide0: bus-master DMA support present
pdcide0: primary channel configured to native-PCI mode
pdcide0: using ioapic0 pin 18 for native-PCI interrupt
atabus2 at pdcide0 channel 0
pdcide0: secondary channel configured to native-PCI mode
atabus3 at pdcide0 channel 1
wd0 at atabus2 drive 0: <ST340810A>
wd0: drive supports 16-sector PIO transfers, LBA addressing
wd0: 38166 MB, 77545 cyl, 16 head, 63 sec, 512 bytes/sect x 78165360 sectors
wd0: 32-bit data port
wd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 5 (Ultra/100)
wd0(pdcide0:0:0): using PIO mode 4, Ultra-DMA mode 5 (Ultra/100) (using DMA)
ataraid0: found 1 RAID volume
ld0 at ataraid0 vendtype 0 unit 0: Promise ATA RAID-0 array
ld0: 38166 MB, 4865 cyl, 255 head, 63 sec, 512 bytes/sect x 78165297 sectors
with / and /usr mounted from ld0, the system hung while extracting netbsd-5
binary sets. ddb said:
[BREAK]fatal breakpoint trap in supervisor mode
trap type 1 code 0 eip c0533acc cs 8 eflags 202 cr2 bb909000 ilevel 8
Stopped in pid 0.4 (system) at netbsd:breakpoint+0x4: popl %ebp
db{0}> tr
breakpoint(0,3f8,5,c046aeed,cb8e842c,cb9ff4a0,cb862f6c,c1fe901c,c1fea000,7f2) at netbsd:breakpoint+0x4
comintr(cb9ff394,cb652bb4,0,0,0,0,0,0,0,0) at netbsd:comintr+0x575
DDB lost frame for netbsd:Xintr_ioapic_edge7+0xa9, trying 0xcb862f74
Xintr_ioapic_edge7() at netbsd:Xintr_ioapic_edge7+0xa9
--- interrupt ---
--- switch to interrupt stack ---
mutex_vector_enter(cc2746a4,cbbf,0,c0520a3d,cb228958,1,0,c1f7d310,c1f7d310,3e00) at netbsd:mutex_vector_enter+0x135
lddone(cc274644,c22130bc,1,1,cb23f7e8,0,ffffffff,0,c0563a00,cb227780) at netbsd:lddone+0x61
ld_ataraid_iodone_raid0(c1f7d310,0,3,8,c1f7d310,c0a6a240,cb652d40,c04afd1a,3,ccb60008) at netbsd:ld_ataraid_iodone_raid0+0x1cb
biodone2(3,ccb60008,3,cb227780,cb58a29c,cb58a004,cb652d80,c046336c,0,10) at netbsd:biodone2+0x7f
biointr(0,10,30,10,10,0,b50800,c1fc6d80,0,cb652da0) at netbsd:biointr+0x3a
softint_dispatch(cb227c80,3,0,0,0,0,cb652d90,cb652ce4,c1fc6a00,18) at netbsd:softint_dispatch+0x7c
DDB lost frame for netbsd:Xsoftintr+0x3d, trying 0xcb652d88
Xsoftintr() at netbsd:Xsoftintr+0x3d
--- interrupt ---
fatal page fault in supervisor mode
trap type 6 code 0 eip c0535fa7 cs 8 eflags 10202 cr2 3b ilevel 8
kernel: supervisor trap page fault, code=0
Faulted in DDB; continuing...
db{0}> mach cpu 1
using CPU 1
db{0}> tr
_kernel_lock(1,cb227780,cc2a26dc,c04513aa,cb224ec0,7,1,cb227780,cc283aa0,cc283aa0) at netbsd:_kernel_lock+0xd5
sleepq_block(0,0,c0a05a4d,c0a6711c,c222ca04,1a8,cc2a273c,cb228970,7c,40) at netbsd:sleepq_block+0x1c0
turnstile_block(0,1,cb23f7e8,c0a6711c,c0b08376,0,0,2,c2212164,c222cc50) at netbsd:turnstile_block+0x1a5
mutex_vector_enter(cb23f7e8,0,ca2df000,4000,cc2746a6,437c,0,c1febb64,0,d43f0000) at netbsd:mutex_vector_enter+0x370
ld_ataraid_start_raid0(cc274644,c2212164,1,4000,c1ff8a00,cc2746a4,cc274648,0,c2212164,cc274644) at netbsd:ld_ataraid_start_raid0+0x2e2
ldstart(6,c2212164,0,c04bf4f7,0,101,cc2a28bc,c04bf4f7,0,c1ff8a00) at netbsd:ldstart+0x62
ldstrategy(c2212164,cc274824,0,c2212164,c2212164,0,cc2a290c,c04cc72a,c2212164,0) at netbsd:ldstrategy+0x165
bdev_strategy(c2212164,0,cc2a291c,c04ae3ac,cc280cf4,0,cc2a293c,c04c5af6,cc2a292c,c2212164) at netbsd:bdev_strategy+0x49
spec_strategy(cc2a292c,c2212164,cc2a293c,c04adf36,ccb5e22c,1,c080b0c0,cc280cf4,c2212164,c2212164) at netbsd:spec_strategy+0x5a
VOP_STRATEGY(cc280cf4,c2212164,0,4000,cc2a29a4,cc2a29b8,4000,0,0,0) at netbsd:VOP_STRATEGY+0x66
genfs_do_io(ca2df000,5000,11,1,c0408310,4,f,c0ebb508,ccb5e22c,c0ebb508) at netbsd:genfs_do_io+0x1e6
genfs_gop_write(ccb5e22c,cc2a2b14,5,11,cc2a2b18,13,3f0,c0454a57,cc2746a6,cc2a2b14) at netbsd:genfs_gop_write+0x61
genfs_do_putpages(ccb5e22c,0,0,0,0,11,0,c0451169,cb2154d0,cc283c58) at netbsd:genfs_do_putpages+0x4d2
genfs_putpages(cc2a2bbc,ccb5ef1c,cc2a2bbc,c04bf43d,ccb5e2e4,cc27e000,c080b280,ccb5e22c,0,0) at netbsd:genfs_putpages+0x3d
VOP_PUTPAGES(ccb5e22c,0,0,0,0,11,cc2a2c0c,c04bf495,cc27e000,0) at netbsd:VOP_PUTPAGES+0x7f
ffs_full_fsync(ccb5e22c,8,1,c0451169,cb2154d0,cc283c58,0,c04b6dd5,ccb5e2cc,1) at netbsd:ffs_full_fsync+0x159
ffs_fsync(cc2a2cd8,12,80b040,1,ccb5e22c,c080acc0,ccb5e22c,cb226f00,8,0) at netbsd:ffs_fsync+0x6f
VOP_FSYNC(ccb5e22c,cb226f00,8,0,0,0,0,cc283aa0,c04cdca0,cc283aa0) at netbsd:VOP_FSYNC+0x7e
sched_sync(cc283aa0,0,c01002cd,0,c01002cd,0,0,0,0,0) at netbsd:sched_sync+0x14a
>How-To-Repeat:
tar xzpvf comp.tgz on a promise ataraid
>Fix:
workaround: boot -c and disable ld
>Audit-Trail:
From: Matthias Scheler <tron@zhadum.org.uk>
To: netbsd-bugs@NetBSD.org
Cc: gnats-bugs@NetBSD.org
Subject: Re: kern/40017: ataraid(4) deadlock
Date: Mon, 24 Nov 2008 22:49:03 +0000
On Mon, Nov 24, 2008 at 04:10:01PM +0000, Manuel.Bouyer@lip6.fr wrote:
> >Number: 40017
> >Category: kern
> >Synopsis: ataraid(4) deadlock
> >Confidential: no
> >Severity: serious
> >Priority: medium
> >Responsible: kern-bug-people
> >State: open
> >Class: sw-bug
> >Submitter-Id: net
> >Arrival-Date: Mon Nov 24 16:10:00 +0000 2008
> >Originator: Manuel.Bouyer@lip6.fr
> >Release: NetBSD 5.0_BETA
> >Organization:
> >Environment:
> System: NetBSD pop.lip6.fr 5.0_BETA NetBSD 5.0_BETA (GENERIC) #0: Sat Nov 22 16:55:37 PST 2008 builds@wb25:/home/builds/ab/netbsd-5/i386/200811220002Z-obj/home/builds/ab/netbsd-5/src/sys/arch/i386/compile/GENERIC i386
> Architecture: i386
> Machine: i386
> >Description:
> This box has 2 CPUs and a promise pdcide(4) controller which attaches
> a ataraid volume:
> pdcide0 at pci0 dev 12 function 0
> pdcide0: Promise Ultra100/ATA Bus Master IDE Accelerator (rev. 0x02)
> pdcide0: bus-master DMA support present
> pdcide0: primary channel configured to native-PCI mode
> pdcide0: using ioapic0 pin 18 for native-PCI interrupt
> atabus2 at pdcide0 channel 0
> pdcide0: secondary channel configured to native-PCI mode
> atabus3 at pdcide0 channel 1
> wd0 at atabus2 drive 0: <ST340810A>
> wd0: drive supports 16-sector PIO transfers, LBA addressing
> wd0: 38166 MB, 77545 cyl, 16 head, 63 sec, 512 bytes/sect x 78165360 sectors
> wd0: 32-bit data port
> wd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 5 (Ultra/100)
> wd0(pdcide0:0:0): using PIO mode 4, Ultra-DMA mode 5 (Ultra/100) (using DMA)
> ataraid0: found 1 RAID volume
> ld0 at ataraid0 vendtype 0 unit 0: Promise ATA RAID-0 array
> ld0: 38166 MB, 4865 cyl, 255 head, 63 sec, 512 bytes/sect x 78165297 sectors
>
> with / and /usr mounted from ld0, the system hung while extracting netbsd-5
> binary sets. ddb said:
[...]
This could be a duplicate of PR kern/38273.
Kind regards
--
Matthias Scheler http://zhadum.org.uk/
From: Manuel Bouyer <bouyer@antioche.eu.org>
To: gnats-bugs@NetBSD.org
Cc: kern-bug-people@NetBSD.org, gnats-admin@NetBSD.org, netbsd-bugs@NetBSD.org
Subject: Re: kern/40017: ataraid(4) deadlock
Date: Tue, 25 Nov 2008 10:26:01 +0100
On Mon, Nov 24, 2008 at 10:50:04PM +0000, Matthias Scheler wrote:
> > with / and /usr mounted from ld0, the system hung while extracting netbsd-5
> > binary sets. ddb said:
> [...]
>
> This could be a duplicate of PR kern/38273.
It could certainly be.
--
Manuel Bouyer <bouyer@antioche.eu.org>
NetBSD: 26 ans d'experience feront toujours la difference
--
(Contact us)
$NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.8 2006/05/07 09:23:38 tsutsui Exp $
Copyright © 1994-2007
The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.