NetBSD Problem Report #21335
Received: (qmail 19307 invoked by uid 605); 27 Apr 2003 07:20:26 -0000
Message-Id: <200304270720.h3R7KKke002754@pip.kardel.de>
Date: Sun, 27 Apr 2003 09:20:20 +0200 (MEST)
From: Frank Kardel <kardel@acm.org>
Sender: gnats-bugs-owner@netbsd.org
Reply-To: kardel@acm.org
To: gnats-bugs@gnats.netbsd.org
Subject: ahc -current 20030427 driver leaves process in D state after timeout/BDR
X-Send-Pr-Version: 3.95
>Number: 21335
>Category: kern
>Synopsis: ahc -current 20030427 driver leaves process in D state after timeout/BDR
>Confidential: no
>Severity: serious
>Priority: high
>Responsible: kern-bug-people
>State: closed
>Class: sw-bug
>Submitter-Id: net
>Arrival-Date: Sun Apr 27 07:21:00 +0000 2003
>Closed-Date: Tue Feb 05 14:22:05 +0000 2019
>Last-Modified: Tue Feb 05 14:22:05 +0000 2019
>Originator: Frank Kardel
>Release: NetBSD 1.6R
>Organization:
>Environment:
System: NetBSD pip 1.6R NetBSD 1.6R (PIP) #1: Sun Apr 27 08:09:36 MEST 2003 kardel@pip:/fs/IC35L060-0-a/src/NetBSD/netbsd/sys/arch/i386/compile/obj.i386/PIP i386
Architecture: i386
Machine: i386
>Description:
Accessing a troblesome tape on a Sony DAT lead to a timeout.
The messages where:
ahc0:SCB 0xe - timed out
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
ahc0: Dumping Card State while idle, at SEQADDR 0x7
Card was paused
ACCUM = 0x45, SINDEX = 0x87, DINDEX = 0x29, ARG_2 = 0x3
HCNT = 0x0 SCBPTR = 0x0
SCSISIGI[0x0] ERROR[0x0] SCSIBUSL[0x0] LASTPHASE[0x1]
SCSISEQ[0x12] SBLKCTL[0x2] SCSIRATE[0x0] SEQCTL[0x10]
SEQ_FLAGS[0xc0] SSTAT0[0x5] SSTAT1[0xa] SSTAT2[0x0]
SSTAT3[0x0] SIMODE0[0x0] SIMODE1[0xa4] SXFRCTL0[0x80]
DFCNTRL[0x0] DFSTATUS[0x2d]
STACK: 0x0 0x153 0x194 0x3
SCB count = 16
Kernel NEXTQSCB = 15
Card NEXTQSCB = 15
QINFIFO entries:
Waiting Queue entries:
Disconnected Queue entries: 0:14
QOUTFIFO entries:
Sequencer Free SCB List: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Sequencer SCB Info:
0 SCB_CONTROL[0x4c] SCB_SCSIID[0x87]
SCB_LUN[0x0] SCB_TAG[0xe]
1 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
2 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
3 SCB_CONTROL[0x0] SCB_SCst0(ahc0:0:8:0): ahc0: no longer in timeout, status = 0
ahc0: Bus Device Reset on A:8. 1 SCBs aborted
The process accessing the tape is still in D state - kill -9 resistent.
0 994 1 0 -6 0 120 656 xscmd D ?? 0:00.00 taper pip
As the device is still open a re-boot is required for accessing the
device again.
NetBSD 1.6R (PIP) #1: Sun Apr 27 08:09:36 MEST 2003
kardel@pip:/fs/IC35L060-0-a/src/NetBSD/netbsd/sys/arch/i386/compile/obj.i386/PIP
total memory = 767 MB
avail memory = 707 MB
using 6144 buffers containing 39400 KB of memory
BIOS32 rev. 0 found at 0xf0f80
mainbus0 (root)
cpu0 at mainbus0: (uniprocessor)
cpu0: AMD Athlon Model 4 (Thunderbird) (686-class), 1210.88 MHz, id 0x642
cpu0: features c1c7f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR>
cpu0: features c1c7f9ff<PGE,MCA,CMOV,PAT,PSE36,PN,MMXX,MMX>
cpu0: features c1c7f9ff<FXSR,3DNOW2,3DNOW>
cpu0: I-cache 64 KB 64b/line 2-way, D-cache 64 KB 64b/line 2-way
cpu0: L2 cache 256 KB 64b/line 16-way
cpu0: ITLB 16 4 KB entries fully associative, 8 4 MB entries fully associative
cpu0: DTLB 24 4 KB entries fully associative, 8 4 MB entries 4-way
cpu0: 8 page colors
pci0 at mainbus0 bus 0: configuration mode 1
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
pchb0 at pci0 dev 0 function 0
pchb0: VIA Technologies VT8363 KT133 System Controller (rev. 0x03)
agp0 at pchb0: aperture at 0xe6000000, size 0x10000000
ppb0 at pci0 dev 1 function 0: VIA Technologies VT8363 KT133 PCI to AGP Bridge (rev. 0x00)
pci1 at ppb0 bus 1
pci1: i/o space, memory space enabled
vga0 at pci1 dev 0 function 0: Matrox MGA G400 AGP (rev. 0x82)
wsdisplay0 at vga0 kbdmux 1: console (80x25, vt100 emulation)
wsmux1: connecting to wsdisplay0
pcib0 at pci0 dev 4 function 0
pcib0: VIA Technologies VT82C686A (Apollo KX133) PCI-ISA Bridge (rev. 0x40)
pciide0 at pci0 dev 4 function 1: VIA Technologies VT82C686A (Apollo KX133) ATA100 controller
pciide0: bus-master DMA support present
pciide0: primary channel configured to compatibility mode
wd0 at pciide0 channel 0 drive 0: <IBM-DTTA-351010>
wd0: drive supports 16-sector PIO transfers, LBA addressing
wd0: 9671 MB, 16383 cyl, 16 head, 63 sec, 512 bytes/sect x 19807200 sectors
wd0: 32-bit data port
wd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 2 (Ultra/33)
wd1 at pciide0 channel 0 drive 1: <IC35L060AVER07-0>
wd1: drive supports 16-sector PIO transfers, LBA addressing
wd1: 58644 MB, 16383 cyl, 16 head, 63 sec, 512 bytes/sect x 120103200 sectors
wd1: 32-bit data port
wd1: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 5 (Ultra/100)
pciide0: primary channel interrupting at irq 14
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 2 (Ultra/33) (using DMA data transfers)
wd1(pciide0:0:1): using PIO mode 4, Ultra-DMA mode 5 (Ultra/100) (using DMA data transfers)
pciide0: secondary channel configured to compatibility mode
atapibus0 at pciide0 channel 1: 2 targets
cd0 at atapibus0 drive 1: <YAMAHA CRW-F1E, , 1.0c> cdrom removable
cd0: 32-bit data port
cd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 2 (Ultra/33)
pciide0: secondary channel interrupting at irq 15
cd0(pciide0:1:1): using PIO mode 4, Ultra-DMA mode 2 (Ultra/33) (using DMA data transfers)
uhci0 at pci0 dev 4 function 2: VIA Technologies VT83C572 USB Controller (rev. 0x16)
uhci0: interrupting at irq 5
usb0 at uhci0: USB revision 1.0
uhub0 at usb0
uhub0: VIA Technologies UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhci1 at pci0 dev 4 function 3: VIA Technologies VT83C572 USB Controller (rev. 0x16)
uhci1: interrupting at irq 5
usb1 at uhci1: USB revision 1.0
uhub1 at usb1
uhub1: VIA Technologies UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
viapm0 at pci0 dev 4 function 4
viaenv0 at viapm0: disabled
auvia0 at pci0 dev 4 function 5: VIA VT82C686A AC'97 Audio (rev 0x50)
auvia0: interrupting at irq 12
auvia0: ICEnsemble ICE1232/VT1611A codec; headphone, 18 bit DAC, 18 bit ADC, KS Waves 3D
auvia0: variable rate audio
audio0 at auvia0: full duplex, mmap, independent
ahc0 at pci0 dev 9 function 0
ahc0: interrupting at irq 5
ahc0: aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs
scsibus0 at ahc0: 16 targets, 8 luns per target
isic0 at pci0 dev 10 function 0: ELSA QuickStep 1000pro/PCI
isic0: IPAC PSB2115 Version 1.1
isic0: interrupting at irq 12
BRI 0 at isic0
ex0 at pci0 dev 11 function 0: 3Com 3c905B-TX 10/100 Ethernet (rev. 0x30)
ex0: interrupting at irq 10
ex0: MAC address 00:10:5a:d8:53:54
exphy0 at ex0 phy 24: 3Com internal media interface
exphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
ex1 at pci0 dev 12 function 0: 3Com 3c905C-TX 10/100 Ethernet with mngmt (rev. 0x6c)
ex1: interrupting at irq 11
ex1: MAC address 00:50:da:ee:ca:2e
bmtphy0 at ex1 phy 24: Broadcom 3c905C internal PHY, rev. 4
bmtphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
pciide1 at pci0 dev 17 function 0: Promise Ultra100/ATA Bus Master IDE Accelerator (rev. 0x02)
pciide1: bus-master DMA support present
pciide1: primary channel configured to native-PCI mode
pciide1: using irq 10 for native-PCI interrupt
wd2 at pciide1 channel 0 drive 0: <IBM-DTLA-305040>
wd2: drive supports 16-sector PIO transfers, LBA addressing
wd2: 39266 MB, 16383 cyl, 16 head, 63 sec, 512 bytes/sect x 80418240 sectors
wd2: 32-bit data port
wd2: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 5 (Ultra/100)
wd2(pciide1:0:0): using PIO mode 4, Ultra-DMA mode 5 (Ultra/100) (using DMA data transfers)
pciide1: secondary channel configured to native-PCI mode
pciide1: disabling secondary channel (no drives)
isa0 at pcib0
com0 at isa0 port 0x3f8-0x3ff irq 4: ns16550a, working fifo
com1 at isa0 port 0x2f8-0x2ff irq 3: ns16550a, working fifo
pckbc0 at isa0 port 0x60-0x64
pckbd0 at pckbc0 (kbd slot)
pckbc0: using irq 1 for kbd slot
wskbd0 at pckbd0: console keyboard, using wsdisplay0
pmsprobe: reset error 5
lpt0 at isa0 port 0x378-0x37b irq 7
pcppi0 at isa0 port 0x61
sysbeep0 at pcppi0
isapnp0 at isa0 port 0x279: ISA Plug 'n Play device support
npx0 at isa0 port 0xf0-0xff: using exception 16
fdc0 at isa0 port 0x3f0-0x3f7 irq 6 drq 2
fd0 at fdc0 drive 0: 1.44MB, 80 cyl, 2 head, 18 sec
isapnp0: no ISA Plug 'n Play devices found
apm0 at mainbus0: Power Management spec V1.2
IPsec: Initialized Security Association Processing.
scsibus0: waiting 2 seconds for devices to settle...
uhub0: port error, restarting port 1
uhub0: port error, giving up port 1
uhub1: port error, restarting port 1
uhub1: port error, giving up port 1
uhub1: port error, restarting port 2
uhub1: port error, giving up port 2
ulpt0 at uhub0 port 1 configuration 1 interface 0
ulpt0: Hewlett-Packard DeskJet 1220C, rev 1.00/1.00, addr 2, iclass 7/1
ulpt0: using uni-directional mode
uhub0: port error, restarting port 2
uhub0: port error, giving up port 2
uhub2 at uhub0 port 2
uhub2: Texas Instruments UT-USB41 hub, class 9/0, rev 1.10/1.10, addr 3
uhub2: 4 ports with 4 removable, self powered
cd1 at scsibus0 target 3 lun 0: <YAMAHA, CRW8824S, 1.00> cdrom removable
cd1: sync (50.00ns offset 15), 8-bit (20.000MB/s) transfers
st0 at scsibus0 target 8 lun 0: <SONY, SDT-10000, 0101> tape removable
st0: density code 36, 512-byte blocks, write-enabled
st0: sync (50.00ns offset 8), 16-bit (40.000MB/s) transfers
umodem0 at uhub2 port 1 configuration 2 interface 0
umodem0: Lucent Technologies, Inc. ELSA Modem Board, rev 1.00/1.00, addr 4, iclass 2/2
umodem0: data interface 1, has CM over data, has break
umodem0: status change notification available
ucom0 at umodem0
uhidev0 at uhub2 port 2 configuration 1 interface 0
uhidev0: Microsoft Microsoft IntelliMouse M-. with IntelliEye, rev 1.10/1.01, addr 5, iclass 3/1
ums0 at uhidev0: 3 buttons and Z dir.
wsmouse0 at ums0 mux 0
boot device: wd0
root on wd0a dumps on wd0b
mountroot: trying smbfs...
mountroot: trying coda...
mountroot: trying msdos...
mountroot: trying cd9660...
mountroot: trying ntfs...
mountroot: trying nfs...
mountroot: trying lfs...
mountroot: trying ext2fs...
mountroot: trying ffs...
root file system type: ffs
init: copying out flags `-s' 3
init: copying out path `/sbin/init' 11
IP Filter: v3.4.29 initialized. Default = pass all, Logging = enabled
wsdisplay0: screen 1 added (80x25, vt100 emulation)
wsdisplay0: screen 2 added (80x25, vt100 emulation)
wsdisplay0: screen 3 added (80x50, vt100 emulation)
wsdisplay0: screen 4 added (80x50, vt100 emulation)
wsdisplay0: screen 5 added (80x50, vt100 emulation)
wsdisplay0: screen 6 added (80x25, vt100 emulation)
wsdisplay0: screen 7 added (80x25, vt100 emulation)
>How-To-Repeat:
Try to access a troublesome tape with a Sony DAT leading to a timeout.
I am not sure where tat is all you need to do, though. I used to
get EIO for things like this.
>Fix:
>Release-Note:
>Audit-Trail:
From: Frank Kardel <kardel@acm.org>
To: gnats-bugs@gnats.netbsd.org
Cc:
Subject: kern/21335: additional information
Date: Sun, 27 Jul 2003 16:59:58 +0200
The problem still exists in current-20030727-064126.
dmesg buffer:
ahc0:SCB 0xe - timed out
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
ahc0: Dumping Card State while idle, at SEQADDR 0x7
Card was paused
ACCUM = 0x43, SINDEX = 0x87, DINDEX = 0x29, ARG_2 = 0x3
HCNT = 0x0 SCBPTR = 0x0
SCSISIGI[0x0] ERROR[0x0] SCSIBUSL[0x0] LASTPHASE[0x1]
SCSISEQ[0x12] SBLKCTL[0x2] SCSIRATE[0x0] SEQCTL[0x10]
SEQ_FLAGS[0xc0] SSTAT0[0x5] SSTAT1[0xa] SSTAT2[0x0]
SSTAT3[0x0] SIMODE0[0x0] SIMODE1[0xa4] SXFRCTL0[0x80]
DFCNTRL[0x0] DFSTATUS[0x2d]
STACK: 0x0 0x153 0x194 0x3
SCB count = 16
Kernel NEXTQSCB = 15
Card NEXTQSCB = 15
QINFIFO entries:
Waiting Queue entries:
Disconnected Queue entries: 0:14
QOUTFIFO entries:
Sequencer Free SCB List: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Sequencer SCB Info:
0 SCB_CONTROL[0x4c] SCB_SCSIID[0x87]
SCB_LUN[0x0] SCB_TAG[0xe]
1 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
2 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
3 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
4 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
5 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
6 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
7 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
8 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
9 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
10 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
11 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
12 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
13 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
14 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
15 SCB_CONTROL[0x0] SCB_SCSIID[0xff]
SCB_LUN[0xff] SCB_TAG[0xff]
Pending list:
14 SCB_CONTROL[0x48] SCB_SCSIID[0x87]
SCB_LUN[0x0]
Kernel Free SCB list: 13 12 11 10 9 8 7 6 5 4 3 2 1 0
Untagged Q(8): 14
>>>>>>>>>>>>>>>>>
sg[0] - Addr 0x186f06fc : Length 2308
sg[1] - Addr 0x37761000 : Length 4096
sg[2] - Addr 0x3ff0a000 : Length 4096
sg[3] - Addr 0x2986b000 : Length 4096
sg[4] - Addr 0x28b44000 : Length 4096
sg[5] - Addr 0x24d3d000 : Length 4096
sg[6] - Addr 0x315b6000 : Length 4096
sg[7] - Addr 0xaa0d000 : Length 4096
sg[8] - Addr 0x267b1000 : Length 1788
ahc0:Queuing a BDR SCB
ahc0:Bus Device Reset Message Sent
st0(ahc0:0:8:0): ahc0: no longer in timeout, status = 0
ahc0: Bus Device Reset on A:8. 1 SCBs aborted
Process Status:
0 9039 5327 0 -6 0 88 4 piperd SW ?? 0:00.00 taper pip
0 23573 11753 0 -6 0 120 4 xscmd DWX ?? 0:00.00 taper pip
Stacktrace (gdb) of taper:
(gdb) where
#0 0x480c376b in close () from /usr/lib/libc.so.12
#1 0x48077001 in tapefd_close () from /usr/pkg/lib/libamtape-2.4.2p2.so
#2 0x480773e5 in tape_rdlabel () from /usr/pkg/lib/libamtape-2.4.2p2.so
#3 0x804d4b1 in dladdr ()
#4 0x480652f8 in changer_find () from /usr/pkg/lib/libamserver-2.4.2p2.so
#5 0x804d76c in dladdr ()
#6 0x804ccf1 in dladdr ()
#7 0x804d149 in dladdr ()
#8 0x804bd72 in dladdr ()
#9 0x804a061 in dladdr ()
#10 0x8049934 in free ()
(gdb)
It seems to hang in close()... see below:
Kernel trace:
trace: pid 23573 at 0xec64ebd0
ltsleep(c21d802c,10,c04198cb,0,0) at netbsd:ltsleep+0x26b
scsipi_execute_xs(c21d802c,ec64ed38,6,1,140) at netbsd:scsipi_execute_xs+0x28c
scsi_scsipi_cmd(c1880700,ec64ed38,6,0,0) at netbsd:scsi_scsipi_cmd+0xfb
scsipi_command(c1880700,ec64ed38,6,0,0) at netbsd:scsipi_command+0x64
scsipi_prevent(c1880700,0,140,c029dce6,1) at netbsd:scsipi_prevent+0x45
stclose(e03,1,2000,e518f024,e5460428) at netbsd:stclose+0x85
spec_close(ec64ee2c,0,ec64edf0,c029ee18,e74c9d40) at netbsd:spec_close+0x178
ufsspec_close(ec64ee2c,e4c89930,3,c03fd380,1) at netbsd:ufsspec_close+0x220
VOP_CLOSE(e74c9d40,1,c22b3580,e518f024,e74c9d40) at netbsd:VOP_CLOSE+0x3b
vn_close(e74c9d40,1,c22b3580,e518f024,1) at netbsd:vn_close+0x31
vn_closefile(e53fc910,e518f024,1eee3000,ffffffff,e940dc1c) at netbsd:vn_closefile+0x1a
closef(e53fc910,e518f024,ec64ef40,c027a5cb,e94f85a8) at netbsd:closef+0x1ab
fdrelease(e518f024,4,ec64ef78,e674c198,0) at netbsd:fdrelease+0xa7
sys_close(e94f85a8,ec64ef80,ec64ef78,c030f0f8,0) at netbsd:sys_close+0x23
syscall_plain(1f,4814001f,bfbf001f,4814001f,8051290) at netbsd:syscall_plain+0xab
From: Frank Kardel <Frank.Kardel@Acrys.COM>
To: gnats-bugs@gnats.NetBSD.org
Cc:
Subject: Re: kern/21335: still present in 2.0G
Date: Tue, 07 Sep 2004 13:53:08 +0200
Hi, *
Just for the record as i am being bitten by this bug on regular basis:
this bug still exists in current kernels (2.0G).
With the backup systems amanda and bacula this means an aborted backup
due to the blocked scsi subsystem
(scsictl... scanbus just hangs, device stays busy) and a reboot to get
things working again.
Regards,
Frank
From: Reinoud Zandijk <reinoud@NetBSD.org>
To: gnats-bugs@NetBSD.org
Cc:
Subject: Re: kern/21335 ahc -current 20030427 driver leaves process in D state after timeout/BDR
Date: Thu, 14 Sep 2006 04:29:06 +0200
Dear folks,
Picking up this PR for a quick look after cries from Tracy, it seems like
the problem is not in src/sys/dev/scsipi/st.c as i first thought but in the
interaction between the scsipi framework and the ahc driver.
The ahc driver contains a bug that gets
src/sys/dev/scsipi/scsipi_base.c:scsipi_execute_xs() in an endless loop
since the failed command is not getting (xs->xs_status & XS_STS_DONE) set.
This leaves the close() call to wait for ever.
In the ahc driver src/sys/dev/ic/aic7xxx_osm.c:ahc_timeout() `magic' is
used to insert a bus reset (notification?) command at line 978 that places
the command in front of the offending command. I think though that this
code will only work when the queue is completely full; its not clear enough
I think that in this piece of code the offending timed out SCSI call
somehow gets `lost' and since we're polling on it in scsipi_execute_xs()
resulting in an endless loop.
The solution i'd see for the ahc driver is to pull over the changes made
by OpenBSD and FreeBSD to the driver's timeout code that tackels the
problem in a different way by using a QUEUE structure.
Note that the 0x0E SCB's found in this PR are `vender specific' (function
unknown) and Tracy's 0x0F SCB is `READ REVERSE'(6?) that is specified as
*optional*. It might be that the software is getting signalled there is a
write error, then wants to read a bit back to see what went wrong/get a
token. If the drive is shabby it might not understand the command and thus
fail but i doubt that is the case; it is a possibility though.
Regards,
Reinoud
From: Paul Ripke <stix@stix.id.au>
To: NetBSD gnats-bugs <gnats-bugs@NetBSD.org>
Cc:
Subject: Re: kern/21335: ahc driver leaves process in D state after timeout/BDR
Date: Thu, 18 Oct 2007 12:11:34 +1000
I can confirm this bug still exists with NetBSD 4.0 RC1.
$ ps alxw | grep xsc
0 8014 20030 368 -6 0 764 4 xscmd DW ? 0:00.09 /sbin/dump -0uat -h 0 -b 64 -f /dev/nrst0 /var
Oct 18 02:28:56 zion /netbsd: ahc0:SCB 0xf - timed out
Oct 18 02:28:56 zion /netbsd: >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
Oct 18 02:28:56 zion /netbsd: ahc0: Dumping Card State while idle, at SEQADDR 0x9
Oct 18 02:28:56 zion /netbsd: Card was paused
Oct 18 02:28:56 zion /netbsd: ACCUM = 0x4, SINDEX = 0x67, DINDEX = 0x27, ARG_2 = 0x3
Oct 18 02:28:56 zion /netbsd: HCNT = 0x0 SCBPTR = 0x0
Oct 18 02:28:56 zion /netbsd: SCSIPHASE[0x0] SCSISIGI[0x0] ERROR[0x0] SCSIBUSL[0x0]
Oct 18 02:28:56 zion /netbsd: LASTPHASE[0x1] SCSISEQ[0x12] SBLKCTL[0xa] SCSIRATE[0x0]
Oct 18 02:28:56 zion /netbsd: SEQCTL[0x10] SEQ_FLAGS[0xc0] SSTAT0[0x0] SSTAT1[0x8]
Oct 18 02:28:56 zion /netbsd: SSTAT2[0x0] SSTAT3[0x0] SIMODE0[0x8] SIMODE1[0xa4]
Oct 18 02:28:56 zion /netbsd: SXFRCTL0[0x80] DFCNTRL[0x0] DFSTATUS[0x89]
Oct 18 02:28:56 zion /netbsd: STACK: 0x0 0x16b 0x180 0x3
Oct 18 02:28:56 zion /netbsd: SCB count = 16
Oct 18 02:28:56 zion /netbsd: Kernel NEXTQSCB = 14
Oct 18 02:28:56 zion /netbsd: Card NEXTQSCB = 14
Oct 18 02:28:56 zion /netbsd: QINFIFO entries:
Oct 18 02:28:56 zion /netbsd: Waiting Queue entries:
Oct 18 02:28:56 zion /netbsd: Disconnected Queue entries: 0:15
Oct 18 02:28:56 zion /netbsd: QOUTFIFO entries:
Oct 18 02:28:56 zion /netbsd: Sequencer Free SCB List: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31
Oct 18 02:28:56 zion /netbsd: Sequencer SCB Info:
Oct 18 02:28:56 zion /netbsd: 0 SCB_CONTROL[0x44]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0x67] SCB_LUN[0x0] SCB_TAG[0xf]
Oct 18 02:28:56 zion /netbsd: 1 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 2 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 3 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 4 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 5 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 6 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 7 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 8 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 9 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 10 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 11 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 12 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 13 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 14 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 15 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 16 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 17 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 18 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 19 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 20 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 21 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 22 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 23 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 24 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 25 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 26 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 27 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 28 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 29 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 30 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 31 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: Pending list:
Oct 18 02:28:56 zion /netbsd: 15 SCB_CONTROL[0x40]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0x67] SCB_LUN[0x0]
Oct 18 02:28:56 zion /netbsd: Kernel Free SCB list: 13 12 11 10 9 8 7 6 5 4 3 2 1 0
Oct 18 02:28:56 zion /netbsd: Untagged Q(6): 15
Oct 18 02:28:56 zion /netbsd:
Oct 18 02:28:56 zion /netbsd: <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
Oct 18 02:28:56 zion /netbsd: ahc0:Queuing a BDR SCB
Oct 18 02:28:56 zion /netbsd: ahc0:Bus Device Reset Message Sent
Oct 18 02:28:56 zion /netbsd: st0(ahc0:0:6:0): ahc0: no longer in timeout, status = 0
Oct 18 02:28:56 zion /netbsd: ahc0:A:6: no active SCB for reconnecting target - issuing BUS DEVICE RESET
Oct 18 02:28:56 zion /netbsd: SAVED_SCSIID == 0x67, SAVED_LUN == 0x0, ARG_1 == 0xf ACCUM = 0x0
Oct 18 02:28:56 zion /netbsd: SEQ_FLAGS == 0xc0, SCBPTR == 0x0, BTT == 0xf, SINDEX == 0x31
Oct 18 02:28:56 zion /netbsd: SCSIID == 0x0, SCB_SCSIID == 0x67, SCB_LUN == 0x0, SCB_TAG == 0xff, SCB_CONTROL == 0x0
Oct 18 02:28:56 zion /netbsd: SCSIBUSL == 0x80, SCSISIGI == 0xe6
Oct 18 02:28:56 zion /netbsd: SXFRCTL0 == 0x88
Oct 18 02:28:56 zion /netbsd: SEQCTL == 0x10
Oct 18 02:28:56 zion /netbsd: >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
Oct 18 02:28:56 zion /netbsd: ahc0: Dumping Card State in Message-in phase, at SEQADDR 0x1ae
Oct 18 02:28:56 zion /netbsd: Card was paused
Oct 18 02:28:56 zion /netbsd: ACCUM = 0x0, SINDEX = 0x31, DINDEX = 0xe4, ARG_2 = 0x3
Oct 18 02:28:56 zion /netbsd: HCNT = 0x0 SCBPTR = 0x0
Oct 18 02:28:56 zion /netbsd: SCSIPHASE[0x8] SCSISIGI[0xe6] ERROR[0x0] SCSIBUSL[0x80]
Oct 18 02:28:56 zion /netbsd: LASTPHASE[0xe0] SCSISEQ[0x12] SBLKCTL[0xa] SCSIRATE[0x0]
Oct 18 02:28:56 zion /netbsd: SEQCTL[0x10] SEQ_FLAGS[0xc0] SSTAT0[0x2] SSTAT1[0x11]
Oct 18 02:28:56 zion /netbsd: SSTAT2[0x0] SSTAT3[0x0] SIMODE0[0x8] SIMODE1[0xac]
Oct 18 02:28:56 zion /netbsd: SXFRCTL0[0x88] DFCNTRL[0x0] DFSTATUS[0x89]
Oct 18 02:28:56 zion /netbsd: STACK: 0x147 0x0 0x16b 0x180
Oct 18 02:28:56 zion /netbsd: SCB count = 16
Oct 18 02:28:56 zion /netbsd: Kernel NEXTQSCB = 14
Oct 18 02:28:56 zion /netbsd: Card NEXTQSCB = 14
Oct 18 02:28:56 zion /netbsd: QINFIFO entries:
Oct 18 02:28:56 zion /netbsd: Waiting Queue entries:
Oct 18 02:28:56 zion /netbsd: Disconnected Queue entries:
Oct 18 02:28:56 zion /netbsd: QOUTFIFO entries:
Oct 18 02:28:56 zion /netbsd: Sequencer Free SCB List: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31
Oct 18 02:28:56 zion /netbsd: Sequencer SCB Info:
Oct 18 02:28:56 zion /netbsd: 0 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0x67] SCB_LUN[0x0] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 1 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 2 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 3 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 4 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 5 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 6 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 7 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 8 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 9 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 10 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 11 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 12 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 13 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 14 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 15 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 16 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 17 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 18 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 19 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 20 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 21 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 22 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 23 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 24 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 25 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 26 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 27 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 28 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 29 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 30 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: 31 SCB_CONTROL[0x0]
Oct 18 02:28:56 zion /netbsd: SCB_SCSIID[0xff] SCB_LUN[0xff] SCB_TAG[0xff]
Oct 18 02:28:56 zion /netbsd: Pending list:
Oct 18 02:28:56 zion /netbsd: Kernel Free SCB list: 15 13 12 11 10 9 8 7 6 5 4 3 2 1 0
Oct 18 02:28:56 zion /netbsd:
Oct 18 02:28:56 zion /netbsd: <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
State-Changed-From-To: open->closed
State-Changed-By: kardel@NetBSD.org
State-Changed-When: Tue, 05 Feb 2019 14:22:05 +0000
State-Changed-Why:
HW does not exist any more
>Unformatted:
(Contact us)
$NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2007
The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.