NetBSD Problem Report #27135

Received: (qmail 22934 invoked by uid 605); 4 Oct 2004 06:36:26 -0000
Message-Id: <20041004063620.2097154ED2@lucette.Ivy.NET>
Date: Mon,  4 Oct 2004 06:36:20 +0000 (GMT)
From: carton@Ivy.NET
Sender: gnats-bugs-owner@NetBSD.org
Reply-To: carton@Ivy.NET
To: gnats-bugs@gnats.NetBSD.org
Subject: sparc64 doesn't make kernel core dumps
X-Send-Pr-Version: 3.95

>Number:         27135
>Category:       kern
>Synopsis:       siop driver doesn't make kernel core dumps
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    port-sparc64-maintainer
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Mon Oct 04 06:37:01 +0000 2004
>Closed-Date:    
>Last-Modified:  Thu Feb 03 16:53:21 +0000 2005
>Originator:     Miles Nordin
>Release:        NetBSD 2.0_BETA 2004-08-15
>Organization:
Ivy Ministries

>Environment:


System: NetBSD lucette 2.0_BETA NetBSD 2.0_BETA (LUCETTE-$Revision: 1.1 $) #3: Sun Oct 3 23:07:46 EDT 2004 carton@castrovalva:/scratch/src/sys/arch/sparc64/compile/LUCETTE sparc64
Architecture: sparc64
Machine: sparc64
>Description:
trap type 0x34: pc=123618c npc=1236190 pstate=44820006<PRIV,IE>
kernel trap 34: mem address not aligned
Stopped at      netbsd:m_tag_find+0x14: lduh            [%o0 + 0x8], %g1
db> bt
ip_output(5dc, 3142830, 1895e00, c6c1800, 240, 0) at netbsd:ip_output+0x930
ip_forward(3258960, 0, 23, ce, 15, 15) at netbsd:ip_forward+0x2a4
ip_input(3258960, 3258960, 0, 800, 1, 1) at netbsd:ip_input+0x4f0
ipintr(1, 9080010, 80000000, 7fff0000, 47, 1750) at netbsd:ipintr+0x10c
softnet(4, 0, e0017ed0, 5, 130ffe4, 21d800) at netbsd:softnet+0x98
sparc64_ipi_flush_all(0, 0, 137c79c, 0, ffffffffffffffff, 0) at netbsd:sparc64_i
pi_flush_all+0x23c
db> reboot 0x104
Frame pointer is at 0xe00164c1
Call traceback:
1310df8(1, 2d7d900, d, f, 0, d, e0016581) fp = e0016581
11d3bdc(104, 0, 0, e0017e7c, 8, e00170f8, e0016641) fp = e0016641
11d3604(123618c, 0, ffffffffffffffff, e0016fe0, 0, 4, e0016711) fp = e0016711
11d32ec(1812cb8, 0, 2b, 8, 0, c1f83fe0, e0016871) fp = e0016871
11d70c8(1236190, 0, 0, 0, 0, 0, e0016951) fp = e0016951
131c228(0, 0, 0, 0, 6, 1000000, e0016a21) fp = e0016a21
1319258(34, e0017500, e0017390, 59, 3082078, 1, e0016ae1) fp = e0016ae1
1008b98(e0017500, 34, 123618c, 44820006, e00176d8, 1000000, e0016c51) fp = e0016c51
108f1a8(67656d310003ba0f, 3, 0, 2, e00175d0, 1000000, e0016e31) fp = e0016e31
102f00c(3303c70, 0, fffffffffffffffc, 8a05, ffffffffffff8a06, ffff, e0016ef1) fp = e0016ef1
102aab0(5dc, 3142830, 1895e00, c6c1800, 240, 0, e0017001) fp = e0017001
10289e0(3258960, 0, 23, ce, 15, 15, e00173d1) fp = e00173d1
10284d8(3258960, 3258960, 0, 800, 1, 1, e00174a1) fp = e00174a1
131007c(1, 9080010, 80000000, 7fff0000, 47, 1750, e0017561) fp = e0017561
100906c(4, 0, e0017ed0, 5, 130ffe4, 21d800, e0017621) fp = e0017621
0(0, 0, 137c79c, 0, ffffffffffffffff, 0, dc530d1) fp = dc530d1

dumping to dev 7,9 offset 1130493
dump esiop0: unable to load cmd DMA map: -1
starting dump, blkno 1130496
device not ready
rebooting

Res
LOM event: +23d+7h36m25s host reset
etting ... 

[...]
$ ls -ld /dev/sd1b <-- this is the 1.5GB configired as dump device
brw-r-----  1 root  operator        7, 9 Aug 26 04:14 /dev/sd1b

>How-To-Repeat:
make 1.5GB /dev/sd1b for a machine with 1GB physical memory.

put in fstab:
/dev/sd1b swap swap sw 0 0
/dev/sd1b swap swap dp 0 0

wait for kernel to crash

reboot 0x104

kernel doesn't dump core.

>Fix:
	 how to correct or work around the problem, if known (multiple lines)>
>Release-Note:
>Audit-Trail:

From: Martin Husemann <martin@duskware.de>
To: carton@Ivy.NET
Cc: gnats-bugs@gnats.NetBSD.org
Subject: Re: port-sparc64/27135: sparc64 doesn't make kernel core dumps
Date: Mon, 4 Oct 2004 08:41:43 +0200

 On Mon, Oct 04, 2004 at 06:36:20AM +0000, carton@Ivy.NET wrote:
 > dumping to dev 7,9 offset 1130493
 > dump esiop0: unable to load cmd DMA map: -1

 There is not much we can do about this IMHO - if the kernel subsystems are in
 a broken state and resource allocation for the disk driver fails, we can not
 dump.

 Martin

From: Miles Nordin <carton@Ivy.NET>
To: Martin Husemann <martin@duskware.de>
Cc: gnats-bugs@gnats.NetBSD.org
Subject: Re: port-sparc64/27135: sparc64 doesn't make kernel core dumps
Date: Mon, 04 Oct 2004 12:59:44 -0400

 >>>>> "mh" == Martin Husemann <martin@duskware.de> writes:

     mh> if the kernel subsystems are in a broken state and resource
     mh> allocation for the disk driver fails, we can not dump.

 as I said in the submission, alpha can dump where sparc64 cannot.
 But, just so there will be no doubt, I hit BREAK while the sparc64
 kernel was running normally, and it still failed to coredump with the
 same message:

 $ sync
 $ ~#kdb breakpoint at 131c484
 Stopped at      netbsd:cpu_Debugger+0x4:        nop
 db> reboot 0x10c
 Frame pointer is at 0xe00167a1
 Call traceback:
 1310df8(1, 2d7d900, d, f, 0, d, e0016861) fp = e0016861
 11d3b9c(10c, 0, 0, 0, 0, e00173d8, e0016921) fp = e0016921
 11d35c4(0, 0, 0, e00172c0, 0, 4, e00169f1) fp = e00169f1
 11d32ac(1812cb8, 0, 0, 0, 0, b, e0016b51) fp = e0016b51
 11d7088(131c48c, 0, 0, 0, 0, 0, e0016c31) fp = e0016c31
 131c228(0, 0, 0, 0, 6, 1000000, e0016d01) fp = e0016d01
 13193ac(101, e00177e0, 137c79c, 0, 0, 3277600, e0016dc1) fp = e0016dc1
 1008b98(e00177e0, 101, 131c484, 820006, 6, 1892800, e0016f31) fp = e0016f31
 1126a68(327b018, 2, 6ad, 1d, 4, c6, e0017111) fp = e0017111
 13100f4(2ea6200, 3, ffffffffffffffff, 30481e0, 8, 0, e0017201) fp = e0017201
 100906c(2d69d00, 1, e0017b70, 36, 13100e4, c9, e00172c1) fp = e00172c1
 10b3ffc(140a400, 1845800, 3096740, 0, 2, 18458a8, e00174a1) fp = e00174a1
 12178e4(3e5a1bafa3d81, 0, 10b3f98, 0, 1a3185c500, 1, e0017561) fp = e0017561
 100906c(e0017ed0, 0, e0017ed0, 0, 12176a4, 0, e0017621) fp = e0017621
 0(0, 0, 137c79c, 0, ffffffffffffffff, 0, e3c30d1) fp = e3c30d1

 dumping to dev 7,9 offset 1130493
 dump esiop0: unable to load cmd DMA map: -1
 starting dump, blkno 1130496
 device not ready
 halted

 Program terminated
 ok 

From: Martin Husemann <martin@duskware.de>
To: Miles Nordin <carton@Ivy.NET>
Cc: gnats-bugs@gnats.NetBSD.org
Subject: Re: port-sparc64/27135: sparc64 doesn't make kernel core dumps
Date: Mon, 4 Oct 2004 22:59:20 +0200

 On Mon, Oct 04, 2004 at 12:59:44PM -0400, Miles Nordin wrote:
 > as I said in the submission, alpha can dump where sparc64 cannot.

 I bet your alpha does not use esiop ;-)

 Martin

From: Miles Nordin <carton@Ivy.NET>
To: Martin Husemann <martin@duskware.de>
Cc: gnats-bugs@gnats.NetBSD.org
Subject: Re: port-sparc64/27135: sparc64 doesn't make kernel core dumps
Date: Mon, 04 Oct 2004 18:54:56 -0400

 >>>>> "mh" == Martin Husemann <martin@duskware.de> writes:

     mh> I bet your alpha does not use esiop ;-)

 right, it is isp.  so maybe this pr should be category kern, not
 port-sparc64.
>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.8 2006/05/07 09:23:38 tsutsui Exp $
Copyright © 1994-2007 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.