NetBSD Problem Report #56463

From martin@duskware.de  Sun Oct 24 10:25:27 2021
Return-Path: <martin@duskware.de>
Received: from mail.netbsd.org (mail.netbsd.org [199.233.217.200])
	(using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits))
	(Client CN "mail.NetBSD.org", Issuer "mail.NetBSD.org CA" (not verified))
	by mollari.NetBSD.org (Postfix) with ESMTPS id B30281A9239
	for <gnats-bugs@gnats.NetBSD.org>; Sun, 24 Oct 2021 10:25:27 +0000 (UTC)
From: martin@NetBSD.org
Reply-To: martin@NetBSD.org
To: gnats-bugs@NetBSD.org
Subject: armv4 kernel/t_lock tests locks up machine
X-Send-Pr-Version: 3.95

>Number:         56463
>Category:       port-arm
>Synopsis:       armv4 kernel/t_lock tests locks up machine
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    port-arm-maintainer
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Sun Oct 24 10:30:01 +0000 2021
>Closed-Date:    
>Last-Modified:  Fri May 31 18:08:38 +0000 2024
>Originator:     Martin Husemann
>Release:        NetBSD 9.99.92
>Organization:
The NetBSD Foundation, Inc.
>Environment:
System: NetBSD night-rest.duskware.de 9.99.92 NetBSD 9.99.92 (GENERIC) #11: Sat Oct 23 12:21:03 CEST 2021 martin@martins.aprisoft.de:/usr/src/sys/arch/shark/compile/GENERIC shark
Architecture: earmv4
Machine: shark
>Description:

Running the kernel/t_lock test on armv4  immediately locks up the
machine, can't ^C the test, console is dead. Ddb works, but is not very
helpfull:

Stopped in pid 14205.14205 (t_lock) at  netbsd:cpu_Debugger+0x4:        mov     
pc, r14
db> ps
PID    LID S CPU     FLAGS       STRUCT LWP *               NAME WAIT
14205>14205 7   0         0           f1c09500             t_lock
678    678 3   0       180           f1715880               tail kqueue
1556  1556 3   0       180           f17b08c0               tcsh pause
1205  1205 3   0       180           f17155c0               sshd select
1059  1059 3   0       180           f13bccc0               sshd poll
1047  1047 3   0       180           f13bca00                tee pipe_rd
936    936 3   0       180           f1495a80         atf-report pipe_rd
1131  1131 3   0       180           f1424a40                tee pipe_rd
1133  1133 3   0       180           f1424200            atf-run poll
956    956 3   0       180           f13bc740                 sh wait
234    234 3   0       180           f13bc480                 sh wait
955    955 3   0       180           f13a6440              getty ttyraw
915    915 3   0       180           f17b0080               cron nanoslp
849    849 3   0       180           f1715b40              inetd kqueue
830    830 3   0       180           f1715300               sshd select
750    750 3   0       180           f1715040               ntpd pause
489    489 3   0       180           f14957c0            rpcbind poll
374    374 3   0       180           f1495500            syslogd kqueue
313    313 3   0       180           f1424d00             dhcpcd poll
277    277 3   0       180           f1495240             dhcpcd poll
280    280 3   0       180           f1424780             dhcpcd poll
1        1 3   0       180           f13a6180               init wait
0     17775 5   0       200           f1b39c00           (zombie)
0       82 3   0       200           f13a6700            physiod physiod
0       96 3   0       200           f13bc1c0          pooldrain pooldrain
0       56 3   0       200           f13a6c80            ioflush syncer
0       55 3   0       200           f13a69c0           pgdaemon pgdaemon
0       51 3   0       200           f1303c40            rt_free rt_free
0       50 3   0       200           f1303980              unpgc unpgc
0       49 3   0       200           f1302100    icmp6_wqinput/0 icmp6_wqinput
0       31 3   0       200           f13023c0          nd6_timer nd6_timer
0       30 3   0       200           f1302680    carp6_wqinput/0 carp6_wqinput
0       29 3   0       200           f1302940     carp_wqinput/0 carp_wqinput
0       28 3   0       200           f1302c00     icmp_wqinput/0 icmp_wqinput
0       27 3   0       200           f1303140           rt_timer rt_timer
0       26 3   0       200           f1303400        vmem_rehash vmem_rehash
0       25 3   0       200           f13036c0          entbutler entropy
0       16 3   0       240           f12dcbc0            atabus0 atath
0       15 3   0       200           f12dc900               iic0 iicintr
0       14 3   0       200           f12dc640         pmfsuspend pmfsuspend
0       13 3   0       200           f12dc380           pmfevent pmfevent
0       12 3   0       200           f12dc0c0         sopendfree sopendfr
0       11 3   0       200           f120fb80            iflnkst iflnkst
0       10 3   0       200           f120f8c0           nfssilly nfssilly
0        9 3   0       240           f120f600             vdrain vdrain
0        8 3   0       240           f120f340          modunload mod_unld
0        7 3   0       200           f120f080            xcall/0 xcall
0        6 1   0       200           f120db40          softser/0
0        5 1   0     40200           f120d880          softclk/0
0        4 1   0     40200           f120d5c0          softbio/0
0        3 1   0     40200           f120d300          softnet/0
0        2 1   0       201           f120d040             idle/0
0        0 3   0       200           f039e1c0            swapper uvm
db> trace/a f1c09500
trace: pid 14205 lid 14205 at 0xf6eaaea4
0xf6eaaea4: f6eaaeb0
Bad frame pointer: 0xf023e2e8


>How-To-Repeat:

cd /usr/tests/kernel && atf-run t_lock

>Fix:
n/a

>Release-Note:

>Audit-Trail:
From: Rin Okuyama <rokuyama.rk@gmail.com>
To: gnats-bugs@netbsd.org, Martin Husemann <martin@NetBSD.org>
Cc: 
Subject: Re: port-arm/56463: armv4 kernel/t_lock tests locks up machine
Date: Tue, 26 Oct 2021 11:49:03 +0900

 For Jornada 728 (hpcarm/SA-1110), kernel/t_lock successfully passes:

 ----
 j728# sysctl hw.model
 hw.model = SA-1110 step B-5 (SA-1 V4 core)
 j728# uname -a
 NetBSD j728 9.99.92 NetBSD 9.99.92 (J728) #55: Mon Oct 25 08:44:45 JST 2021  rin@latipes:/sys/arch/hpcarm/compile/J728 hpcarm
 j728# cd /usr/tests/kernel && atf-run t_lock | atf-report
 Tests root: /usr/tests/kernel

 t_lock (1/1): 1 test cases
      lock: [1.256586s] Passed.
 [1.286225s]

 Summary for 1 test programs:
      1 passed test cases.
      0 failed test cases.
      0 expected failed test cases.
      0 skipped test cases.
 ----

 Full dmesg is provided here:

 https://dmesgd.nycbug.org/index.cgi?do=view&id=6334

 Unlike modern evbarm targets, MD parts for classic ARM machines differ
 significantly between ports/machines/CPUs. I guess that this problem is
 specific to shark or SA-110 rather than armv4.

 Thanks,
 rin

State-Changed-From-To: open->feedback
State-Changed-By: skrll@NetBSD.org
State-Changed-When: Fri, 31 May 2024 07:25:38 +0000
State-Changed-Why:
Still a problem?


State-Changed-From-To: feedback->open
State-Changed-By: martin@NetBSD.org
State-Changed-When: Fri, 31 May 2024 18:08:38 +0000
State-Changed-Why:
Still hangs hard on shark


>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.47 2022/09/11 19:34:41 kim Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2024 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.