NetBSD Problem Report #56463
From martin@duskware.de Sun Oct 24 10:25:27 2021
Return-Path: <martin@duskware.de>
Received: from mail.netbsd.org (mail.netbsd.org [199.233.217.200])
(using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits))
(Client CN "mail.NetBSD.org", Issuer "mail.NetBSD.org CA" (not verified))
by mollari.NetBSD.org (Postfix) with ESMTPS id B30281A9239
for <gnats-bugs@gnats.NetBSD.org>; Sun, 24 Oct 2021 10:25:27 +0000 (UTC)
From: martin@NetBSD.org
Reply-To: martin@NetBSD.org
To: gnats-bugs@NetBSD.org
Subject: armv4 kernel/t_lock tests locks up machine
X-Send-Pr-Version: 3.95
>Number: 56463
>Category: port-arm
>Synopsis: armv4 kernel/t_lock tests locks up machine
>Confidential: no
>Severity: serious
>Priority: medium
>Responsible: port-arm-maintainer
>State: open
>Class: sw-bug
>Submitter-Id: net
>Arrival-Date: Sun Oct 24 10:30:01 +0000 2021
>Closed-Date:
>Last-Modified: Fri May 31 18:08:38 +0000 2024
>Originator: Martin Husemann
>Release: NetBSD 9.99.92
>Organization:
The NetBSD Foundation, Inc.
>Environment:
System: NetBSD night-rest.duskware.de 9.99.92 NetBSD 9.99.92 (GENERIC) #11: Sat Oct 23 12:21:03 CEST 2021 martin@martins.aprisoft.de:/usr/src/sys/arch/shark/compile/GENERIC shark
Architecture: earmv4
Machine: shark
>Description:
Running the kernel/t_lock test on armv4 immediately locks up the
machine, can't ^C the test, console is dead. Ddb works, but is not very
helpfull:
Stopped in pid 14205.14205 (t_lock) at netbsd:cpu_Debugger+0x4: mov
pc, r14
db> ps
PID LID S CPU FLAGS STRUCT LWP * NAME WAIT
14205>14205 7 0 0 f1c09500 t_lock
678 678 3 0 180 f1715880 tail kqueue
1556 1556 3 0 180 f17b08c0 tcsh pause
1205 1205 3 0 180 f17155c0 sshd select
1059 1059 3 0 180 f13bccc0 sshd poll
1047 1047 3 0 180 f13bca00 tee pipe_rd
936 936 3 0 180 f1495a80 atf-report pipe_rd
1131 1131 3 0 180 f1424a40 tee pipe_rd
1133 1133 3 0 180 f1424200 atf-run poll
956 956 3 0 180 f13bc740 sh wait
234 234 3 0 180 f13bc480 sh wait
955 955 3 0 180 f13a6440 getty ttyraw
915 915 3 0 180 f17b0080 cron nanoslp
849 849 3 0 180 f1715b40 inetd kqueue
830 830 3 0 180 f1715300 sshd select
750 750 3 0 180 f1715040 ntpd pause
489 489 3 0 180 f14957c0 rpcbind poll
374 374 3 0 180 f1495500 syslogd kqueue
313 313 3 0 180 f1424d00 dhcpcd poll
277 277 3 0 180 f1495240 dhcpcd poll
280 280 3 0 180 f1424780 dhcpcd poll
1 1 3 0 180 f13a6180 init wait
0 17775 5 0 200 f1b39c00 (zombie)
0 82 3 0 200 f13a6700 physiod physiod
0 96 3 0 200 f13bc1c0 pooldrain pooldrain
0 56 3 0 200 f13a6c80 ioflush syncer
0 55 3 0 200 f13a69c0 pgdaemon pgdaemon
0 51 3 0 200 f1303c40 rt_free rt_free
0 50 3 0 200 f1303980 unpgc unpgc
0 49 3 0 200 f1302100 icmp6_wqinput/0 icmp6_wqinput
0 31 3 0 200 f13023c0 nd6_timer nd6_timer
0 30 3 0 200 f1302680 carp6_wqinput/0 carp6_wqinput
0 29 3 0 200 f1302940 carp_wqinput/0 carp_wqinput
0 28 3 0 200 f1302c00 icmp_wqinput/0 icmp_wqinput
0 27 3 0 200 f1303140 rt_timer rt_timer
0 26 3 0 200 f1303400 vmem_rehash vmem_rehash
0 25 3 0 200 f13036c0 entbutler entropy
0 16 3 0 240 f12dcbc0 atabus0 atath
0 15 3 0 200 f12dc900 iic0 iicintr
0 14 3 0 200 f12dc640 pmfsuspend pmfsuspend
0 13 3 0 200 f12dc380 pmfevent pmfevent
0 12 3 0 200 f12dc0c0 sopendfree sopendfr
0 11 3 0 200 f120fb80 iflnkst iflnkst
0 10 3 0 200 f120f8c0 nfssilly nfssilly
0 9 3 0 240 f120f600 vdrain vdrain
0 8 3 0 240 f120f340 modunload mod_unld
0 7 3 0 200 f120f080 xcall/0 xcall
0 6 1 0 200 f120db40 softser/0
0 5 1 0 40200 f120d880 softclk/0
0 4 1 0 40200 f120d5c0 softbio/0
0 3 1 0 40200 f120d300 softnet/0
0 2 1 0 201 f120d040 idle/0
0 0 3 0 200 f039e1c0 swapper uvm
db> trace/a f1c09500
trace: pid 14205 lid 14205 at 0xf6eaaea4
0xf6eaaea4: f6eaaeb0
Bad frame pointer: 0xf023e2e8
>How-To-Repeat:
cd /usr/tests/kernel && atf-run t_lock
>Fix:
n/a
>Release-Note:
>Audit-Trail:
From: Rin Okuyama <rokuyama.rk@gmail.com>
To: gnats-bugs@netbsd.org, Martin Husemann <martin@NetBSD.org>
Cc:
Subject: Re: port-arm/56463: armv4 kernel/t_lock tests locks up machine
Date: Tue, 26 Oct 2021 11:49:03 +0900
For Jornada 728 (hpcarm/SA-1110), kernel/t_lock successfully passes:
----
j728# sysctl hw.model
hw.model = SA-1110 step B-5 (SA-1 V4 core)
j728# uname -a
NetBSD j728 9.99.92 NetBSD 9.99.92 (J728) #55: Mon Oct 25 08:44:45 JST 2021 rin@latipes:/sys/arch/hpcarm/compile/J728 hpcarm
j728# cd /usr/tests/kernel && atf-run t_lock | atf-report
Tests root: /usr/tests/kernel
t_lock (1/1): 1 test cases
lock: [1.256586s] Passed.
[1.286225s]
Summary for 1 test programs:
1 passed test cases.
0 failed test cases.
0 expected failed test cases.
0 skipped test cases.
----
Full dmesg is provided here:
https://dmesgd.nycbug.org/index.cgi?do=view&id=6334
Unlike modern evbarm targets, MD parts for classic ARM machines differ
significantly between ports/machines/CPUs. I guess that this problem is
specific to shark or SA-110 rather than armv4.
Thanks,
rin
State-Changed-From-To: open->feedback
State-Changed-By: skrll@NetBSD.org
State-Changed-When: Fri, 31 May 2024 07:25:38 +0000
State-Changed-Why:
Still a problem?
State-Changed-From-To: feedback->open
State-Changed-By: martin@NetBSD.org
State-Changed-When: Fri, 31 May 2024 18:08:38 +0000
State-Changed-Why:
Still hangs hard on shark
>Unformatted:
(Contact us)
$NetBSD: query-full-pr,v 1.47 2022/09/11 19:34:41 kim Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2024
The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.