NetBSD Problem Report #54840
From gson@gson.org Tue Jan 7 09:49:04 2020
Return-Path: <gson@gson.org>
Received: from mail.netbsd.org (mail.netbsd.org [199.233.217.200])
(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
(Client CN "mail.NetBSD.org", Issuer "mail.NetBSD.org CA" (not verified))
by mollari.NetBSD.org (Postfix) with ESMTPS id 25BB87A154
for <gnats-bugs@gnats.NetBSD.org>; Tue, 7 Jan 2020 09:49:04 +0000 (UTC)
Message-Id: <20200107094859.69FCD253F3E@guava.gson.org>
Date: Tue, 7 Jan 2020 11:48:59 +0200 (EET)
From: gson@gson.org (Andreas Gustafsson)
Reply-To: gson@gson.org (Andreas Gustafsson)
To: gnats-bugs@NetBSD.org
Subject: Some ATF test runs leave h_ps_strings2 process running
X-Send-Pr-Version: 3.95
>Number: 54840
>Category: kern
>Synopsis: Some ATF test runs leave h_ps_strings2 process running
>Confidential: no
>Severity: serious
>Priority: medium
>Responsible: kern-bug-people
>State: open
>Class: sw-bug
>Submitter-Id: net
>Arrival-Date: Tue Jan 07 09:50:00 +0000 2020
>Last-Modified: Tue Jan 07 11:15:01 +0000 2020
>Originator: Andreas Gustafsson
>Release: NetBSD-current
>Organization:
>Environment:
System: NetBSD
Architecture: i386
Machine: i386
>Description:
Several recent ATF test runs on my i386 testbed have left a
h_ps_strings2 process running after the tests have completed.
The first occurrence was using sources from 2019.12.24.23.41.05,
and in 2020 it has happened in 5 runs out of 11 so far.
This is on real 8-core hardware. I have not seen it happen in qemu.
The console log output from the first occurrence is here:
http://www.gson.org/netbsd/bugs/build/i386-baremetal/2019/2019.12.24.23.41.05/test.log
Search for "arg0000" to find the process in case.
>How-To-Repeat:
>Fix:
>Audit-Trail:
From: Martin Husemann <martin@duskware.de>
To: gnats-bugs@netbsd.org
Cc:
Subject: Re: kern/54840: Some ATF test runs leave h_ps_strings2 process
running
Date: Tue, 7 Jan 2020 10:51:57 +0100
Duplicate of 54819? [modulo the typo there]
Martin
From: Andreas Gustafsson <gson@gson.org>
To: Martin Husemann <martin@duskware.de>
Cc: gnats-bugs@netbsd.org
Subject: Re: kern/54840: Some ATF test runs leave h_ps_strings2 process running
Date: Tue, 7 Jan 2020 12:42:22 +0200
Martin Husemann wrote:
> Duplicate of 54819? [modulo the typo there]
Probably related, yes.
Strangely, I can't reproduce the problem by running just the
t_ps_strings test program manually, even if I run it 1000 times
in a row:
# cd /usr/tests/kernel
# seq 1000 | while read i && atf-run t_ps_strings; do true; done >log
# ps -glaxw|grep h_
0 11323 531 0 85 0 3776 388 pipe_rd O+ tty00 0:00.00 grep h_
#
This was run booted from the same disk image that showed a leftover
h_ps_strings2 process after a full ATF run here:
http://www.gson.org/netbsd/bugs/build/i386-baremetal/2020/2020.01.06.21.04.58/test.log
--
Andreas Gustafsson, gson@gson.org
From: Martin Husemann <martin@duskware.de>
To: Andreas Gustafsson <gson@gson.org>
Cc: gnats-bugs@netbsd.org
Subject: Re: kern/54840: Some ATF test runs leave h_ps_strings2 process
running
Date: Tue, 7 Jan 2020 12:10:13 +0100
It also does not happen every time for me (sounds like a race, and could
also be a libpthread issue).
Martin
(Contact us)
$NetBSD: query-full-pr,v 1.46 2020/01/03 16:35:01 leot Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2020
The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.