NetBSD Problem Report #54840

From gson@gson.org  Tue Jan  7 09:49:04 2020
Return-Path: <gson@gson.org>
Received: from mail.netbsd.org (mail.netbsd.org [199.233.217.200])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(Client CN "mail.NetBSD.org", Issuer "mail.NetBSD.org CA" (not verified))
	by mollari.NetBSD.org (Postfix) with ESMTPS id 25BB87A154
	for <gnats-bugs@gnats.NetBSD.org>; Tue,  7 Jan 2020 09:49:04 +0000 (UTC)
Message-Id: <20200107094859.69FCD253F3E@guava.gson.org>
Date: Tue,  7 Jan 2020 11:48:59 +0200 (EET)
From: gson@gson.org (Andreas Gustafsson)
Reply-To: gson@gson.org (Andreas Gustafsson)
To: gnats-bugs@NetBSD.org
Subject: Some ATF test runs leave h_ps_strings2 process running
X-Send-Pr-Version: 3.95

>Number:         54840
>Category:       kern
>Synopsis:       Some ATF test runs leave h_ps_strings2 process running
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    kern-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Tue Jan 07 09:50:00 +0000 2020
>Last-Modified:  Tue Jan 07 11:15:01 +0000 2020
>Originator:     Andreas Gustafsson
>Release:        NetBSD-current
>Organization:
>Environment:
System: NetBSD
Architecture: i386
Machine: i386
>Description:

Several recent ATF test runs on my i386 testbed have left a
h_ps_strings2 process running after the tests have completed.

The first occurrence was using sources from 2019.12.24.23.41.05,
and in 2020 it has happened in 5 runs out of 11 so far.

This is on real 8-core hardware.  I have not seen it happen in qemu.

The console log output from the first occurrence is here:

  http://www.gson.org/netbsd/bugs/build/i386-baremetal/2019/2019.12.24.23.41.05/test.log

Search for "arg0000" to find the process in case.

>How-To-Repeat:

>Fix:

>Audit-Trail:
From: Martin Husemann <martin@duskware.de>
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: kern/54840: Some ATF test runs leave h_ps_strings2 process
 running
Date: Tue, 7 Jan 2020 10:51:57 +0100

 Duplicate of 54819? [modulo the typo there]

 Martin

From: Andreas Gustafsson <gson@gson.org>
To: Martin Husemann <martin@duskware.de>
Cc: gnats-bugs@netbsd.org
Subject: Re: kern/54840: Some ATF test runs leave h_ps_strings2 process running
Date: Tue, 7 Jan 2020 12:42:22 +0200

 Martin Husemann wrote:
 >  Duplicate of 54819? [modulo the typo there]

 Probably related, yes.

 Strangely, I can't reproduce the problem by running just the
 t_ps_strings test program manually, even if I run it 1000 times
 in a row:

   # cd /usr/tests/kernel
   # seq 1000 | while read i && atf-run t_ps_strings; do true; done >log
   # ps -glaxw|grep h_
     0 11323  531    0  85  0  3776   388 pipe_rd O+   tty00 0:00.00 grep h_ 
   #

 This was run booted from the same disk image that showed a leftover
 h_ps_strings2 process after a full ATF run here:

   http://www.gson.org/netbsd/bugs/build/i386-baremetal/2020/2020.01.06.21.04.58/test.log

 -- 
 Andreas Gustafsson, gson@gson.org

From: Martin Husemann <martin@duskware.de>
To: Andreas Gustafsson <gson@gson.org>
Cc: gnats-bugs@netbsd.org
Subject: Re: kern/54840: Some ATF test runs leave h_ps_strings2 process
 running
Date: Tue, 7 Jan 2020 12:10:13 +0100

 It also does not happen every time for me (sounds like a race, and could
 also be a libpthread issue).

 Martin

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.46 2020/01/03 16:35:01 leot Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2020 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.