NetBSD Problem Report #47923
From gson@gson.org Thu Jun 13 06:46:59 2013
Return-Path: <gson@gson.org>
Received: from mail.netbsd.org (mail.netbsd.org [149.20.53.66])
(using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits))
(Client CN "mail.NetBSD.org", Issuer "Postmaster NetBSD.org" (verified OK))
by mollari.NetBSD.org (Postfix) with ESMTPS id 43C7770933
for <gnats-bugs@gnats.NetBSD.org>; Thu, 13 Jun 2013 06:46:59 +0000 (UTC)
Message-Id: <20130613064654.3672475E14@guava.gson.org>
Date: Thu, 13 Jun 2013 09:46:54 +0300 (EEST)
From: gson@gson.org (Andreas Gustafsson)
Reply-To: gson@gson.org (Andreas Gustafsson)
To: gnats-bugs@gnats.NetBSD.org
Subject: ATF ffs tests leave processes running on sparc
X-Send-Pr-Version: 3.95
>Number: 47923
>Category: misc
>Synopsis: ATF ffs tests leave processes running on sparc
>Confidential: no
>Severity: serious
>Priority: medium
>Responsible: misc-bug-people
>State: open
>Class: sw-bug
>Submitter-Id: net
>Arrival-Date: Thu Jun 13 06:50:00 +0000 2013
>Closed-Date:
>Last-Modified: Sun Apr 27 09:15:01 +0000 2014
>Originator: Andreas Gustafsson
>Release: NetBSD-current >= 2012.09.30.21.26.58
>Organization:
>Environment:
System: NetBSD
Architecture: sparc
Machine: sparc
>Description:
When running the tests in /usr/tests/fs/ffs on sparc, after the tests
have completed, there is one or more h_ffs_server process still
running in the background. See for example the "ps-post-test" lines
near the end of
http://releng.netbsd.org/b5reports/sparc/build/2013.06.12.17.13.05/test.log
This has been happening for some time. According to the log files on
babylon5, it started with the following commits in September 2012:
2012.09.30.21.26.57 bouyer src/tests/fs/ffs/h_quota2_tests.c 1.4
2012.09.30.21.26.58 bouyer src/tests/fs/ffs/t_miscquota.sh 1.7
>How-To-Repeat:
To reproduce by hand:
cd /usr/tests/fs/ffs
atf-run
ps -glaxw | grep ffs_server
>Fix:
>Release-Note:
>Audit-Trail:
State-Changed-From-To: open->closed
State-Changed-By: gson@NetBSD.org
State-Changed-When: Wed, 15 Jan 2014 08:10:08 +0000
State-Changed-Why:
Problem disappeared with the commit of src/tests/fs/ffs/ffs_common.sh 1.2
on 2013.07.29.13.15.24. Thanks skrll.
From: matthew green <mrg@eterna.com.au>
To: gnats-bugs@NetBSD.org
Cc: misc-bug-people@netbsd.org, netbsd-bugs@netbsd.org,
gnats-admin@netbsd.org, gson@NetBSD.org,
gson@gson.org (Andreas Gustafsson)
Subject: re: misc/47923 (ATF ffs tests leave processes running on sparc)
Date: Wed, 15 Jan 2014 19:47:48 +1100
> Synopsis: ATF ffs tests leave processes running on sparc
>
> State-Changed-From-To: open->closed
> State-Changed-By: gson@NetBSD.org
> State-Changed-When: Wed, 15 Jan 2014 08:10:08 +0000
> State-Changed-Why:
> Problem disappeared with the commit of src/tests/fs/ffs/ffs_common.sh 1.2
> on 2013.07.29.13.15.24. Thanks skrll.
i still see this on real hardware. it's much worse on SMP.
there are often over 50 processes left at the of the run in
tests/fs/ffs.
.mrg.
State-Changed-From-To: closed->open
State-Changed-By: gson@NetBSD.org
State-Changed-When: Wed, 15 Jan 2014 08:51:18 +0000
State-Changed-Why:
mrg is still seeing this on real hardware.
From: Andreas Gustafsson <gson@NetBSD.org>
To: matthew green <mrg@eterna.com.au>
Cc: gnats-bugs@NetBSD.org,
misc-bug-people@netbsd.org,
netbsd-bugs@netbsd.org,
gnats-admin@netbsd.org
Subject: re: misc/47923 (ATF ffs tests leave processes running on sparc)
Date: Wed, 15 Jan 2014 11:01:40 +0200
matthew green wrote:
> i still see this on real hardware. it's much worse on SMP.
Could you describe your hardware in more detail?
Does further increasing the timeout in the test_case_root() function
in ffs_common.sh fix the problem for you?
--
Andreas Gustafsson, gson@NetBSD.org
From: matthew green <mrg@eterna.com.au>
To: Andreas Gustafsson <gson@NetBSD.org>
Cc: gnats-bugs@NetBSD.org, misc-bug-people@netbsd.org,
netbsd-bugs@netbsd.org, gnats-admin@netbsd.org
Subject: re: misc/47923 (ATF ffs tests leave processes running on sparc)
Date: Wed, 15 Jan 2014 20:46:59 +1100
> matthew green wrote:
> > i still see this on real hardware. it's much worse on SMP.
>
> Could you describe your hardware in more detail?
SS20 with two SM70 (75mhz) cpus (i think that's right.) 384MB ram.
> Does further increasing the timeout in the test_case_root() function
> in ffs_common.sh fix the problem for you?
i'll try and see. (i hard hung this box just a short time ago. ;-)
State-Changed-From-To: open->feedback
State-Changed-By: dholland@NetBSD.org
State-Changed-When: Sat, 26 Apr 2014 05:55:16 +0000
State-Changed-Why:
is this fixed now? some similar stuff was supposedly fixed by a recent atf
update.
From: matthew green <mrg@eterna.com.au>
To: gnats-bugs@NetBSD.org
Cc: misc-bug-people@netbsd.org, netbsd-bugs@netbsd.org,
gnats-admin@netbsd.org, dholland@NetBSD.org,
gson@gson.org (Andreas Gustafsson)
Subject: re: misc/47923 (ATF ffs tests leave processes running on sparc)
Date: Sat, 26 Apr 2014 16:44:52 +1000
pretty sure this still fails, but my ss20 is running something that
may or may not be with the latest atf, so i'm running a build and
will see what happens.
.mrg.
State-Changed-From-To: feedback->open
State-Changed-By: mrg@NetBSD.org
State-Changed-When: Sat, 26 Apr 2014 23:48:05 +0000
State-Changed-Why:
this still fails for me.
> Does further increasing the timeout in the test_case_root() function
> in ffs_common.sh fix the problem for you?
i tried this, increasing the timeouts by 10x and it does not help.
.mrg.
From: Martin Husemann <martin@duskware.de>
To: gnats-bugs@NetBSD.org
Cc:
Subject: Re: misc/47923 (ATF ffs tests leave processes running on sparc)
Date: Sun, 27 Apr 2014 11:10:42 +0200
FWIW: I see very similar effects on my evbarm test runs
(http://www.netbsd.org/~martin/evbarm-atf/), but not on the shark
runs. My bet is on something MD in mutex/pthread land.
Martin
>Unformatted:
(Contact us)
$NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.8 2006/05/07 09:23:38 tsutsui Exp $
Copyright © 1994-2007
The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.