NetBSD Problem Report #57345

From www@netbsd.org  Wed Apr 12 17:28:53 2023
Return-Path: <www@netbsd.org>
Received: from mail.netbsd.org (mail.netbsd.org [199.233.217.200])
	(using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits))
	(Client CN "mail.NetBSD.org", Issuer "mail.NetBSD.org CA" (not verified))
	by mollari.NetBSD.org (Postfix) with ESMTPS id 488331A9239
	for <gnats-bugs@gnats.NetBSD.org>; Wed, 12 Apr 2023 17:28:53 +0000 (UTC)
Message-Id: <20230412172851.2A5A51A923A@mollari.NetBSD.org>
Date: Wed, 12 Apr 2023 17:28:51 +0000 (UTC)
From: jspath55@gmail.com
Reply-To: jspath55@gmail.com
To: gnats-bugs@NetBSD.org
Subject: Automated test kernel/kqueue/t_empty fails intermittently on an amd64 machine
X-Send-Pr-Version: www-1.0

>Number:         57345
>Category:       kern
>Synopsis:       Automated test kernel/kqueue/t_empty fails intermittently on an amd64 machine
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    kern-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Wed Apr 12 17:30:01 +0000 2023
>Originator:     Jim Spath
>Release:        10.0_BETA
>Organization:
>Environment:
NetBSD hi.me 10.0_BETA NetBSD 10.0_BETA (GENERIC) #0: Sun Feb 12 12:39:37 UTC 2023  mkrepro@mkrepro.NetBSD.org:/usr/src/sys/arch/amd64/compile/GENERIC amd64 x86_64 AMD 686-class NetBSD
>Description:
With several full automated test framework passes done, I found some tests that fail (or succeed) intermittently. When this test fails for me, the output is:

tps-count: 1
tp-start: 1680663191.765848, kernel/kqueue/t_empty, 1
tc-start: 1680663191.765881, sock_tcp
tc-end: 1680663191.777728, sock_tcp, failed, /usr/src/tests/kernel/kqueue/t_empty.c:167: (readsock = accept(readsock, (struct sockaddr *)&sin, &slen)) != -1 not met
tp-end: 1680663191.778046, kernel/kqueue/t_empty
info: time.end, Wed Apr 5 02:53:11 UTC 2023

I next ran loops of 10, 20 and 30 runs, finding failures about half the time (but not getting a pass first, then a fail, and then another pass).

With 30 repetitions, one result showed 19 passes (and 11 failures):
$ grep passed  ~/tests.t_empty_x30.txt | wc -l
      19

Then I set up frequent runs driven by a cron job, for just over 24 hours. Pass/fail summary:
$ grep tc-end t*log | awk '{print $1, $3, $4, $5}' | sort | uniq -c
 812 tc-end: sock_tcp, failed, /usr/src/tests/kernel/kqueue/t_empty.c:167:
1018 tc-end: sock_tcp, passed 

I also rebooted the machine, shut down optional applications but still see failures.
>How-To-Repeat:
The test run command is:

cd /usr/tests
atf-run kernel/kqueue/t_empty 
>Fix:
Unknown root cause.

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.47 2022/09/11 19:34:41 kim Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2023 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.