NetBSD Problem Report #52926
From martin@duskware.de Mon Jan 15 17:13:22 2018
Return-Path: <martin@duskware.de>
Received: from mail.netbsd.org (mail.netbsd.org [199.233.217.200])
(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
(Client CN "mail.NetBSD.org", Issuer "mail.NetBSD.org CA" (not verified))
by mollari.NetBSD.org (Postfix) with ESMTPS id 754C17A18F
for <gnats-bugs@gnats.NetBSD.org>; Mon, 15 Jan 2018 17:13:22 +0000 (UTC)
From: martin@NetBSD.org
Reply-To: martin@NetBSD.org
To: gnats-bugs@NetBSD.org
Subject: rump server locks up in ld.elf_so
X-Send-Pr-Version: 3.95
>Number: 52926
>Category: lib
>Synopsis: rump server locks up in ld.elf_so
>Confidential: no
>Severity: critical
>Priority: high
>Responsible: lib-bug-people
>State: open
>Class: sw-bug
>Submitter-Id: net
>Arrival-Date: Mon Jan 15 17:15:00 +0000 2018
>Last-Modified: Tue Jan 16 12:40:01 +0000 2018
>Originator: Martin Husemann
>Release: NetBSD 8.99.12
>Organization:
The NetBSD Foundation, Inc.
>Environment:
System: NetBSD sunset-glow.duskware.de 8.99.12 NetBSD 8.99.12 (GENERIC.MP) #1: Mon Jan 15 10:55:45 CET 2018 martin@night-owl.duskware.de:/usr/src/sys/arch/sparc/compile/GENERIC.MP sparc
Architecture: sparc
Machine: sparc
>Description:
On an SMP sparc system the tests/lib/libc/net/t_hostent test reproducably
locks up in ld.elf_so. The rump server process used to run a nameserver
is idle with a lockup like:
(gdb) thread 11
[Switching to thread 11 (LWP 1 of process 18657)]
#0 0xededc8e0 in ___lwp_park60 () from /usr/libexec/ld.elf_so
(gdb) bt
#0 0xededc8e0 in ___lwp_park60 () from /usr/libexec/ld.elf_so
#1 0xeded32f8 in _rtld_exclusive_enter (mask=mask@entry=0xefffef00) at /usr/src/libexec/ld.elf_so/rtld.c:1714
#2 0xededb7f8 in _rtld_tls_allocate () at /usr/src/libexec/ld.elf_so/tls.c:160
#3 0xedc0abec in pthread_create (thread=thread@entry=0xefffefe0, attr=attr@entry=0xefffefe4,
startfunc=startfunc@entry=0xedda9aac <threadbouncer>, arg=arg@entry=0xed315288) at /usr/src/lib/libpthread/pthread.c:524
#4 0xedc87998 in rumpuser_thread_create (f=0xedda9aac <threadbouncer>, arg=arg@entry=0xed315288, thrname=thrname@entry=0xeffff070 "pdaemon",
joinable=0, priority=priority@entry=126, cpuidx=<optimized out>, ptcookie=ptcookie@entry=0xed3397a4)
at /usr/src/lib/librumpuser/rumpuser_pth.c:77
#5 0xedda9e80 in kthread_create (pri=126, flags=<optimized out>, ci=0x0, func=<optimized out>, arg=<optimized out>,
newlp=0xede013bc <rumpns_uvm+12>, fmt=0xeddc3b90 "pdaemon") at /usr/src/lib/librump/../../sys/rump/librump/rumpkern/threads.c:211
#6 0xeddaf3d4 in rump_init () at /usr/src/lib/librump/../../sys/rump/librump/rumpkern/rump.c:415
#7 0x0001177c in main (argc=<optimized out>, argv=0xeffff798) at /usr/src/usr.bin/rump_server/../rump_allserver/rump_allserver.c:401
and no progress is ever made. The rump process will never exit.
>How-To-Repeat:
cd /usr/tests/lib/libc/net
atf-run t_hostent
>Fix:
n/a
>Audit-Trail:
From: Martin Husemann <martin@netbsd.org>
To: gnats-bugs@NetBSD.org
Cc: lib-bug-people@netbsd.org, gnats-admin@netbsd.org,
netbsd-bugs@netbsd.org
Subject: Re: lib/52926: rump server locks up in ld.elf_so
Date: Tue, 16 Jan 2018 12:36:53 +0000
I can also reproduce this on a sparc64 two CPU machine running sparc
userland under netbsd32 emulation.
This is how it looks there:
(gdb) info thread
Id Target Id Frame
* 1 LWP 11 of process 295 _rtld_shared_enter () at /work/src/libexec/ld.elf_so/rtld.c:1659
2 LWP 10 of process 295 _rtld_shared_enter () at /work/src/libexec/ld.elf_so/rtld.c:1659
3 LWP 9 of process 295 pthread_cond_timedwait (cond=0x20404b00, mutex=0x204187c0, abstime=0x0)
at /work/src/lib/libpthread/pthread_cond.c:173
4 LWP 8 of process 295 pthread_cond_timedwait (cond=0x20404ae0, mutex=0x20418740, abstime=0x0)
at /work/src/lib/libpthread/pthread_cond.c:173
5 LWP 7 of process 295 pthread_cond_timedwait (cond=0x20404b00, mutex=0x204187c0, abstime=0x0)
at /work/src/lib/libpthread/pthread_cond.c:173
6 LWP 6 of process 295 pthread_cond_timedwait (cond=0x20404ae0, mutex=0x20418740, abstime=0x0)
at /work/src/lib/libpthread/pthread_cond.c:173
7 LWP 5 of process 295 _rtld_shared_enter () at /work/src/libexec/ld.elf_so/rtld.c:1659
8 LWP 4 of process 295 pthread_cond_timedwait (cond=0x20404ae0, mutex=0x20418740, abstime=0x0)
at /work/src/lib/libpthread/pthread_cond.c:173
9 LWP 3 of process 295 pthread_cond_timedwait (cond=0x20404b00, mutex=0x204187c0, abstime=0x0)
at /work/src/lib/libpthread/pthread_cond.c:173
10 LWP 2 of process 295 pthread_cond_timedwait (cond=0x20404ae0, mutex=0x20418740, abstime=0x0)
at /work/src/lib/libpthread/pthread_cond.c:173
11 LWP 1 of process 295 _rtld_shared_enter () at /work/src/libexec/ld.elf_so/rtld.c:1659
Martin
(Contact us)
$NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.8 2006/05/07 09:23:38 tsutsui Exp $
Copyright © 1994-2014
The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.