NetBSD Problem Report #52926

From martin@duskware.de  Mon Jan 15 17:13:22 2018
Return-Path: <martin@duskware.de>
Received: from mail.netbsd.org (mail.netbsd.org [199.233.217.200])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(Client CN "mail.NetBSD.org", Issuer "mail.NetBSD.org CA" (not verified))
	by mollari.NetBSD.org (Postfix) with ESMTPS id 754C17A18F
	for <gnats-bugs@gnats.NetBSD.org>; Mon, 15 Jan 2018 17:13:22 +0000 (UTC)
From: martin@NetBSD.org
Reply-To: martin@NetBSD.org
To: gnats-bugs@NetBSD.org
Subject: rump server locks up in ld.elf_so
X-Send-Pr-Version: 3.95

>Number:         52926
>Category:       lib
>Synopsis:       rump server locks up in ld.elf_so
>Confidential:   no
>Severity:       critical
>Priority:       high
>Responsible:    lib-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Mon Jan 15 17:15:00 +0000 2018
>Last-Modified:  Tue Jan 16 12:40:01 +0000 2018
>Originator:     Martin Husemann
>Release:        NetBSD 8.99.12
>Organization:
The NetBSD Foundation, Inc.
>Environment:
System: NetBSD sunset-glow.duskware.de 8.99.12 NetBSD 8.99.12 (GENERIC.MP) #1: Mon Jan 15 10:55:45 CET 2018 martin@night-owl.duskware.de:/usr/src/sys/arch/sparc/compile/GENERIC.MP sparc
Architecture: sparc
Machine: sparc
>Description:

On an SMP sparc system the tests/lib/libc/net/t_hostent test reproducably
locks up in ld.elf_so. The rump server process used to run a nameserver
is idle with a lockup like:

(gdb) thread 11
[Switching to thread 11 (LWP 1 of process 18657)]
#0  0xededc8e0 in ___lwp_park60 () from /usr/libexec/ld.elf_so
(gdb) bt
#0  0xededc8e0 in ___lwp_park60 () from /usr/libexec/ld.elf_so
#1  0xeded32f8 in _rtld_exclusive_enter (mask=mask@entry=0xefffef00) at /usr/src/libexec/ld.elf_so/rtld.c:1714
#2  0xededb7f8 in _rtld_tls_allocate () at /usr/src/libexec/ld.elf_so/tls.c:160
#3  0xedc0abec in pthread_create (thread=thread@entry=0xefffefe0, attr=attr@entry=0xefffefe4, 
    startfunc=startfunc@entry=0xedda9aac <threadbouncer>, arg=arg@entry=0xed315288) at /usr/src/lib/libpthread/pthread.c:524
#4  0xedc87998 in rumpuser_thread_create (f=0xedda9aac <threadbouncer>, arg=arg@entry=0xed315288, thrname=thrname@entry=0xeffff070 "pdaemon", 
    joinable=0, priority=priority@entry=126, cpuidx=<optimized out>, ptcookie=ptcookie@entry=0xed3397a4)
    at /usr/src/lib/librumpuser/rumpuser_pth.c:77
#5  0xedda9e80 in kthread_create (pri=126, flags=<optimized out>, ci=0x0, func=<optimized out>, arg=<optimized out>, 
    newlp=0xede013bc <rumpns_uvm+12>, fmt=0xeddc3b90 "pdaemon") at /usr/src/lib/librump/../../sys/rump/librump/rumpkern/threads.c:211
#6  0xeddaf3d4 in rump_init () at /usr/src/lib/librump/../../sys/rump/librump/rumpkern/rump.c:415
#7  0x0001177c in main (argc=<optimized out>, argv=0xeffff798) at /usr/src/usr.bin/rump_server/../rump_allserver/rump_allserver.c:401

and no progress is ever made. The rump process will never exit.

>How-To-Repeat:

cd /usr/tests/lib/libc/net
atf-run t_hostent

>Fix:
n/a

>Audit-Trail:
From: Martin Husemann <martin@netbsd.org>
To: gnats-bugs@NetBSD.org
Cc: lib-bug-people@netbsd.org, gnats-admin@netbsd.org,
	netbsd-bugs@netbsd.org
Subject: Re: lib/52926: rump server locks up in ld.elf_so
Date: Tue, 16 Jan 2018 12:36:53 +0000

 I can also reproduce this on a sparc64 two CPU machine running sparc
 userland under netbsd32 emulation.

 This is how it looks there:

 (gdb) info thread
   Id   Target Id         Frame 
 * 1    LWP 11 of process 295 _rtld_shared_enter () at /work/src/libexec/ld.elf_so/rtld.c:1659
   2    LWP 10 of process 295 _rtld_shared_enter () at /work/src/libexec/ld.elf_so/rtld.c:1659
   3    LWP 9 of process 295 pthread_cond_timedwait (cond=0x20404b00, mutex=0x204187c0, abstime=0x0)
     at /work/src/lib/libpthread/pthread_cond.c:173
   4    LWP 8 of process 295 pthread_cond_timedwait (cond=0x20404ae0, mutex=0x20418740, abstime=0x0)
     at /work/src/lib/libpthread/pthread_cond.c:173
   5    LWP 7 of process 295 pthread_cond_timedwait (cond=0x20404b00, mutex=0x204187c0, abstime=0x0)
     at /work/src/lib/libpthread/pthread_cond.c:173
   6    LWP 6 of process 295 pthread_cond_timedwait (cond=0x20404ae0, mutex=0x20418740, abstime=0x0)
     at /work/src/lib/libpthread/pthread_cond.c:173
   7    LWP 5 of process 295 _rtld_shared_enter () at /work/src/libexec/ld.elf_so/rtld.c:1659
   8    LWP 4 of process 295 pthread_cond_timedwait (cond=0x20404ae0, mutex=0x20418740, abstime=0x0)
     at /work/src/lib/libpthread/pthread_cond.c:173
   9    LWP 3 of process 295 pthread_cond_timedwait (cond=0x20404b00, mutex=0x204187c0, abstime=0x0)
     at /work/src/lib/libpthread/pthread_cond.c:173
   10   LWP 2 of process 295 pthread_cond_timedwait (cond=0x20404ae0, mutex=0x20418740, abstime=0x0)
     at /work/src/lib/libpthread/pthread_cond.c:173
   11   LWP 1 of process 295 _rtld_shared_enter () at /work/src/libexec/ld.elf_so/rtld.c:1659


 Martin

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.39 2013/11/01 18:47:49 spz Exp $
$NetBSD: gnats_config.sh,v 1.8 2006/05/07 09:23:38 tsutsui Exp $
Copyright © 1994-2014 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.