NetBSD Problem Report #49075

From www@NetBSD.org  Tue Aug  5 18:32:07 2014
Return-Path: <www@NetBSD.org>
Received: from mail.netbsd.org (mail.netbsd.org [149.20.53.66])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(Client CN "mail.netbsd.org", Issuer "Postmaster NetBSD.org" (verified OK))
	by mollari.NetBSD.org (Postfix) with ESMTPS id 58C45A955B
	for <gnats-bugs@gnats.NetBSD.org>; Tue,  5 Aug 2014 18:32:07 +0000 (UTC)
Message-Id: <20140805183205.D06ADA958A@mollari.NetBSD.org>
Date: Tue,  5 Aug 2014 18:32:05 +0000 (UTC)
From: brad@anduin.eldar.org
Reply-To: brad@anduin.eldar.org
To: gnats-bugs@NetBSD.org
Subject: OpenJDK7 hangs in lwp_park50 in Xen PVs
X-Send-Pr-Version: www-1.0

>Number:         49075
>Category:       port-xen
>Synopsis:       OpenJDK7 hangs in lwp_park50 in Xen PVs
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    port-xen-maintainer
>State:          closed
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Tue Aug 05 18:35:00 +0000 2014
>Closed-Date:    Fri Jul 17 11:06:07 +0000 2020
>Last-Modified:  Fri Jul 17 11:06:07 +0000 2020
>Originator:     Brad Spencer
>Release:        NetBSD 6.1.4_PATCH
>Organization:
>Environment:
NetBSD bradrun2 6.1.4_PATCH NetBSD 6.1.4_PATCH (XEN3_DOMU) #0: Mon Jun 30 22:24:35 EDT 2014  brad@bradnetbsd:/usr/obj/sys/arch/amd64/compile/XEN3_DOMU amd64

>Description:
It appears that some Java applications can get stuck in the lwp_park50 system call in DOMUs.  A ktrace of a hung application shows the following repeating forever:

   781     10 java     CALL  ___lwp_park50(0x7f7fef7ffaf0,0,0x7f7ff7b4f440,0x7f7ff7b4f440)
   781     10 java     RET   ___lwp_park50 -1 errno 60 Connection timed out
   781     10 java     CALL  __clock_gettime50(3,0x7f7fef7ffbc0)
   781     10 java     RET   __clock_gettime50 0
   781     10 java     CALL  __gettimeofday50(0x7f7fef7ffb20,0)
   781     10 java     RET   __gettimeofday50 0
   781     10 java     CALL  __clock_gettime50(3,0x7f7fef7ffbc0)
   781     10 java     RET   __clock_gettime50 0
   781     10 java     CALL  __gettimeofday50(0x7f7fef7ffae0,0)
   781     10 java     RET   __gettimeofday50 0
   781     10 java     CALL  ___lwp_park50(0x7f7fef7ffaf0,0,0x7f7ff7b4f440,0x7f7ff7b4f440)
   781     10 java     RET   ___lwp_park50 -1 errno 60 Connection timed out
   781     10 java     CALL  __clock_gettime50(3,0x7f7fef7ffbc0)
   781     10 java     RET   __clock_gettime50 0
   781     10 java     CALL  __gettimeofday50(0x7f7fef7ffb20,0)
   781     10 java     RET   __gettimeofday50 0
   781     10 java     CALL  __clock_gettime50(3,0x7f7fef7ffbc0)
   781     10 java     RET   __clock_gettime50 0
   781     10 java     CALL  __gettimeofday50(0x7f7fef7ffae0,0)
   781     10 java     RET   __gettimeofday50 0

Further, if a CTRL-C is attempted against the java process, something goes into a tight loop such that the PV guest becomes completely unresponsive and has to be destroyed.

Also note that if I run this particular Java application in DOM0 using the same JVM it does not hang at all.

>How-To-Repeat:
I can't provide the application that trips this for me, but the same basic problem can be seen with the jar command.  Executing the following:

openjdk7-jar t /usr/pkg/java/openjdk7/jre/lib/rt.jar

will produce the same output in ktrace as the stuck java application.  The difference is that a CTRL-C will kill the jar process and not cause it to run away and DOM0 in this case exhibits the same problem as a DOMU.

Also:

openjdk7-1.7.51nb2  Open-source implementation of the Java Platform, Standard Edition

>Fix:
Not known.  I tried a DOMU kernel without any of the COMPAT stuff complied in just to see and tried a DOMU without MULTIPROCESSOR enabled.  Neither helped.

>Release-Note:

>Audit-Trail:

State-Changed-From-To: open->feedback
State-Changed-By: jdolecek@NetBSD.org
State-Changed-When: Sun, 24 Jun 2018 20:43:37 +0000
State-Changed-Why:
Can you please try if this still happens with NetBSD 8.0_RC1?


State-Changed-From-To: feedback->closed
State-Changed-By: jdolecek@NetBSD.org
State-Changed-When: Fri, 17 Jul 2020 11:06:07 +0000
State-Changed-Why:
Feedback timeout. Thanks for report.


>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.46 2020/01/03 16:35:01 leot Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2020 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.