NetBSD Problem Report #49075
From www@NetBSD.org Tue Aug 5 18:32:07 2014
Return-Path: <www@NetBSD.org>
Received: from mail.netbsd.org (mail.netbsd.org [149.20.53.66])
(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
(Client CN "mail.netbsd.org", Issuer "Postmaster NetBSD.org" (verified OK))
by mollari.NetBSD.org (Postfix) with ESMTPS id 58C45A955B
for <gnats-bugs@gnats.NetBSD.org>; Tue, 5 Aug 2014 18:32:07 +0000 (UTC)
Message-Id: <20140805183205.D06ADA958A@mollari.NetBSD.org>
Date: Tue, 5 Aug 2014 18:32:05 +0000 (UTC)
From: brad@anduin.eldar.org
Reply-To: brad@anduin.eldar.org
To: gnats-bugs@NetBSD.org
Subject: OpenJDK7 hangs in lwp_park50 in Xen PVs
X-Send-Pr-Version: www-1.0
>Number: 49075
>Category: port-xen
>Synopsis: OpenJDK7 hangs in lwp_park50 in Xen PVs
>Confidential: no
>Severity: serious
>Priority: medium
>Responsible: port-xen-maintainer
>State: closed
>Class: sw-bug
>Submitter-Id: net
>Arrival-Date: Tue Aug 05 18:35:00 +0000 2014
>Closed-Date: Fri Jul 17 11:06:07 +0000 2020
>Last-Modified: Fri Jul 17 11:06:07 +0000 2020
>Originator: Brad Spencer
>Release: NetBSD 6.1.4_PATCH
>Organization:
>Environment:
NetBSD bradrun2 6.1.4_PATCH NetBSD 6.1.4_PATCH (XEN3_DOMU) #0: Mon Jun 30 22:24:35 EDT 2014 brad@bradnetbsd:/usr/obj/sys/arch/amd64/compile/XEN3_DOMU amd64
>Description:
It appears that some Java applications can get stuck in the lwp_park50 system call in DOMUs. A ktrace of a hung application shows the following repeating forever:
781 10 java CALL ___lwp_park50(0x7f7fef7ffaf0,0,0x7f7ff7b4f440,0x7f7ff7b4f440)
781 10 java RET ___lwp_park50 -1 errno 60 Connection timed out
781 10 java CALL __clock_gettime50(3,0x7f7fef7ffbc0)
781 10 java RET __clock_gettime50 0
781 10 java CALL __gettimeofday50(0x7f7fef7ffb20,0)
781 10 java RET __gettimeofday50 0
781 10 java CALL __clock_gettime50(3,0x7f7fef7ffbc0)
781 10 java RET __clock_gettime50 0
781 10 java CALL __gettimeofday50(0x7f7fef7ffae0,0)
781 10 java RET __gettimeofday50 0
781 10 java CALL ___lwp_park50(0x7f7fef7ffaf0,0,0x7f7ff7b4f440,0x7f7ff7b4f440)
781 10 java RET ___lwp_park50 -1 errno 60 Connection timed out
781 10 java CALL __clock_gettime50(3,0x7f7fef7ffbc0)
781 10 java RET __clock_gettime50 0
781 10 java CALL __gettimeofday50(0x7f7fef7ffb20,0)
781 10 java RET __gettimeofday50 0
781 10 java CALL __clock_gettime50(3,0x7f7fef7ffbc0)
781 10 java RET __clock_gettime50 0
781 10 java CALL __gettimeofday50(0x7f7fef7ffae0,0)
781 10 java RET __gettimeofday50 0
Further, if a CTRL-C is attempted against the java process, something goes into a tight loop such that the PV guest becomes completely unresponsive and has to be destroyed.
Also note that if I run this particular Java application in DOM0 using the same JVM it does not hang at all.
>How-To-Repeat:
I can't provide the application that trips this for me, but the same basic problem can be seen with the jar command. Executing the following:
openjdk7-jar t /usr/pkg/java/openjdk7/jre/lib/rt.jar
will produce the same output in ktrace as the stuck java application. The difference is that a CTRL-C will kill the jar process and not cause it to run away and DOM0 in this case exhibits the same problem as a DOMU.
Also:
openjdk7-1.7.51nb2 Open-source implementation of the Java Platform, Standard Edition
>Fix:
Not known. I tried a DOMU kernel without any of the COMPAT stuff complied in just to see and tried a DOMU without MULTIPROCESSOR enabled. Neither helped.
>Release-Note:
>Audit-Trail:
State-Changed-From-To: open->feedback
State-Changed-By: jdolecek@NetBSD.org
State-Changed-When: Sun, 24 Jun 2018 20:43:37 +0000
State-Changed-Why:
Can you please try if this still happens with NetBSD 8.0_RC1?
State-Changed-From-To: feedback->closed
State-Changed-By: jdolecek@NetBSD.org
State-Changed-When: Fri, 17 Jul 2020 11:06:07 +0000
State-Changed-Why:
Feedback timeout. Thanks for report.
>Unformatted:
(Contact us)
$NetBSD: query-full-pr,v 1.46 2020/01/03 16:35:01 leot Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2020
The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.