NetBSD Problem Report #54489

From www@netbsd.org  Mon Aug 26 15:12:52 2019
Return-Path: <www@netbsd.org>
Received: from mail.netbsd.org (mail.netbsd.org [199.233.217.200])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(Client CN "mail.NetBSD.org", Issuer "mail.NetBSD.org CA" (not verified))
	by mollari.NetBSD.org (Postfix) with ESMTPS id AB8977A14B
	for <gnats-bugs@gnats.NetBSD.org>; Mon, 26 Aug 2019 15:12:52 +0000 (UTC)
Message-Id: <20190826151251.DE8657A1BF@mollari.NetBSD.org>
Date: Mon, 26 Aug 2019 15:12:51 +0000 (UTC)
From: tnn@nygren.pp.se
Reply-To: tnn@nygren.pp.se
To: gnats-bugs@NetBSD.org
Subject: panic: assertion "rcr4() & CR4_SMAP" failed on HP DL380 Gen10
X-Send-Pr-Version: www-1.0

>Number:         54489
>Category:       port-amd64
>Synopsis:       panic: assertion "rcr4() & CR4_SMAP" failed on HP DL380 Gen10
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    port-amd64-maintainer
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Mon Aug 26 15:15:00 +0000 2019
>Closed-Date:    
>Last-Modified:  Tue Sep 08 08:35:01 +0000 2020
>Originator:     Tobias Nygren
>Release:        9.99.10
>Organization:
>Environment:
N/A
>Description:
Tried to boot a -current GENERIC kernel on a dual socket HP DL380 Gen10.

It failed with assertion "rcr4() & CR4_SMAP" at arch/x86/x86/patch.c line 299 during multiprocessor hatch.

The machine has:
Slot 0: 16 core "Intel(R) Xeon(R) Gold 6130 CPU 2 2.10GHz"  
Slot 1: 16 core "Intel(R) Xeon(R) Gold 6130 CPU 2 2.10GHz"  
Memory: 384 GB RAM.

I tried to #if 0 the SMAP code but then the kernel asserted later with "ci->ci_tlbstate != TLBSTATE_VALID" in x86/pmap.c line 2792.
>How-To-Repeat:

>Fix:

>Release-Note:

>Audit-Trail:
From: Tobias Nygren <tnn@nygren.pp.se>
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: port-amd64/54489: panic: assertion "rcr4() & CR4_SMAP" failed on
 HP DL380 Gen10
Date: Tue, 27 Aug 2019 10:48:04 +0200

 CPU info from Linux:

 # lscpu
 Architecture:          x86_64
 CPU op-mode(s):        32-bit, 64-bit
 Byte Order:            Little Endian
 CPU(s):                64
 On-line CPU(s) list:   0-63
 Thread(s) per core:    2
 Core(s) per socket:    16
 Socket(s):             2
 NUMA node(s):          2
 Vendor ID:             GenuineIntel
 CPU family:            6
 Model:                 85
 Model name:            Intel(R) Xeon(R) Gold 6130 CPU @ 2.10GHz
 Stepping:              4
 CPU MHz:               1045.129
 CPU max MHz:           3700.0000
 CPU min MHz:           1000.0000
 BogoMIPS:              4200.00
 Virtualization:        VT-x
 L1d cache:             32K
 L1i cache:             32K
 L2 cache:              1024K
 L3 cache:              22528K
 NUMA node0 CPU(s):     0-15,32-47
 NUMA node1 CPU(s):     16-31,48-63
 Flags:                 fpu vme de pse tsc msr pae mce cx8 apic sep mtrr 
 pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe 
 syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts 
 rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq 
 dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid 
 dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx 
 f16c rdrand lahf_lm abm 3dnowprefetch epb cat_l3 cdp_l3 intel_ppin 
 intel_pt ssbd mba ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid 
 fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm mpx 
 rdt_a avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw 
 avx512vl xsaveopt xsavec xgetbv1 cqm_llc cqm_occup_llc cqm_mbm_total 
 cqm_mbm_local dtherm ida arat pln pts hwp hwp_act_window hwp_pkg_req pku 
 ospke md_clear spec_ctrl intel_stibp flush_l1d

State-Changed-From-To: open->feedback
State-Changed-By: maxv@NetBSD.org
State-Changed-When: Tue, 29 Oct 2019 10:20:06 +0000
State-Changed-Why:
On which CPU does the SMAP KASSERT fire?


From: Takahiro Kambe <taca@back-street.net>
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: port-amd64/54489: panic: assertion "rcr4() & CR4_SMAP" failed
 on HP DL380 Gen10
Date: Thu, 13 Aug 2020 00:03:14 +0900 (JST)

 Hi,

 In message <20190826151500.50B6F7A1DC@mollari.NetBSD.org>
 	on Mon, 26 Aug 2019 15:15:00 +0000 (UTC),
 	tnn@nygren.pp.se wrote:
 >>Number:         54489
 >>Category:       port-amd64
 >>Synopsis:       panic: assertion "rcr4() & CR4_SMAP" failed on HP DL380 Gen10
 >>Confidential:   no
 >>Severity:       serious
 >>Priority:       medium
 >>Responsible:    port-amd64-maintainer
 >>State:          open
 >>Class:          sw-bug
 >>Submitter-Id:   net
 >>Arrival-Date:   Mon Aug 26 15:15:00 +0000 2019
 >>Originator:     Tobias Nygren
 >>Release:        9.99.10
 I met the same problem with a machine at my office.

 HPE Proliant MicroServer Gen10 Plus

 Release 9.99.70 and I have several photo of screen.

 >>Description:
 > Tried to boot a -current GENERIC kernel on a dual socket HP DL380 Gen10.
 > 
 > It failed with assertion "rcr4() & CR4_SMAP" at arch/x86/x86/patch.c line 299 during multiprocessor hatch.
 The same failure.

 > The machine has:
 > Slot 0: 16 core "Intel(R) Xeon(R) Gold 6130 CPU 2 2.10GHz"  
 > Slot 1: 16 core "Intel(R) Xeon(R) Gold 6130 CPU 2 2.10GHz"  
 > Memory: 384 GB RAM.
 Intel(R) Xeon (R) E-23224 CPU @ 3.40GHz
 Memory: 16GB

 > I tried to #if 0 the SMAP code but then the kernel asserted later with "ci->ci_tlbstate != TLBSTATE_VALID" in x86/pmap.c line 2792.
 I've confirmed the same situation.

 > State-Changed-From-To: open->feedback
 > State-Changed-By: maxv@NetBSD.org
 > State-Changed-When: Tue, 29 Oct 2019 10:20:06 +0000
 > State-Changed-Why:
 > On which CPU does the SMAP KASSERT fire?
 "cpu0" in my case.

 -- 
 Takahiro Kambe <taca@back-street.net>

State-Changed-From-To: feedback->open
State-Changed-By: tnn@NetBSD.org
State-Changed-When: Thu, 13 Aug 2020 09:53:45 +0000
State-Changed-Why:
Feedback was provided.


From: Takahiro Kambe <taca@back-street.net>
To: gnats-bugs@netbsd.org
Cc: 
Subject: Re: port-amd64/54489 (panic: assertion "rcr4() & CR4_SMAP" failed
 on HP DL380 Gen10)
Date: Tue, 08 Sep 2020 17:30:10 +0900 (JST)

 Is there anyone working on this?

 -- 
 Takahiro Kambe <taca@back-street.net>

>Unformatted:

NetBSD Home
NetBSD PR Database Search

(Contact us) $NetBSD: query-full-pr,v 1.46 2020/01/03 16:35:01 leot Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2020 The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.