NetBSD Problem Report #59309
From www@netbsd.org Thu Apr 17 06:41:28 2025
Return-Path: <www@netbsd.org>
Received: from mail.netbsd.org (mail.netbsd.org [199.233.217.200])
(using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)
key-exchange X25519 server-signature RSA-PSS (2048 bits)
client-signature RSA-PSS (2048 bits))
(Client CN "mail.NetBSD.org", Issuer "mail.NetBSD.org CA" (not verified))
by mollari.NetBSD.org (Postfix) with ESMTPS id 1B49B1A9239
for <gnats-bugs@gnats.NetBSD.org>; Thu, 17 Apr 2025 06:41:28 +0000 (UTC)
Message-Id: <20250417064126.DAB591A923D@mollari.NetBSD.org>
Date: Thu, 17 Apr 2025 06:41:26 +0000 (UTC)
From: campbell+netbsd@mumble.net
Reply-To: campbell+netbsd@mumble.net
To: gnats-bugs@NetBSD.org
Subject: optimize fpu_kern_enter in kthread_fpu_enter/exit
X-Send-Pr-Version: www-1.0
>Number: 59309
>Category: port-amd64
>Synopsis: optimize fpu_kern_enter in kthread_fpu_enter/exit
>Confidential: no
>Severity: serious
>Priority: medium
>Responsible: port-amd64-maintainer
>State: open
>Class: sw-bug
>Submitter-Id: net
>Arrival-Date: Thu Apr 17 06:45:01 +0000 2025
>Originator: Taylor R Campbell
>Release: current
>Organization:
The x86FPU kFoundation
>Environment:
>Description:
On x86, fpu_kern_enter currently always saves FPU state, toggles CR0.TS back and forth to disable and re-enable the FPu, and then restores a safe FPU state from memory. This is expensive, and should be unnecessary when running in a kernel thread during kthread_fpu_enter/exit. On aarch64, this improved cgd(4) throughput by ~20%, if memory serves.
My first attempts to take advantage of kthread_fpu_enter/exit on x86 met with failure:
https://mail-index.netbsd.org/source-changes/2020/08/01/msg120050.html
https://mail-index.netbsd.org/source-changes/2023/02/25/msg143550.html
The symptom of the latter failure was described in PR kern/57258: kthread_fpu_enter/exit problem <https://gnats.NetBSD.org/57258>.
It's possible this is related to an undersized union savefpu, which should be addressed by this patch series once it lands: https://mail-index.NetBSD.org/port-amd64/2025/04/11/msg003748.html
It's possible this is related to the order in which LW_SYSTEM_FPU is set/cleared and kthread_fpu_enter/exit_md is called by kthread_fpu_enter/exit, particularly if an interrupt happens in the intervening time.
Not sure but we should figure it out!
>How-To-Repeat:
get frustrated at seeing cgd(4) threads taking too much CPU time under heavy disk I/O
>Fix:
Yes, please!
(Contact us)
$NetBSD: query-full-pr,v 1.47 2022/09/11 19:34:41 kim Exp $
$NetBSD: gnats_config.sh,v 1.9 2014/08/02 14:16:04 spz Exp $
Copyright © 1994-2025
The NetBSD Foundation, Inc. ALL RIGHTS RESERVED.