[llvm] [AMDGPU] Automatic conversion from wave32 to wave64 (PR #137376)

Matt Arsenault via llvm-commits llvm-commits at lists.llvm.org
Mon May 5 09:34:08 PDT 2025


================
@@ -0,0 +1,321 @@
+//===- SIConvertWaveSize.cpp - Automatically converts wave32 kernels to wave64
+//---------===//
+//
+// Part of the LLVM Project, under the Apache License v2.0 WITH LLVM Exceptions.
+// See https://llvm.org/LICENSE.txt for license information.
+// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
+//
+//===----------------------------------------------------------------------===//
+//
+/// \file
+// Small short living kernels may become waveslot limited.
+// To work around the problem an optimization is proposed to convert such
+// kernels from wave32 to wave64 automatically.These kernels shall conform to a
+// strict set of limitations and satisfy profitability conditions.
+//
+// 1. A kernel shall have no function calls as we cannot analyze call stack
+// requirements (nor will it fall into a category of short living kernels
+// anyway).
----------------
arsenm wrote:

The call itself isn't necessarily expensive, but the attributor would also take care of most of these queries even within a single kernel 

https://github.com/llvm/llvm-project/pull/137376


More information about the llvm-commits mailing list