[llvm] [AMDGPU][CodeGen][True16] Track waitcnt of vgpr32 instead of vgpr16 for 16bit reg in GFX11 (PR #157795)

Joe Nash via llvm-commits llvm-commits at lists.llvm.org
Mon Sep 15 07:20:41 PDT 2025


================
@@ -586,6 +586,12 @@ def FeatureRealTrue16Insts : SubtargetFeature<"real-true16",
   "Use true 16-bit registers"
 >;
 
+def Feature16bitD16HWBug : SubtargetFeature<"d16-hw-bug",
+  "Enable16bitD16HWBug",
+  "true",
+  "D16 for 16 bit data type interfere the other half in true16 mode"
----------------
Sisyph wrote:

```suggestion
  "D16 instructions potentially have 32-bit data dependencies"
```


https://github.com/llvm/llvm-project/pull/157795


More information about the llvm-commits mailing list