[llvm] [AMDGPU] SIPeepholeSDWA: Handle V_CNDMASK_B32_e64 (PR #137930)
Frederik Harwath via llvm-commits
llvm-commits at lists.llvm.org
Mon May 5 05:40:29 PDT 2025
================
@@ -0,0 +1,41 @@
+# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py UTC_ARGS: --version 5
+# RUN: llc -mtriple=amdgcn-amd-amdhsa -run-pass=si-peephole-sdwa -mcpu=gfx803 -o - %s | FileCheck -check-prefix=GFX8 %s
+# RUN: llc -mtriple=amdgcn-amd-amdhsa -run-pass=si-peephole-sdwa -mcpu=gfx1100 -o - %s | FileCheck -check-prefix=GFX11 %s
+
+---
----------------
frederik-h wrote:
> This test should merge into the other 2. This also should really have a separate wave32 and wave64 version, and it's a bug you need to use vcc in wave32
Ok, I will merge the tests. Concerning the bug: You mean having to specify `implicit $vcc` in ` %4:vgpr_32 = V_CNDMASK_B32_e32 killed %2, killed %3, implicit $exec, implicit $vcc` with "wave32"?
https://github.com/llvm/llvm-project/pull/137930
More information about the llvm-commits
mailing list