[llvm] [AMDGPU] Eliminate unnecessary packing in wider f16 vectors for sdwa/opsel-able instruction (PR #137137)
Vikash Gupta via llvm-commits
llvm-commits at lists.llvm.org
Tue Apr 29 00:37:27 PDT 2025
vg0204 wrote:
> Precommit test is merged into this patch so it kind of defeats the purpose of precommiting the tests I'd also like to see MIR tests that test the specific patterns you're looking for, along with some edge cases that can break the code to test for infinite loops, crashes and so on.
Added basic MIR test specifying the targeted pattern (generated via ISEL just before si-peephole-sdwa pass for intrinsics). Yet to add non-conservative and edge test!
https://github.com/llvm/llvm-project/pull/137137
More information about the llvm-commits
mailing list