[llvm] [X86] canCreateUndefOrPoisonForTargetNode - add GF2P8AFFINEINVQB / GF2P8AFFINEQB / GF2P8MULB handling (PR #155409)
Simon Pilgrim via llvm-commits
llvm-commits at lists.llvm.org
Tue Aug 26 05:55:40 PDT 2025
https://github.com/RKSimon created https://github.com/llvm/llvm-project/pull/155409
All 3 instructions are well defined bit twiddling operations - they do not introduce undef/poison with well defined inputs.
Fixes regressions in #152107
>From 30b9bd136dd6d0e91e8aebf81fe68bfd90f94430 Mon Sep 17 00:00:00 2001
From: Simon Pilgrim <llvm-dev at redking.me.uk>
Date: Tue, 26 Aug 2025 13:54:25 +0100
Subject: [PATCH] [X86] canCreateUndefOrPoisonForTargetNode - add
GF2P8AFFINEINVQB / GF2P8AFFINEQB / GF2P8MULB handling
All 3 instructions are well defined bit twiddling operations - they do not introduce undef/poison with well defined inputs.
Fixes regressions in #152107
---
llvm/lib/Target/X86/X86ISelLowering.cpp | 5 +++++
llvm/test/CodeGen/X86/combine-gfni.ll | 9 +++------
2 files changed, 8 insertions(+), 6 deletions(-)
diff --git a/llvm/lib/Target/X86/X86ISelLowering.cpp b/llvm/lib/Target/X86/X86ISelLowering.cpp
index 19131fbd4102b..dacbda6d7eb10 100644
--- a/llvm/lib/Target/X86/X86ISelLowering.cpp
+++ b/llvm/lib/Target/X86/X86ISelLowering.cpp
@@ -45167,6 +45167,11 @@ bool X86TargetLowering::canCreateUndefOrPoisonForTargetNode(
// SSE signbit extraction.
case X86ISD::MOVMSK:
return false;
+ // GFNI instructions.
+ case X86ISD::GF2P8AFFINEINVQB:
+ case X86ISD::GF2P8AFFINEQB:
+ case X86ISD::GF2P8MULB:
+ return false;
case ISD::INTRINSIC_WO_CHAIN:
switch (Op->getConstantOperandVal(0)) {
case Intrinsic::x86_sse2_pmadd_wd:
diff --git a/llvm/test/CodeGen/X86/combine-gfni.ll b/llvm/test/CodeGen/X86/combine-gfni.ll
index 21ea17a20e0fd..b105cdf7ea895 100644
--- a/llvm/test/CodeGen/X86/combine-gfni.ll
+++ b/llvm/test/CodeGen/X86/combine-gfni.ll
@@ -24,8 +24,7 @@ define <16 x i8> @gf2p8affineqb_freeze(<16 x i8> %a0, <16 x i8> %a1, <16 x i8> %
; AVX512-LABEL: gf2p8affineqb_freeze:
; AVX512: # %bb.0:
; AVX512-NEXT: vpmovb2m %xmm2, %k1
-; AVX512-NEXT: vgf2p8affineqb $11, %xmm1, %xmm1, %xmm1
-; AVX512-NEXT: vmovdqu8 %xmm1, %xmm0 {%k1}
+; AVX512-NEXT: vgf2p8affineqb $11, %xmm1, %xmm1, %xmm0 {%k1}
; AVX512-NEXT: retq
%i = icmp slt <16 x i8> %a2, zeroinitializer
%g = call <16 x i8> @llvm.x86.vgf2p8affineqb.128(<16 x i8> %a1, <16 x i8> %a1, i8 11)
@@ -55,8 +54,7 @@ define <16 x i8> @gf2p8affineinvqb_freeze(<16 x i8> %a0, <16 x i8> %a1, <16 x i8
; AVX512-LABEL: gf2p8affineinvqb_freeze:
; AVX512: # %bb.0:
; AVX512-NEXT: vpmovb2m %xmm2, %k1
-; AVX512-NEXT: vgf2p8affineinvqb $11, %xmm1, %xmm1, %xmm1
-; AVX512-NEXT: vmovdqu8 %xmm1, %xmm0 {%k1}
+; AVX512-NEXT: vgf2p8affineinvqb $11, %xmm1, %xmm1, %xmm0 {%k1}
; AVX512-NEXT: retq
%i = icmp slt <16 x i8> %a2, zeroinitializer
%g = call <16 x i8> @llvm.x86.vgf2p8affineinvqb.128(<16 x i8> %a1, <16 x i8> %a1, i8 11)
@@ -86,8 +84,7 @@ define <16 x i8> @gf2p8mulb_freeze(<16 x i8> %a0, <16 x i8> %a1, <16 x i8> %a2)
; AVX512-LABEL: gf2p8mulb_freeze:
; AVX512: # %bb.0:
; AVX512-NEXT: vpmovb2m %xmm2, %k1
-; AVX512-NEXT: vgf2p8mulb %xmm1, %xmm1, %xmm1
-; AVX512-NEXT: vmovdqu8 %xmm1, %xmm0 {%k1}
+; AVX512-NEXT: vgf2p8mulb %xmm1, %xmm1, %xmm0 {%k1}
; AVX512-NEXT: retq
%i = icmp slt <16 x i8> %a2, zeroinitializer
%g = call <16 x i8> @llvm.x86.vgf2p8mulb.128(<16 x i8> %a1, <16 x i8> %a1)
More information about the llvm-commits
mailing list