[clang] [llvm] [HLSL] Implement elementwise firstbitlow builtin (PR #116858)

Ashley Coleman via llvm-commits llvm-commits at lists.llvm.org
Mon Dec 16 12:52:38 PST 2024


================
@@ -3166,109 +3171,228 @@ bool SPIRVInstructionSelector::selectFirstBitHigh32(Register ResVReg,
       .constrainAllUses(TII, TRI, RBI);
 }
 
-bool SPIRVInstructionSelector::selectFirstBitHigh64(Register ResVReg,
-                                                    const SPIRVType *ResType,
-                                                    MachineInstr &I,
-                                                    bool IsSigned) const {
-  Register OpReg = I.getOperand(2).getReg();
-  // 1. split our int64 into 2 pieces using a bitcast
-  unsigned count = GR.getScalarOrVectorComponentCount(ResType);
-  SPIRVType *baseType = GR.retrieveScalarOrVectorIntType(ResType);
+bool SPIRVInstructionSelector::selectFirstBitSet64(
+    Register ResVReg, const SPIRVType *ResType, MachineInstr &I,
+    Register SrcReg, unsigned BitSetOpcode, bool SwapPrimarySide) const {
+  unsigned ComponentCount = GR.getScalarOrVectorComponentCount(ResType);
+  SPIRVType *BaseType = GR.retrieveScalarOrVectorIntType(ResType);
+  bool ZeroAsNull = STI.isOpenCLEnv();
+  Register ConstIntZero =
+      GR.getOrCreateConstInt(0, I, BaseType, TII, ZeroAsNull);
+  Register ConstIntOne =
+      GR.getOrCreateConstInt(1, I, BaseType, TII, ZeroAsNull);
+
+  // SPIRV doesn't support vectors with more than 4 components. Since the
+  // algoritm below converts i64 -> i32x2 and i64x4 -> i32x8 it can only
+  // operate on vectors with 2 or less components. When largers vectors are
+  // seen. Split them, recurse, then recombine them.
+  if (ComponentCount > 2) {
+    unsigned LeftComponentCount = ComponentCount / 2;
----------------
V-FEXrt wrote:

Can you give an example? I'm pretty sure the splitting will never create a vector too large (but the merging back together certainly can)

Example:

Given `u64x12` the call stack becomes

```c++
selectFirstBitSet64(u64x12); // Top
selectFirstBitSet64Overflow(u64x12); // Top
  selectFirstBitSet64(u64x6); // Top.Left
  selectFirstBitSet64Overflow(u64x6);   // Top.Left
    selectFirstBitSet64(u64x3);  // Top.Left.Left
    selectFirstBitSet64Overflow(u64x3); // Top.Left.Left
      selectFirstBitSet64(u64);  // Top.Left.Left.Left
      selectFirstBitSet64(u64x2);  // Top.Left.Left.Right
    selectFirstBitSet64Overflow(u64x3); // Top.Left.Right
      selectFirstBitSet64(u64);  // Top.Left.Right.Left
      selectFirstBitSet64(u64x2);  // Top.Left.Right.Right
  selectFirstBitSet64Overflow(u64x6);   // Top.Right
    selectFirstBitSet64(u64x3);  // Top.Right.Left
    selectFirstBitSet64Overflow(u64x3); // Top.Right.Left
      selectFirstBitSet64(u64);  // Top.Right.Left.Left
      selectFirstBitSet64(u64x2);  // Top.Right.Left.Right
    selectFirstBitSet64Overflow(u64x3); // Top.Right.Right
      selectFirstBitSet64(u64);  // Top.Right.Right.Left
      selectFirstBitSet64(u64x2);  // Top.Right.Right.Right
```

https://github.com/llvm/llvm-project/pull/116858


More information about the llvm-commits mailing list