[PATCH] D79100: [LV][TTI] Emit new IR intrinsic llvm.get.active.mask for tail-folded loops

Eli Friedman via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue May 19 14:54:29 PDT 2020


efriedma added inline comments.


================
Comment at: llvm/docs/LangRef.rst:16198
+active/inactive vector lanes. This mask can e.g. be used in the masked
+load/store instructions.
+
----------------
Is this semantically equivalent to icmp ule?  If it is, you should probably make that more clear, and explain that it's a hint to the backend.  If not, this needs a much more thorough explanation.


================
Comment at: llvm/include/llvm/IR/Intrinsics.td:1240
+  Intrinsic<[llvm_anyvector_ty],
+            [llvm_anyvector_ty, llvm_anyvector_ty], [IntrNoDuplicate]>;
 
----------------
Is IntrNoDuplicate here actually semantically significant?  The LangRef explanation doesn't really indicate why it needs to be noduplicate.

Please use LLVMMatchType/LLVMScalarOrSameVectorWidth to ensure the argument/result types match.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D79100/new/

https://reviews.llvm.org/D79100





More information about the llvm-commits mailing list