[PATCH] D104205: [X86] Schedule-model second (mask) output of GATHER instruction

Mon Jun 14 05:53:42 PDT 2021

lebedev.ri added inline comments.

================
Comment at: llvm/lib/Target/X86/X86SchedBroadwell.td:210
+// Model the effect of clobbering the read-write mask operand of the GATHER operation.
+// Does not cost anything by itself, only has latency, matching that of the WriteLoad,
+defm : X86WriteRes<WriteGatherMaskWriteback, [], 5, [], 0>;
----------------
RKSimon wrote:
> Why does it match a scalar load latency and not a vector load latency?
As you can see in `llvm/lib/Target/X86/X86InstrAVX512.td`/`llvm/lib/Target/X86/X86InstrSSE.td` changes,
for gather, `WriteLoad` is used for some reason. This simply mimics that.
I do guess it should be `WriteVecLoadX`/`WriteVecLoadY`, but that seems like a separate change?

Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D104205/new/

https://reviews.llvm.org/D104205