[PATCH] D147408: [AMDGPU] Enable AMDGPU Atomic Optimizer Pass by default.
Ruiling, Song via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Apr 3 23:35:02 PDT 2023
ruiling added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUAtomicOptimizer.cpp:463
+
+ for (unsigned LaneIdx = 0; LaneIdx < WaveFrontSize; LaneIdx++) {
+ // Iterate over all the lanes of a wavefront to compute the partial sum. If
----------------
Why do we choose to unroll the loop over wave-front-size? I think this makes the sp3 assembly hard to read. Shouldn't a loop over active lanes just work?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D147408/new/
https://reviews.llvm.org/D147408
More information about the llvm-commits
mailing list