[PATCH] D65644: [AMDGPU] gfx10 atomic optimizer changes.
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Aug 5 07:33:15 PDT 2019
arsenm added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUAtomicOptimizer.cpp:289
+ B, Op, V,
+ B.CreateIntrinsic(Intrinsic::amdgcn_update_dpp, Ty,
+ {Identity, V, B.getInt32(DPP::ROW_SHR0 | 1 << Idx),
----------------
I think it would end up being shorter/less line wrapping if you separately got the declaration for the update_dpp intrinsic and reused it in all of these places
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUAtomicOptimizer.cpp:293
+ }
+ if (ST->getGeneration() < AMDGPUSubtarget::GFX10) {
+ // GFX9 has DPP row broadcast operations.
----------------
I'm trying to avoid explicit getGeneration checks everywhere, and restricting them to all be in the Subtarget.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D65644/new/
https://reviews.llvm.org/D65644
More information about the llvm-commits
mailing list