[PATCH] D92483: AMDGPU - Use MUBUF instructions for global address space access
Tony Tye via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Dec 23 10:58:09 PST 2020
t-tye added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUSubtarget.cpp:93
if (isAmdHsaOS()) // Turn on FlatForGlobal for HSA.
FullFS += "+flat-for-global,+unaligned-access-mode,+trap-handler,";
----------------
pvellien wrote:
> t-tye wrote:
> > Seems +flat-for-global should not be added if SI. Suggest pulling up later logic to here and combine.
> But the gen member variable is set to correct generation only after the call to parsesubtargetfeatures function, so it would be incorrect to use getGeneration function in this place. I think it would be better to conditionally select later.
But could that earlier code be using some other means to determine if the target supports flat operations? Is there no target property for that?
If not then leaving "as is" is fine with me. Just would be nice if the logic for each aspect was logically organized together rather than spread over different places making it hard to figure out what is happening.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D92483/new/
https://reviews.llvm.org/D92483
More information about the llvm-commits
mailing list