[PATCH] D92483: AMDGPU - Use MUBUF instructions for global address space access

praveen velliengiri via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Dec 23 11:19:12 PST 2020


pvellien added a comment.

If this is fine, place land. I don't have commit access yet.



================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUSubtarget.cpp:93
   if (isAmdHsaOS()) // Turn on FlatForGlobal for HSA.
     FullFS += "+flat-for-global,+unaligned-access-mode,+trap-handler,";
 
----------------
t-tye wrote:
> pvellien wrote:
> > t-tye wrote:
> > > Seems +flat-for-global should not be added if SI. Suggest pulling up later logic to here and combine.
> > But the gen member variable is set to correct generation only after the call to parsesubtargetfeatures function, so it would be incorrect to use getGeneration function in this place.  I think it would be better to conditionally select later.
> But could that earlier code be using some other means to determine if the target supports flat operations? Is there no target property for that?
> 
> If not then leaving "as is" is fine with me. Just would be nice if the logic for each aspect was logically organized together rather than spread over different places making it hard to figure out what is happening.
As far as I know there is no target property for that. I  agree that it would be very much clear to have a common place for this mechanism


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D92483/new/

https://reviews.llvm.org/D92483



More information about the llvm-commits mailing list