[llvm] [AMDGPU] Add intrinsic readanylane (PR #115696)
    via llvm-commits 
    llvm-commits at lists.llvm.org
       
    Mon Nov 11 17:57:30 PST 2024
    
    
  
ruiling wrote:
> The first part "if x is uniform" means that it is uniform at the place where it is used, not at the place where it is defined. In your example that means that all the dark red cells must have the same value (42) but other green cells could be different.
I would say let's say it clearly that readanylane is defined only if the input is uniform when defined and at the use of readanylane call. We really want the intrinsic to be not convergent to allow related middle-end optimizations.
btw, I think for the existing uniformity analysis, we can say that: "if a value is uniform at use, then it should be uniform at define", right? We only have the case that a value is defined uniform but divergent at its use currently.
https://github.com/llvm/llvm-project/pull/115696
    
    
More information about the llvm-commits
mailing list