[llvm] [AMDGPU] Fix code sequence for barrier start in GFX10+ CU Mode (PR #160501)
    Pierre van Houtryve via llvm-commits 
    llvm-commits at lists.llvm.org
       
    Thu Sep 25 02:01:17 PDT 2025
    
    
  
Pierre-vh wrote:
> Quite frankly, I'd rather admit that trying to reduce CU-mode release fences to only VM_VSRC waits was a mistake, and go back to using the same waits there as for WGP-mode.
We never emitted a `vm_vsrc(0)` wait for workgroup release fences in CU mode.
> This pessimizes LDS-only barriers too much.
Which LDS-only barriers ? The check only includes `S_BARRIER`. 
Do you think we should instead pessimize all workgroup release fences in CU mode so they have a wait on storecnt?
https://github.com/llvm/llvm-project/pull/160501
    
    
More information about the llvm-commits
mailing list