[llvm] [CodeGen] Use 128bits for LaneBitmask. (PR #111157)
    Matt Arsenault via llvm-commits 
    llvm-commits at lists.llvm.org
       
    Fri Oct  4 10:20:25 PDT 2024
    
    
  
arsenm wrote:
> I'm not really sure what AMDGPU does that is different or how it encodes the information more efficiently. Are there any lane masks in the table I shared above that you believe use unnecessary regunits?
This table seems to have one bit for every subregister index. I would expect overlapping tuples to use multiple bits of mask. e.g. AMDGPU has this:
```
hi16 :   L0000000000000001 EMPTY
lo16 :   L0000000000000002 EMPTY
sub0 :   L0000000000000003 EMPTY
sub0_sub1 :   L000000000000000F EMPTY
sub0_sub1_sub2 :   L000000000000003F EMPTY
...
```
https://github.com/llvm/llvm-project/pull/111157
    
    
More information about the llvm-commits
mailing list