[llvm] [CodeGen] Use 128bits for LaneBitmask. (PR #111157)

Matt Arsenault via llvm-commits llvm-commits at lists.llvm.org
Fri Oct 4 10:20:25 PDT 2024


arsenm wrote:

> I'm not really sure what AMDGPU does that is different or how it encodes the information more efficiently. Are there any lane masks in the table I shared above that you believe use unnecessary regunits?

This table seems to have one bit for every subregister index. I would expect overlapping tuples to use multiple bits of mask. e.g. AMDGPU has this:

```
hi16 :   L0000000000000001 EMPTY
lo16 :   L0000000000000002 EMPTY
sub0 :   L0000000000000003 EMPTY
sub0_sub1 :   L000000000000000F EMPTY
sub0_sub1_sub2 :   L000000000000003F EMPTY
...

```

https://github.com/llvm/llvm-project/pull/111157


More information about the llvm-commits mailing list