[llvm] [CodeGen] Use 128bits for LaneBitmask. (PR #111157)

Matt Arsenault via llvm-commits llvm-commits at lists.llvm.org
Fri Oct 4 08:03:29 PDT 2024


arsenm wrote:

> I think this is mostly because defining register tuples (2x, 3x and 4x) replicates the regunits. When I define the top bits and do some post-processing of the table in AArch64GenRegisterInfo.inc, I get the following lane masks:

This doesn't sound right. AMDGPU nearly exclusively uses register tuples, and we get one regunit per lane (well, one for each 16-bit half of each lane). 

https://github.com/llvm/llvm-project/pull/111157


More information about the llvm-commits mailing list