[llvm] [AMDGPU] Automatic conversion from wave32 to wave64 (PR #137376)
Stanislav Mekhanoshin via llvm-commits
llvm-commits at lists.llvm.org
Mon Apr 28 10:09:06 PDT 2025
rampitec wrote:
> > I am not sure what do you mean, but it sounds you are referring to intrinsics accessing wave size since these need such mangling. That shall not be allowed.
>
> @llvm.read_register.i32( some regular reg) is allowed as it is not convergent and does not read/write exec or vcc. We should use llvm.read_register.i64 instead if convert the caller kernel to wave64, otherwise it only will read a low half of the register.
You cannot do it and keep the semantics. Just bail on any use of these intrinsics regardless of a register accessed.
https://github.com/llvm/llvm-project/pull/137376
More information about the llvm-commits
mailing list