[clang] AMDGPU: Rename and add bf16 support for global_load_tr builtins (PR #86202)
Changpeng Fang via cfe-commits
cfe-commits at lists.llvm.org
Thu Mar 21 16:29:48 PDT 2024
changpeng wrote:
> > > Do you want to rename intrinsics as well? Because now intrinsic names do not match builtin names.
> >
> >
> > Do we have to match builtins with intrinsics? Renaming intrinsics here means we will have to duplicate the intrinsics.
>
> Is that because of the mangling?
Right. It was originally suggested to use a single instrinsic "load_lr". But eventually we use global_load_tr to indicate this is in global address space. If we want to rename intrinsics here, it should be global_load_tr_b64 and global_load_tr_b128.
We should rename intrinsic if users can use intrinsics directly. I think use-friendly is more important.
https://github.com/llvm/llvm-project/pull/86202
More information about the cfe-commits
mailing list