[all-commits] [llvm/llvm-project] 22a2d7: [NVPTX] Emit ld.v4.b16 for loading <4 x bfloat> (#...
Ahmed S. Taei via All-commits
all-commits at lists.llvm.org
Tue Sep 17 21:07:08 PDT 2024
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: 22a2d74c0c6dbde6b3503ec51486d6cf5d0c83f1
https://github.com/llvm/llvm-project/commit/22a2d74c0c6dbde6b3503ec51486d6cf5d0c83f1
Author: Ahmed S. Taei <asaadaldien at users.noreply.github.com>
Date: 2024-09-17 (Tue, 17 Sep 2024)
Changed paths:
M llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp
M llvm/test/CodeGen/NVPTX/vector-loads.ll
Log Message:
-----------
[NVPTX] Emit ld.v4.b16 for loading <4 x bfloat> (#109069)
This PR enables emitting a single load instruction for <4 x bfloat>,
otherwise, 2 ld.b32 loads are generated.
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list