[all-commits] [llvm/llvm-project] a6f494: AMDGPU: Optimize outgoing workitem ID based on req...
Matt Arsenault via All-commits
all-commits at lists.llvm.org
Thu Jan 13 09:08:32 PST 2022
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: a6f49423c1ecad4b414c204822d26d9025da2599
https://github.com/llvm/llvm-project/commit/a6f49423c1ecad4b414c204822d26d9025da2599
Author: Matt Arsenault <Matthew.Arsenault at amd.com>
Date: 2022-01-13 (Thu, 13 Jan 2022)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPUCallLowering.cpp
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/test/CodeGen/AMDGPU/GlobalISel/irtranslator-call-implicit-args.ll
M llvm/test/CodeGen/AMDGPU/call-reqd-group-size.ll
Log Message:
-----------
AMDGPU: Optimize outgoing workitem ID based on reqd_work_group_size
If we know we we aren't using a component from the kernel, we can save
a few bit packing instructions.
We're still enabling the VGPR input to the kernel though.
Commit: 59994c25f9df9db598c48fd33d8c3089b45184cd
https://github.com/llvm/llvm-project/commit/59994c25f9df9db598c48fd33d8c3089b45184cd
Author: Matt Arsenault <Matthew.Arsenault at amd.com>
Date: 2022-01-13 (Thu, 13 Jan 2022)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.workitem.id.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.workitem.id.ll
Log Message:
-----------
AMDGPU: Select workitem ID intrinsics to 0 with req_work_group_size
Shockingly we weren't doing this already. We should probably have this
be done earlier in the IR too, but it's still helpful to have the
lowering guarantee it so that we can modify the ABI implicit inputs
based on it.
Compare: https://github.com/llvm/llvm-project/compare/c719a8596d01...59994c25f9df
More information about the All-commits
mailing list