[all-commits] [llvm/llvm-project] 2d8106: [Clang] Add width handling for <gpuintrin.h> shuff...
Joseph Huber via All-commits
all-commits at lists.llvm.org
Wed Feb 5 10:39:11 PST 2025
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: 2d8106cb5a505326d1da0f4461708ed44a0ac761
https://github.com/llvm/llvm-project/commit/2d8106cb5a505326d1da0f4461708ed44a0ac761
Author: Joseph Huber <huberjn at outlook.com>
Date: 2025-02-05 (Wed, 05 Feb 2025)
Changed paths:
M clang/lib/Headers/amdgpuintrin.h
M clang/lib/Headers/gpuintrin.h
M clang/lib/Headers/nvptxintrin.h
M libc/src/__support/GPU/utils.h
M libc/test/integration/src/__support/GPU/CMakeLists.txt
A libc/test/integration/src/__support/GPU/shuffle.cpp
Log Message:
-----------
[Clang] Add width handling for <gpuintrin.h> shuffle helper (#125896)
Summary:
The CUDA impelementation has long supported the `width` argument on its
shuffle instrucitons, which makes it more difficult to replace those
uses with this helper. This patch just correctly implements that for
AMDGPU and NVPTX so it's equivalent to `__shfl_sync` in CUDA. This will
ease porting.
Fortunately these get optimized out correctly when passing in known
widths.
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list