[all-commits] [llvm/llvm-project] 2d8106: [Clang] Add width handling for <gpuintrin.h> shuff...

Joseph Huber via All-commits all-commits at lists.llvm.org
Wed Feb 5 10:39:11 PST 2025


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 2d8106cb5a505326d1da0f4461708ed44a0ac761
      https://github.com/llvm/llvm-project/commit/2d8106cb5a505326d1da0f4461708ed44a0ac761
  Author: Joseph Huber <huberjn at outlook.com>
  Date:   2025-02-05 (Wed, 05 Feb 2025)

  Changed paths:
    M clang/lib/Headers/amdgpuintrin.h
    M clang/lib/Headers/gpuintrin.h
    M clang/lib/Headers/nvptxintrin.h
    M libc/src/__support/GPU/utils.h
    M libc/test/integration/src/__support/GPU/CMakeLists.txt
    A libc/test/integration/src/__support/GPU/shuffle.cpp

  Log Message:
  -----------
  [Clang] Add width handling for <gpuintrin.h> shuffle helper (#125896)

Summary:
The CUDA impelementation has long supported the `width` argument on its
shuffle instrucitons, which makes it more difficult to replace those
uses with this helper. This patch just correctly implements that for
AMDGPU and NVPTX so it's equivalent to `__shfl_sync` in CUDA. This will
ease porting.

Fortunately these get optimized out correctly when passing in known
widths.



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list