[all-commits] [llvm/llvm-project] b989c7: [flang][cuda] Add interfaces for __drcp_rX (#153681)

Oliver Hunt via All-commits all-commits at lists.llvm.org
Thu Aug 14 15:58:44 PDT 2025


  Branch: refs/heads/users/ojhunt/configurable-block-descriptor-ptrauth
  Home:   https://github.com/llvm/llvm-project
  Commit: b989c7c2e099211b549b2866cc4e6c6775775510
      https://github.com/llvm/llvm-project/commit/b989c7c2e099211b549b2866cc4e6c6775775510
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add interfaces for __drcp_rX (#153681)


  Commit: defbbf0129d5c1ac360c6ffd9eaa2f07ab07d622
      https://github.com/llvm/llvm-project/commit/defbbf0129d5c1ac360c6ffd9eaa2f07ab07d622
  Author: Craig Topper <craig.topper at sifive.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Target/RISCV/RISCVMoveMerger.cpp
    A llvm/test/CodeGen/RISCV/pr153598.mir

  Log Message:
  -----------
  [RISCV][MoveMerge] Don't copy kill flag when moving past an instruction that reads the register. (#153644)

If we're moving the second copy before another instruction that reads
the copied register, we need to clear the kill flag on the combined
move.

Fixes #153598.


  Commit: cb2f0d0a5f14c183e7182aba0f0e54a518de9e3f
      https://github.com/llvm/llvm-project/commit/cb2f0d0a5f14c183e7182aba0f0e54a518de9e3f
  Author: DeanSturtevant1 <dsturtevant at google.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M utils/bazel/llvm-project-overlay/mlir/BUILD.bazel

  Log Message:
  -----------
  [bazel] Fix mlir/BUILD.bazel for VectorToXeGPU. (#153696)


  Commit: df15c0d716fa0d84104b05937854becacbda6629
      https://github.com/llvm/llvm-project/commit/df15c0d716fa0d84104b05937854becacbda6629
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-device-proc.cuf
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add interfaces for __dsqrt_rn and __dsqrt_rz (#153624)


  Commit: a1529cd85a9f99a263de72e16ed2f7567c519456
      https://github.com/llvm/llvm-project/commit/a1529cd85a9f99a263de72e16ed2f7567c519456
  Author: Andy Kaylor <akaylor at nvidia.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M clang/include/clang/CIR/Dialect/Builder/CIRBaseBuilder.h
    M clang/include/clang/CIR/Dialect/IR/CIRAttrs.td
    M clang/include/clang/CIR/Dialect/IR/CIRDataLayout.h
    M clang/lib/CIR/CodeGen/CIRGenBuilder.cpp
    M clang/lib/CIR/CodeGen/CIRGenBuilder.h
    M clang/lib/CIR/CodeGen/CIRGenExprConstant.cpp
    M clang/lib/CIR/Dialect/IR/CIRDataLayout.cpp
    M clang/lib/CIR/Lowering/DirectToLLVM/LowerToLLVM.cpp
    M clang/test/CIR/CodeGen/globals.cpp

  Log Message:
  -----------
  [CIR] Add index support for global_view (#153254)

The #cir.global_view attribute was initially added without support for
the optional index list. This change adds index list support. This is
used when the address of an array or structure member is used as an
initializer.

This patch does not include support for taking the address of a
structure or class member. That will be added later.


  Commit: 57c1e01e484019084cfa975a4f41643d05ae327f
      https://github.com/llvm/llvm-project/commit/57c1e01e484019084cfa975a4f41643d05ae327f
  Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin at amd.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M clang/test/CodeGenHIP/hip-cumode.hip
    M clang/test/Driver/hip-macros.hip
    M llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp
    M llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
    M llvm/lib/Target/AMDGPU/GCNSubtarget.h
    M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUTargetStreamer.cpp
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
    M llvm/test/CodeGen/AMDGPU/GlobalISel/mul.ll
    M llvm/test/CodeGen/AMDGPU/global-load-xcnt.ll
    M llvm/test/CodeGen/AMDGPU/memory-legalizer-local-workgroup.ll
    M llvm/test/CodeGen/AMDGPU/packed-fp32.ll
    M llvm/test/MC/AMDGPU/hsa-diag-v4.s

  Log Message:
  -----------
  [AMDGPU] Don't allow wgp mode on gfx1250 (#153680)

- gfx1250 only supports cu mode


  Commit: fc4df2c9173a110102005af937a26df7f29d24e1
      https://github.com/llvm/llvm-project/commit/fc4df2c9173a110102005af937a26df7f29d24e1
  Author: Daniel Paoliello <danpao at microsoft.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M clang/test/Headers/mm3dnow.c
    M clang/test/Headers/pmmintrin.c
    M clang/test/Headers/x86-intrinsics-headers.c
    M clang/test/Headers/x86intrin.c
    M clang/test/Headers/x86intrin.cpp

  Log Message:
  -----------
  [win][arm64ec] XFAIL x64 intrinsic tests on Arm64EC (#153474)

Clang defines the x64 preprocessor macro (`__x86_64__`) when building
Arm64EC, however the tests for x64 built-ins and intrinsics are
currently failing since the relevant functions don't exist, resulting in
errors like:

```
Line 165: invalid conversion between vector type '__v2di' (vector of 2 'long long' values) and integer type 'int' of different size
```

(Clang doesn't know the intrinsics being called, and so treats it like
an undefined function, which makes it assume the return type is `int`)

For now, expect these tests to fail until someone decides to implement
these intrinsics.


  Commit: ca9ddd54b77020322a94a8032276b46cc1046522
      https://github.com/llvm/llvm-project/commit/ca9ddd54b77020322a94a8032276b46cc1046522
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add interfaces for __ll2float_rX (#153694)


  Commit: d56fa965243bcdc115b8f262ce79cf2547500c6d
      https://github.com/llvm/llvm-project/commit/d56fa965243bcdc115b8f262ce79cf2547500c6d
  Author: joaosaffran <126493771+joaosaffran at users.noreply.github.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/include/llvm/BinaryFormat/DXContainer.h
    M llvm/include/llvm/Support/DXILABI.h
    M llvm/lib/Analysis/DXILResource.cpp
    M llvm/lib/BinaryFormat/DXContainer.cpp
    M llvm/lib/Frontend/HLSL/HLSLRootSignature.cpp
    M llvm/lib/Frontend/HLSL/RootSignatureMetadata.cpp
    M llvm/lib/Support/CMakeLists.txt
    A llvm/lib/Support/DXILABI.cpp
    M llvm/lib/Target/DirectX/DXContainerGlobals.cpp
    M llvm/lib/Target/DirectX/DXILOpLowering.cpp
    M llvm/lib/Target/DirectX/DXILPostOptimizationValidation.cpp
    M llvm/lib/Target/DirectX/DXILRootSignature.h
    M llvm/test/Analysis/DXILResource/buffer-frombinding.ll
    M llvm/test/CodeGen/DirectX/llc-pipeline.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-cbuffer-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-descriptor-table-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-root-descriptor-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-sampler.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-static-sampler-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation.ll

  Log Message:
  -----------
  [DirectX] Add Range Overlap validation (#152229)

As part of the Root Signature Spec, we need to validate if Root
Signatures are not defining overlapping ranges.
Closes: https://github.com/llvm/llvm-project/issues/126645

---------

Co-authored-by: joaosaffran <joao.saffran at microsoft.com>
Co-authored-by: Joao Saffran <{ID}+{username}@users.noreply.github.com>
Co-authored-by: Joao Saffran <jderezende at microsoft.com>


  Commit: 2775c79c4fd06b4654a664c5182d9eb86f3e72c1
      https://github.com/llvm/llvm-project/commit/2775c79c4fd06b4654a664c5182d9eb86f3e72c1
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add interfaces for __float2ll_rX (#153702)


  Commit: a629119c7544e62fa031c85f87f507f9a90a070f
      https://github.com/llvm/llvm-project/commit/a629119c7544e62fa031c85f87f507f9a90a070f
  Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin at amd.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Target/AMDGPU/AMDGPURemoveIncompatibleFunctions.cpp
    M llvm/lib/Target/AMDGPU/GCNSubtarget.h
    A llvm/test/CodeGen/AMDGPU/pal-metadata-3.0.gfx1250.ll
    A llvm/test/CodeGen/AMDGPU/remove-incompatible-wave64-feature.ll

  Log Message:
  -----------
  [AMDGPU] Remove wave64 functions (#153690)

gfx1250 only supports wave32.


  Commit: 602f308d4fcc84d90d49fb708f757589bca68d4e
      https://github.com/llvm/llvm-project/commit/602f308d4fcc84d90d49fb708f757589bca68d4e
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add interface for __saturatef (#153705)


  Commit: 5ca6d1d28c4e174b75c4fee6a7b7e39d56914077
      https://github.com/llvm/llvm-project/commit/5ca6d1d28c4e174b75c4fee6a7b7e39d56914077
  Author: Oliver Hunt <oliver at apple.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M clang/include/clang/CIR/Dialect/Builder/CIRBaseBuilder.h
    M clang/include/clang/CIR/Dialect/IR/CIRAttrs.td
    M clang/include/clang/CIR/Dialect/IR/CIRDataLayout.h
    M clang/lib/CIR/CodeGen/CIRGenBuilder.cpp
    M clang/lib/CIR/CodeGen/CIRGenBuilder.h
    M clang/lib/CIR/CodeGen/CIRGenExprConstant.cpp
    M clang/lib/CIR/Dialect/IR/CIRDataLayout.cpp
    M clang/lib/CIR/Lowering/DirectToLLVM/LowerToLLVM.cpp
    M clang/test/CIR/CodeGen/globals.cpp
    M clang/test/CodeGenHIP/hip-cumode.hip
    M clang/test/Driver/hip-macros.hip
    M clang/test/Headers/mm3dnow.c
    M clang/test/Headers/pmmintrin.c
    M clang/test/Headers/x86-intrinsics-headers.c
    M clang/test/Headers/x86intrin.c
    M clang/test/Headers/x86intrin.cpp
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-device-proc.cuf
    M flang/test/Lower/CUDA/cuda-libdevice.cuf
    M llvm/include/llvm/BinaryFormat/DXContainer.h
    M llvm/include/llvm/Support/DXILABI.h
    M llvm/lib/Analysis/DXILResource.cpp
    M llvm/lib/BinaryFormat/DXContainer.cpp
    M llvm/lib/Frontend/HLSL/HLSLRootSignature.cpp
    M llvm/lib/Frontend/HLSL/RootSignatureMetadata.cpp
    M llvm/lib/Support/CMakeLists.txt
    A llvm/lib/Support/DXILABI.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp
    M llvm/lib/Target/AMDGPU/AMDGPURemoveIncompatibleFunctions.cpp
    M llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
    M llvm/lib/Target/AMDGPU/GCNSubtarget.h
    M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUTargetStreamer.cpp
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
    M llvm/lib/Target/DirectX/DXContainerGlobals.cpp
    M llvm/lib/Target/DirectX/DXILOpLowering.cpp
    M llvm/lib/Target/DirectX/DXILPostOptimizationValidation.cpp
    M llvm/lib/Target/DirectX/DXILRootSignature.h
    M llvm/lib/Target/RISCV/RISCVMoveMerger.cpp
    M llvm/test/Analysis/DXILResource/buffer-frombinding.ll
    M llvm/test/CodeGen/AMDGPU/GlobalISel/mul.ll
    M llvm/test/CodeGen/AMDGPU/global-load-xcnt.ll
    M llvm/test/CodeGen/AMDGPU/memory-legalizer-local-workgroup.ll
    M llvm/test/CodeGen/AMDGPU/packed-fp32.ll
    A llvm/test/CodeGen/AMDGPU/pal-metadata-3.0.gfx1250.ll
    A llvm/test/CodeGen/AMDGPU/remove-incompatible-wave64-feature.ll
    M llvm/test/CodeGen/DirectX/llc-pipeline.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-cbuffer-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-descriptor-table-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-root-descriptor-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-sampler.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-static-sampler-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation.ll
    A llvm/test/CodeGen/RISCV/pr153598.mir
    M llvm/test/MC/AMDGPU/hsa-diag-v4.s
    M utils/bazel/llvm-project-overlay/mlir/BUILD.bazel

  Log Message:
  -----------
  Merge branch 'main' into users/ojhunt/configurable-block-descriptor-ptrauth


Compare: https://github.com/llvm/llvm-project/compare/a5a56f131b47...5ca6d1d28c4e

To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list