[all-commits] [llvm/llvm-project] 421d29: [CIR] Lower sret returns in CallConvLowering (#201...
Amir Ayupov via All-commits
all-commits at lists.llvm.org
Thu Jun 11 23:46:13 PDT 2026
Branch: refs/heads/users/aaupov/spr/main.bolt-split-out-setparsingbuf
Home: https://github.com/llvm/llvm-project
Commit: 421d296893ed173846c0aceaee67cdafa67da07b
https://github.com/llvm/llvm-project/commit/421d296893ed173846c0aceaee67cdafa67da07b
Author: adams381 <adams at nvidia.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M clang/lib/CIR/Dialect/Transforms/TargetLowering/CIRABIRewriteContext.cpp
M clang/lib/CIR/Dialect/Transforms/TargetLowering/CIRABIRewriteContext.h
A clang/test/CIR/Transforms/abi-lowering/indirect-return-sret.cir
Log Message:
-----------
[CIR] Lower sret returns in CallConvLowering (#201716)
Functions that return an aggregate by value classify their return as
ArgKind::Indirect, but CallConvLowering reached an errorNYI for that
case, so the whole CallConv pass refused to lower any struct-returning
function.
rewriteFunctionDefinition now recognizes an Indirect return: the wire
return type becomes void, a hidden sret pointer is prepended as block
argument 0, and every cir.return is routed through that pointer. Rather
than storing the loaded return value through the sret pointer (a
byte-copy that breaks non-trivially-copyable types -- libstdc++'s SSO
std::string keeps a _M_p pointer into its own _M_local_buf, so a
byte-copy leaves the destination aliasing the source's dying stack
storage), insertSRetStores rewires the __retval alloca to the sret
pointer so construction flows directly into the caller's slot, matching
classic CodeGen's "construct into %agg.result" pattern. CIRGen emits one
cir.load __retval / cir.return pair per return statement, all reading
the
single __retval alloca, so the alloca is rewired once and every return
is
collapsed to a bare return. That cir.return (cir.load <alloca>) shape is
treated as an invariant and asserted with cast<> rather than guarded by
a
fallback. The sret parameter carries sret(T) align A writable
dead_on_unwind, plus noalias on definitions.
rewriteCallSite prepends the return slot, makes the call return void,
and reads the result back. When the result has a single store-into-dest
use whose destination dominates the call, it reuses that destination as
the sret slot and drops the redundant store, so the callee writes
straight into the local with no copy; otherwise it allocates a fresh
slot
and loads the value out. The slot's
per-argument attributes go through the same updateArgAttrs path as the
non-sret case, so sret composes with Extend (signext/zeroext) and Ignore
arguments.
byval indirect arguments and Expand are still errorNYI.
Co-authored-by: Cursor <cursoragent at cursor.com>
---------
Co-authored-by: Cursor <cursoragent at cursor.com>
Commit: 3397635cb50abdc9413246a3810f7d1701211f0b
https://github.com/llvm/llvm-project/commit/3397635cb50abdc9413246a3810f7d1701211f0b
Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin at amd.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
M llvm/lib/Target/AMDGPU/VOP3PInstructions.td
M llvm/test/CodeGen/AMDGPU/packed-fp64.ll
M llvm/test/MC/AMDGPU/gfx1251_asm_vop3p.s
M llvm/test/MC/AMDGPU/gfx1251_err.s
M llvm/test/MC/Disassembler/AMDGPU/gfx1251_dasm_vop3p.txt
Log Message:
-----------
[AMDGPU] Add gfx1251 V_PK_MUL_F64 (#203410)
Commit: 673ec226f6fae99ab58cb1cff90a795b03a17f80
https://github.com/llvm/llvm-project/commit/673ec226f6fae99ab58cb1cff90a795b03a17f80
Author: Jason Molenda <jmolenda at apple.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M lldb/test/API/macosx/simulator/TestSimulatorPlatform.py
Log Message:
-----------
[lldb][macosx] Pare down the SimulatPlatform tests, don't run on intel (#203417)
TestSimulatorPlatform.py is flakey on the Intel Mac CI, doing a
'platform process list' and not getting a reply back. The issue is only
happening on the old Intel mac CI and not the newer arm64 CI's; it is
not important. I am skipping this test on the Intel mac CI.
I'm also removing test runs that try to build 32-bit binaries (i386,
armv7k) and the tests that would explicitly test for a legacy build load
command (e.g. `LC_VERSION_MIN_IPHONEOS`); we don't support building back
to when OSes were using those, any more.
Commit: bd1de35f75bf1b08c02939fc9c4b67c45630d902
https://github.com/llvm/llvm-project/commit/bd1de35f75bf1b08c02939fc9c4b67c45630d902
Author: Xyloflake <xyloflake at gmail.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M llvm/lib/TargetParser/Host.cpp
Log Message:
-----------
Add Apple M5 host detection (#200616)
The values have been taken from `mach/machine.h` in the Xcode 26.5 SDK.
Commit: bfca0db5bd17ab116cd70464d7e1099b07a5872b
https://github.com/llvm/llvm-project/commit/bfca0db5bd17ab116cd70464d7e1099b07a5872b
Author: Helena Kotas <hekotas at microsoft.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M llvm/include/llvm/Frontend/HLSL/CBuffer.h
M llvm/lib/Frontend/HLSL/CBuffer.cpp
M llvm/lib/Target/DirectX/DXILCBufferAccess.cpp
M llvm/lib/Target/SPIRV/SPIRVCBufferAccess.cpp
Log Message:
-----------
[NFC][DirectX][SPIRV] Remove helper from HLSLFrontend (#203401)
Remove helper function `removeCBufferGlobalsFromUseList` which basically
implements existing `llvm::removeFromUsedLists`. Modify the passes that
use it to call `llvm::removeFromUsedLists`.
Follow-up on
https://github.com/llvm/llvm-project/pull/202745#discussion_r3398119720
Commit: 10836c2234c480ac943850fec9359c83dba2d45d
https://github.com/llvm/llvm-project/commit/10836c2234c480ac943850fec9359c83dba2d45d
Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin at amd.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/lib/Target/AMDGPU/SIInstrInfo.td
M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
M llvm/lib/Target/AMDGPU/VOP3PInstructions.td
M llvm/test/CodeGen/AMDGPU/packed-fp64.ll
M llvm/test/MC/AMDGPU/gfx1251_asm_vop3p.s
M llvm/test/MC/AMDGPU/gfx1251_err.s
M llvm/test/MC/Disassembler/AMDGPU/gfx1251_dasm_vop3p.txt
Log Message:
-----------
[AMDGPU] Add gfx1251 V_PK_FMA_F64 (#203418)
Commit: 2e708e7c90e611a57f7e748347acc4dbd540e43a
https://github.com/llvm/llvm-project/commit/2e708e7c90e611a57f7e748347acc4dbd540e43a
Author: William Tran-Viet <wtranviet at proton.me>
Date: 2026-06-12 (Fri, 12 Jun 2026)
Changed paths:
M libcxx/include/optional
M libcxx/test/std/utilities/optional/optional.syn/optional_in_place_t.verify.cpp
M libcxx/test/std/utilities/optional/optional.syn/optional_nullopt_t.verify.cpp
Log Message:
-----------
[libc++] Reimplement `optional<T&>` as a partial specialization (#201983)
- Break out the `optional<T&>` implementation that was interwoven with
the regular interface
- This should simplify the implementation
- And fix a missed added constraint for the `std::swap` specialization.
Commit: 9f1e08fa8ed7bcf4b7cfaf9eaaa7c23a2d3ed347
https://github.com/llvm/llvm-project/commit/9f1e08fa8ed7bcf4b7cfaf9eaaa7c23a2d3ed347
Author: Ziqing Luo <ziqing_luo at apple.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M clang/lib/ScalableStaticAnalysisFramework/Analyses/PointerFlow/PointerFlowExtractor.cpp
M clang/lib/ScalableStaticAnalysisFramework/Analyses/SSAFAnalysesCommon.h
M clang/lib/ScalableStaticAnalysisFramework/Analyses/UnsafeBufferUsage/UnsafeBufferUsageExtractor.cpp
M clang/unittests/ScalableStaticAnalysisFramework/Analyses/PointerFlow/PointerFlowTest.cpp
M clang/unittests/ScalableStaticAnalysisFramework/Analyses/UnsafeBufferUsage/UnsafeBufferUsageTest.cpp
Log Message:
-----------
[SSAF][Extractor] Make hard errors in PointerFlow and UnsafeBufferUsage Extractors quiet (#201953)
Hard errors were used in extractors during development to quickly
identify unsupported language constructs. This commit converts them to
DEBUG_WITH_TYPE so that these errors are silenced in release builds.
In addition, translating unsupported language constructs now silently
results in an empty EntityPointerLevelSet. The PointerFlowExtractor will
skip empty sets for either the source or the destination when building
edges to avoid an ill-formed edge set data structure.
rdar://178747892
---------
Co-authored-by: Balázs Benics <benicsbalazs at gmail.com>
Commit: 48813264376be9868bec78fa1653bc9003db512f
https://github.com/llvm/llvm-project/commit/48813264376be9868bec78fa1653bc9003db512f
Author: SeongJaePark <111725324+SeongjaeP at users.noreply.github.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M mlir/lib/Dialect/Vector/Transforms/LowerVectorContract.cpp
M mlir/test/Dialect/Vector/vector-contract-to-parallel-arith-transforms.mlir
Log Message:
-----------
[mlir][vector][NFC] Document and simplify vector contract reshape helpers (#200544)
Replace the bare `// TODO` placeholders above `reshapeLoad` and
`reshapeStore`
in `LowerVectorContract.cpp` with doc comments describing each helper's
semantics (the `index == -1` / `index == 0` / `index > 0` cases) and
short
before/after IR examples.
Also drop the redundant `VectorType` parameter from both helpers; the
type is
now inferred from the value operand (`val` for `reshapeLoad`, `result`
for
`reshapeStore`).
Add a LIT test that exercises the recursive `reshapeLoad` unroll path by
mapping a parallel iterator to a non-leading dimension, which falls
through
to the generic `lowerParallel` / `reshapeLoad` path.
No functional change.
---------
Co-authored-by: sjae.park <dev at opt-ai.kr>
Commit: 7597c0c938b364306cbee21dd03835ff5b699c74
https://github.com/llvm/llvm-project/commit/7597c0c938b364306cbee21dd03835ff5b699c74
Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin at amd.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.wmma.gfx1250.w32.fmt-error.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.wmma.gfx1250.w32.ll
M llvm/test/MC/AMDGPU/gfx1250_asm_wmma_w32.s
M llvm/test/MC/AMDGPU/gfx1250_asm_wmma_w32_err.s
M llvm/test/MC/Disassembler/AMDGPU/gfx1250_dasm_wmma_w32.txt
Log Message:
-----------
[AMDGPU] Validate WMMA scale/format combination (#203074)
Only some combinations are listed as supported.
Fixes: https://github.com/ROCm/llvm-project/issues/2634
Commit: 3ef102be08a0b9c3bd948e651d70f36dc73cc5ae
https://github.com/llvm/llvm-project/commit/3ef102be08a0b9c3bd948e651d70f36dc73cc5ae
Author: Ziqing Luo <ziqing_luo at apple.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M clang/lib/ScalableStaticAnalysisFramework/Analyses/PointerFlow/PointerFlowExtractor.cpp
M clang/unittests/ScalableStaticAnalysisFramework/Analyses/PointerFlow/PointerFlowTest.cpp
Log Message:
-----------
[SSAF][PointerFlowExtractor] Handle empty initializer lists for scalars and unions (#201968)
Empty initializer lists for scalars and unions are corner cases that
were previously missed by the PointerFlowExtractor. This commit adds
support for these cases. Because the initializing values are implicitly
constant 0 (or nullptr), they do not bridge entity pointer levels.
rdar://178856689
---------
Co-authored-by: Balázs Benics <benicsbalazs at gmail.com>
Commit: d583701b350c8b8767cc8e6c17f03aa6336cfc1d
https://github.com/llvm/llvm-project/commit/d583701b350c8b8767cc8e6c17f03aa6336cfc1d
Author: Prabhu Rajasekaran <prabhukr at google.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M libc/config/baremetal/config.json
Log Message:
-----------
[libc] Enable baremetal printf float320 (#203421)
For memory constrained baremetal devices using float320 seems a
reasonable option by default.
Commit: d543c49ad6bf02434abc5db85332d5ab45e6e3ca
https://github.com/llvm/llvm-project/commit/d543c49ad6bf02434abc5db85332d5ab45e6e3ca
Author: yueshe <yueshe1996 at gmail.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M clang/lib/Driver/ToolChains/Darwin.cpp
M clang/test/Driver/darwin-objc-selector-stubs.m
Log Message:
-----------
[clang][Darwin] Disable ObjC class selector stubs when using LLD (#203388)
LLD does not support ObjC class selector stubs yet (which requires
synthesizing `objc_msgSendClass$...` stubs). This change disables
`-fobjc-msgsend-class-selector-stubs` by default when the linker is LLD.
Ref: https://github.com/llvm/llvm-project/issues/203385
Commit: 95746377678925adfe39c83081b4144d92c5d3b8
https://github.com/llvm/llvm-project/commit/95746377678925adfe39c83081b4144d92c5d3b8
Author: SiHuaN <liyongtai at iscas.ac.cn>
Date: 2026-06-12 (Fri, 12 Jun 2026)
Changed paths:
M clang/lib/Headers/riscv_packed_simd.h
M clang/test/CodeGen/RISCV/rvp-intrinsics.c
M cross-project-tests/intrinsic-header-tests/riscv_packed_simd.c
Log Message:
-----------
[Clang][RISCV] packed comparison intrinsics (#203191)
Add header wrappers for pmseq/pmsne/pmslt[u]/pmsgt[u]/pmsge[u]/pmsle[u]
as element-wise vector comparisons cast to the unsigned result type.
Commit: 7dcd1d2ad104c3f9748370a42dc775cd6e7e34dc
https://github.com/llvm/llvm-project/commit/7dcd1d2ad104c3f9748370a42dc775cd6e7e34dc
Author: Ziqing Luo <ziqing_luo at apple.com>
Date: 2026-06-12 (Fri, 12 Jun 2026)
Changed paths:
M clang/lib/ScalableStaticAnalysisFramework/Analyses/PointerFlow/PointerFlowExtractor.cpp
M clang/lib/ScalableStaticAnalysisFramework/Analyses/SSAFAnalysesCommon.h
M clang/lib/ScalableStaticAnalysisFramework/Analyses/UnsafeBufferUsage/UnsafeBufferUsageExtractor.cpp
M clang/unittests/ScalableStaticAnalysisFramework/Analyses/PointerFlow/PointerFlowTest.cpp
M clang/unittests/ScalableStaticAnalysisFramework/Analyses/UnsafeBufferUsage/UnsafeBufferUsageTest.cpp
Log Message:
-----------
Revert "[SSAF][Extractor] Make hard errors in PointerFlow and UnsafeBufferUsage Extractors quiet (#201953)" (#203432)
This reverts commit 9f1e08fa8ed7bcf4b7cfaf9eaaa7c23a2d3ed347.
It causes build error:
https://lab.llvm.org/buildbot/#/builders/2/builds/53597.
The use of 'setCurrentDebugType' should be guarded by '#ifndef NDEBUG'
Commit: 8433cf6d6ccc49a0fd99ed86c7d94a1fbb2fd1be
https://github.com/llvm/llvm-project/commit/8433cf6d6ccc49a0fd99ed86c7d94a1fbb2fd1be
Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin at amd.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
M llvm/lib/Target/AMDGPU/AMDGPUInstructionSelector.cpp
M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/lib/Target/AMDGPU/SIInstructions.td
M llvm/test/CodeGen/AMDGPU/packed-fp64.ll
Log Message:
-----------
[AMDGPU] Make v2f64 fneg legal on gfx1251 (#203427)
Commit: b9704de13ca5ce3158c6681839162a791042c6f9
https://github.com/llvm/llvm-project/commit/b9704de13ca5ce3158c6681839162a791042c6f9
Author: David Zbarsky <dzbarsky at gmail.com>
Date: 2026-06-12 (Fri, 12 Jun 2026)
Changed paths:
M llvm/lib/Passes/PassBuilder.cpp
Log Message:
-----------
[PassBuilder] Table-drive pass name printing (#202656)
Replace the macro-expanded raw_ostream operations in
PassBuilder::printPassNames with static pass-name arrays and two shared
noinline printing loops. Preserve the generated category order and the
exact spelling of parameterized pass names.
The change only executes when a client requests the pass-name listing;
normal pipeline parsing and optimization do not access the new tables or
helpers.
A stripped opt binary shrinks from 115,493,720 to 115,394,640 bytes,
saving 99,080 bytes. The linked __TEXT section shrinks by 98,304 bytes.
Work towards #202616
AI tool disclosure: Co-authored with OpenAI Codex.
Commit: 15fdc79231b9d32a6bef89f5fc89105b8d3f9363
https://github.com/llvm/llvm-project/commit/15fdc79231b9d32a6bef89f5fc89105b8d3f9363
Author: Jordan Rupprecht <rupprecht at google.com>
Date: 2026-06-12 (Fri, 12 Jun 2026)
Changed paths:
M utils/bazel/llvm-project-overlay/llvm/BUILD.bazel
Log Message:
-----------
[bazel][DirectX] Fix 2bccbf23edddf216ef060d34443f60f644d0fb06 (#203442)
Add new dep on MC
Commit: 56f8fbb0149146f50aa8c905467b9384e8cb7bad
https://github.com/llvm/llvm-project/commit/56f8fbb0149146f50aa8c905467b9384e8cb7bad
Author: TelGome <93700071+TelGome at users.noreply.github.com>
Date: 2026-06-12 (Fri, 12 Jun 2026)
Changed paths:
M clang/include/clang/Basic/BuiltinsRISCV.td
M clang/lib/CodeGen/TargetBuiltins/RISCV.cpp
M clang/lib/Headers/riscv_packed_simd.h
M clang/test/CodeGen/RISCV/rvp-intrinsics.c
M cross-project-tests/intrinsic-header-tests/riscv_packed_simd.c
M llvm/include/llvm/IR/IntrinsicsRISCV.td
M llvm/lib/Target/RISCV/RISCVISelLowering.cpp
M llvm/lib/Target/RISCV/RISCVInstrInfoP.td
M llvm/test/CodeGen/RISCV/rvp-simd-32.ll
M llvm/test/CodeGen/RISCV/rvp-simd-64.ll
Log Message:
-----------
[RISCV][P-ext] Support Packed Averaging Addition and Subtraction intrinsics(#203147)
Commit: 69371e6c313063fac7fb580c126cb17996d71a01
https://github.com/llvm/llvm-project/commit/69371e6c313063fac7fb580c126cb17996d71a01
Author: Jordan Rupprecht <rupprecht at google.com>
Date: 2026-06-12 (Fri, 12 Jun 2026)
Changed paths:
M utils/bazel/llvm-project-overlay/libc/BUILD.bazel
Log Message:
-----------
[bazel][libc] Fix 8acfc364e9f788367ff0beab5c76a3527a689a0b (#203443)
Add extra htons yaml deps
Commit: 0591eef767078a0a079469656658c4a2fc017bbe
https://github.com/llvm/llvm-project/commit/0591eef767078a0a079469656658c4a2fc017bbe
Author: Nikolas Klauser <nikolasklauser at berlin.de>
Date: 2026-06-12 (Fri, 12 Jun 2026)
Changed paths:
M llvm/lib/Transforms/InstCombine/InstCombineCalls.cpp
M llvm/test/Transforms/InstCombine/assume.ll
Log Message:
-----------
[InstCombine] Move noundef assume bundles on loads into metadata (#203395)
Commit: beb2614e3ec915069380107409e88d002404aee9
https://github.com/llvm/llvm-project/commit/beb2614e3ec915069380107409e88d002404aee9
Author: Jordan Rupprecht <rupprecht at google.com>
Date: 2026-06-12 (Fri, 12 Jun 2026)
Changed paths:
M utils/bazel/llvm-project-overlay/libc/BUILD.bazel
Log Message:
-----------
[bazel][libc] Fix 582643f1ec62d0c81d97afcf1b741babb3152728 (#203449)
Add dep for dyadic float -> attributes
Commit: 2a6cfc5643053117b3baa301b91afaf9fc5edede
https://github.com/llvm/llvm-project/commit/2a6cfc5643053117b3baa301b91afaf9fc5edede
Author: Aiden Grossman <aidengrossman at google.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M compiler-rt/test/fuzzer/features_dir.test
M compiler-rt/test/fuzzer/merge-posix.test
Log Message:
-----------
[Fuzzer] Make two tests compatible with the internal shell. (#203448)
1. Remove redundant parntheses that broke the internal shell's parsing
logic.
2. Use env when specifying environment variables.
3. Rewrite a bash one-line loop in python.
Commit: 62847ab7593df144a5708406e6240ec5e8a3a101
https://github.com/llvm/llvm-project/commit/62847ab7593df144a5708406e6240ec5e8a3a101
Author: Jiahao Guo <eoonguo at gmail.com>
Date: 2026-06-12 (Fri, 12 Jun 2026)
Changed paths:
M clang/include/clang/CIR/Dialect/IR/CIROps.td
M clang/lib/CIR/Lowering/DirectToLLVM/LowerToLLVM.cpp
Log Message:
-----------
[CIR] Support zero/one result trivial operation lower via tablegen (#203183)
### summary
Lower zero result operation have been supported in this PR:
https://github.com/llvm/llvm-project/pull/202273
In this PR, the lowering of operations with zero-result and one-result
is changed to be automatically lowered via TableGen. This helps reduce
the size of the file
`clang/lib/CIR/Lowering/DirectToLLVM/LowerToLLVM.cpp`
#### test
I thought existing lower tests can cover this PR, so I didn't add more
tests.
Assisted-by: Claude Opus 4.8
Commit: c1991da69a5f6bd7117224c68f2f66546264a743
https://github.com/llvm/llvm-project/commit/c1991da69a5f6bd7117224c68f2f66546264a743
Author: Srinivasa Ravi <srinivasar at nvidia.com>
Date: 2026-06-12 (Fri, 12 Jun 2026)
Changed paths:
M mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
M mlir/lib/Dialect/LLVMIR/IR/NVVMDialect.cpp
M mlir/test/Conversion/NVVMToLLVM/nvvm-to-llvm.mlir
M mlir/test/Target/LLVMIR/nvvm/barrier.mlir
Log Message:
-----------
[MLIR][NVVM] Update nvvm.barrier.arrive Op (#202608)
This change updates the `nvvm.barrier.arrive` Op to lower using
intrinsics instead of inline PTX. It also adds a new `aligned` attribute
to the Op to lower to both aligned and unaligned forms.
PTX Spec Reference:
https://docs.nvidia.com/cuda/parallel-thread-execution/#parallel-synchronization-and-communication-instructions-bar
Commit: f3f7317aaf39928329dd453a6a340e6ddd027b81
https://github.com/llvm/llvm-project/commit/f3f7317aaf39928329dd453a6a340e6ddd027b81
Author: Changpeng Fang <changpeng.fang at amd.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M llvm/lib/Target/AMDGPU/VOP1Instructions.td
M llvm/test/CodeGen/AMDGPU/bf16-math.ll
M llvm/test/MC/AMDGPU/gfx1250_asm_vop1_err.s
M llvm/test/MC/AMDGPU/gfx1250_asm_vop3_from_vop1-fake16.s
M llvm/test/MC/AMDGPU/gfx1250_asm_vop3_from_vop1.s
M llvm/test/MC/Disassembler/AMDGPU/gfx1250_dasm_vop3_from_vop1.txt
Log Message:
-----------
[AMDGPU] Add MC clamp support for bf16 trans instructions (#203433)
Based on recent gfx1250 sp3 update. Refer to DEGFXSP3-664
Commit: 8679ab6267aeae9971cae4ed7eff9966e813e28d
https://github.com/llvm/llvm-project/commit/8679ab6267aeae9971cae4ed7eff9966e813e28d
Author: Hoàng Minh Thiên <hoangminhthien05022009 at gmail.com>
Date: 2026-06-12 (Fri, 12 Jun 2026)
Changed paths:
M libc/test/src/math/RoundToIntegerTest.h
Log Message:
-----------
[libc] [math] Fix build bot failure introduced by unit test in PR #201154 (#203457)
The root cause is that the unit test
`libc/test/src/math/RoundToIntegerTest.h` `#include <cfenv>` which
requires the macro `__GLIBC_PREREQ` to be defined. But in that riscv32
runtime, seems like it's not defined.
Removing the include works fine, and at the same time, would resolve the
failure.
Commit: 89a5c690e51a0ea33c962842f90a8ec89965afa3
https://github.com/llvm/llvm-project/commit/89a5c690e51a0ea33c962842f90a8ec89965afa3
Author: Craig Topper <craig.topper at sifive.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M clang/lib/CodeGen/Targets/RISCV.cpp
Log Message:
-----------
[RISCV] Return the type from detectVLSCCEligibleStruct instead of using an output argument. NFC (#203423)
We can replace the previous bool return with the type and use nullptr for
false.
Commit: 452f59c8dc30a3bc16c691f5862bc5b39d97319a
https://github.com/llvm/llvm-project/commit/452f59c8dc30a3bc16c691f5862bc5b39d97319a
Author: Madhur Amilkanthwar <madhura at nvidia.com>
Date: 2026-06-12 (Fri, 12 Jun 2026)
Changed paths:
M llvm/lib/Transforms/Utils/LoopUnroll.cpp
A llvm/test/Transforms/LoopUnroll/runtime-unroll-reductions-min-max.ll
Log Message:
-----------
Reapply "[LoopUnroll] Support parallel reductions for minmax" (#201010)
Reapplies 1e79ea1f5b3e (#182473) reverted by 56ccbc253150 (#200892). The
revert was due to a profcheck failure: prof-verify reported "select
annotation missing" on the combine select createMinMaxOp emits for FP
fcmp+select min/max.
This patch fixes it by marking the branch weights of newly inserted
selects as explicitly unknown.
Commit: efb038f38f23ee201ac872ae98668c2ef922f0fa
https://github.com/llvm/llvm-project/commit/efb038f38f23ee201ac872ae98668c2ef922f0fa
Author: ShashwathiNavada <shashwathinavada at gmail.com>
Date: 2026-06-12 (Fri, 12 Jun 2026)
Changed paths:
M flang/lib/Lower/OpenMP/OpenMP.cpp
M flang/test/Lower/OpenMP/Todo/error.f90
A flang/test/Lower/OpenMP/nothing.f90
Log Message:
-----------
[Flang][Openmp]Prevent TODO abort on nothing directive (#202679)
Since nothing is a no-op directive (OpenMP 5.2, 8.4), handle it during
lowering instead of falling through to the generic unimplemented
utility-directive path and triggering a TODO abort.
Commit: 06c7de45221621a0773d125ff7d016df87eb3d62
https://github.com/llvm/llvm-project/commit/06c7de45221621a0773d125ff7d016df87eb3d62
Author: Madhur Amilkanthwar <madhura at nvidia.com>
Date: 2026-06-12 (Fri, 12 Jun 2026)
Changed paths:
M llvm/lib/Transforms/Scalar/LoopFuse.cpp
Log Message:
-----------
[LoopFusion][NFC] Avoid copying fusion candidates per pair (#203461)
`fuseCandidates()` copied both candidates (each holding two
`SmallVector<Instruction *, 16>`) for every adjacent pair examined, even
pairs rejected by an early continue. Bind them by const reference; they
are only read before being erased from the list, and performFusion runs
before the erases.
Commit: 15e7f2e8f2a14e5d574eb8894a4cbba8546291ca
https://github.com/llvm/llvm-project/commit/15e7f2e8f2a14e5d574eb8894a4cbba8546291ca
Author: Amir Ayupov <aaupov at fb.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M bolt/include/bolt/Profile/DataAggregator.h
M bolt/lib/Profile/DataAggregator.cpp
M bolt/test/X86/pre-aggregated-perf.test
Log Message:
-----------
[BOLT] Propagate DataAggregator parse errors
Propagate perf/preaggregated input parsing errors through DataAggregator
instead of terminating from per-input aggregation jobs.
This lets multi-input aggregation report failed inputs as warnings when
at least one input succeeds while returning errors when all inputs fail.
It also converts pre-aggregated parsing diagnostics to returned Error
values and removes worker-path exits from perf setup and parsing.
Test Plan: updated pre-aggregated-perf.test
Reviewers: maksfb, rafaelauler, ayermolo, yozhu, yavtuk
Reviewed By: rafaelauler
Pull Request: https://github.com/llvm/llvm-project/pull/200476
Commit: 2b04d746524f6970523ceda9a6a9489caa4f2337
https://github.com/llvm/llvm-project/commit/2b04d746524f6970523ceda9a6a9489caa4f2337
Author: Amir Ayupov <aaupov at fb.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M bolt/include/bolt/Profile/DataAggregator.h
M bolt/include/bolt/Profile/DataReader.h
M bolt/lib/Profile/DataAggregator.cpp
M bolt/test/X86/pre-aggregated-perf.test
Log Message:
-----------
[BOLT] Buffer DataAggregator diagnostics
To avoid mixed up error messages in multi-perf case, provide diagnostics
buffer and stream for each aggregator job.
Test Plan: updated pre-aggregated-perf.test
Reviewers: yavtuk, maksfb, rafaelauler, ayermolo, paschalis-mpeis, yozhu
Reviewed By: yavtuk
Pull Request: https://github.com/llvm/llvm-project/pull/203464
Commit: 6c9232d47820859e77deb25169e55d53ba823093
https://github.com/llvm/llvm-project/commit/6c9232d47820859e77deb25169e55d53ba823093
Author: Amir Ayupov <aaupov at fb.com>
Date: 2026-06-11 (Thu, 11 Jun 2026)
Changed paths:
M bolt/include/bolt/Profile/DataAggregator.h
M bolt/include/bolt/Profile/DataReader.h
M bolt/lib/Profile/DataAggregator.cpp
M bolt/test/X86/pre-aggregated-perf.test
M bolt/unittests/Core/MemoryMaps.cpp
M clang/include/clang/Basic/BuiltinsRISCV.td
M clang/include/clang/CIR/Dialect/IR/CIROps.td
M clang/lib/CIR/Dialect/Transforms/TargetLowering/CIRABIRewriteContext.cpp
M clang/lib/CIR/Dialect/Transforms/TargetLowering/CIRABIRewriteContext.h
M clang/lib/CIR/Lowering/DirectToLLVM/LowerToLLVM.cpp
M clang/lib/CodeGen/TargetBuiltins/RISCV.cpp
M clang/lib/CodeGen/Targets/RISCV.cpp
M clang/lib/Driver/ToolChains/Darwin.cpp
M clang/lib/Headers/riscv_packed_simd.h
M clang/lib/ScalableStaticAnalysisFramework/Analyses/PointerFlow/PointerFlowExtractor.cpp
A clang/test/CIR/Transforms/abi-lowering/indirect-return-sret.cir
M clang/test/CodeGen/RISCV/rvp-intrinsics.c
M clang/test/Driver/darwin-objc-selector-stubs.m
M clang/unittests/ScalableStaticAnalysisFramework/Analyses/PointerFlow/PointerFlowTest.cpp
M compiler-rt/test/fuzzer/features_dir.test
M compiler-rt/test/fuzzer/merge-posix.test
M cross-project-tests/intrinsic-header-tests/riscv_packed_simd.c
M flang/lib/Lower/OpenMP/OpenMP.cpp
M flang/test/Lower/OpenMP/Todo/error.f90
A flang/test/Lower/OpenMP/nothing.f90
M libc/config/baremetal/config.json
M libc/test/src/math/RoundToIntegerTest.h
M libcxx/include/optional
M libcxx/test/std/utilities/optional/optional.syn/optional_in_place_t.verify.cpp
M libcxx/test/std/utilities/optional/optional.syn/optional_nullopt_t.verify.cpp
M lldb/test/API/macosx/simulator/TestSimulatorPlatform.py
M llvm/include/llvm/Frontend/HLSL/CBuffer.h
M llvm/include/llvm/IR/IntrinsicsRISCV.td
M llvm/lib/Frontend/HLSL/CBuffer.cpp
M llvm/lib/Passes/PassBuilder.cpp
M llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
M llvm/lib/Target/AMDGPU/AMDGPUInstructionSelector.cpp
M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp
M llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/lib/Target/AMDGPU/SIInstrInfo.td
M llvm/lib/Target/AMDGPU/SIInstructions.td
M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
M llvm/lib/Target/AMDGPU/VOP1Instructions.td
M llvm/lib/Target/AMDGPU/VOP3PInstructions.td
M llvm/lib/Target/DirectX/DXILCBufferAccess.cpp
M llvm/lib/Target/RISCV/RISCVISelLowering.cpp
M llvm/lib/Target/RISCV/RISCVInstrInfoP.td
M llvm/lib/Target/SPIRV/SPIRVCBufferAccess.cpp
M llvm/lib/TargetParser/Host.cpp
M llvm/lib/Transforms/InstCombine/InstCombineCalls.cpp
M llvm/lib/Transforms/Scalar/LoopFuse.cpp
M llvm/lib/Transforms/Utils/LoopUnroll.cpp
M llvm/test/CodeGen/AMDGPU/bf16-math.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.wmma.gfx1250.w32.fmt-error.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.wmma.gfx1250.w32.ll
M llvm/test/CodeGen/AMDGPU/packed-fp64.ll
M llvm/test/CodeGen/RISCV/rvp-simd-32.ll
M llvm/test/CodeGen/RISCV/rvp-simd-64.ll
M llvm/test/MC/AMDGPU/gfx1250_asm_vop1_err.s
M llvm/test/MC/AMDGPU/gfx1250_asm_vop3_from_vop1-fake16.s
M llvm/test/MC/AMDGPU/gfx1250_asm_vop3_from_vop1.s
M llvm/test/MC/AMDGPU/gfx1250_asm_wmma_w32.s
M llvm/test/MC/AMDGPU/gfx1250_asm_wmma_w32_err.s
M llvm/test/MC/AMDGPU/gfx1251_asm_vop3p.s
M llvm/test/MC/AMDGPU/gfx1251_err.s
M llvm/test/MC/Disassembler/AMDGPU/gfx1250_dasm_vop3_from_vop1.txt
M llvm/test/MC/Disassembler/AMDGPU/gfx1250_dasm_wmma_w32.txt
M llvm/test/MC/Disassembler/AMDGPU/gfx1251_dasm_vop3p.txt
M llvm/test/Transforms/InstCombine/assume.ll
A llvm/test/Transforms/LoopUnroll/runtime-unroll-reductions-min-max.ll
M mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
M mlir/lib/Dialect/LLVMIR/IR/NVVMDialect.cpp
M mlir/lib/Dialect/Vector/Transforms/LowerVectorContract.cpp
M mlir/test/Conversion/NVVMToLLVM/nvvm-to-llvm.mlir
M mlir/test/Dialect/Vector/vector-contract-to-parallel-arith-transforms.mlir
M mlir/test/Target/LLVMIR/nvvm/barrier.mlir
M utils/bazel/llvm-project-overlay/libc/BUILD.bazel
M utils/bazel/llvm-project-overlay/llvm/BUILD.bazel
Log Message:
-----------
[𝘀𝗽𝗿] changes introduced through rebase
Created using spr 1.3.8-beta.1
[skip ci]
Compare: https://github.com/llvm/llvm-project/compare/22c487ab7e45...6c9232d47820
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list