[all-commits] [llvm/llvm-project] e27e4f: [flang][cuda] Add interfaces for __uint2float_rX f...

Fri Aug 15 08:01:21 PDT 2025

  Branch: refs/heads/users/alexey-bataev/spr/slp-prefer-copyable-vectorization-over-alternate-opcodes
  Home:   https://github.com/llvm/llvm-project
  Commit: e27e4f3a99911675215ca79eb95290a3ab77fe74
      https://github.com/llvm/llvm-project/commit/e27e4f3a99911675215ca79eb95290a3ab77fe74
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add interfaces for __uint2float_rX functions (#153620)

Also add bind name for __uint2double_rn

  Commit: ff0ce74be84bb1300dc5790842c85ad1f3d88174
      https://github.com/llvm/llvm-project/commit/ff0ce74be84bb1300dc5790842c85ad1f3d88174
  Author: Florian Hahn <flo at fhahn.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

  Log Message:
  -----------
  [VPlan] Replace scalar preheader with VPIRBB at single place (NFC).

Replace the scalar preheader VPBB with an VPIRBB wrapping the IR basic
block created by createVectorizedLoopSkeleton.

  Commit: c202d2f5157256208dcd2e46d910e633f52b74ad
      https://github.com/llvm/llvm-project/commit/c202d2f5157256208dcd2e46d910e633f52b74ad
  Author: Min-Yih Hsu <min.hsu at sifive.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/CodeGen/InterleavedAccessPass.cpp
    M llvm/test/CodeGen/RISCV/rvv/fixed-vectors-interleaved-access.ll

  Log Message:
  -----------
  [IA][RISCV] Recognizing gap masks assembled from bitwise AND (#153324)

For a deinterleaved masked.load / vp.load, if it's mask, `%c`, is
synthesized by the following snippet:
```
%m = shufflevector %s, poison, <0, 0, 0, 1, 1, 1, 2, 2, 2, 3, 3, 3>
%g = <1, 1, 0, 1, 1, 0, 1, 1, 0, 1, 1, 0>
%c = and %m, %g
```
Then we can know that `%g` is the gap mask and `%s` is the mask for each
field / component. This patch teaches InterleaveAccess pass to recognize
such patterns

  Commit: f5d284309fe4f4e83f879f08356d149000bbc762
      https://github.com/llvm/llvm-project/commit/f5d284309fe4f4e83f879f08356d149000bbc762
  Author: Mircea Trofin <mtrofin at google.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/include/llvm/ProfileData/InstrProf.h
    M llvm/lib/Transforms/Scalar/JumpTableToSwitch.cpp
    M llvm/test/Transforms/JumpTableToSwitch/basic.ll

  Log Message:
  -----------
  [JTS] Propagate profile info (#153305)

If the indirect call target being recognized as a jump table has profile info, we can accurately synthesize the branch weights of the switch that replaces the indirect call.

Otherwise we insert the "unknown" `MD_prof` to indicate this is the best we can do here.

Part of Issue #147390

  Commit: 37cc010b91e0aaeced53d9261c1f4b511f603871
      https://github.com/llvm/llvm-project/commit/37cc010b91e0aaeced53d9261c1f4b511f603871
  Author: Thurston Dang <thurston at google.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M compiler-rt/test/asan/TestCases/Posix/fakestack_alignment.cpp

  Log Message:
  -----------
  [asan] Fix-forward undefined type in test from #153142 (#153636)

Fix Mac build breakage (reported by aeubanks in
https://github.com/llvm/llvm-project/pull/153142#issuecomment-3189202274)
by including stdint.h and using uintptr_t

  Commit: 98728d9dc8f38703bc3ebbbf32d45567acaa75cc
      https://github.com/llvm/llvm-project/commit/98728d9dc8f38703bc3ebbbf32d45567acaa75cc
  Author: Jianhui Li <jian.hui.li at intel.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M mlir/include/mlir/Dialect/XeGPU/Utils/XeGPUUtils.h
    M mlir/lib/Conversion/VectorToXeGPU/CMakeLists.txt
    M mlir/lib/Conversion/VectorToXeGPU/VectorToXeGPU.cpp
    M mlir/lib/Dialect/XeGPU/IR/XeGPUOps.cpp
    M mlir/lib/Dialect/XeGPU/Utils/CMakeLists.txt
    M mlir/lib/Dialect/XeGPU/Utils/XeGPUUtils.cpp
    M mlir/test/Conversion/VectorToXeGPU/transfer-read-to-xegpu.mlir
    M mlir/test/Conversion/VectorToXeGPU/transfer-write-to-xegpu.mlir
    M mlir/test/Dialect/XeGPU/invalid.mlir

  Log Message:
  -----------
  [MLIR][XeGPU] Add lowering from transfer_read/transfer_write to load_gather/store_scatter  (#152429)

Lowering transfer_read/transfer_write to load_gather/store_scatter in
case the target uArch doesn't support load_nd/store_nd. The high level
steps:
  1. compute Strides;
  2. compute Offsets;
  3. collapseMemrefTo1D;
  4. create Load gather or store_scatter op

  Commit: db98ac43ec13805e1876c389933eb7155ee1f073
      https://github.com/llvm/llvm-project/commit/db98ac43ec13805e1876c389933eb7155ee1f073
  Author: Florian Hahn <flo at fhahn.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
    M llvm/test/Transforms/LoopVectorize/AArch64/conditional-branches-cost.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/divs-with-scalable-vfs.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/eliminate-tail-predication.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/gather-do-not-vectorize-addressing.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/induction-costs-sve.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/interleave-with-gaps.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/interleaving-reduction.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/low_trip_count_predicates.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/masked-call.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/optsize_minsize.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/outer_loop_prefer_scalable.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-chained.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-dot-product-mixed.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-dot-product.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-sub.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/pr60831-sve-inv-store-crash.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/reduction-recurrence-costs-sve.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/scalable-avoid-scalarization.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/scalable-reduction-inloop-cond.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/scalable-strict-fadd.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/simple_early_exit.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/single-early-exit-interleave.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/store-costs-sve.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-epilog-vect-inloop-reductions.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-epilog-vect-reductions.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-epilog-vect-strict-reductions.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-epilog-vect.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-fneg.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-inductions-unusual-types.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-inv-store.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-live-out-pointer-induction.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-multiexit.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-runtime-check-size-based-threshold.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-tail-folding-forced.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-tail-folding-reductions.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-tail-folding-unroll.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-tail-folding.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-vscale-based-trip-counts.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-widen-extractvalue.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-widen-gep.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/tail-folding-styles.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/dead-ops-cost.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/fminimumnum.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/inloop-reduction.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/partial-reduce-dot-product.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/riscv-vector-reverse.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/scalable-tailfold.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-bin-unary-ops-args.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-call-intrinsics.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-cast-intrinsics.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-cond-reduction.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-div.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-fixed-order-recurrence.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-gather-scatter.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-inloop-reduction.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-interleave.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-intermediate-store.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-iv32.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-masked-loadstore.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-ordered-reduction.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-reduction.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-reverse-load-store.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-safe-dep-distance.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/uniform-load-store.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/vectorize-vp-intrinsics.ll
    M llvm/test/Transforms/LoopVectorize/outer_loop_scalable.ll
    M llvm/test/Transforms/LoopVectorize/scalable-assume.ll
    M llvm/test/Transforms/LoopVectorize/scalable-first-order-recurrence.ll
    M llvm/test/Transforms/LoopVectorize/scalable-iv-outside-user.ll
    M llvm/test/Transforms/LoopVectorize/scalable-lifetime.ll
    M llvm/test/Transforms/LoopVectorize/scalable-loop-unpredicated-body-scalar-tail.ll
    M llvm/test/Transforms/LoopVectorize/scalable-predication.ll
    M llvm/test/Transforms/LoopVectorize/scalable-reduction-inloop.ll
    M llvm/test/Transforms/LoopVectorize/scalable-trunc-min-bitwidth.ll
    M llvm/test/Transforms/LoopVectorize/vectorize-force-tail-with-evl.ll
    M llvm/test/Transforms/PhaseOrdering/AArch64/sve-interleave-vectorization.ll

  Log Message:
  -----------
  [LV] Use shl for ((VF * Step) * vscale) in createStepForVF. (#153495)

Directly emit shl instead of a multiply if VF * Step is a power-of-2. The
main motivation here is to prepare the code and test for directly
generating and expanding a SCEV expression of the minimum iteration
count. SCEVExpander will directly emit shl for multiplies with
powers-of-2.

InstCombine will also performs this combine, so end-to-end this should
effectively by NFC.

PR: https://github.com/llvm/llvm-project/pull/153495

  Commit: 20a829937cc8cd69170b75c0bb7f31ad9ba19677
      https://github.com/llvm/llvm-project/commit/20a829937cc8cd69170b75c0bb7f31ad9ba19677
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add interfaces for __expf and __exp10f (#153633)

  Commit: b62b65a95f2b5e79e90f3f957e7a52ec50c5fe31
      https://github.com/llvm/llvm-project/commit/b62b65a95f2b5e79e90f3f957e7a52ec50c5fe31
  Author: Jonas Devlieghere <jonas at devlieghere.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M lldb/source/Plugins/ScriptInterpreter/Python/ScriptInterpreterPython.cpp

  Log Message:
  -----------
  [lldb] Use (only) PyImport_AppendInittab to patch readline (#153329)

The current implementation tries to (1) patch the existing readline
module definition if it's already present in the inittab and (2) append
our patched readline module to the inittab. The former (1) uses the
non-stable Python API and I can't find a situation where this is
necessary. 

We do this work before initialization, so for the readline
module to exist, it either needs to be added by Python itself (which
doesn't seem to be the case), or someone would have had to have added it
without initializing.

  Commit: 016c301d30fc9e5d8a3e42fb9a5cd990bf2b164f
      https://github.com/llvm/llvm-project/commit/016c301d30fc9e5d8a3e42fb9a5cd990bf2b164f
  Author: Mircea Trofin <mtrofin at google.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Transforms/Scalar/JumpTableToSwitch.cpp

  Log Message:
  -----------
  [NFC] Use `[[maybe_unused]]` for variable used in assertion (#153639)

  Commit: cbfc22c06b88d1f0176559a5f41ddc1f003b4c6f
      https://github.com/llvm/llvm-project/commit/cbfc22c06b88d1f0176559a5f41ddc1f003b4c6f
  Author: Kaitlin Peng <kaitlinpeng at microsoft.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M clang/lib/Headers/hlsl/hlsl_alias_intrinsics.h

  Log Message:
  -----------
  Fix typo in `step` intrinsic comment (#153642)

`y` should be the first argument and `x` should be the second, otherwise
the formula is wrong. This also matches the documentation
[here](https://learn.microsoft.com/en-us/windows/win32/direct3dhlsl/dx-graphics-hlsl-step).

  Commit: 4f007041a87b1b0c2686d1871ea6e35394545865
      https://github.com/llvm/llvm-project/commit/4f007041a87b1b0c2686d1871ea6e35394545865
  Author: Erick Velez <erickvelez7 at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M clang-tools-extra/clang-doc/HTMLMustacheGenerator.cpp
    M clang-tools-extra/clang-doc/JSONGenerator.cpp
    M clang-tools-extra/test/clang-doc/basic-project.mustache.test
    M clang-tools-extra/test/clang-doc/json/class-requires.cpp
    M clang-tools-extra/test/clang-doc/json/class-specialization.cpp
    M clang-tools-extra/test/clang-doc/json/class-template.cpp
    M clang-tools-extra/test/clang-doc/json/class.cpp
    M clang-tools-extra/test/clang-doc/json/compound-constraints.cpp
    M clang-tools-extra/test/clang-doc/json/concept.cpp
    M clang-tools-extra/test/clang-doc/json/function-requires.cpp
    M clang-tools-extra/test/clang-doc/json/function-specifiers.cpp
    M clang-tools-extra/test/clang-doc/json/method-template.cpp
    M clang-tools-extra/test/clang-doc/json/namespace.cpp
    M clang-tools-extra/test/clang-doc/json/nested-namespace.cpp
    M clang-tools-extra/test/clang-doc/mustache-index.cpp
    M clang-tools-extra/test/clang-doc/mustache-separate-namespace.cpp

  Log Message:
  -----------
  [clang-doc] place HTML/JSON output inside their own directories (#150655)

Instead of just outputting everything into the designated root folder,
HTML and JSON output will be placed in html/ and json/ directories.

  Commit: 2912c9c249350bae3178ba7fad605e1f269c32e7
      https://github.com/llvm/llvm-project/commit/2912c9c249350bae3178ba7fad605e1f269c32e7
  Author: Abhinav Gaba <abhinav.gaba at intel.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M offload/test/mapping/data_member_ref.cpp
    M offload/test/mapping/declare_mapper_nested_default_mappers.cpp
    M offload/test/mapping/declare_mapper_nested_mappers.cpp
    M offload/test/mapping/ptr_and_obj_motion.c
    M offload/test/mapping/target_derefence_array_pointrs.cpp
    M offload/test/mapping/target_has_device_addr.c

  Log Message:
  -----------
  [NFC][Offload] Add missing maps to OpenMP offloading tests. (#153103)

A few tests were only mapping a pointee, like: `map(pp[0][0])`, on an
`int** pp`, but expecting the pointers, like `pp`, `pp[0]` to also be
mapped, which is incorrect.

This change fixes six such tests.

  Commit: a508ea2ad7d9ab8bad0ec8fa3db3397985d57841
      https://github.com/llvm/llvm-project/commit/a508ea2ad7d9ab8bad0ec8fa3db3397985d57841
  Author: Mircea Trofin <mtrofin at google.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Transforms/Scalar/CMakeLists.txt
    M llvm/lib/Transforms/Scalar/JumpTableToSwitch.cpp

  Log Message:
  -----------
  Add dependency on `ProfileData` from ScalarOpts (#153651)

Fixing buildbot failures after PR #153305, e.g.
https://lab.llvm.org/buildbot/#/builders/203/builds/19861

Analysis already depends on `ProfileData`, so the transitive closure of
the dependencies of `ScalarOpts` doesn't change.

Also avoided an extra dependency (and very unnecessary) on
`Instrumentation`. The API previously used doesn't need to live in
Instrumentation to begin with, but that's something to address in a
follow-up.

  Commit: eeada0d30f41b2d680073e49246ac0388e6c5ec7
      https://github.com/llvm/llvm-project/commit/eeada0d30f41b2d680073e49246ac0388e6c5ec7
  Author: Matheus Izvekov <mizvekov at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M clang/include/clang/AST/TypeLoc.h
    M clang/test/AST/ast-dump-templates.cpp

  Log Message:
  -----------
  [clang] fix source range computation for DeducedTemplateSpecializationType (#153646)

This was a regression introduced in
https://github.com/llvm/llvm-project/pull/147835

Since this regression was never released, there are no release notes.

Fixes https://github.com/llvm/llvm-project/issues/153540

  Commit: 334a046a3cd31aaec00c4e4bb8866242c6e38c98
      https://github.com/llvm/llvm-project/commit/334a046a3cd31aaec00c4e4bb8866242c6e38c98
  Author: Michael Berg <93234525+mcberg2021 at users.noreply.github.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Analysis/LoopAccessAnalysis.cpp
    M llvm/lib/Transforms/Scalar/LoopDistribute.cpp
    A llvm/test/Transforms/LoopDistribute/cross-partition-access.ll

  Log Message:
  -----------
  [LoopDist] Consider reads and writes together for runtime checks (#145623)

Emit safety guards for ptr accesses when cross partition loads exist
which have a corresponding store to the same address in a different
partition. This will emit the necessary ptr checks for these accesses.

The test case was obtained from SuperTest, which SiFive runs regularly.
We enabled LoopDistribution by default in our downstream compiler, this
change was part of that enablement.

  Commit: 49f20934776cccbed82ccdca657d9111bf550286
      https://github.com/llvm/llvm-project/commit/49f20934776cccbed82ccdca657d9111bf550286
  Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin at amd.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/docs/AMDGPUUsage.rst
    M llvm/lib/Target/AMDGPU/AMDGPU.td
    M llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUFeatures.td
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
    M llvm/test/CodeGen/AMDGPU/extra-lds-size.ll
    A llvm/test/CodeGen/AMDGPU/lds-limit-diagnostics-gfx1250.ll
    A llvm/test/CodeGen/AMDGPU/lds-size-hsa-gfx1250.ll
    A llvm/test/CodeGen/AMDGPU/lds-size-pal-gfx1250.ll

  Log Message:
  -----------
  [AMDGPU] Increase LDS to 320K on gfx1250 (#153645)

  Commit: 5479b7ed4200a85de3aa9335110883a86715811d
      https://github.com/llvm/llvm-project/commit/5479b7ed4200a85de3aa9335110883a86715811d
  Author: CatherineMoore <catmoore at amd.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M openmp/runtime/src/kmp_settings.cpp

  Log Message:
  -----------
  [OpenMP] Update printf stmt in  kmp_settings.cpp (#152800)

Remove extraneous argument from printf statement

---------

Co-authored-by: Joachim <protze at rz.rwth-aachen.de>

  Commit: aa4805a09052c1b6298718eeb6d30c33dd0d695f
      https://github.com/llvm/llvm-project/commit/aa4805a09052c1b6298718eeb6d30c33dd0d695f
  Author: Bill Wendling <morbo at google.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M clang/include/clang/AST/Type.h
    M clang/include/clang/Basic/Attr.td
    M clang/include/clang/Basic/AttrDocs.td
    M clang/lib/AST/ASTContext.cpp
    M clang/lib/AST/Type.cpp
    M clang/lib/AST/TypePrinter.cpp
    M clang/lib/CodeGen/CodeGenFunction.cpp
    M clang/lib/CodeGen/CodeGenModule.cpp
    M clang/lib/CodeGen/CodeGenModule.h
    M clang/lib/Sema/SemaType.cpp
    A clang/test/CodeGen/cfi-salt.c
    M clang/test/Misc/pragma-attribute-supported-attributes-list.test
    A clang/test/Sema/attr-cfi-salt.c

  Log Message:
  -----------
  [Clang][attr] Add 'cfi_salt' attribute (#141846)

The 'cfi_salt' attribute specifies a string literal that is used as a
"salt" for Control-Flow Integrity (CFI) checks to distinguish between
functions with the same type signature. This attribute can be applied
to function declarations, function definitions, and function pointer
typedefs.

This attribute prevents function pointers from being replaced with
pointers to functions that have a compatible type, which can be a CFI
bypass vector.

The attribute affects type compatibility during compilation and CFI
hash generation during code generation.

  Attribute syntax: [[clang::cfi_salt("<salt_string>")]]
  GNU-style syntax: __attribute__((cfi_salt("<salt_string>")))

- The attribute takes a single string of non-NULL ASCII characters.
- It only applies to function types; using it on a non-function type
  will generate an error.
- All function declarations and the function definition must include
  the attribute and use identical salt values.

Example usage:

  // Header file:
  #define __cfi_salt(S) __attribute__((cfi_salt(S)))

  // Convenient typedefs to avoid nested declarator syntax.
  typedef int (*fp_unsalted_t)(void);
  typedef int (*fp_salted_t)(void) __cfi_salt("pepper");

  struct widget_ops {
    fp_unsalted_t init;     // Regular CFI.
    fp_salted_t exec;       // Salted CFI.
    fp_unsalted_t teardown; // Regular CFI.
  };

  // bar.c file:
  static int bar_init(void) { ... }
  static int bar_salted_exec(void) __cfi_salt("pepper") { ... }
  static int bar_teardown(void) { ... }

  static struct widget_generator _generator = {
    .init = bar_init,
    .exec = bar_salted_exec,
    .teardown = bar_teardown,
  };

  struct widget_generator *widget_gen = _generator;

  // 2nd .c file:
  int generate_a_widget(void) {
    int ret;

    // Called with non-salted CFI.
    ret = widget_gen.init();
    if (ret)
      return ret;

    // Called with salted CFI.
    ret = widget_gen.exec();
    if (ret)
      return ret;

    // Called with non-salted CFI.
    return widget_gen.teardown();
  }

Link: https://github.com/ClangBuiltLinux/linux/issues/1736
Link: https://github.com/KSPP/linux/issues/365

---------

Signed-off-by: Bill Wendling <morbo at google.com>
Co-authored-by: Aaron Ballman <aaron at aaronballman.com>

  Commit: e5e3e4bdb57efbfc0b2709d198d4ced752794bcd
      https://github.com/llvm/llvm-project/commit/e5e3e4bdb57efbfc0b2709d198d4ced752794bcd
  Author: Erich Keane <ekeane at nvidia.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M mlir/include/mlir/Dialect/OpenACC/OpenACCOps.td
    M mlir/lib/Dialect/OpenACC/IR/OpenACC.cpp

  Log Message:
  -----------
  [OpenACC] Add firstprivate recipe helper methods to ACC dialect (#153604)

Like we did for the 'private' clause, this adds an easier to use helper
function to add the 'firstprivate' clause + recipe to the Parallel and
Serial ops.

  Commit: 6b316ecb5f843926bbd62b18fd8e0c5cf1cee622
      https://github.com/llvm/llvm-project/commit/6b316ecb5f843926bbd62b18fd8e0c5cf1cee622
  Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin at amd.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Target/AMDGPU/SIInstrFormats.td

  Log Message:
  -----------
  [AMDGPU] Encode NV bit in VIMAGE/VSAMPLE. NFC (#153654)

This is NFC as this target does not have it.

  Commit: 8a0c7e9b3266051803e3919954866e23498ca4d0
      https://github.com/llvm/llvm-project/commit/8a0c7e9b3266051803e3919954866e23498ca4d0
  Author: Florian Hahn <flo at fhahn.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/test/Transforms/LoopVectorize/X86/consecutive-ptr-uniforms.ll
    M llvm/test/Transforms/LoopVectorize/dbg-outer-loop-vect.ll
    M llvm/test/Transforms/LoopVectorize/pr45525.ll
    M llvm/test/Transforms/LoopVectorize/runtime-check-readonly.ll

  Log Message:
  -----------
  [LV] Regenerate some more tests.

  Commit: 52c9489d1d9bd80454a5b9647393ca41c65e2b57
      https://github.com/llvm/llvm-project/commit/52c9489d1d9bd80454a5b9647393ca41c65e2b57
  Author: Jonas Devlieghere <jonas at devlieghere.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M lldb/cmake/modules/LLDBConfig.cmake

  Log Message:
  -----------
  [lldb] Use the Python limited API with SWIG 4.2 or later (#153119) (#153472)

Use the Python limited API when building with SWIG 4.2 or later.

  Commit: bad3df476457db2fd7dd36729c47fb1d2bddad3a
      https://github.com/llvm/llvm-project/commit/bad3df476457db2fd7dd36729c47fb1d2bddad3a
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add bind names for __double2ll_rX interfaces (#153660)

  Commit: 76dd742f7b32e4d3acf50fab1dbbd897f215837e
      https://github.com/llvm/llvm-project/commit/76dd742f7b32e4d3acf50fab1dbbd897f215837e
  Author: Zhaoxuan Jiang <jiangzhaoxuan94 at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/include/llvm/CGData/CodeGenData.h
    M llvm/include/llvm/CGData/CodeGenData.inc
    M llvm/include/llvm/CGData/StableFunctionMap.h
    M llvm/include/llvm/CGData/StableFunctionMapRecord.h
    M llvm/lib/CGData/CodeGenData.cpp
    M llvm/lib/CGData/CodeGenDataReader.cpp
    M llvm/lib/CGData/StableFunctionMap.cpp
    M llvm/lib/CGData/StableFunctionMapRecord.cpp
    M llvm/lib/CodeGen/GlobalMergeFunctions.cpp
    M llvm/test/ThinLTO/AArch64/cgdata-merge-write.ll
    M llvm/test/tools/llvm-cgdata/empty.test
    M llvm/test/tools/llvm-cgdata/error.test
    M llvm/test/tools/llvm-cgdata/merge-combined-funcmap-hashtree.test
    M llvm/test/tools/llvm-cgdata/merge-funcmap-archive.test
    M llvm/test/tools/llvm-cgdata/merge-funcmap-concat.test
    M llvm/test/tools/llvm-cgdata/merge-funcmap-double.test
    M llvm/test/tools/llvm-cgdata/merge-funcmap-single.test
    M llvm/tools/llvm-cgdata/Opts.td
    M llvm/tools/llvm-cgdata/llvm-cgdata.cpp
    M llvm/unittests/CGData/StableFunctionMapTest.cpp

  Log Message:
  -----------
  [CGData] Lazy loading support for stable function map (#151660)

The stable function map could be huge for a large application. Fully
loading it is slow and consumes a significant amount of memory, which is
unnecessary and drastically slows down compilation especially for
non-LTO and distributed-ThinLTO setups. This patch introduces an opt-in
lazy loading support for the stable function map. The detailed changes
are:

- `StableFunctionMap`
- The map now stores entries in an `EntryStorage` struct, which includes
offsets for serialized entries and a `std::once_flag` for thread-safe
lazy loading.
- The underlying map type is changed from `DenseMap` to
`std::unordered_map` for compatibility with `std::once_flag`.
- `contains()`, `size()` and `at()` are implemented to only load
requested entries on demand.

- Lazy Loading Mechanism
- When reading indexed codegen data, if the newly-introduced
`-indexed-codegen-data-lazy-loading` flag is set, the stable function
map is not fully deserialized up front. The binary format for the stable
function map now includes offsets and sizes to support lazy loading.
- The safety of lazy loading is guarded by the once flag per function
hash. This guarantees that even in a multi-threaded environment, the
deserialization for a given function hash will happen exactly once. The
first thread to request it performs the load, and subsequent threads
will wait for it to complete before using the data. For single-threaded
builds, the overhead is negligible (a single check on the once flag).
For multi-threaded scenarios, users can omit the flag to retain the
previous eager-loading behavior.

  Commit: 1e9fc8edd03781ef4836d601e6ff672df3901d88
      https://github.com/llvm/llvm-project/commit/1e9fc8edd03781ef4836d601e6ff672df3901d88
  Author: Bill Wendling <morbo at google.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M clang/test/Sema/attr-cfi-salt.c

  Log Message:
  -----------
  [Clang][attr] Add '-std=c11' to allow for typedef redefinition

  Commit: e56ae9651b2e51cbf18e33627165c76b31ccc86b
      https://github.com/llvm/llvm-project/commit/e56ae9651b2e51cbf18e33627165c76b31ccc86b
  Author: Morris Hafner <mmha at users.noreply.github.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M clang/include/clang/CIR/MissingFeatures.h
    M clang/lib/CIR/CodeGen/CIRGenExpr.cpp
    M clang/lib/CIR/CodeGen/CIRGenFunction.cpp
    M clang/lib/CIR/CodeGen/CIRGenFunction.h
    M clang/lib/CIR/CodeGen/CIRGenStmt.cpp

  Log Message:
  -----------
  [CIR][NFC] Add Symbol Table to CIRGenFunction (#153625)

This patchs adds a symbol table to CIRGenFunction plus scopes and
insertions to the table where we were missing them previously.

  Commit: 5836bae463ce68e834e83231e443007d324ed89a
      https://github.com/llvm/llvm-project/commit/5836bae463ce68e834e83231e443007d324ed89a
  Author: David Green <david.green at arm.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp
    M llvm/test/Analysis/CostModel/AArch64/arith-fp.ll
    M llvm/test/Analysis/CostModel/AArch64/sve-arith-fp.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/f128-fmuladd-reduction.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/veclib-intrinsic-calls.ll
    M llvm/test/Transforms/SLPVectorizer/AArch64/reused-scalar-repeated-in-node.ll
    M llvm/test/Transforms/SLPVectorizer/AArch64/vec3-reorder-reshuffle.ll
    M llvm/test/Transforms/SLPVectorizer/insertelement-postpone.ll

  Log Message:
  -----------
  [AArch64] Change the cost of fma and fmuladd to match fmul. (#152963)

As fmul and fmadd are so similar, their performance characteristics tend
to be the same on most platforms, at least in terms of reciprocal
throughputs. Processors capable of performing a given number of fmul per
cycle can usually perform the same number of fma, with the extra add
being relatively simple on top. This patch makes the scores of the two
operations the same, which brings the throughput cost of a fma/fmuladd
to 2, and the latency to 3, which are the defaults for fmul.

Note that we might also want to change the throughput cost of a fmul to
1, as most processors have ample bandwidth for them, but they should
still stay in-line with one another.

  Commit: 06590444f53ee855a6b321895594382f79d6b1d8
      https://github.com/llvm/llvm-project/commit/06590444f53ee855a6b321895594382f79d6b1d8
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add bind names for __double2ull_rX interfaces (#153678)

  Commit: db5f7dc374fdd70d39857d3402d42878139cbb4e
      https://github.com/llvm/llvm-project/commit/db5f7dc374fdd70d39857d3402d42878139cbb4e
  Author: Alex Bradbury <asb at igalia.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
    M llvm/test/Transforms/SLPVectorizer/AArch64/alternate-vectorization-split-node.ll
    M llvm/test/Transforms/SLPVectorizer/X86/load-merge-inseltpoison.ll
    M llvm/test/Transforms/SLPVectorizer/X86/load-merge.ll

  Log Message:
  -----------
  Revert "[SLP]Support LShr as base for copyable elements"

This reverts commit ca4ebf95172d24f8c47655709b2c9eb85bda5cb2.

Causes compile-time crashes for some inputs with RVV zvl512b/zvl1024b
configurations. See here for a minimal reproducer:
https://github.com/llvm/llvm-project/pull/153393#issuecomment-3189898813

  Commit: 4e63d704e8f8bbceffeb19ffd62e0b5ed926dbe6
      https://github.com/llvm/llvm-project/commit/4e63d704e8f8bbceffeb19ffd62e0b5ed926dbe6
  Author: DeanSturtevant1 <dsturtevant at google.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M utils/bazel/llvm-project-overlay/mlir/BUILD.bazel

  Log Message:
  -----------
  Fix mlir/BUILD.bazel for XeGPUUtils. (#153689)

  Commit: b989c7c2e099211b549b2866cc4e6c6775775510
      https://github.com/llvm/llvm-project/commit/b989c7c2e099211b549b2866cc4e6c6775775510
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add interfaces for __drcp_rX (#153681)

  Commit: defbbf0129d5c1ac360c6ffd9eaa2f07ab07d622
      https://github.com/llvm/llvm-project/commit/defbbf0129d5c1ac360c6ffd9eaa2f07ab07d622
  Author: Craig Topper <craig.topper at sifive.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Target/RISCV/RISCVMoveMerger.cpp
    A llvm/test/CodeGen/RISCV/pr153598.mir

  Log Message:
  -----------
  [RISCV][MoveMerge] Don't copy kill flag when moving past an instruction that reads the register. (#153644)

If we're moving the second copy before another instruction that reads
the copied register, we need to clear the kill flag on the combined
move.

Fixes #153598.

  Commit: cb2f0d0a5f14c183e7182aba0f0e54a518de9e3f
      https://github.com/llvm/llvm-project/commit/cb2f0d0a5f14c183e7182aba0f0e54a518de9e3f
  Author: DeanSturtevant1 <dsturtevant at google.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M utils/bazel/llvm-project-overlay/mlir/BUILD.bazel

  Log Message:
  -----------
  [bazel] Fix mlir/BUILD.bazel for VectorToXeGPU. (#153696)

  Commit: df15c0d716fa0d84104b05937854becacbda6629
      https://github.com/llvm/llvm-project/commit/df15c0d716fa0d84104b05937854becacbda6629
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-device-proc.cuf
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add interfaces for __dsqrt_rn and __dsqrt_rz (#153624)

  Commit: a1529cd85a9f99a263de72e16ed2f7567c519456
      https://github.com/llvm/llvm-project/commit/a1529cd85a9f99a263de72e16ed2f7567c519456
  Author: Andy Kaylor <akaylor at nvidia.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M clang/include/clang/CIR/Dialect/Builder/CIRBaseBuilder.h
    M clang/include/clang/CIR/Dialect/IR/CIRAttrs.td
    M clang/include/clang/CIR/Dialect/IR/CIRDataLayout.h
    M clang/lib/CIR/CodeGen/CIRGenBuilder.cpp
    M clang/lib/CIR/CodeGen/CIRGenBuilder.h
    M clang/lib/CIR/CodeGen/CIRGenExprConstant.cpp
    M clang/lib/CIR/Dialect/IR/CIRDataLayout.cpp
    M clang/lib/CIR/Lowering/DirectToLLVM/LowerToLLVM.cpp
    M clang/test/CIR/CodeGen/globals.cpp

  Log Message:
  -----------
  [CIR] Add index support for global_view (#153254)

The #cir.global_view attribute was initially added without support for
the optional index list. This change adds index list support. This is
used when the address of an array or structure member is used as an
initializer.

This patch does not include support for taking the address of a
structure or class member. That will be added later.

  Commit: 57c1e01e484019084cfa975a4f41643d05ae327f
      https://github.com/llvm/llvm-project/commit/57c1e01e484019084cfa975a4f41643d05ae327f
  Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin at amd.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M clang/test/CodeGenHIP/hip-cumode.hip
    M clang/test/Driver/hip-macros.hip
    M llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp
    M llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
    M llvm/lib/Target/AMDGPU/GCNSubtarget.h
    M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUTargetStreamer.cpp
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
    M llvm/test/CodeGen/AMDGPU/GlobalISel/mul.ll
    M llvm/test/CodeGen/AMDGPU/global-load-xcnt.ll
    M llvm/test/CodeGen/AMDGPU/memory-legalizer-local-workgroup.ll
    M llvm/test/CodeGen/AMDGPU/packed-fp32.ll
    M llvm/test/MC/AMDGPU/hsa-diag-v4.s

  Log Message:
  -----------
  [AMDGPU] Don't allow wgp mode on gfx1250 (#153680)

- gfx1250 only supports cu mode

  Commit: fc4df2c9173a110102005af937a26df7f29d24e1
      https://github.com/llvm/llvm-project/commit/fc4df2c9173a110102005af937a26df7f29d24e1
  Author: Daniel Paoliello <danpao at microsoft.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M clang/test/Headers/mm3dnow.c
    M clang/test/Headers/pmmintrin.c
    M clang/test/Headers/x86-intrinsics-headers.c
    M clang/test/Headers/x86intrin.c
    M clang/test/Headers/x86intrin.cpp

  Log Message:
  -----------
  [win][arm64ec] XFAIL x64 intrinsic tests on Arm64EC (#153474)

Clang defines the x64 preprocessor macro (`__x86_64__`) when building
Arm64EC, however the tests for x64 built-ins and intrinsics are
currently failing since the relevant functions don't exist, resulting in
errors like:

```
Line 165: invalid conversion between vector type '__v2di' (vector of 2 'long long' values) and integer type 'int' of different size
```

(Clang doesn't know the intrinsics being called, and so treats it like
an undefined function, which makes it assume the return type is `int`)

For now, expect these tests to fail until someone decides to implement
these intrinsics.

  Commit: ca9ddd54b77020322a94a8032276b46cc1046522
      https://github.com/llvm/llvm-project/commit/ca9ddd54b77020322a94a8032276b46cc1046522
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add interfaces for __ll2float_rX (#153694)

  Commit: d56fa965243bcdc115b8f262ce79cf2547500c6d
      https://github.com/llvm/llvm-project/commit/d56fa965243bcdc115b8f262ce79cf2547500c6d
  Author: joaosaffran <126493771+joaosaffran at users.noreply.github.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/include/llvm/BinaryFormat/DXContainer.h
    M llvm/include/llvm/Support/DXILABI.h
    M llvm/lib/Analysis/DXILResource.cpp
    M llvm/lib/BinaryFormat/DXContainer.cpp
    M llvm/lib/Frontend/HLSL/HLSLRootSignature.cpp
    M llvm/lib/Frontend/HLSL/RootSignatureMetadata.cpp
    M llvm/lib/Support/CMakeLists.txt
    A llvm/lib/Support/DXILABI.cpp
    M llvm/lib/Target/DirectX/DXContainerGlobals.cpp
    M llvm/lib/Target/DirectX/DXILOpLowering.cpp
    M llvm/lib/Target/DirectX/DXILPostOptimizationValidation.cpp
    M llvm/lib/Target/DirectX/DXILRootSignature.h
    M llvm/test/Analysis/DXILResource/buffer-frombinding.ll
    M llvm/test/CodeGen/DirectX/llc-pipeline.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-cbuffer-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-descriptor-table-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-root-descriptor-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-sampler.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-static-sampler-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation.ll

  Log Message:
  -----------
  [DirectX] Add Range Overlap validation (#152229)

As part of the Root Signature Spec, we need to validate if Root
Signatures are not defining overlapping ranges.
Closes: https://github.com/llvm/llvm-project/issues/126645

---------

Co-authored-by: joaosaffran <joao.saffran at microsoft.com>
Co-authored-by: Joao Saffran <{ID}+{username}@users.noreply.github.com>
Co-authored-by: Joao Saffran <jderezende at microsoft.com>

  Commit: 2775c79c4fd06b4654a664c5182d9eb86f3e72c1
      https://github.com/llvm/llvm-project/commit/2775c79c4fd06b4654a664c5182d9eb86f3e72c1
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add interfaces for __float2ll_rX (#153702)

  Commit: a629119c7544e62fa031c85f87f507f9a90a070f
      https://github.com/llvm/llvm-project/commit/a629119c7544e62fa031c85f87f507f9a90a070f
  Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin at amd.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Target/AMDGPU/AMDGPURemoveIncompatibleFunctions.cpp
    M llvm/lib/Target/AMDGPU/GCNSubtarget.h
    A llvm/test/CodeGen/AMDGPU/pal-metadata-3.0.gfx1250.ll
    A llvm/test/CodeGen/AMDGPU/remove-incompatible-wave64-feature.ll

  Log Message:
  -----------
  [AMDGPU] Remove wave64 functions (#153690)

gfx1250 only supports wave32.

  Commit: 602f308d4fcc84d90d49fb708f757589bca68d4e
      https://github.com/llvm/llvm-project/commit/602f308d4fcc84d90d49fb708f757589bca68d4e
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add interface for __saturatef (#153705)

  Commit: 47bc6acf861e9b97b13ecbc4313001ee5d50adc1
      https://github.com/llvm/llvm-project/commit/47bc6acf861e9b97b13ecbc4313001ee5d50adc1
  Author: LLVM GN Syncbot <llvmgnsyncbot at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/utils/gn/secondary/llvm/lib/Support/BUILD.gn

  Log Message:
  -----------
  [gn build] Port d56fa965243b

  Commit: ffe48704721ec0710c7622d7f4170a9d03e77dbd
      https://github.com/llvm/llvm-project/commit/ffe48704721ec0710c7622d7f4170a9d03e77dbd
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add interfaces for __float2int_rX and __float2unit_rX (#153691)

  Commit: 8bce10ac6d3358d67d8ce3d111b455aca6b35654
      https://github.com/llvm/llvm-project/commit/8bce10ac6d3358d67d8ce3d111b455aca6b35654
  Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin at amd.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Target/AMDGPU/AMDGPUPreloadKernArgProlog.cpp
    M llvm/lib/Target/AMDGPU/GCNSubtarget.h
    M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.cvt.sat.pk.ll
    M llvm/test/CodeGen/AMDGPU/preload-implicit-kernargs.ll
    M llvm/test/CodeGen/AMDGPU/preload-kernargs.ll

  Log Message:
  -----------
  [AMDGPU] Enable kernarg preload on gfx1250 (#153686)

  Commit: abe92a50005a2a4998e4a006417fdfe6ff36eb6b
      https://github.com/llvm/llvm-project/commit/abe92a50005a2a4998e4a006417fdfe6ff36eb6b
  Author: Min-Yih Hsu <min.hsu at sifive.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
    A llvm/test/CodeGen/RISCV/rvv/incorrect-extract-subvector-combine.ll

  Log Message:
  -----------
  [DAGCombine] Fix an incorrect folding of extract_subvector (#153709)

Reported from
https://github.com/llvm/llvm-project/pull/153393#issuecomment-3189898813

During DAGCombine, an intermediate extract_subvector sequence was
generated:
```
  t8: v9i16 = extract_subvector t3, Constant:i64<9>
t24: v8i16 = extract_subvector t8, Constant:i64<0>
```
And one of the DAGCombine rule which turns `(extract_subvector
(extract_subvector X, C), 0)` into `(extract_subvector X, C)` kicked in
and turn that into `v8i16 = extract_subvector t3, Constant:i64<9>`. But
it forgot to check if the extracted index is a multiple of the minimum
vector length of the result type, hence the crash.

This patch fixes this by adding an additional check.

  Commit: 3bc4d660823e72ef4c78d96a473818efc06c9e32
      https://github.com/llvm/llvm-project/commit/3bc4d660823e72ef4c78d96a473818efc06c9e32
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-libdevice.cuf

  Log Message:
  -----------
  [flang][cuda] Add interfaces for __int2float_rX (#153708)

  Commit: 07d3a73d70cac6e58ca9002c98e31423c26cc735
      https://github.com/llvm/llvm-project/commit/07d3a73d70cac6e58ca9002c98e31423c26cc735
  Author: Kyungwoo Lee <kyulee at meta.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/include/llvm/CGData/CodeGenData.h
    M llvm/include/llvm/CGData/CodeGenData.inc
    M llvm/include/llvm/CGData/StableFunctionMap.h
    M llvm/include/llvm/CGData/StableFunctionMapRecord.h
    M llvm/lib/CGData/CodeGenData.cpp
    M llvm/lib/CGData/CodeGenDataReader.cpp
    M llvm/lib/CGData/StableFunctionMap.cpp
    M llvm/lib/CGData/StableFunctionMapRecord.cpp
    M llvm/lib/CodeGen/GlobalMergeFunctions.cpp
    M llvm/test/ThinLTO/AArch64/cgdata-merge-write.ll
    M llvm/test/tools/llvm-cgdata/empty.test
    M llvm/test/tools/llvm-cgdata/error.test
    M llvm/test/tools/llvm-cgdata/merge-combined-funcmap-hashtree.test
    M llvm/test/tools/llvm-cgdata/merge-funcmap-archive.test
    M llvm/test/tools/llvm-cgdata/merge-funcmap-concat.test
    M llvm/test/tools/llvm-cgdata/merge-funcmap-double.test
    M llvm/test/tools/llvm-cgdata/merge-funcmap-single.test
    M llvm/tools/llvm-cgdata/Opts.td
    M llvm/tools/llvm-cgdata/llvm-cgdata.cpp
    M llvm/unittests/CGData/StableFunctionMapTest.cpp

  Log Message:
  -----------
  Revert "[CGData] Lazy loading support for stable function map (#151660)"

This reverts commit 76dd742f7b32e4d3acf50fab1dbbd897f215837e.

  Commit: 769a9058c8d04fc920994f6a5bbb03c8a4fbcd05
      https://github.com/llvm/llvm-project/commit/769a9058c8d04fc920994f6a5bbb03c8a4fbcd05
  Author: Matt Arsenault <Matthew.Arsenault at amd.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/benchmarks/CMakeLists.txt
    A llvm/benchmarks/RuntimeLibcalls.cpp
    M llvm/include/llvm/IR/RuntimeLibcalls.h
    M llvm/lib/IR/RuntimeLibcalls.cpp
    M llvm/lib/Object/IRSymtab.cpp
    M llvm/test/TableGen/RuntimeLibcallEmitter.td
    M llvm/unittests/IR/CMakeLists.txt
    A llvm/unittests/IR/RuntimeLibcallsTest.cpp
    M llvm/utils/TableGen/Basic/RuntimeLibcallsEmitter.cpp

  Log Message:
  -----------
  TableGen: Emit statically generated hash table for runtime libcalls (#150192)

a96121089b9c94e08c6632f91f2dffc73c0ffa28 reverted a change
to use a binary search on the string name table because it
was too slow. This replaces it with a static string hash
table based on the known set of libcall names. Microbenchmarking
shows this is similarly fast to using DenseMap. It's possibly
slightly slower than using StringSet, though these aren't an
exact comparison. This also saves on the one time use construction
of the map, so it could be better in practice.

This search isn't simple set check, since it does find the
range of possible matches with the same name. There's also
an additional check for whether the current target supports
the name. The runtime constructed set doesn't require this,
since it only adds the symbols live for the target.

Followed algorithm from this post
http://0x80.pl/notesen/2023-04-30-lookup-in-strings.html

I'm also thinking the 2 special case global symbols should
just be added to RuntimeLibcalls. There are also other global
references emitted in the backend that aren't tracked; we probably
should just use this as a centralized database for all compiler
selected symbols.

  Commit: 7e46f5db21d40dd386aced6586e5e1fa6892092f
      https://github.com/llvm/llvm-project/commit/7e46f5db21d40dd386aced6586e5e1fa6892092f
  Author: Steven Wu <stevenwu at apple.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/include/llvm/Support/FileSystem.h
    M llvm/lib/Support/Unix/Path.inc
    M llvm/lib/Support/Windows/Path.inc
    M llvm/unittests/Support/Path.cpp

  Log Message:
  -----------
  [Support] Add mapped_file_region::sync(), equivalent to msync (#153632)

  Commit: 7ec2096cd3a9f14d36fd7e69471d0be9e52de1c9
      https://github.com/llvm/llvm-project/commit/7ec2096cd3a9f14d36fd7e69471d0be9e52de1c9
  Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin at amd.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Target/AMDGPU/SIRegisterInfo.td

  Log Message:
  -----------
  [AMDGPU] Fix the comment wrt SSrc_* RCs. NFC. (#153711)

  Commit: fb14f91305dd512f28927ee17766c1b00ae114dc
      https://github.com/llvm/llvm-project/commit/fb14f91305dd512f28927ee17766c1b00ae114dc
  Author: Oliver Hunt <oliver at apple.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M clang/include/clang/Basic/Features.def
    M clang/include/clang/Basic/LangOptions.def
    M clang/include/clang/Driver/Options.td
    M clang/lib/Frontend/CompilerInvocation.cpp
    M clang/test/CodeGenObjC/ptrauth-block-descriptor-pointer.m
    M clang/test/CodeGenObjC/ptrauth-block-isa.m

  Log Message:
  -----------
  [clang][Obj-C][PAC] Make block descriptor pointer signing configurable (#153700)

Pointer auth protection of the block descriptor pointer is only
supported in some constrained environments so we do actually need it to
be configurable.

We had made it non configurable in the first PR to protect block
metadata because we believed that was an option but subsequently
realised it does need to remain configurable.

This PR revives the flags that permit this.

  Commit: 0f64ec83f62e4689c886ca3df54ebd49f1169c2c
      https://github.com/llvm/llvm-project/commit/0f64ec83f62e4689c886ca3df54ebd49f1169c2c
  Author: Min-Yih Hsu <min.hsu at sifive.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/test/CodeGen/RISCV/rvv/incorrect-extract-subvector-combine.ll

  Log Message:
  -----------
  [RISCV][NFC] Make the pointer in the test case for #153709 non-null

The snippet was originally from llvm-reduce but we probably shouldn't use a null
pointer in the actual test case.

NFC.

  Commit: 0226e9449176f476c0a989ac66762f45e9db9f82
      https://github.com/llvm/llvm-project/commit/0226e9449176f476c0a989ac66762f45e9db9f82
  Author: LLVM GN Syncbot <llvmgnsyncbot at gmail.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/utils/gn/secondary/llvm/unittests/IR/BUILD.gn

  Log Message:
  -----------
  [gn build] Port 769a9058c8d0

  Commit: cb1228fbd535b8f9fe78505a15292b0ba23b17de
      https://github.com/llvm/llvm-project/commit/cb1228fbd535b8f9fe78505a15292b0ba23b17de
  Author: Matt Arsenault <Matthew.Arsenault at amd.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/benchmarks/RuntimeLibcalls.cpp
    M llvm/include/llvm/CodeGen/TargetLowering.h
    M llvm/include/llvm/IR/RuntimeLibcalls.h
    M llvm/lib/CodeGen/PreISelIntrinsicLowering.cpp
    M llvm/lib/LTO/LTO.cpp
    M llvm/lib/Target/WebAssembly/WebAssemblyRuntimeLibcallSignatures.cpp
    M llvm/lib/Transforms/Utils/DeclareRuntimeLibcalls.cpp
    M llvm/unittests/IR/RuntimeLibcallsTest.cpp

  Log Message:
  -----------
  RuntimeLibcalls: Return StringRef for libcall names (#153209)

Does not yet fully propagate this down into the TargetLowering
uses, many of which are relying on null checks on the returned
value.

  Commit: f2a6fcd31167cb1949498236d0e2b40525ef2be9
      https://github.com/llvm/llvm-project/commit/f2a6fcd31167cb1949498236d0e2b40525ef2be9
  Author: Shoreshen <372660931 at qq.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/tools/opt/optdriver.cpp

  Log Message:
  -----------
  [AMDGPU] Delete amdgpu-unify-metadata in optdriver.cpp (#153717)

Fix up for https://github.com/llvm/llvm-project/pull/153548, which is
from https://github.com/llvm/llvm-project/issues/153150.

  Commit: 9465916a61e6307bd51b2cabe447b93604679488
      https://github.com/llvm/llvm-project/commit/9465916a61e6307bd51b2cabe447b93604679488
  Author: Craig Topper <craig.topper at sifive.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Target/RISCV/RISCVMoveMerger.cpp

  Log Message:
  -----------
  [RISCV] Stop passing the merge opcode around in RISCVMoveMerger. NFC (#153687)

What most code wants to know is the direction and we have to decode the
opcode to figure that out. Instead pass the direction around as a bool
and convert to opcode when we create the merge instruction.

  Commit: 793a6b4fd402c929e04f9da407fe4491d5d9d22e
      https://github.com/llvm/llvm-project/commit/793a6b4fd402c929e04f9da407fe4491d5d9d22e
  Author: Ami-zhang <zhanglimin at loongson.cn>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    A clang/test/CodeGen/LoongArch/targetattr-lasx.c

  Log Message:
  -----------
  [NFC] Precommit test for issue #149512 (#153541)

  Commit: e261f2895f158632ca279f6e63bd2b9aa6ef7846
      https://github.com/llvm/llvm-project/commit/e261f2895f158632ca279f6e63bd2b9aa6ef7846
  Author: Luke Lau <luke at igalia.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/lib/Target/RISCV/MCTargetDesc/RISCVBaseInfo.h
    M llvm/lib/Target/RISCV/RISCVInstrFormats.td
    M llvm/lib/Target/RISCV/RISCVInstrInfoV.td
    M llvm/lib/Target/RISCV/RISCVInstrInfoXRivos.td
    M llvm/lib/Target/RISCV/RISCVInstrInfoXSf.td
    M llvm/lib/Target/RISCV/RISCVInstrInfoXSfmm.td
    M llvm/lib/Target/RISCV/RISCVVLOptimizer.cpp

  Log Message:
  -----------
  [RISCV] Add TSFlag for reading past VL behaviour. NFCI (#149704)

Currently we have a switch statement that checks if a vector instruction
may read elements past VL. However it currently doesn't account for
instructions in vendor extensions.

Handling all possible vendor instructions will result in quite a lot of
opcodes being added, so I've created a new TSFlag that we can declare in
TableGen, and added it to the existing instruction definitions.

I've tried to be conservative as possible here: All SiFive vendor vector
instructions should be covered by the flag, as well as all of
XRivosVizip, and ri.vextract from XRivosVisni.

For now this should be NFC because coincidentally, these instructions
aren't handled in getOperandInfo, so RISCVVLOptimizer should currently
avoid touching them despite them being liberally handled in
getMinimumVLForUser.

However in an upcoming patch we'll need to also bail in
getMinimumVLForUser, so this prepares for it.

  Commit: c22ec9cde3708e0c7afd0909508a67ef9625aa4c
      https://github.com/llvm/llvm-project/commit/c22ec9cde3708e0c7afd0909508a67ef9625aa4c
  Author: Shafik Yaghmour <shafik.yaghmour at intel.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M clang/lib/Sema/SemaOverload.cpp

  Log Message:
  -----------
  [Clang][Sema] Add nullptr check in IsFunctionConversion (#153710)

Static analysis flagged this code b/c ToFPT could be nullptr but we were
not checking it even though in the previous if statement we did. It
looks like this was a mistaken refactor from:

https://github.com/llvm/llvm-project/pull/135836

In the older code ToFPT was set using a cast which would have asserted
but no longer in the new code.

  Commit: 10d9e7b1b7913586bb2663cba57c6e25f68a404f
      https://github.com/llvm/llvm-project/commit/10d9e7b1b7913586bb2663cba57c6e25f68a404f
  Author: Jasmine Tang <jjasmine at igalia.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Analysis/ConstantFolding.cpp
    A llvm/test/Transforms/InstSimplify/ConstProp/WebAssembly/dot.ll

  Log Message:
  -----------
  Reapply "[WebAssembly] Constant fold wasm.dot" (#153070)

In #149619, for the test of `@dot_follow_modulo_spec_2`, constant
folding the addition of two i32 1073741824 causes an overflow from 2^32
to -2^32=-2147483648, which triggers the UB sanitizer. This PR reapplies
the previous PR, explicitly casting the addition operand to int64_t
first before performing the addition before producing a int32 number via
`Constant *C = get(cast<IntegerType>(Ty->getScalarType()), V, isSigned)`

  Commit: a1b6e7ff393533a5c4f3bdfd4efe5da106e2de2b
      https://github.com/llvm/llvm-project/commit/a1b6e7ff393533a5c4f3bdfd4efe5da106e2de2b
  Author: Ami-zhang <zhanglimin at loongson.cn>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M clang/lib/Basic/Targets/LoongArch.cpp
    M clang/test/CodeGen/LoongArch/targetattr-lasx.c

  Log Message:
  -----------
  [clang][LoongArch] Ensure `target("lasx")` implies LSX support (#153542)

Currently, `__attribute__((target("lasx")))` does not automatically
enable LSX support, causing Clang to fail with `-mno-lsx`. Since
LASX depends on LSX, enabling LASX should implicitly enable LSX to
avoid clang error.

Fixes #149512.

Depends on #153541

  Commit: 3f797a8342c3dbe4a260b26f948d8776ff490431
      https://github.com/llvm/llvm-project/commit/3f797a8342c3dbe4a260b26f948d8776ff490431
  Author: Chenguang Wang <w3cing at gmail.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M mlir/include/mlir/Dialect/SPIRV/Interfaces/SPIRVImageInterfaces.h

  Log Message:
  -----------
  [mlir][spirv] Add missing #include in SPIRVImageInterfaces.h (#153727)

SPIRVImageInterfaces.h.inc uses some types, e.g. mlir::TypedValue,
without #include the necessary headers. This is fine most of the time,
but we did run into a weird case where bazel fails to compile
//mlir:SPIRVImageInterfaces on clang19 for ChromiumOS when parse_headers
(see [1]) is specified.

[1]: https://bazel.build/docs/bazel-and-cpp#toolchain-features

  Commit: bf2f2414585239b0cffc0a866f1f646d2f4b65d9
      https://github.com/llvm/llvm-project/commit/bf2f2414585239b0cffc0a866f1f646d2f4b65d9
  Author: Alexey Bataev <a.bataev at outlook.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
    M llvm/test/Transforms/SLPVectorizer/AArch64/alternate-vectorization-split-node.ll
    M llvm/test/Transforms/SLPVectorizer/X86/load-merge-inseltpoison.ll
    M llvm/test/Transforms/SLPVectorizer/X86/load-merge.ll

  Log Message:
  -----------
  [SLP]Support LShr as base for copyable elements

Added support for LShr instructions as base for copyable elements. Also,
added simple analysis for best base instruction selection, if multiple
candidates are available.

Fixed scheduling after cancellation

Reviewers: hiraditya, RKSimon

Reviewed By: RKSimon

Pull Request: https://github.com/llvm/llvm-project/pull/153393

  Commit: 93d24b6b7b148c47a2fa228a4ef31524fa1d9f3f
      https://github.com/llvm/llvm-project/commit/93d24b6b7b148c47a2fa228a4ef31524fa1d9f3f
  Author: Mircea Trofin <mtrofin at google.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/include/llvm/Transforms/Instrumentation/PGOInstrumentation.h
    M llvm/lib/Transforms/Instrumentation/PGOCtxProfFlattening.cpp
    M llvm/lib/Transforms/Instrumentation/PGOInstrumentation.cpp
    M llvm/lib/Transforms/Instrumentation/PGOMemOPSizeOpt.cpp

  Log Message:
  -----------
  [NFC][PGO] Drop unused `Module` parameter in `setProfMetadata` (#153733)

  Commit: 1761b80a4aa19829e37a2b81e7b891c1dfa31f10
      https://github.com/llvm/llvm-project/commit/1761b80a4aa19829e37a2b81e7b891c1dfa31f10
  Author: Luke Lau <luke at igalia.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/docs/ReleaseNotes.md

  Log Message:
  -----------
  [RISCV] Add a release note about tail folding being enabled. NFC (#153535)

It's probably useful for users to know how to get the old scalar
epilogue back if they need it.

  Commit: 4f15debe13af14e0fd668ee4861aff6dbc5283df
      https://github.com/llvm/llvm-project/commit/4f15debe13af14e0fd668ee4861aff6dbc5283df
  Author: Aiden Grossman <aidengrossman at google.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M utils/bazel/llvm-project-overlay/libc/BUILD.bazel

  Log Message:
  -----------
  [bazel] Reformat libc build

Buildifier flags some issues, so run it over everything and have it
automatically reformat the files.

  Commit: 0923aafcf9159735934fb936f9ba36121a40fe4d
      https://github.com/llvm/llvm-project/commit/0923aafcf9159735934fb936f9ba36121a40fe4d
  Author: Kazu Hirata <kazu at google.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/tools/llvm-c-test/debuginfo.c

  Log Message:
  -----------
  [llvm-c-test] Fix a warning

This patch fixes:

  llvm/tools/llvm-c-test/debuginfo.c:447:27: error: unused variable
  'ME' [-Werror,-Wunused-variable]

  Commit: 3b4775d31d8b5f4005e6d32539834bec990dd481
      https://github.com/llvm/llvm-project/commit/3b4775d31d8b5f4005e6d32539834bec990dd481
  Author: Mircea Trofin <mtrofin at google.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/include/llvm/IR/ProfDataUtils.h
    M llvm/include/llvm/Transforms/Utils/Instrumentation.h
    M llvm/lib/IR/ProfDataUtils.cpp
    M llvm/lib/Transforms/Instrumentation/PGOInstrumentation.cpp

  Log Message:
  -----------
  [NFC][PGO] Factor downscaling of branch weights out of `Instrumentation` into `ProfileData` (#153735)

The logic isn’t instrumentation-specific, and the refactoring allows users avoid a dependency on `Instrumentation` and just take one on `ProfileData` (which a fairly low-level dependency)

  Commit: 16314eb7312dab38d721c70f247f2117e9800704
      https://github.com/llvm/llvm-project/commit/16314eb7312dab38d721c70f247f2117e9800704
  Author: David Green <david.green at arm.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/test/CodeGen/AArch64/fadd-combines.ll
    M llvm/test/CodeGen/AArch64/vecreduce-fadd-legalization-strict.ll
    M llvm/test/CodeGen/AArch64/vecreduce-fadd-legalization.ll
    M llvm/test/CodeGen/AArch64/vecreduce-fadd.ll

  Log Message:
  -----------
  [AArch64][GlobalISel] Add additional vecreduce.fadd and fadd 0.0 tests. NFC

  Commit: 8da1ce559e63a16ad05c22787d01112859cf4d89
      https://github.com/llvm/llvm-project/commit/8da1ce559e63a16ad05c22787d01112859cf4d89
  Author: Mircea Trofin <mtrofin at google.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/IR/ProfDataUtils.cpp

  Log Message:
  -----------
  Fix after #153735 (#153749)

Example failure
<https://lab.llvm.org/buildbot/#/builders/105/builds/11073>

Seems compiler-dependent.

  Commit: 45e6951ba7b305dbad218e88d64f27c1f2c4b7cf
      https://github.com/llvm/llvm-project/commit/45e6951ba7b305dbad218e88d64f27c1f2c4b7cf
  Author: Mircea Trofin <mtrofin at google.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/IR/ProfDataUtils.cpp

  Log Message:
  -----------
  Use `uint32_t` rather than `unsigned` in `downscaleWeights` (#153750)

  Commit: e2eaea412a379742146c5c94fbf328499699b139
      https://github.com/llvm/llvm-project/commit/e2eaea412a379742146c5c94fbf328499699b139
  Author: Craig Topper <craig.topper at sifive.com>
  Date:   2025-08-14 (Thu, 14 Aug 2025)

  Changed paths:
    M llvm/lib/Target/RISCV/RISCVInstrInfoP.td
    M llvm/test/MC/RISCV/rv32p-invalid.s
    M llvm/test/MC/RISCV/rv32p-valid.s
    M llvm/test/MC/RISCV/rv64p-invalid.s
    M llvm/test/MC/RISCV/rv64p-valid.s

  Log Message:
  -----------
  [RISCV] Add MC support for more P extension instructions. (#153629)

This implements pages 10-14 from
https://jhauser.us/RISCV/ext-P/RVP-instrEncodings-015.pdf

Test cases copied from #123271 with a couple mistakes fixed.

Co-authored-by: realqhc <caiqihan021 at hotmail.com>

  Commit: b9e33fd49386a4be569e7d579c24e0e2a9607943
      https://github.com/llvm/llvm-project/commit/b9e33fd49386a4be569e7d579c24e0e2a9607943
  Author: Kareem Ergawy <kareem.ergawy at amd.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M flang/lib/Lower/Bridge.cpp
    A flang/test/Lower/do_concurrent_loop_in_nested_block.f90

  Log Message:
  -----------
  [flang] Do not re-localize loop ivs when nested inside `block`s (#153350)

Consider the following example:
```fortran
  implicit none
  integer :: i, j

  do concurrent (i=1:10) local(j)
    block
      do j=1,20
      end do
    end block
  end do
```

Without the fix introduced in this PR, the compiler would "re-localize"
the `j` variable inside the `fir.do_concurrent` loop:
```mlir
    fir.do_concurrent {
      %7 = fir.alloca i32 {bindc_name = "j"}
      %8:2 = hlfir.declare %7 {uniq_name = "_QFloop_in_nested_blockEj"} : (!fir.ref<i32>) -> (!fir.ref<i32>, !fir.ref<i32>)
      ...
      fir.do_concurrent.loop (%arg0) = (%5) to (%6) step (%c1) local(@_QFloop_in_nested_blockEj_private_i32 %4#0 -> %arg1 : !fir.ref<i32>) {
        %12:2 = hlfir.declare %arg1 {uniq_name = "_QFloop_in_nested_blockEj"} : (!fir.ref<i32>) -> (!fir.ref<i32>, !fir.ref<i32>)
        ...
        %17:2 = fir.do_loop %arg2 = %14 to %15 step %c1_1 iter_args(%arg3 = %16) -> (index, i32) {
          fir.store %arg3 to %8#0 : !fir.ref<i32>
          ...
        }
      }
    }
```

This happened because we did a shallow look-up of `j` and since the loop
is nested inside a `block`, the look-up failed and we re-created a local
allocation for `j` inside the parent `fir.do_concurrent` loop. This
means that we ended up not using the actual localized symbol which is
passed as a region argument to the `fir.do_concurrent.loop` op.

In case of `j`, we do not need to do a shallow look-up. The shallow
look-up is only needed if a symbol is an OpenMP private one or an
iteration variable of a `do concurrent` loop. Neither of which applies
to `j`.

With the fix, `j` is properly resolved to the `local` region argument:
```mlir
    fir.do_concurrent {
      ...
      fir.do_concurrent.loop (%arg0) = (%5) to (%6) step (%c1) local(@_QFloop_in_nested_blockEj_private_i32 %4#0 -> %arg1 : !fir.ref<i32>) {
        ...
        %10:2 = hlfir.declare %arg1 {uniq_name = "_QFloop_in_nested_blockEj"} : (!fir.ref<i32>) -> (!fir.ref<i32>, !fir.ref<i32>)
        ...
        %15:2 = fir.do_loop %arg2 = %12 to %13 step %c1_1 iter_args(%arg3 = %14) -> (index, i32) {
          fir.store %arg3 to %10#0 : !fir.ref<i32>
          ...
        }
      }
    }
```

  Commit: cbbf303ff51b61315f82b0f87bb52db2bedf2b78
      https://github.com/llvm/llvm-project/commit/cbbf303ff51b61315f82b0f87bb52db2bedf2b78
  Author: Nikolas Klauser <nikolasklauser at berlin.de>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M libcxx/docs/ReleaseNotes/22.rst
    M libcxx/include/__hash_table
    M libcxx/include/unordered_map
    M libcxx/include/unordered_set
    M libcxx/test/std/containers/unord/unord.map/unord.map.cnstr/assign_copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.map/unord.map.cnstr/copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.map/unord.map.cnstr/copy_alloc.pass.cpp
    M libcxx/test/std/containers/unord/unord.multimap/unord.multimap.cnstr/assign_copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.multimap/unord.multimap.cnstr/copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.multimap/unord.multimap.cnstr/copy_alloc.pass.cpp
    M libcxx/test/std/containers/unord/unord.multiset/unord.multiset.cnstr/assign_copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.multiset/unord.multiset.cnstr/copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.multiset/unord.multiset.cnstr/copy_alloc.pass.cpp
    M libcxx/test/std/containers/unord/unord.set/unord.set.cnstr/assign_copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.set/unord.set.cnstr/copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.set/unord.set.cnstr/copy_alloc.pass.cpp

  Log Message:
  -----------
  [libc++] Optimize __hash_table copy constructors and assignment (#151951)

```
----------------------------------------------------------------------------------------------------------------------
Benchmark                                                                                             old          new
----------------------------------------------------------------------------------------------------------------------
std::unordered_set<int>::ctor(const&)/0                                                           15.4 ns      14.6 ns
std::unordered_set<int>::ctor(const&)/32                                                           686 ns       322 ns
std::unordered_set<int>::ctor(const&)/1024                                                       35839 ns     21490 ns
std::unordered_set<int>::ctor(const&)/8192                                                      385790 ns    280270 ns
std::unordered_set<int>::operator=(const&) (into cleared Container)/0                             15.1 ns      15.9 ns
std::unordered_set<int>::operator=(const&) (into cleared Container)/32                            1077 ns       333 ns
std::unordered_set<int>::operator=(const&) (into cleared Container)/1024                         31296 ns      9984 ns
std::unordered_set<int>::operator=(const&) (into cleared Container)/8192                        266776 ns    109418 ns
std::unordered_set<int>::operator=(const&) (into partially populated Container)/0                 15.1 ns      16.3 ns
std::unordered_set<int>::operator=(const&) (into partially populated Container)/32                 962 ns       320 ns
std::unordered_set<int>::operator=(const&) (into partially populated Container)/1024             31713 ns     10128 ns
std::unordered_set<int>::operator=(const&) (into partially populated Container)/8192            266113 ns    108525 ns
std::unordered_set<int>::operator=(const&) (into populated Container)/0                          0.990 ns      2.03 ns
std::unordered_set<int>::operator=(const&) (into populated Container)/32                           963 ns       263 ns
std::unordered_set<int>::operator=(const&) (into populated Container)/1024                       27600 ns      7793 ns
std::unordered_set<int>::operator=(const&) (into populated Container)/8192                      235295 ns     66248 ns
std::unordered_set<std::string>::ctor(const&)/0                                                   16.0 ns      15.0 ns
std::unordered_set<std::string>::ctor(const&)/32                                                  2950 ns      1277 ns
std::unordered_set<std::string>::ctor(const&)/1024                                              246935 ns     73762 ns
std::unordered_set<std::string>::ctor(const&)/8192                                             3310895 ns   2468608 ns
std::unordered_set<std::string>::operator=(const&) (into cleared Container)/0                     16.1 ns      15.8 ns
std::unordered_set<std::string>::operator=(const&) (into cleared Container)/32                    5856 ns      1039 ns
std::unordered_set<std::string>::operator=(const&) (into cleared Container)/1024                170436 ns     74836 ns
std::unordered_set<std::string>::operator=(const&) (into cleared Container)/8192               1574235 ns   1096891 ns
std::unordered_set<std::string>::operator=(const&) (into partially populated Container)/0         16.0 ns      16.3 ns
std::unordered_set<std::string>::operator=(const&) (into partially populated Container)/32        5571 ns      1064 ns
std::unordered_set<std::string>::operator=(const&) (into partially populated Container)/1024    199220 ns     75462 ns
std::unordered_set<std::string>::operator=(const&) (into partially populated Container)/8192   1552465 ns   1116094 ns
std::unordered_set<std::string>::operator=(const&) (into populated Container)/0                   1.70 ns      2.14 ns
std::unordered_set<std::string>::operator=(const&) (into populated Container)/32                  2562 ns       645 ns
std::unordered_set<std::string>::operator=(const&) (into populated Container)/1024              228608 ns     39100 ns
std::unordered_set<std::string>::operator=(const&) (into populated Container)/8192             2013723 ns    390401 ns
```

Fixes #77657

  Commit: d9bc548facf3929b45a68d0d8ae44778afe66d8f
      https://github.com/llvm/llvm-project/commit/d9bc548facf3929b45a68d0d8ae44778afe66d8f
  Author: Nikolas Klauser <nikolasklauser at berlin.de>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M libcxx/docs/ReleaseNotes/22.rst
    M libcxx/include/__tree
    M libcxx/test/std/containers/associative/multimap/multimap.ops/find.pass.cpp

  Log Message:
  -----------
  [libc++] Optimize __tree::find and __tree::__erase_unique (#152370)

This patch changes `__tree::find` to return when it has found any equal
element instead of the lower bound of the equal elements. For `map` and
`set` there is no observable difference, since the keys are unique.
However for their `multi` versions this can mean a change in behaviour
since it's not longer guaranteed that `find` will return the first
element.

```
------------------------------------------------------------------------------------------
Benchmark                                                                  old         new
------------------------------------------------------------------------------------------
std::map<int, int>::erase(key) (existent)/0                           24.4 ns      24.9 ns
std::map<int, int>::erase(key) (existent)/32                          39.8 ns      32.1 ns
std::map<int, int>::erase(key) (existent)/1024                        83.8 ns      52.5 ns
std::map<int, int>::erase(key) (existent)/8192                        91.4 ns      66.4 ns
std::map<int, int>::erase(key) (non-existent)/0                      0.511 ns     0.328 ns
std::map<int, int>::erase(key) (non-existent)/32                      9.12 ns      5.62 ns
std::map<int, int>::erase(key) (non-existent)/1024                    26.6 ns      11.3 ns
std::map<int, int>::erase(key) (non-existent)/8192                    37.0 ns      16.9 ns
std::map<int, int>::find(key) (existent)/0                           0.007 ns     0.007 ns
std::map<int, int>::find(key) (existent)/32                           6.02 ns      4.32 ns
std::map<int, int>::find(key) (existent)/1024                         13.6 ns      8.35 ns
std::map<int, int>::find(key) (existent)/8192                         30.3 ns      12.8 ns
std::map<int, int>::find(key) (non-existent)/0                       0.299 ns     0.545 ns
std::map<int, int>::find(key) (non-existent)/32                       8.78 ns      4.60 ns
std::map<int, int>::find(key) (non-existent)/1024                     26.1 ns      21.8 ns
std::map<int, int>::find(key) (non-existent)/8192                     36.2 ns      27.9 ns
std::map<std::string, int>::erase(key) (existent)/0                   74.1 ns      76.7 ns
std::map<std::string, int>::erase(key) (existent)/32                   161 ns       114 ns
std::map<std::string, int>::erase(key) (existent)/1024                 196 ns       126 ns
std::map<std::string, int>::erase(key) (existent)/8192                 207 ns       160 ns
std::map<std::string, int>::erase(key) (non-existent)/0              0.754 ns     0.328 ns
std::map<std::string, int>::erase(key) (non-existent)/32              47.3 ns      40.7 ns
std::map<std::string, int>::erase(key) (non-existent)/1024             122 ns      96.1 ns
std::map<std::string, int>::erase(key) (non-existent)/8192             168 ns       123 ns
std::map<std::string, int>::find(key) (existent)/0                   0.059 ns     0.058 ns
std::map<std::string, int>::find(key) (existent)/32                   54.3 ns      34.6 ns
std::map<std::string, int>::find(key) (existent)/1024                  125 ns      64.5 ns
std::map<std::string, int>::find(key) (existent)/8192                  159 ns      79.2 ns
std::map<std::string, int>::find(key) (non-existent)/0               0.311 ns     0.299 ns
std::map<std::string, int>::find(key) (non-existent)/32               44.0 ns      42.7 ns
std::map<std::string, int>::find(key) (non-existent)/1024              120 ns      92.6 ns
std::map<std::string, int>::find(key) (non-existent)/8192              189 ns       124 ns
std::set<int>::erase(key) (existent)/0                                25.1 ns      25.1 ns
std::set<int>::erase(key) (existent)/32                               42.1 ns      33.1 ns
std::set<int>::erase(key) (existent)/1024                             73.8 ns      55.5 ns
std::set<int>::erase(key) (existent)/8192                              101 ns      68.8 ns
std::set<int>::erase(key) (non-existent)/0                           0.511 ns     0.328 ns
std::set<int>::erase(key) (non-existent)/32                           9.60 ns      4.67 ns
std::set<int>::erase(key) (non-existent)/1024                         26.5 ns      11.2 ns
std::set<int>::erase(key) (non-existent)/8192                         46.2 ns      16.8 ns
std::set<int>::find(key) (existent)/0                                0.008 ns     0.007 ns
std::set<int>::find(key) (existent)/32                                5.87 ns      4.51 ns
std::set<int>::find(key) (existent)/1024                              14.3 ns      8.69 ns
std::set<int>::find(key) (existent)/8192                              30.2 ns      12.8 ns
std::set<int>::find(key) (non-existent)/0                            0.531 ns     0.530 ns
std::set<int>::find(key) (non-existent)/32                            8.77 ns      4.64 ns
std::set<int>::find(key) (non-existent)/1024                          26.1 ns      21.7 ns
std::set<int>::find(key) (non-existent)/8192                          36.3 ns      27.8 ns
std::set<std::string>::erase(key) (existent)/0                        93.2 ns      70.2 ns
std::set<std::string>::erase(key) (existent)/32                        164 ns       116 ns
std::set<std::string>::erase(key) (existent)/1024                      161 ns       136 ns
std::set<std::string>::erase(key) (existent)/8192                      231 ns       140 ns
std::set<std::string>::erase(key) (non-existent)/0                   0.532 ns     0.326 ns
std::set<std::string>::erase(key) (non-existent)/32                   43.4 ns      40.1 ns
std::set<std::string>::erase(key) (non-existent)/1024                  122 ns      99.5 ns
std::set<std::string>::erase(key) (non-existent)/8192                  168 ns       125 ns
std::set<std::string>::find(key) (existent)/0                        0.059 ns     0.059 ns
std::set<std::string>::find(key) (existent)/32                        53.1 ns      35.5 ns
std::set<std::string>::find(key) (existent)/1024                       124 ns      61.2 ns
std::set<std::string>::find(key) (existent)/8192                       154 ns      73.9 ns
std::set<std::string>::find(key) (non-existent)/0                    0.532 ns     0.301 ns
std::set<std::string>::find(key) (non-existent)/32                    44.4 ns      39.5 ns
std::set<std::string>::find(key) (non-existent)/1024                   120 ns      95.5 ns
std::set<std::string>::find(key) (non-existent)/8192                   193 ns       119 ns
std::multimap<int, int>::erase(key) (existent)/0                       26.5 ns     26.6 ns
std::multimap<int, int>::erase(key) (existent)/32                      33.5 ns     32.9 ns
std::multimap<int, int>::erase(key) (existent)/1024                    55.5 ns     58.0 ns
std::multimap<int, int>::erase(key) (existent)/8192                    67.4 ns     70.0 ns
std::multimap<int, int>::erase(key) (non-existent)/0                  0.523 ns    0.532 ns
std::multimap<int, int>::erase(key) (non-existent)/32                  5.08 ns     5.09 ns
std::multimap<int, int>::erase(key) (non-existent)/1024                13.0 ns     12.9 ns
std::multimap<int, int>::erase(key) (non-existent)/8192                19.6 ns     19.8 ns
std::multimap<int, int>::find(key) (existent)/0                       0.015 ns    0.037 ns
std::multimap<int, int>::find(key) (existent)/32                       7.07 ns     3.85 ns
std::multimap<int, int>::find(key) (existent)/1024                     22.0 ns     7.44 ns
std::multimap<int, int>::find(key) (existent)/8192                     37.6 ns     12.0 ns
std::multimap<int, int>::find(key) (non-existent)/0                   0.297 ns    0.305 ns
std::multimap<int, int>::find(key) (non-existent)/32                   8.79 ns     4.59 ns
std::multimap<int, int>::find(key) (non-existent)/1024                 26.0 ns     11.2 ns
std::multimap<int, int>::find(key) (non-existent)/8192                 36.4 ns     16.8 ns
std::multimap<std::string, int>::erase(key) (existent)/0               93.4 ns     84.5 ns
std::multimap<std::string, int>::erase(key) (existent)/32               101 ns      101 ns
std::multimap<std::string, int>::erase(key) (existent)/1024             118 ns      126 ns
std::multimap<std::string, int>::erase(key) (existent)/8192             108 ns      124 ns
std::multimap<std::string, int>::erase(key) (non-existent)/0           2.39 ns     2.43 ns
std::multimap<std::string, int>::erase(key) (non-existent)/32          44.4 ns     49.7 ns
std::multimap<std::string, int>::erase(key) (non-existent)/1024         108 ns      103 ns
std::multimap<std::string, int>::erase(key) (non-existent)/8192         140 ns      125 ns
std::multimap<std::string, int>::find(key) (existent)/0               0.059 ns    0.058 ns
std::multimap<std::string, int>::find(key) (existent)/32               52.3 ns     32.6 ns
std::multimap<std::string, int>::find(key) (existent)/1024              122 ns     58.9 ns
std::multimap<std::string, int>::find(key) (existent)/8192              160 ns     72.7 ns
std::multimap<std::string, int>::find(key) (non-existent)/0           0.524 ns    0.494 ns
std::multimap<std::string, int>::find(key) (non-existent)/32           43.8 ns     38.9 ns
std::multimap<std::string, int>::find(key) (non-existent)/1024          123 ns     90.8 ns
std::multimap<std::string, int>::find(key) (non-existent)/8192          190 ns      126 ns
std::multiset<int>::erase(key) (existent)/0                            27.1 ns     26.8 ns
std::multiset<int>::erase(key) (existent)/32                           33.3 ns     34.1 ns
std::multiset<int>::erase(key) (existent)/1024                         58.5 ns     58.8 ns
std::multiset<int>::erase(key) (existent)/8192                         66.7 ns     64.1 ns
std::multiset<int>::erase(key) (non-existent)/0                       0.318 ns    0.325 ns
std::multiset<int>::erase(key) (non-existent)/32                       5.15 ns     5.25 ns
std::multiset<int>::erase(key) (non-existent)/1024                     12.9 ns     12.7 ns
std::multiset<int>::erase(key) (non-existent)/8192                     20.3 ns     20.3 ns
std::multiset<int>::find(key) (existent)/0                            0.043 ns    0.015 ns
std::multiset<int>::find(key) (existent)/32                            6.94 ns     4.22 ns
std::multiset<int>::find(key) (existent)/1024                          21.4 ns     8.23 ns
std::multiset<int>::find(key) (existent)/8192                          37.4 ns     12.6 ns
std::multiset<int>::find(key) (non-existent)/0                        0.515 ns    0.300 ns
std::multiset<int>::find(key) (non-existent)/32                        8.52 ns     4.62 ns
std::multiset<int>::find(key) (non-existent)/1024                      25.5 ns     11.3 ns
std::multiset<int>::find(key) (non-existent)/8192                      36.5 ns     27.0 ns
std::multiset<std::string>::erase(key) (existent)/0                    81.9 ns     77.5 ns
std::multiset<std::string>::erase(key) (existent)/32                    113 ns      129 ns
std::multiset<std::string>::erase(key) (existent)/1024                  132 ns      148 ns
std::multiset<std::string>::erase(key) (existent)/8192                  114 ns      165 ns
std::multiset<std::string>::erase(key) (non-existent)/0                2.33 ns     2.32 ns
std::multiset<std::string>::erase(key) (non-existent)/32               44.4 ns     42.0 ns
std::multiset<std::string>::erase(key) (non-existent)/1024             97.3 ns     95.1 ns
std::multiset<std::string>::erase(key) (non-existent)/8192              132 ns      123 ns
std::multiset<std::string>::find(key) (existent)/0                    0.058 ns    0.059 ns
std::multiset<std::string>::find(key) (existent)/32                    48.3 ns     34.4 ns
std::multiset<std::string>::find(key) (existent)/1024                   121 ns     61.9 ns
std::multiset<std::string>::find(key) (existent)/8192                   155 ns     77.7 ns
std::multiset<std::string>::find(key) (non-existent)/0                0.524 ns    0.306 ns
std::multiset<std::string>::find(key) (non-existent)/32                44.1 ns     40.4 ns
std::multiset<std::string>::find(key) (non-existent)/1024               121 ns     96.3 ns
std::multiset<std::string>::find(key) (non-existent)/8192               193 ns      121 ns
```

  Commit: ae90b60fc8661956250a0b8185be37e8fb92c0d7
      https://github.com/llvm/llvm-project/commit/ae90b60fc8661956250a0b8185be37e8fb92c0d7
  Author: Doug Gregor <dgregor at apple.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M clang/docs/APINotes.rst
    M clang/include/clang/APINotes/Types.h
    M clang/lib/APINotes/APINotesFormat.h
    M clang/lib/APINotes/APINotesReader.cpp
    M clang/lib/APINotes/APINotesWriter.cpp
    M clang/lib/APINotes/APINotesYAMLCompiler.cpp
    M clang/lib/Sema/SemaAPINotes.cpp
    M clang/test/APINotes/Inputs/Headers/SwiftImportAs.apinotes
    M clang/test/APINotes/Inputs/Headers/SwiftImportAs.h
    M clang/test/APINotes/swift-import-as.cpp

  Log Message:
  -----------
  [APINotes] Add SwiftDestroyOp API note to map to the "destroy" function (#153261)

Like retain/release for reference types, "destroy" lets us specify an
operation that is used to deinitialize an instance of a noncopyable
type.

  Commit: 393c21137e345626136630d9679cbccb6e1e96f8
      https://github.com/llvm/llvm-project/commit/393c21137e345626136630d9679cbccb6e1e96f8
  Author: Cullen Rhodes <cullen.rhodes at arm.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/lib/CodeGen/SelectionDAG/StatepointLowering.cpp
    A llvm/test/CodeGen/AArch64/pr142314.ll
    M llvm/test/CodeGen/X86/pr33010.ll

  Log Message:
  -----------
  [StatepointLowering] Use FrameIndex instead of TargetFrameIndex (#153555)

TargetFrameIndex shouldn't be used as an operand to target-independent
node such as a load. This causes ISel issues.

#81635 fixed a similar issue with this code using a TargetConstant,
instead of a Constant.

Fixes #142314.

  Commit: b8d74ad2b302ff4ee73a6e4530042658b4177ff8
      https://github.com/llvm/llvm-project/commit/b8d74ad2b302ff4ee73a6e4530042658b4177ff8
  Author: Mircea Trofin <mtrofin at google.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/lib/Transforms/Scalar/JumpTableToSwitch.cpp
    M llvm/test/Transforms/JumpTableToSwitch/basic.ll

  Log Message:
  -----------
  [JTS] Use common branch weight downscaling (#153738)

This also fixes a bug introduced accidentally in #153651, whereby the
`JumpTableToSwitch` would convert all the branch weights to 0 except
for one. It didn't trip the test because `update_test_checks` wasn't
run with `-check-globals`. It is now. This also made noticeable that
the direct calls promoted from the indirect call inherited the
`VP`metadata, which should be dropped as it makes no more sense now.

  Commit: fdd2d4df1212ef6b7c8e0dfbba8f2a24343d2d9d
      https://github.com/llvm/llvm-project/commit/fdd2d4df1212ef6b7c8e0dfbba8f2a24343d2d9d
  Author: Gaëtan Bossu <gaetan.bossu at arm.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/lib/Target/AArch64/AArch64ExpandPseudoInsts.cpp
    M llvm/lib/Target/AArch64/AArch64InstrFormats.td
    M llvm/lib/Target/AArch64/AArch64InstrInfo.h
    M llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td
    M llvm/lib/Target/AArch64/AArch64SchedA320.td
    M llvm/lib/Target/AArch64/AArch64SchedA510.td
    M llvm/lib/Target/AArch64/AArch64SchedNeoverseN2.td
    M llvm/lib/Target/AArch64/AArch64SchedNeoverseN3.td
    M llvm/lib/Target/AArch64/AArch64SchedNeoverseV1.td
    M llvm/lib/Target/AArch64/AArch64SchedNeoverseV2.td
    M llvm/lib/Target/AArch64/SVEInstrFormats.td
    A llvm/test/CodeGen/AArch64/expand-constructive-zzi.mir

  Log Message:
  -----------
  [AArch64] Define constructive EXT_ZZI pseudo instruction (#152552)

It will get expanded into MOVPRFX_ZZ and EXT_ZZI by the
AArch64ExpandPseudo pass. This instruction takes a single Z register as
input, as opposed to the existing destructive EXT_ZZI instruction.

Note this patch only defines the pseudo, it isn't used in any ISel
pattern yet. It will later be used for vector.extract.

  Commit: ac005e16f617451ad2dc0c794661159cb8111f72
      https://github.com/llvm/llvm-project/commit/ac005e16f617451ad2dc0c794661159cb8111f72
  Author: Diana Picus <Diana-Magda.Picus at amd.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/include/llvm/IR/IntrinsicsAMDGPU.td
    M llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp
    M llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
    M llvm/lib/IR/Verifier.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUCallLowering.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUSearchableTables.td
    M llvm/test/Analysis/UniformityAnalysis/AMDGPU/intrinsics.ll
    A llvm/test/CodeGen/AMDGPU/amdgcn-call-whole-wave.ll
    M llvm/test/CodeGen/AMDGPU/irtranslator-whole-wave-functions.ll
    M llvm/test/CodeGen/AMDGPU/isel-whole-wave-functions.ll
    M llvm/test/CodeGen/AMDGPU/whole-wave-functions.ll
    A llvm/test/Verifier/AMDGPU/intrinsic-amdgcn-call-whole-wave.ll

  Log Message:
  -----------
  Reapply "[AMDGPU] Intrinsic for launching whole wave functions" (#153584)

This reverts commit 14cd1339318b16e08c1363ec6896bd7d1e4ae281. The
buildbot failure seems to have been a cmake issue which has been
discussed in more detail in this Discourse post:

https://discourse.llvm.org/t/cmake-doesnt-regenerate-all-tablegen-target-files/87901

If any buildbots fail to select arbitrary intrinsics with this patch,
it's worth considering using clean builds with ccache instead of
incremental builds, as recommended here:

https://llvm.org/docs/HowToAddABuilder.html#:~:text=Use%20CCache%20and%20NOT%20incremental%20builds

The original commit message for this patch:
Add the llvm.amdgcn.call.whole.wave intrinsic for calling whole wave
functions. This will take as its first argument the callee with the
amdgpu_gfx_whole_wave calling convention, followed by the call
parameters which must match the signature of the callee except for the
first function argument (the i1 original EXEC mask, which doesn't need
to be passed in). Indirect calls are not allowed.

Make direct calls to amdgpu_gfx_whole_wave functions a verifier error.

Tail calls are handled in a future patch.

  Commit: cfdbbb8fcc62f75dcf0b251dfab63353f3921227
      https://github.com/llvm/llvm-project/commit/cfdbbb8fcc62f75dcf0b251dfab63353f3921227
  Author: Dan Blackwell <dan_blackwell at apple.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M compiler-rt/lib/sanitizer_common/sanitizer_redefine_builtins.h

  Log Message:
  -----------
  [sanitizer_common] Disable sanitizer_redefine_builtins on Apple platforms (#153381)

This currently generates many linker warnings of this form, due to
defining mem(cpy|move|set) in every object file:
```
ld: warning: '.../build/projects/compiler-rt/lib/interception/CMakeFiles/RTInterception.ios.dir/interception_linux.cpp.o' has malformed LC_DYSYMTAB, expected 6 undefined symbols to start at index 1, found 3 undefined symbols starting at index 1
```

In order for this to actually replace these symbols on mach-o, they
would need a leading underscore, e.g. `.set _memcpy,
___sanitizer_internal_memcpy`. However doing so does not fix the
warnings, and furthermore it ends up replacing `REAL(memcpy)` calls with
`__sanitizer_internal_memcpy` in places such as
`__asan::Allocator::Reallocate`. There is no way on Apple platforms to
recreate the intended behaviour, so let's just disable this on them to
reduce warning noise.

rdar://123771479

  Commit: 36be0bba2a8e45c48ae4336eff73abbcc77ee1f5
      https://github.com/llvm/llvm-project/commit/36be0bba2a8e45c48ae4336eff73abbcc77ee1f5
  Author: Florian Hahn <flo at fhahn.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/lib/Analysis/ScalarEvolution.cpp
    M llvm/test/Transforms/LoopVectorize/first-order-recurrence-dead-instructions.ll

  Log Message:
  -----------
  [SCEV] Check if predicate is known false for predicated AddRecs. (#151134)

Similarly to https://github.com/llvm/llvm-project/pull/131538, we can
also try and check if a predicate is known to wrap given the backedge
taken count.

For now, this just checks directly when we try to create predicated
AddRecs. This both helps to avoid spending compile-time on optimizations
where we know the predicate is false, and can also help to allow
additional vectorization (e.g. by deciding to scalarize memory accesses
when otherwise we would try to create a predicated AddRec with a
predicate that's always false).

The initial version is quite restricted, but can be extended in
follow-ups to cover more cases.

PR: https://github.com/llvm/llvm-project/pull/151134

  Commit: 598562077af801632ff2547b46219720fb78c689
      https://github.com/llvm/llvm-project/commit/598562077af801632ff2547b46219720fb78c689
  Author: Nikita Popov <npopov at redhat.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/tools/llvm-c-test/debuginfo.c

  Log Message:
  -----------
  [llvm-c] Fix memory leak in test

  Commit: 30c79511360de82c57cf9a78fff9fb10a8ccc58a
      https://github.com/llvm/llvm-project/commit/30c79511360de82c57cf9a78fff9fb10a8ccc58a
  Author: Ross Brunton <ross at codeplay.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M offload/liboffload/API/APIDefs.td
    M offload/liboffload/API/Queue.td
    M offload/liboffload/src/OffloadImpl.cpp
    M offload/plugins-nextgen/amdgpu/src/rtl.cpp
    M offload/plugins-nextgen/common/include/PluginInterface.h
    M offload/plugins-nextgen/common/src/PluginInterface.cpp
    M offload/plugins-nextgen/cuda/src/rtl.cpp
    M offload/plugins-nextgen/host/src/rtl.cpp
    M offload/unittests/OffloadAPI/CMakeLists.txt
    A offload/unittests/OffloadAPI/queue/olLaunchHostFunction.cpp

  Log Message:
  -----------
  [Offload] `olLaunchHostFunction` (#152482)

Add an `olLaunchHostFunction` method that allows enqueueing host work
to the stream.

  Commit: b014d10ed7a1ec9864016a9db0e7323857bc2a3f
      https://github.com/llvm/llvm-project/commit/b014d10ed7a1ec9864016a9db0e7323857bc2a3f
  Author: Simon Pilgrim <llvm-dev at redking.me.uk>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M clang/test/CodeGen/X86/avx512cd-builtins.c
    M clang/test/CodeGen/X86/avx512vlcd-builtins.c

  Log Message:
  -----------
  [X86] avx512cd-builtins.c + avx512vlcd-builtins.c - add C/C++ and 32/64-bit test coverage

  Commit: 6bb8f6f2d0ed672217e0a0521afc5b86913b717e
      https://github.com/llvm/llvm-project/commit/6bb8f6f2d0ed672217e0a0521afc5b86913b717e
  Author: Ferdinand Lemaire <flscminecraft at gmail.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M mlir/include/mlir/InitAllTranslations.h
    A mlir/include/mlir/Target/Wasm/WasmBinaryEncoding.h
    A mlir/include/mlir/Target/Wasm/WasmImporter.h
    M mlir/lib/Target/CMakeLists.txt
    A mlir/lib/Target/Wasm/CMakeLists.txt
    A mlir/lib/Target/Wasm/TranslateFromWasm.cpp
    A mlir/lib/Target/Wasm/TranslateRegistration.cpp
    M mlir/test/CMakeLists.txt
    A mlir/test/Target/Wasm/bad_wasm_version.yaml
    A mlir/test/Target/Wasm/function_export_out_of_scope.yaml
    A mlir/test/Target/Wasm/import.mlir
    A mlir/test/Target/Wasm/inputs/import.yaml.wasm
    A mlir/test/Target/Wasm/inputs/memory_min_eq_max.yaml.wasm
    A mlir/test/Target/Wasm/inputs/memory_min_max.yaml.wasm
    A mlir/test/Target/Wasm/inputs/memory_min_no_max.yaml.wasm
    A mlir/test/Target/Wasm/inputs/stats.yaml.wasm
    A mlir/test/Target/Wasm/inputs/table.yaml.wasm
    A mlir/test/Target/Wasm/invalid_function_type_index.yaml
    A mlir/test/Target/Wasm/memory_min_eq_max.mlir
    A mlir/test/Target/Wasm/memory_min_max.mlir
    A mlir/test/Target/Wasm/memory_min_no_max.mlir
    A mlir/test/Target/Wasm/missing_header.yaml
    A mlir/test/Target/Wasm/stats.mlir

  Log Message:
  -----------
  [MLIR][WASM] Introduce an importer for Wasm binaries (#152131)

First step in introducing the wasm-import target to mlir-translate. 
This is the first PR to introduce the pass, with this PR, there is very
little support for the actual WebAssembly language, it's mostly there to
introduce the skeleton of the importer. A follow-up will come with
support for a wider range of operators. It was split to make it easier
to review, since it's a good chunk of work.

---------

Co-authored-by: Luc Forget <dev at alias.lforget.fr>
Co-authored-by: Ferdinand Lemaire <ferdinand.lemaire at woven-planet.global>
Co-authored-by: Jessica Paquette <jessica.paquette at woven-planet.global>
Co-authored-by: Luc Forget <luc.forget at woven.toyota>

  Commit: 21b607adbeb4326c10f899fb293d057ee1199794
      https://github.com/llvm/llvm-project/commit/21b607adbeb4326c10f899fb293d057ee1199794
  Author: Matthias Springer <me at m-sp.org>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M mlir/include/mlir/Dialect/SCF/IR/SCFOps.td
    M mlir/lib/Conversion/SCFToControlFlow/SCFToControlFlow.cpp
    M mlir/lib/Conversion/SCFToEmitC/SCFToEmitC.cpp
    M mlir/lib/Conversion/SCFToSPIRV/SCFToSPIRV.cpp
    M mlir/lib/Dialect/Linalg/Transforms/HoistPadding.cpp
    M mlir/lib/Dialect/Linalg/Transforms/Hoisting.cpp
    M mlir/lib/Dialect/SCF/IR/SCF.cpp
    M mlir/lib/Dialect/SCF/Transforms/BufferizableOpInterfaceImpl.cpp
    M mlir/lib/Dialect/SCF/Transforms/ForToWhile.cpp
    M mlir/lib/Dialect/SCF/Transforms/LoopPipelining.cpp
    M mlir/lib/Dialect/SCF/Transforms/LoopSpecialization.cpp
    M mlir/lib/Dialect/SCF/Transforms/StructuralTypeConversions.cpp
    M mlir/lib/Dialect/SCF/Transforms/TileUsingInterface.cpp
    M mlir/lib/Dialect/SCF/Utils/Utils.cpp
    M mlir/lib/Dialect/SparseTensor/Transforms/SparseVectorization.cpp
    M mlir/lib/Dialect/Vector/Transforms/VectorDistribute.cpp
    M mlir/test/Conversion/SCFToControlFlow/convert-to-cfg.mlir
    M mlir/test/Conversion/SCFToSPIRV/for.mlir
    M mlir/test/Dialect/SCF/ops.mlir

  Log Message:
  -----------
  [mlir][SCF] `scf.for`: Add support for unsigned integer comparison (#153379)

Add a new unit attribute to allow for unsigned integer comparison.

Example:
```mlir
scf.for unsigned %iv_32 = %lb_32 to %ub_32 step %step_32 : i32 {
  // body
}
```

Discussion:
https://discourse.llvm.org/t/scf-should-scf-for-support-unsigned-comparison/84655

  Commit: 6b16a276ef8fe6b68330ecb57f7c1ee985880967
      https://github.com/llvm/llvm-project/commit/6b16a276ef8fe6b68330ecb57f7c1ee985880967
  Author: William Huynh <William.Huynh at arm.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M libc/startup/baremetal/arm/start.cpp

  Log Message:
  -----------
  [libc] Add startup code for ARM v7-A, ARM v7-R variants (#153576)

These variants require a different exception table that requires a bit
of initialisation.

This allows us to enable testing for these variants downstream.

  Commit: 8582025f1fb9485ced594efe0661ed4a4a80d5c9
      https://github.com/llvm/llvm-project/commit/8582025f1fb9485ced594efe0661ed4a4a80d5c9
  Author: Markus Böck <markus.boeck02 at gmail.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M flang/include/flang/Optimizer/CodeGen/FIROpPatterns.h
    M mlir/include/mlir/Conversion/LLVMCommon/Pattern.h
    M mlir/include/mlir/Transforms/DialectConversion.h
    M mlir/lib/Transforms/Utils/DialectConversion.cpp
    M mlir/test/Transforms/test-legalizer.mlir
    M mlir/test/lib/Dialect/Test/TestPatterns.cpp

  Log Message:
  -----------
  [mlir][Transforms] Turn 1:N -> 1:1 dispatch fatal error into match failure (#153605)

Prior to this PR, the default behaviour of a conversion pattern which
receives operands of a 1:N is to abort the compilation. This has
historically been useful when the 1:N type conversion got merged into
the dialect conversion as it allowed us to easily find patterns that
should be capable of handling 1:N type conversions but didn't.

However, this behaviour has the disadvantage of being non-composable:
While the pattern in question cannot handle the 1:N type conversion,
another pattern part of the set might, but doesn't get the chance as
compilation is aborted.

This PR fixes this behaviour by failing to match and instead of
aborting, giving other patterns the chance to legalize an op. The
implementation uses a reusable function called `dispatchTo1To1` to allow
derived conversion patterns to also implement the behaviour.

  Commit: 56681c94f366c936acffc048a5a32dfa19bade1a
      https://github.com/llvm/llvm-project/commit/56681c94f366c936acffc048a5a32dfa19bade1a
  Author: Sergei Barannikov <barannikov88 at gmail.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/utils/TableGen/DecoderEmitter.cpp

  Log Message:
  -----------
  [TableGen][DecoderEmitter] Compute bit attribute once (NFC) (#153530)

Pull the logic to compute bit attributes from `filterProcessor()` to its
caller to avoid recomputing them on the second call.

  Commit: dab971ed23064c915941b463d4ffaf2356418726
      https://github.com/llvm/llvm-project/commit/dab971ed23064c915941b463d4ffaf2356418726
  Author: Pavel Labath <pavel at labath.sk>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/include/llvm/Object/SFrameParser.h
    M llvm/lib/Object/SFrameParser.cpp
    A llvm/test/tools/llvm-readobj/ELF/sframe-reloc.test
    M llvm/tools/llvm-readobj/ELFDumper.cpp

  Log Message:
  -----------
  [llvm-readobj] Dump SFrame relocations as well (#153161)

If there is a relocation for a particular FDE, print it as well. This is
mainly meant for human consumption (otherwise, there's no way to tell
which function a given (relocatable) FDE refers to). For testing of
relocation generation, I'd still recommend using the regular relocation
dumper, as this code will not detect (e.g.) any superfluous relocations.

I've considered handling relocations inside the SFrameParser class, but
I couldn't find an elegant way to do that. Right now, I don't have a use
case for resolving relocations there as lldb (my other use case for
SFrameParser) will always operate on linked objects.

  Commit: 3db17429da63b0e33fb5dda0706d3bd024ecfe2a
      https://github.com/llvm/llvm-project/commit/3db17429da63b0e33fb5dda0706d3bd024ecfe2a
  Author: Nikita Popov <npopov at redhat.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/lib/Target/Mips/MipsCCState.cpp
    M llvm/test/CodeGen/Mips/llvm.frexp.ll
    M llvm/test/CodeGen/Mips/llvm.sincos.ll

  Log Message:
  -----------
  [Mips] Add frexpl and sincosl to f128 libcall list

  Commit: 3b27d50cc792a49b4b1b6fe399f9d9085e536b55
      https://github.com/llvm/llvm-project/commit/3b27d50cc792a49b4b1b6fe399f9d9085e536b55
  Author: Yanzuo Liu <zwuis at outlook.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M clang-tools-extra/docs/ReleaseNotes.rst
    A clang-tools-extra/docs/ReleaseNotesTemplate.txt
    M clang/docs/ReleaseNotes.rst
    A clang/docs/ReleaseNotesTemplate.txt
    M flang/docs/ReleaseNotes.md
    A flang/docs/ReleaseNotesTemplate.txt
    M lld/docs/ReleaseNotes.rst
    A lld/docs/ReleaseNotesTemplate.txt
    M llvm/docs/HowToReleaseLLVM.rst
    M llvm/docs/ReleaseNotes.md
    A llvm/docs/ReleaseNotesTemplate.txt
    A llvm/utils/release/clear-release-notes.py

  Log Message:
  -----------
  [LLVM][utils] Add script which clears release notes (#153593)

The script copies `ReleaseNotesTemplate.txt` to corresponding
`ReleaseNotes.rst`/`.md` to clear release notes.

The suffix of `ReleaseNotesTemplate.txt` must be `.txt`. If it is
`.rst`/`.md`, it will be treated as a documentation source file when
building documentation.

  Commit: d803a93f5555310c3c9413844586bc5331686a78
      https://github.com/llvm/llvm-project/commit/d803a93f5555310c3c9413844586bc5331686a78
  Author: Tobias Stadler <mail at stadler-tobias.de>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/lib/Transforms/IPO/Inliner.cpp
    M llvm/lib/Transforms/IPO/ModuleInliner.cpp
    A llvm/test/Transforms/Inline/optimization-remarks-passed-deleted-callee-yaml.ll

  Log Message:
  -----------
  [Inliner] Report inlining decision before deleting Callee contents (#153616)

Call `recordInliningWithCalleeDeleted` before dropping the contents of
the Callee. Otherwise the handlers don't have access to e.g. the
DebugLoc, so the Callee DebugLoc was missing in inlining remarks for
functions with internal linkage.

The test is the same as `optimization-remarks-passed-yaml.ll` except
that the function `foo` has internal linkage instead of external linkage.

  Commit: a9ff15d89344c7fccc7972f97be0cade93767013
      https://github.com/llvm/llvm-project/commit/a9ff15d89344c7fccc7972f97be0cade93767013
  Author: Simon Pilgrim <llvm-dev at redking.me.uk>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/test/CodeGen/X86/select-smin-smax.ll

  Log Message:
  -----------
  [X86] select-smin-smax.ll - add 32-bit test coverage (#153780)

Inspired by #151893

  Commit: 6ad39bc8669845ddee758d2a43744fe17e7194f6
      https://github.com/llvm/llvm-project/commit/6ad39bc8669845ddee758d2a43744fe17e7194f6
  Author: Simon Pilgrim <llvm-dev at redking.me.uk>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M clang/test/CodeGen/X86/avx512ifma-builtins.c
    M clang/test/CodeGen/X86/avx512ifmavl-builtins.c
    M clang/test/CodeGen/X86/avxifma-builtins.c

  Log Message:
  -----------
  [X86] avxifma-builtins.c / avx512ifma-builtins.c / avx512ifmavl-builtins.c - add C/C++ and 32/64-bit test coverage

  Commit: ce0bc3aa70bb3fa58473058985f4e5cd1ac45b5a
      https://github.com/llvm/llvm-project/commit/ce0bc3aa70bb3fa58473058985f4e5cd1ac45b5a
  Author: ⭐️NINIKA⭐️ <moslike6 at gmail.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M lldb/docs/use/map.rst

  Log Message:
  -----------
  [lldb][docs] document an analogue for `info proc mappings` (#153559)

  Commit: 69e105beec651a791426f4c79affc23f403d2855
      https://github.com/llvm/llvm-project/commit/69e105beec651a791426f4c79affc23f403d2855
  Author: Gaëtan Bossu <gaetan.bossu at arm.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/test/CodeGen/AArch64/named-vector-shuffles-sve.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-extract-subvector.ll

  Log Message:
  -----------
  [AArch64][ISel] Add unary vector_splice tests (NFC) (#152553)

They use extract shuffles for fixed vectors, and
llvm.vector.splice intrinsics for scalable vectors.

In the previous tests using ld+extract+st, the extract was optimized
away and replaced by a smaller load at the right offset. This meant
we didn't really test the vector_splice ISD node.

  Commit: 4f65345ab5f2787a4704efb5828657c50be6d65a
      https://github.com/llvm/llvm-project/commit/4f65345ab5f2787a4704efb5828657c50be6d65a
  Author: David Spickett <david.spickett at linaro.org>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M lldb/test/API/tools/lldb-dap/launch/TestDAP_launch.py

  Log Message:
  -----------
  [lldb][lldb-dap][test] Disable part of TestDAP_launch on Arm 32-bit

This test has been flakey on our bot:
https://lab.llvm.org/buildbot/#/builders/18/builds/20410

```
======================================================================
FAIL: test_extra_launch_commands (TestDAP_launch.TestDAP_launch)
    Tests the "launchCommands" with extra launching settings
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/home/tcwg-buildbot/worker/lldb-arm-ubuntu/llvm-project/lldb/test/API/tools/lldb-dap/launch/TestDAP_launch.py", line 482, in test_extra_launch_commands
    self.verify_commands("stopCommands", output, stopCommands)
  File "/home/tcwg-buildbot/worker/lldb-arm-ubuntu/llvm-project/lldb/packages/Python/lldbsuite/test/tools/lldb-dap/lldbdap_testcase.py", line 228, in verify_commands
    self.assertTrue(
AssertionError: False is not true : verify 'frame variable' found in console output for 'stopCommands'
Config=arm-/home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/bin/clang
----------------------------------------------------------------------
```

Likely a timing issue waiting for the command output on a slower
machine.

General tracking issue - https://github.com/llvm/llvm-project/issues/137660

  Commit: 7640645f797bbc9f727d0c6fa73a31425b3eb219
      https://github.com/llvm/llvm-project/commit/7640645f797bbc9f727d0c6fa73a31425b3eb219
  Author: Mehdi Amini <joker.eph at gmail.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M mlir/lib/Target/Wasm/TranslateFromWasm.cpp
    R mlir/test/Target/Wasm/stats.mlir

  Log Message:
  -----------
  [MLIR][Wasm] Remove statistics as they depend on global ctors (#153795)

Use a debug log instead for now.

  Commit: 0fca1e4e06445c98352fb3a034bf197bc7990f36
      https://github.com/llvm/llvm-project/commit/0fca1e4e06445c98352fb3a034bf197bc7990f36
  Author: David Spickett <david.spickett at linaro.org>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M lldb/test/API/tools/lldb-dap/launch/TestDAP_launch.py

  Log Message:
  -----------
  [lldb][lldb-dap][test] Correct skip in TestDAP_launch

Fixes 4f65345ab5f2787a4704efb5828657c50be6d65a

Yet again I forgot it's skip[I]f.

  Commit: 69453d702108a6570a664869c903fb6dda44c856
      https://github.com/llvm/llvm-project/commit/69453d702108a6570a664869c903fb6dda44c856
  Author: Mehdi Amini <joker.eph at gmail.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M mlir/lib/Target/Wasm/TranslateFromWasm.cpp

  Log Message:
  -----------
  [MLIR] Fix memory leak in importWebAssemblyToModule when it fails to import (#153794)

  Commit: 61caab7789eab30df73afa1bec411d6b36ebb74e
      https://github.com/llvm/llvm-project/commit/61caab7789eab30df73afa1bec411d6b36ebb74e
  Author: Erick Ochoa Lopez <erick.ochoalopez at amd.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M mlir/include/mlir/Dialect/LLVMIR/LLVMIntrinsicOps.td
    M mlir/include/mlir/Dialect/LLVMIR/LLVMOpBase.td
    M mlir/lib/Dialect/LLVMIR/IR/LLVMDialect.cpp
    M mlir/test/Target/LLVMIR/Import/intrinsic.ll
    M mlir/test/Target/LLVMIR/llvmir-intrinsics.mlir

  Log Message:
  -----------
  [mlir][llvm] Add `align` attribute to `llvm.intr.masked.{expandload,compressstore}` (#153063)

* Add `requiresArgsAndResultsAttr` to `LLVM_OneResultIntrOp`
* Add `args_attrs` to `llvm.intr.masked.{expandload,compressstore}`

The LLVM intrinsics
[`llvm.intr.masked.expandload`](https://llvm.org/docs/LangRef.html#llvm-masked-expandload-intrinsics)
and
[`llvm.intr.masked.compressstore`](https://llvm.org/docs/LangRef.html#llvm-masked-compressstore-intrinsics)
both allow an optional align parameter attribute to be set which
defaults to one.

Inlining the documentation below for [`llvm.intr.masked.expandload` 's
](https://llvm.org/docs/LangRef.html#id1522) and
[`llvm.intr.masked.compressstore`'s](https://llvm.org/docs/LangRef.html#id1522)
arguments respectively

> The `align` parameter attribute can be provided for the first
argument. The pointer alignment defaults to 1.

> The `align` parameter attribute can be provided for the second
argument. The pointer alignment defaults to 1.

  Commit: 263e45827357f416435e42e99c7323ca723549be
      https://github.com/llvm/llvm-project/commit/263e45827357f416435e42e99c7323ca723549be
  Author: Simon Pilgrim <llvm-dev at redking.me.uk>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/test/CodeGen/X86/select-smin-smax.ll

  Log Message:
  -----------
  [X86] select-smin-smax.ll - add i8/i16 test coverage (#153788)

Pulled out of #151893 to show 32/64-bit target coverage

  Commit: bcb4984a0b75862c43d7603124efcf76def1a5f3
      https://github.com/llvm/llvm-project/commit/bcb4984a0b75862c43d7603124efcf76def1a5f3
  Author: Simon Pilgrim <llvm-dev at redking.me.uk>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
    M llvm/test/CodeGen/X86/select-smin-smax.ll

  Log Message:
  -----------
  [X86] select-smin-smax.ll - add i128 tests

Helps check quality of legality codegen (all we had was x86 i64 handling)

  Commit: bc216b057d8bba98c99f82ddeb27da63759d20ad
      https://github.com/llvm/llvm-project/commit/bc216b057d8bba98c99f82ddeb27da63759d20ad
  Author: Stephen Tozer <stephen.tozer at sony.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/lib/Transforms/Utils/Debugify.cpp
    A llvm/test/tools/llvm-original-di-preservation/Inputs/expected-origin-reduced.html
    M llvm/test/tools/llvm-original-di-preservation/Inputs/expected-origin.html
    M llvm/test/tools/llvm-original-di-preservation/Inputs/origin.json
    M llvm/test/tools/llvm-original-di-preservation/basic.test
    M llvm/utils/llvm-original-di-preservation.py

  Log Message:
  -----------
  [Debugify] Improve reduction of debugify coverage build output (#150212)

In current DebugLoc coverage builds, the output for any reasonably large
build can become very large if any missing DebugLocs are present; this
happens because single errors in LLVM may result in many errors being
reported in the output report. The main cause of this is that the empty
locations attached to instructions may be propagated to other
instructions in later passes, which will each be reported as new errors.
This patch prevents this by adding an "unknown" annotation to
instructions after reporting them once, ensuring that any other
DebugLocs copied or derived from the original empty location will not be
marked as new errors.

As a separate but related change, this patch updates the report
generation script to deduplicate results using the recorded stacktrace
if they are available, instead of the pass+instruction combination. This
reduces the size of the reduction, but makes the reduction highly
reliable, as the stacktrace allows us to very precisely identify when
two bugs have originated from the same place.

  Commit: 649762cb04c35e14f61c1226d05a9aee73d3501d
      https://github.com/llvm/llvm-project/commit/649762cb04c35e14f61c1226d05a9aee73d3501d
  Author: David Green <david.green at arm.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/test/CodeGen/AArch64/fadd-combines.ll
    M llvm/test/CodeGen/AArch64/vecreduce-fadd-legalization-strict.ll
    M llvm/test/CodeGen/AArch64/vecreduce-fadd-legalization.ll
    M llvm/test/CodeGen/AArch64/vecreduce-fadd.ll

  Log Message:
  -----------
  Revert "[AArch64][GlobalISel] Add additional vecreduce.fadd and fadd 0.0 tests. NFC"

This reverts commit 16314eb7312dab38d721c70f247f2117e9800704 as the test cases
are failing under EXPENSIVE_CHECKS. Scalar vecreduce.fadd are not valid in
GISel.

  Commit: 98287456610c0fd17742fc6c8d23ed8a44b79b0c
      https://github.com/llvm/llvm-project/commit/98287456610c0fd17742fc6c8d23ed8a44b79b0c
  Author: Gaëtan Bossu <gaetan.bossu at arm.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td
    M llvm/test/CodeGen/AArch64/named-vector-shuffles-sve.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-extract-subvector.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-fp-to-int.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-int-extends.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-int-rem.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-int-to-fp.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-limit-duplane.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-masked-loads.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-partial-reduce.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-shuffles.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-vector-llrint.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-vector-lrint.ll
    M llvm/test/CodeGen/AArch64/sve-pr92779.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-fp-extend-trunc.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-fp-to-int.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-int-extends.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-int-rem.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-int-to-fp.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-limit-duplane.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-masked-load.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-masked-store.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-reductions.ll

  Log Message:
  -----------
  [AArch64][ISel] Select constructive EXT_ZZI pseudo instruction (#152554)

The patch adds patterns to select the EXT_ZZI_CONSTRUCTIVE pseudo
instead of the EXT_ZZI destructive instruction for vector_splice. This
only works when the two inputs to vector_splice are identical.

Given that registers aren't tied anymore, this gives the register
allocator more freedom and a lot of MOVs get replaced with MOVPRFX.

In some cases however, we could have just chosen the same input and
output register, but regalloc preferred not to. This means we end up
with some test cases now having more instructions: there is now a
MOVPRFX while no MOV was previously needed.

  Commit: 15d7a95ea990214f77d9d99df8c2fc53799e5161
      https://github.com/llvm/llvm-project/commit/15d7a95ea990214f77d9d99df8c2fc53799e5161
  Author: Erich Keane <ekeane at nvidia.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M clang/lib/CIR/CodeGen/CIRGenFunction.h
    M clang/lib/CIR/CodeGen/CIRGenOpenACCClause.cpp

  Log Message:
  -----------
  [CIR] Refactor recipe init generation, cleanup after init (#153610)

In preperation of the firstprivate implementation, this separates out
some functions to make it easier to read.

Additionally, it cleans up the VarDecl->alloca relationship, which will
prevent issues if we have to re-use the same vardecl for a future
generated recipe (and causes concerns in firstprivate later).

  Commit: af92cabdef72164341a7108a8c099d1c12cf283c
      https://github.com/llvm/llvm-project/commit/af92cabdef72164341a7108a8c099d1c12cf283c
  Author: Guray Ozen <guray.ozen at gmail.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
    M mlir/test/Dialect/LLVMIR/nvvm.mlir
    M mlir/test/Target/LLVMIR/nvvmir.mlir

  Log Message:
  -----------
  [MLIR][NVVM] Combine griddepcontrol Ops  (#152525)

We've 2 ops:
1. nvvm.griddepcontrol.wait
2. nvvm.griddepcontrol.launch_dependents

They are related to Grid Dependent Launch (or programmatic dependent
launch in CUDA) and same concept. This PR unifies both ops into a single
one.

  Commit: 4c389178eeb73618ec80f88d306300949480e40f
      https://github.com/llvm/llvm-project/commit/4c389178eeb73618ec80f88d306300949480e40f
  Author: Guray Ozen <guray.ozen at gmail.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M mlir/include/mlir/Dialect/LLVMIR/BasicPtxBuilderInterface.h
    M mlir/lib/Conversion/NVVMToLLVM/NVVMToLLVM.cpp

  Log Message:
  -----------
  [MLIR][NVVM] Print readable modifer (NFC) (#153779)

Currently, modifier is printed as address, so it is not readable and not
useful. This PR adds readable printing for it.

---------

Co-authored-by: Copilot <175728472+Copilot at users.noreply.github.com>

  Commit: 17dd57b00e9c97876900fc56514c71cb78364459
      https://github.com/llvm/llvm-project/commit/17dd57b00e9c97876900fc56514c71cb78364459
  Author: Simon Pilgrim <llvm-dev at redking.me.uk>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M clang/test/CodeGen/X86/avxvnni-builtins.c
    M clang/test/CodeGen/X86/avxvnniint16-builtins.c
    M clang/test/CodeGen/X86/avxvnniint8-builtins.c

  Log Message:
  -----------
  [X86] avxvnni-builtins.c / avxvnniint8-builtins.c / avxvnniint16-builtins.c - add C/C++ and 32/64-bit test coverage

  Commit: f4bc3151bb716c434a6c47c80b4e53e8f773bc85
      https://github.com/llvm/llvm-project/commit/f4bc3151bb716c434a6c47c80b4e53e8f773bc85
  Author: Kazu Hirata <kazu at google.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M mlir/lib/Target/Wasm/TranslateFromWasm.cpp

  Log Message:
  -----------
  [mlir] Fix warnings

This patch fixes:

  mlir/lib/Target/Wasm/TranslateFromWasm.cpp:82:1: error: unused
  variable 'wasmSectionName<(anonymous
  namespace)::WasmSectionType::DATACOUNT>'
  [-Werror,-Wunused-const-variable]

  mlir/lib/Target/Wasm/TranslateFromWasm.cpp:100:5: error: unused
  variable 'valueTypesEncodings' [-Werror,-Wunused-const-variable]

  mlir/lib/Target/Wasm/TranslateFromWasm.cpp:735:13: error: unused
  function 'buildLiteralType<unsigned int>'
  [-Werror,-Wunused-function]

  mlir/lib/Target/Wasm/TranslateFromWasm.cpp:740:13: error: unused
  function 'buildLiteralType<unsigned long>'
  [-Werror,-Wunused-function]

  mlir/lib/Target/Wasm/TranslateFromWasm.cpp:292:33: error: private
  field 'symbols' is not used [-Werror,-Wunused-private-field]

  Commit: 8c0914d826a633191a042cdd8a82032bcc30de30
      https://github.com/llvm/llvm-project/commit/8c0914d826a633191a042cdd8a82032bcc30de30
  Author: Benjamin Chetioui <3920784+bchetioui at users.noreply.github.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M utils/bazel/llvm-project-overlay/mlir/BUILD.bazel

  Log Message:
  -----------
  [mlir][bazel] Fix Bazel build after 6bb8f6f2d0ed672217e0a0521afc5b86913b717e (#153811)

  Commit: 9a14b1d254a43dc0d4445c3ffa3d393bca007ba3
      https://github.com/llvm/llvm-project/commit/9a14b1d254a43dc0d4445c3ffa3d393bca007ba3
  Author: Matt Arsenault <Matthew.Arsenault at amd.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/include/llvm/IR/RuntimeLibcalls.h
    M llvm/test/TableGen/RuntimeLibcallEmitter.td
    M llvm/utils/TableGen/Basic/RuntimeLibcallsEmitter.cpp

  Log Message:
  -----------
  RuntimeLibcalls: Generate table of libcall name lengths (#153210)

Avoids strlen when constructing the returned StringRef. We were emitting
these in the libcall name lookup anyway, so split out the offsets for
general use.

Currently emitted as a separate table, not sure if it would be better
to change the string offset table to store pairs of offset and width
instead.

  Commit: 6602d6c7a7e2fb893433496efa149fe6311f4eac
      https://github.com/llvm/llvm-project/commit/6602d6c7a7e2fb893433496efa149fe6311f4eac
  Author: Krishna Pandey <kpandey81930 at gmail.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M libc/docs/headers/math/index.rst

  Log Message:
  -----------
  [libc][math][docs] Add documentation for BFloat16 type (#153475)

Signed-off-by: Krishna Pandey <kpandey81930 at gmail.com>

  Commit: 09267f6720d9bfd7f01bdf9ddd9429dd6e5a9157
      https://github.com/llvm/llvm-project/commit/09267f6720d9bfd7f01bdf9ddd9429dd6e5a9157
  Author: Simon Pilgrim <llvm-dev at redking.me.uk>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M clang/test/CodeGen/X86/avx512vlvp2intersect-builtins.c
    M clang/test/CodeGen/X86/avx512vp2intersect-builtins.c

  Log Message:
  -----------
  [X86] avx512vp2intersect-builtins.c / avx512vlvp2intersect-builtins.c - add C/C++ and 32/64-bit test coverage

  Commit: 1fd1d634630754cc9b9c4b5526961d5856f64ff9
      https://github.com/llvm/llvm-project/commit/1fd1d634630754cc9b9c4b5526961d5856f64ff9
  Author: Akash Banerjee <akash.banerjee at amd.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M flang/include/flang/Optimizer/OpenMP/Passes.td
    A flang/lib/Optimizer/OpenMP/AutomapToTargetData.cpp
    M flang/lib/Optimizer/OpenMP/CMakeLists.txt
    M flang/lib/Optimizer/Passes/Pipelines.cpp
    A flang/test/Transforms/omp-automap-to-target-data.fir
    A offload/test/offloading/fortran/declare-target-automap.f90

  Log Message:
  -----------
  [MLIR][OpenMP] Add a new AutomapToTargetData conversion pass in FIR (#153048)

Add a new AutomapToTargetData pass. This gathers the declare target
enter variables which have the AUTOMAP modifier. And adds
omp.declare_target_enter/exit mapping directives for fir.alloca and
fir.free oeprations on the AUTOMAP enabled variables.

Automap Ref: OpenMP 6.0 section 7.9.7.

  Commit: d7199544afd0ddff8ed5fa9d79643e91c35bd010
      https://github.com/llvm/llvm-project/commit/d7199544afd0ddff8ed5fa9d79643e91c35bd010
  Author: Mikhail R. Gadelha <mikhail at igalia.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M libc/test/src/wchar/mbrtowc_test.cpp

  Log Message:
  -----------
  [libc] Fix mbrtowc test (#153721)

Previously, we were trying to memset a pointer that wasn't being
initialized, and the test would randomly fail.

This PR replaces the pointers with actual objects.

  Commit: 144f3c4cbf7164938eb1b6fea1688ac428ead9c6
      https://github.com/llvm/llvm-project/commit/144f3c4cbf7164938eb1b6fea1688ac428ead9c6
  Author: David Green <david.green at arm.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/lib/Target/AArch64/AArch64SchedA510.td
    M llvm/test/CodeGen/AArch64/sve-bf16-converts.ll
    M llvm/test/CodeGen/AArch64/sve-fptosi-sat.ll
    M llvm/test/CodeGen/AArch64/sve-fptoui-sat.ll
    M llvm/test/CodeGen/AArch64/sve-llrint.ll
    M llvm/test/CodeGen/AArch64/sve-lrint.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-fp-compares.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-ptest.ll
    M llvm/test/CodeGen/AArch64/sve2-bf16-converts.ll
    M llvm/test/tools/llvm-mca/AArch64/Cortex/A510-sve-instructions.s

  Log Message:
  -----------
  [AArch64] Adjust the scheduling info of SVE FCMP on Cortex-A510. (#153810)

According to the SWOG, these have a lower throughput than other
instructions. Mark them as taking multiple cycles to model that.

  Commit: f24d91eb2c97b8b2808404caf1ddd94e5f7a14c0
      https://github.com/llvm/llvm-project/commit/f24d91eb2c97b8b2808404caf1ddd94e5f7a14c0
  Author: Phoebe Wang <phoebe.wang at intel.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M clang/lib/Headers/emmintrin.h

  Log Message:
  -----------
  [Headers][X86] Remove duplicate __v8hu, NFCI (#153734)

Newly added in xmmintrin.h by c8312bdd1665225c585dd2b0bff5e46d569edd45

  Commit: 13b54f7dc14f2c40a06cad34fba4df7ad7b5b1fd
      https://github.com/llvm/llvm-project/commit/13b54f7dc14f2c40a06cad34fba4df7ad7b5b1fd
  Author: Alexey Bataev <a.bataev at outlook.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
    A llvm/test/Transforms/SLPVectorizer/X86/schedule-control-deps-after-copyable.ll

  Log Message:
  -----------
  [SLP] Recalculate dependencies for potential control dependencies if cleared

If the control dependecies are cleared after calcellation of the
copyables, need to reclculate them unconditionally.

Fixes #153754 #153676

  Commit: 2b1e06598fc058ed6a9c73c6c615559919cc6d56
      https://github.com/llvm/llvm-project/commit/2b1e06598fc058ed6a9c73c6c615559919cc6d56
  Author: Florian Hahn <flo at fhahn.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M llvm/test/Transforms/LoopVectorize/X86/pr51366-sunk-instruction-used-outside-of-loop.ll
    M llvm/test/Transforms/LoopVectorize/noalias-scope-decl.ll
    M llvm/test/Transforms/LoopVectorize/preserve-dbg-loc-and-loop-metadata.ll
    M llvm/test/Transforms/LoopVectorize/vplan-widen-select-instruction.ll

  Log Message:
  -----------
  [LV] Regenerate some more check lines. (NFC)

  Commit: efe2e4d48475188ac63d1af5f2893de5ab98fb1b
      https://github.com/llvm/llvm-project/commit/efe2e4d48475188ac63d1af5f2893de5ab98fb1b
  Author: Alexey Bataev <a.bataev at outlook.com>
  Date:   2025-08-15 (Fri, 15 Aug 2025)

  Changed paths:
    M clang-tools-extra/clang-doc/HTMLMustacheGenerator.cpp
    M clang-tools-extra/clang-doc/JSONGenerator.cpp
    M clang-tools-extra/docs/ReleaseNotes.rst
    A clang-tools-extra/docs/ReleaseNotesTemplate.txt
    M clang-tools-extra/test/clang-doc/basic-project.mustache.test
    M clang-tools-extra/test/clang-doc/json/class-requires.cpp
    M clang-tools-extra/test/clang-doc/json/class-specialization.cpp
    M clang-tools-extra/test/clang-doc/json/class-template.cpp
    M clang-tools-extra/test/clang-doc/json/class.cpp
    M clang-tools-extra/test/clang-doc/json/compound-constraints.cpp
    M clang-tools-extra/test/clang-doc/json/concept.cpp
    M clang-tools-extra/test/clang-doc/json/function-requires.cpp
    M clang-tools-extra/test/clang-doc/json/function-specifiers.cpp
    M clang-tools-extra/test/clang-doc/json/method-template.cpp
    M clang-tools-extra/test/clang-doc/json/namespace.cpp
    M clang-tools-extra/test/clang-doc/json/nested-namespace.cpp
    M clang-tools-extra/test/clang-doc/mustache-index.cpp
    M clang-tools-extra/test/clang-doc/mustache-separate-namespace.cpp
    M clang/docs/APINotes.rst
    M clang/docs/ReleaseNotes.rst
    A clang/docs/ReleaseNotesTemplate.txt
    M clang/include/clang/APINotes/Types.h
    M clang/include/clang/AST/Type.h
    M clang/include/clang/AST/TypeLoc.h
    M clang/include/clang/Basic/Attr.td
    M clang/include/clang/Basic/AttrDocs.td
    M clang/include/clang/Basic/Features.def
    M clang/include/clang/Basic/LangOptions.def
    M clang/include/clang/CIR/Dialect/Builder/CIRBaseBuilder.h
    M clang/include/clang/CIR/Dialect/IR/CIRAttrs.td
    M clang/include/clang/CIR/Dialect/IR/CIRDataLayout.h
    M clang/include/clang/CIR/MissingFeatures.h
    M clang/include/clang/Driver/Options.td
    M clang/lib/APINotes/APINotesFormat.h
    M clang/lib/APINotes/APINotesReader.cpp
    M clang/lib/APINotes/APINotesWriter.cpp
    M clang/lib/APINotes/APINotesYAMLCompiler.cpp
    M clang/lib/AST/ASTContext.cpp
    M clang/lib/AST/Type.cpp
    M clang/lib/AST/TypePrinter.cpp
    M clang/lib/Basic/Targets/LoongArch.cpp
    M clang/lib/CIR/CodeGen/CIRGenBuilder.cpp
    M clang/lib/CIR/CodeGen/CIRGenBuilder.h
    M clang/lib/CIR/CodeGen/CIRGenExpr.cpp
    M clang/lib/CIR/CodeGen/CIRGenExprConstant.cpp
    M clang/lib/CIR/CodeGen/CIRGenFunction.cpp
    M clang/lib/CIR/CodeGen/CIRGenFunction.h
    M clang/lib/CIR/CodeGen/CIRGenOpenACCClause.cpp
    M clang/lib/CIR/CodeGen/CIRGenStmt.cpp
    M clang/lib/CIR/Dialect/IR/CIRDataLayout.cpp
    M clang/lib/CIR/Lowering/DirectToLLVM/LowerToLLVM.cpp
    M clang/lib/CodeGen/CodeGenFunction.cpp
    M clang/lib/CodeGen/CodeGenModule.cpp
    M clang/lib/CodeGen/CodeGenModule.h
    M clang/lib/Frontend/CompilerInvocation.cpp
    M clang/lib/Headers/emmintrin.h
    M clang/lib/Headers/hlsl/hlsl_alias_intrinsics.h
    M clang/lib/Sema/SemaAPINotes.cpp
    M clang/lib/Sema/SemaOverload.cpp
    M clang/lib/Sema/SemaType.cpp
    M clang/test/APINotes/Inputs/Headers/SwiftImportAs.apinotes
    M clang/test/APINotes/Inputs/Headers/SwiftImportAs.h
    M clang/test/APINotes/swift-import-as.cpp
    M clang/test/AST/ast-dump-templates.cpp
    M clang/test/CIR/CodeGen/globals.cpp
    A clang/test/CodeGen/LoongArch/targetattr-lasx.c
    M clang/test/CodeGen/X86/avx512cd-builtins.c
    M clang/test/CodeGen/X86/avx512ifma-builtins.c
    M clang/test/CodeGen/X86/avx512ifmavl-builtins.c
    M clang/test/CodeGen/X86/avx512vlcd-builtins.c
    M clang/test/CodeGen/X86/avx512vlvp2intersect-builtins.c
    M clang/test/CodeGen/X86/avx512vp2intersect-builtins.c
    M clang/test/CodeGen/X86/avxifma-builtins.c
    M clang/test/CodeGen/X86/avxvnni-builtins.c
    M clang/test/CodeGen/X86/avxvnniint16-builtins.c
    M clang/test/CodeGen/X86/avxvnniint8-builtins.c
    A clang/test/CodeGen/cfi-salt.c
    M clang/test/CodeGenHIP/hip-cumode.hip
    M clang/test/CodeGenObjC/ptrauth-block-descriptor-pointer.m
    M clang/test/CodeGenObjC/ptrauth-block-isa.m
    M clang/test/Driver/hip-macros.hip
    M clang/test/Headers/mm3dnow.c
    M clang/test/Headers/pmmintrin.c
    M clang/test/Headers/x86-intrinsics-headers.c
    M clang/test/Headers/x86intrin.c
    M clang/test/Headers/x86intrin.cpp
    M clang/test/Misc/pragma-attribute-supported-attributes-list.test
    A clang/test/Sema/attr-cfi-salt.c
    M compiler-rt/lib/sanitizer_common/sanitizer_redefine_builtins.h
    M compiler-rt/test/asan/TestCases/Posix/fakestack_alignment.cpp
    M flang/docs/ReleaseNotes.md
    A flang/docs/ReleaseNotesTemplate.txt
    M flang/include/flang/Optimizer/CodeGen/FIROpPatterns.h
    M flang/include/flang/Optimizer/OpenMP/Passes.td
    M flang/lib/Lower/Bridge.cpp
    A flang/lib/Optimizer/OpenMP/AutomapToTargetData.cpp
    M flang/lib/Optimizer/OpenMP/CMakeLists.txt
    M flang/lib/Optimizer/Passes/Pipelines.cpp
    M flang/module/cudadevice.f90
    M flang/test/Lower/CUDA/cuda-device-proc.cuf
    M flang/test/Lower/CUDA/cuda-libdevice.cuf
    A flang/test/Lower/do_concurrent_loop_in_nested_block.f90
    A flang/test/Transforms/omp-automap-to-target-data.fir
    M libc/docs/headers/math/index.rst
    M libc/startup/baremetal/arm/start.cpp
    M libc/test/src/wchar/mbrtowc_test.cpp
    M libcxx/docs/ReleaseNotes/22.rst
    M libcxx/include/__hash_table
    M libcxx/include/__tree
    M libcxx/include/unordered_map
    M libcxx/include/unordered_set
    M libcxx/test/std/containers/associative/multimap/multimap.ops/find.pass.cpp
    M libcxx/test/std/containers/unord/unord.map/unord.map.cnstr/assign_copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.map/unord.map.cnstr/copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.map/unord.map.cnstr/copy_alloc.pass.cpp
    M libcxx/test/std/containers/unord/unord.multimap/unord.multimap.cnstr/assign_copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.multimap/unord.multimap.cnstr/copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.multimap/unord.multimap.cnstr/copy_alloc.pass.cpp
    M libcxx/test/std/containers/unord/unord.multiset/unord.multiset.cnstr/assign_copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.multiset/unord.multiset.cnstr/copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.multiset/unord.multiset.cnstr/copy_alloc.pass.cpp
    M libcxx/test/std/containers/unord/unord.set/unord.set.cnstr/assign_copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.set/unord.set.cnstr/copy.pass.cpp
    M libcxx/test/std/containers/unord/unord.set/unord.set.cnstr/copy_alloc.pass.cpp
    M lld/docs/ReleaseNotes.rst
    A lld/docs/ReleaseNotesTemplate.txt
    M lldb/cmake/modules/LLDBConfig.cmake
    M lldb/docs/use/map.rst
    M lldb/source/Plugins/ScriptInterpreter/Python/ScriptInterpreterPython.cpp
    M lldb/test/API/tools/lldb-dap/launch/TestDAP_launch.py
    M llvm/benchmarks/CMakeLists.txt
    A llvm/benchmarks/RuntimeLibcalls.cpp
    M llvm/docs/AMDGPUUsage.rst
    M llvm/docs/HowToReleaseLLVM.rst
    M llvm/docs/ReleaseNotes.md
    A llvm/docs/ReleaseNotesTemplate.txt
    M llvm/include/llvm/BinaryFormat/DXContainer.h
    M llvm/include/llvm/CodeGen/TargetLowering.h
    M llvm/include/llvm/IR/IntrinsicsAMDGPU.td
    M llvm/include/llvm/IR/ProfDataUtils.h
    M llvm/include/llvm/IR/RuntimeLibcalls.h
    M llvm/include/llvm/Object/SFrameParser.h
    M llvm/include/llvm/ProfileData/InstrProf.h
    M llvm/include/llvm/Support/DXILABI.h
    M llvm/include/llvm/Support/FileSystem.h
    M llvm/include/llvm/Transforms/Instrumentation/PGOInstrumentation.h
    M llvm/include/llvm/Transforms/Utils/Instrumentation.h
    M llvm/lib/Analysis/ConstantFolding.cpp
    M llvm/lib/Analysis/DXILResource.cpp
    M llvm/lib/Analysis/LoopAccessAnalysis.cpp
    M llvm/lib/Analysis/ScalarEvolution.cpp
    M llvm/lib/BinaryFormat/DXContainer.cpp
    M llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp
    M llvm/lib/CodeGen/InterleavedAccessPass.cpp
    M llvm/lib/CodeGen/PreISelIntrinsicLowering.cpp
    M llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
    M llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
    M llvm/lib/CodeGen/SelectionDAG/StatepointLowering.cpp
    M llvm/lib/Frontend/HLSL/HLSLRootSignature.cpp
    M llvm/lib/Frontend/HLSL/RootSignatureMetadata.cpp
    M llvm/lib/IR/ProfDataUtils.cpp
    M llvm/lib/IR/RuntimeLibcalls.cpp
    M llvm/lib/IR/Verifier.cpp
    M llvm/lib/LTO/LTO.cpp
    M llvm/lib/Object/IRSymtab.cpp
    M llvm/lib/Object/SFrameParser.cpp
    M llvm/lib/Support/CMakeLists.txt
    A llvm/lib/Support/DXILABI.cpp
    M llvm/lib/Support/Unix/Path.inc
    M llvm/lib/Support/Windows/Path.inc
    M llvm/lib/Target/AArch64/AArch64ExpandPseudoInsts.cpp
    M llvm/lib/Target/AArch64/AArch64InstrFormats.td
    M llvm/lib/Target/AArch64/AArch64InstrInfo.h
    M llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td
    M llvm/lib/Target/AArch64/AArch64SchedA320.td
    M llvm/lib/Target/AArch64/AArch64SchedA510.td
    M llvm/lib/Target/AArch64/AArch64SchedNeoverseN2.td
    M llvm/lib/Target/AArch64/AArch64SchedNeoverseN3.td
    M llvm/lib/Target/AArch64/AArch64SchedNeoverseV1.td
    M llvm/lib/Target/AArch64/AArch64SchedNeoverseV2.td
    M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp
    M llvm/lib/Target/AArch64/SVEInstrFormats.td
    M llvm/lib/Target/AMDGPU/AMDGPU.td
    M llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUCallLowering.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUFeatures.td
    M llvm/lib/Target/AMDGPU/AMDGPUPreloadKernArgProlog.cpp
    M llvm/lib/Target/AMDGPU/AMDGPURemoveIncompatibleFunctions.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUSearchableTables.td
    M llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
    M llvm/lib/Target/AMDGPU/GCNSubtarget.h
    M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUTargetStreamer.cpp
    M llvm/lib/Target/AMDGPU/SIInstrFormats.td
    M llvm/lib/Target/AMDGPU/SIRegisterInfo.td
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
    M llvm/lib/Target/DirectX/DXContainerGlobals.cpp
    M llvm/lib/Target/DirectX/DXILOpLowering.cpp
    M llvm/lib/Target/DirectX/DXILPostOptimizationValidation.cpp
    M llvm/lib/Target/DirectX/DXILRootSignature.h
    M llvm/lib/Target/Mips/MipsCCState.cpp
    M llvm/lib/Target/RISCV/MCTargetDesc/RISCVBaseInfo.h
    M llvm/lib/Target/RISCV/RISCVInstrFormats.td
    M llvm/lib/Target/RISCV/RISCVInstrInfoP.td
    M llvm/lib/Target/RISCV/RISCVInstrInfoV.td
    M llvm/lib/Target/RISCV/RISCVInstrInfoXRivos.td
    M llvm/lib/Target/RISCV/RISCVInstrInfoXSf.td
    M llvm/lib/Target/RISCV/RISCVInstrInfoXSfmm.td
    M llvm/lib/Target/RISCV/RISCVMoveMerger.cpp
    M llvm/lib/Target/RISCV/RISCVVLOptimizer.cpp
    M llvm/lib/Target/WebAssembly/WebAssemblyRuntimeLibcallSignatures.cpp
    M llvm/lib/Transforms/IPO/Inliner.cpp
    M llvm/lib/Transforms/IPO/ModuleInliner.cpp
    M llvm/lib/Transforms/Instrumentation/PGOCtxProfFlattening.cpp
    M llvm/lib/Transforms/Instrumentation/PGOInstrumentation.cpp
    M llvm/lib/Transforms/Instrumentation/PGOMemOPSizeOpt.cpp
    M llvm/lib/Transforms/Scalar/CMakeLists.txt
    M llvm/lib/Transforms/Scalar/JumpTableToSwitch.cpp
    M llvm/lib/Transforms/Scalar/LoopDistribute.cpp
    M llvm/lib/Transforms/Utils/Debugify.cpp
    M llvm/lib/Transforms/Utils/DeclareRuntimeLibcalls.cpp
    M llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
    M llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
    M llvm/test/Analysis/CostModel/AArch64/arith-fp.ll
    M llvm/test/Analysis/CostModel/AArch64/sve-arith-fp.ll
    M llvm/test/Analysis/DXILResource/buffer-frombinding.ll
    M llvm/test/Analysis/UniformityAnalysis/AMDGPU/intrinsics.ll
    A llvm/test/CodeGen/AArch64/expand-constructive-zzi.mir
    M llvm/test/CodeGen/AArch64/named-vector-shuffles-sve.ll
    A llvm/test/CodeGen/AArch64/pr142314.ll
    M llvm/test/CodeGen/AArch64/sve-bf16-converts.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-extract-subvector.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-fp-to-int.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-int-extends.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-int-rem.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-int-to-fp.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-limit-duplane.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-masked-loads.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-partial-reduce.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-length-shuffles.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-vector-llrint.ll
    M llvm/test/CodeGen/AArch64/sve-fixed-vector-lrint.ll
    M llvm/test/CodeGen/AArch64/sve-fptosi-sat.ll
    M llvm/test/CodeGen/AArch64/sve-fptoui-sat.ll
    M llvm/test/CodeGen/AArch64/sve-llrint.ll
    M llvm/test/CodeGen/AArch64/sve-lrint.ll
    M llvm/test/CodeGen/AArch64/sve-pr92779.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-fp-compares.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-fp-extend-trunc.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-fp-to-int.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-int-extends.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-int-rem.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-int-to-fp.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-limit-duplane.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-masked-load.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-masked-store.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-ptest.ll
    M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-reductions.ll
    M llvm/test/CodeGen/AArch64/sve2-bf16-converts.ll
    M llvm/test/CodeGen/AMDGPU/GlobalISel/mul.ll
    A llvm/test/CodeGen/AMDGPU/amdgcn-call-whole-wave.ll
    M llvm/test/CodeGen/AMDGPU/extra-lds-size.ll
    M llvm/test/CodeGen/AMDGPU/global-load-xcnt.ll
    M llvm/test/CodeGen/AMDGPU/irtranslator-whole-wave-functions.ll
    M llvm/test/CodeGen/AMDGPU/isel-whole-wave-functions.ll
    A llvm/test/CodeGen/AMDGPU/lds-limit-diagnostics-gfx1250.ll
    A llvm/test/CodeGen/AMDGPU/lds-size-hsa-gfx1250.ll
    A llvm/test/CodeGen/AMDGPU/lds-size-pal-gfx1250.ll
    M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.cvt.sat.pk.ll
    M llvm/test/CodeGen/AMDGPU/memory-legalizer-local-workgroup.ll
    M llvm/test/CodeGen/AMDGPU/packed-fp32.ll
    A llvm/test/CodeGen/AMDGPU/pal-metadata-3.0.gfx1250.ll
    M llvm/test/CodeGen/AMDGPU/preload-implicit-kernargs.ll
    M llvm/test/CodeGen/AMDGPU/preload-kernargs.ll
    A llvm/test/CodeGen/AMDGPU/remove-incompatible-wave64-feature.ll
    M llvm/test/CodeGen/AMDGPU/whole-wave-functions.ll
    M llvm/test/CodeGen/DirectX/llc-pipeline.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-cbuffer-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-descriptor-table-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-root-descriptor-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-sampler.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation-fail-static-sampler-range.ll
    A llvm/test/CodeGen/DirectX/rootsignature-validation.ll
    M llvm/test/CodeGen/Mips/llvm.frexp.ll
    M llvm/test/CodeGen/Mips/llvm.sincos.ll
    A llvm/test/CodeGen/RISCV/pr153598.mir
    M llvm/test/CodeGen/RISCV/rvv/fixed-vectors-interleaved-access.ll
    A llvm/test/CodeGen/RISCV/rvv/incorrect-extract-subvector-combine.ll
    M llvm/test/CodeGen/X86/pr33010.ll
    M llvm/test/CodeGen/X86/select-smin-smax.ll
    M llvm/test/MC/AMDGPU/hsa-diag-v4.s
    M llvm/test/MC/RISCV/rv32p-invalid.s
    M llvm/test/MC/RISCV/rv32p-valid.s
    M llvm/test/MC/RISCV/rv64p-invalid.s
    M llvm/test/MC/RISCV/rv64p-valid.s
    M llvm/test/TableGen/RuntimeLibcallEmitter.td
    A llvm/test/Transforms/Inline/optimization-remarks-passed-deleted-callee-yaml.ll
    A llvm/test/Transforms/InstSimplify/ConstProp/WebAssembly/dot.ll
    M llvm/test/Transforms/JumpTableToSwitch/basic.ll
    A llvm/test/Transforms/LoopDistribute/cross-partition-access.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/conditional-branches-cost.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/divs-with-scalable-vfs.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/eliminate-tail-predication.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/f128-fmuladd-reduction.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/gather-do-not-vectorize-addressing.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/induction-costs-sve.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/interleave-with-gaps.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/interleaving-reduction.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/low_trip_count_predicates.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/masked-call.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/optsize_minsize.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/outer_loop_prefer_scalable.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-chained.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-dot-product-mixed.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-dot-product.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-sub.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/pr60831-sve-inv-store-crash.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/reduction-recurrence-costs-sve.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/scalable-avoid-scalarization.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/scalable-reduction-inloop-cond.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/scalable-strict-fadd.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/simple_early_exit.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/single-early-exit-interleave.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/store-costs-sve.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-epilog-vect-inloop-reductions.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-epilog-vect-reductions.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-epilog-vect-strict-reductions.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-epilog-vect.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-fneg.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-inductions-unusual-types.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-inv-store.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-live-out-pointer-induction.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-multiexit.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-runtime-check-size-based-threshold.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-tail-folding-forced.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-tail-folding-reductions.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-tail-folding-unroll.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-tail-folding.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-vscale-based-trip-counts.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-widen-extractvalue.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/sve-widen-gep.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/tail-folding-styles.ll
    M llvm/test/Transforms/LoopVectorize/AArch64/veclib-intrinsic-calls.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/dead-ops-cost.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/fminimumnum.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/inloop-reduction.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/partial-reduce-dot-product.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/riscv-vector-reverse.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/scalable-tailfold.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-bin-unary-ops-args.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-call-intrinsics.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-cast-intrinsics.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-cond-reduction.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-div.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-fixed-order-recurrence.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-gather-scatter.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-inloop-reduction.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-interleave.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-intermediate-store.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-iv32.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-masked-loadstore.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-ordered-reduction.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-reduction.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-reverse-load-store.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-safe-dep-distance.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/uniform-load-store.ll
    M llvm/test/Transforms/LoopVectorize/RISCV/vectorize-vp-intrinsics.ll
    M llvm/test/Transforms/LoopVectorize/X86/consecutive-ptr-uniforms.ll
    M llvm/test/Transforms/LoopVectorize/X86/pr51366-sunk-instruction-used-outside-of-loop.ll
    M llvm/test/Transforms/LoopVectorize/dbg-outer-loop-vect.ll
    M llvm/test/Transforms/LoopVectorize/first-order-recurrence-dead-instructions.ll
    M llvm/test/Transforms/LoopVectorize/noalias-scope-decl.ll
    M llvm/test/Transforms/LoopVectorize/outer_loop_scalable.ll
    M llvm/test/Transforms/LoopVectorize/pr45525.ll
    M llvm/test/Transforms/LoopVectorize/preserve-dbg-loc-and-loop-metadata.ll
    M llvm/test/Transforms/LoopVectorize/runtime-check-readonly.ll
    M llvm/test/Transforms/LoopVectorize/scalable-assume.ll
    M llvm/test/Transforms/LoopVectorize/scalable-first-order-recurrence.ll
    M llvm/test/Transforms/LoopVectorize/scalable-iv-outside-user.ll
    M llvm/test/Transforms/LoopVectorize/scalable-lifetime.ll
    M llvm/test/Transforms/LoopVectorize/scalable-loop-unpredicated-body-scalar-tail.ll
    M llvm/test/Transforms/LoopVectorize/scalable-predication.ll
    M llvm/test/Transforms/LoopVectorize/scalable-reduction-inloop.ll
    M llvm/test/Transforms/LoopVectorize/scalable-trunc-min-bitwidth.ll
    M llvm/test/Transforms/LoopVectorize/vectorize-force-tail-with-evl.ll
    M llvm/test/Transforms/LoopVectorize/vplan-widen-select-instruction.ll
    M llvm/test/Transforms/PhaseOrdering/AArch64/sve-interleave-vectorization.ll
    M llvm/test/Transforms/SLPVectorizer/AArch64/reused-scalar-repeated-in-node.ll
    M llvm/test/Transforms/SLPVectorizer/AArch64/vec3-reorder-reshuffle.ll
    A llvm/test/Transforms/SLPVectorizer/X86/schedule-control-deps-after-copyable.ll
    M llvm/test/Transforms/SLPVectorizer/insertelement-postpone.ll
    A llvm/test/Verifier/AMDGPU/intrinsic-amdgcn-call-whole-wave.ll
    M llvm/test/tools/llvm-mca/AArch64/Cortex/A510-sve-instructions.s
    A llvm/test/tools/llvm-original-di-preservation/Inputs/expected-origin-reduced.html
    M llvm/test/tools/llvm-original-di-preservation/Inputs/expected-origin.html
    M llvm/test/tools/llvm-original-di-preservation/Inputs/origin.json
    M llvm/test/tools/llvm-original-di-preservation/basic.test
    A llvm/test/tools/llvm-readobj/ELF/sframe-reloc.test
    M llvm/tools/llvm-c-test/debuginfo.c
    M llvm/tools/llvm-readobj/ELFDumper.cpp
    M llvm/tools/opt/optdriver.cpp
    M llvm/unittests/IR/CMakeLists.txt
    A llvm/unittests/IR/RuntimeLibcallsTest.cpp
    M llvm/unittests/Support/Path.cpp
    M llvm/utils/TableGen/Basic/RuntimeLibcallsEmitter.cpp
    M llvm/utils/TableGen/DecoderEmitter.cpp
    M llvm/utils/gn/secondary/llvm/lib/Support/BUILD.gn
    M llvm/utils/gn/secondary/llvm/unittests/IR/BUILD.gn
    M llvm/utils/llvm-original-di-preservation.py
    A llvm/utils/release/clear-release-notes.py
    M mlir/include/mlir/Conversion/LLVMCommon/Pattern.h
    M mlir/include/mlir/Dialect/LLVMIR/BasicPtxBuilderInterface.h
    M mlir/include/mlir/Dialect/LLVMIR/LLVMIntrinsicOps.td
    M mlir/include/mlir/Dialect/LLVMIR/LLVMOpBase.td
    M mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
    M mlir/include/mlir/Dialect/OpenACC/OpenACCOps.td
    M mlir/include/mlir/Dialect/SCF/IR/SCFOps.td
    M mlir/include/mlir/Dialect/SPIRV/Interfaces/SPIRVImageInterfaces.h
    M mlir/include/mlir/Dialect/XeGPU/Utils/XeGPUUtils.h
    M mlir/include/mlir/InitAllTranslations.h
    A mlir/include/mlir/Target/Wasm/WasmBinaryEncoding.h
    A mlir/include/mlir/Target/Wasm/WasmImporter.h
    M mlir/include/mlir/Transforms/DialectConversion.h
    M mlir/lib/Conversion/NVVMToLLVM/NVVMToLLVM.cpp
    M mlir/lib/Conversion/SCFToControlFlow/SCFToControlFlow.cpp
    M mlir/lib/Conversion/SCFToEmitC/SCFToEmitC.cpp
    M mlir/lib/Conversion/SCFToSPIRV/SCFToSPIRV.cpp
    M mlir/lib/Conversion/VectorToXeGPU/CMakeLists.txt
    M mlir/lib/Conversion/VectorToXeGPU/VectorToXeGPU.cpp
    M mlir/lib/Dialect/LLVMIR/IR/LLVMDialect.cpp
    M mlir/lib/Dialect/Linalg/Transforms/HoistPadding.cpp
    M mlir/lib/Dialect/Linalg/Transforms/Hoisting.cpp
    M mlir/lib/Dialect/OpenACC/IR/OpenACC.cpp
    M mlir/lib/Dialect/SCF/IR/SCF.cpp
    M mlir/lib/Dialect/SCF/Transforms/BufferizableOpInterfaceImpl.cpp
    M mlir/lib/Dialect/SCF/Transforms/ForToWhile.cpp
    M mlir/lib/Dialect/SCF/Transforms/LoopPipelining.cpp
    M mlir/lib/Dialect/SCF/Transforms/LoopSpecialization.cpp
    M mlir/lib/Dialect/SCF/Transforms/StructuralTypeConversions.cpp
    M mlir/lib/Dialect/SCF/Transforms/TileUsingInterface.cpp
    M mlir/lib/Dialect/SCF/Utils/Utils.cpp
    M mlir/lib/Dialect/SparseTensor/Transforms/SparseVectorization.cpp
    M mlir/lib/Dialect/Vector/Transforms/VectorDistribute.cpp
    M mlir/lib/Dialect/XeGPU/IR/XeGPUOps.cpp
    M mlir/lib/Dialect/XeGPU/Utils/CMakeLists.txt
    M mlir/lib/Dialect/XeGPU/Utils/XeGPUUtils.cpp
    M mlir/lib/Target/CMakeLists.txt
    A mlir/lib/Target/Wasm/CMakeLists.txt
    A mlir/lib/Target/Wasm/TranslateFromWasm.cpp
    A mlir/lib/Target/Wasm/TranslateRegistration.cpp
    M mlir/lib/Transforms/Utils/DialectConversion.cpp
    M mlir/test/CMakeLists.txt
    M mlir/test/Conversion/SCFToControlFlow/convert-to-cfg.mlir
    M mlir/test/Conversion/SCFToSPIRV/for.mlir
    M mlir/test/Conversion/VectorToXeGPU/transfer-read-to-xegpu.mlir
    M mlir/test/Conversion/VectorToXeGPU/transfer-write-to-xegpu.mlir
    M mlir/test/Dialect/LLVMIR/nvvm.mlir
    M mlir/test/Dialect/SCF/ops.mlir
    M mlir/test/Dialect/XeGPU/invalid.mlir
    M mlir/test/Target/LLVMIR/Import/intrinsic.ll
    M mlir/test/Target/LLVMIR/llvmir-intrinsics.mlir
    M mlir/test/Target/LLVMIR/nvvmir.mlir
    A mlir/test/Target/Wasm/bad_wasm_version.yaml
    A mlir/test/Target/Wasm/function_export_out_of_scope.yaml
    A mlir/test/Target/Wasm/import.mlir
    A mlir/test/Target/Wasm/inputs/import.yaml.wasm
    A mlir/test/Target/Wasm/inputs/memory_min_eq_max.yaml.wasm
    A mlir/test/Target/Wasm/inputs/memory_min_max.yaml.wasm
    A mlir/test/Target/Wasm/inputs/memory_min_no_max.yaml.wasm
    A mlir/test/Target/Wasm/inputs/stats.yaml.wasm
    A mlir/test/Target/Wasm/inputs/table.yaml.wasm
    A mlir/test/Target/Wasm/invalid_function_type_index.yaml
    A mlir/test/Target/Wasm/memory_min_eq_max.mlir
    A mlir/test/Target/Wasm/memory_min_max.mlir
    A mlir/test/Target/Wasm/memory_min_no_max.mlir
    A mlir/test/Target/Wasm/missing_header.yaml
    M mlir/test/Transforms/test-legalizer.mlir
    M mlir/test/lib/Dialect/Test/TestPatterns.cpp
    M offload/liboffload/API/APIDefs.td
    M offload/liboffload/API/Queue.td
    M offload/liboffload/src/OffloadImpl.cpp
    M offload/plugins-nextgen/amdgpu/src/rtl.cpp
    M offload/plugins-nextgen/common/include/PluginInterface.h
    M offload/plugins-nextgen/common/src/PluginInterface.cpp
    M offload/plugins-nextgen/cuda/src/rtl.cpp
    M offload/plugins-nextgen/host/src/rtl.cpp
    M offload/test/mapping/data_member_ref.cpp
    M offload/test/mapping/declare_mapper_nested_default_mappers.cpp
    M offload/test/mapping/declare_mapper_nested_mappers.cpp
    M offload/test/mapping/ptr_and_obj_motion.c
    M offload/test/mapping/target_derefence_array_pointrs.cpp
    M offload/test/mapping/target_has_device_addr.c
    A offload/test/offloading/fortran/declare-target-automap.f90
    M offload/unittests/OffloadAPI/CMakeLists.txt
    A offload/unittests/OffloadAPI/queue/olLaunchHostFunction.cpp
    M openmp/runtime/src/kmp_settings.cpp
    M utils/bazel/llvm-project-overlay/libc/BUILD.bazel
    M utils/bazel/llvm-project-overlay/mlir/BUILD.bazel

  Log Message:
  -----------
  Rebase

Created using spr 1.3.5

Compare: https://github.com/llvm/llvm-project/compare/cef7e3275b9a...efe2e4d48475

To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications