[all-commits] [llvm/llvm-project] 431213: [mlir][linalg] Implement patterns for reducing ran...
Vitaly Buka via All-commits
all-commits at lists.llvm.org
Tue Jun 25 10:04:24 PDT 2024
Branch: refs/heads/users/vitalybuka/spr/tsan-lockunlock-allocator-and-stacks-on-fork
Home: https://github.com/llvm/llvm-project
Commit: 431213c99d7707114d8e7956073a057cf1607160
https://github.com/llvm/llvm-project/commit/431213c99d7707114d8e7956073a057cf1607160
Author: srcarroll <50210727+srcarroll at users.noreply.github.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M mlir/include/mlir/Dialect/Linalg/Transforms/Transforms.h
M mlir/lib/Dialect/Linalg/Transforms/DropUnitDims.cpp
A mlir/test/Dialect/Linalg/rank-reduce-contraction-ops.mlir
M mlir/test/lib/Dialect/Linalg/CMakeLists.txt
A mlir/test/lib/Dialect/Linalg/TestLinalgRankReduceContractionOps.cpp
M mlir/tools/mlir-opt/mlir-opt.cpp
Log Message:
-----------
[mlir][linalg] Implement patterns for reducing rank of named linalg contraction ops (#95710)
This patch introduces pattern rewrites for reducing the rank of named
linalg contraction ops with unit spatial dim(s) to other named
contraction ops. For example `linalg.batch_matmul` with batch size 1 ->
`linalg.matmul` and `linalg.matmul` with unit LHS spatial dim ->
`linalg.vecmat`, etc. These patterns don't support reducing the rank
along reduction dimension as those don't convert to other named
contraction ops.
Commit: 962d7ac7fcdf6f0cb43e36dec2a6a67cadd7c46c
https://github.com/llvm/llvm-project/commit/962d7ac7fcdf6f0cb43e36dec2a6a67cadd7c46c
Author: Quentin Michaud <128144579+mh4ck-Thales at users.noreply.github.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M clang/include/clang/Basic/LangOptions.h
M clang/include/clang/Driver/Options.td
M clang/lib/Driver/ToolChains/WebAssembly.cpp
Log Message:
-----------
Add flag to opt out of wasm-opt (#95208)
This PR fixes #55781 by adding the `--no-wasm-opt` and `--wasm-opt`
flags in clang to disable/enable the `wasm-opt` optimizations. The
default is to enable `wasm-opt` as before in order to not break existing
workflows.
I think that adding a warning when no flag or the `--wasm-opt` flag is
given but `wasm-opt` wasn't found in the path may be relevant here. It
allows people using `wasm-opt` to be aware of if it have been used on
their produced binary or not. The only downside I see to this is that
people already using the toolchain with the `-O` and `-Werror` flags but
without `wasm-opt` in the path will see their toolchain break (with an
easy fix: either adding `--no-wasm-opt` or add `wasm-opt` to the path).
I haven't implemented this here because I haven't figured out how to add
such a warning, and I don't know if this warning should be added here or
in another PR.
CC @sunfishcode that proposed in the associated issue to review this
patch.
Commit: 0ab81986be7c2df8dc784e9e950c906274b51fd4
https://github.com/llvm/llvm-project/commit/0ab81986be7c2df8dc784e9e950c906274b51fd4
Author: Fred Grim <fgrim at apple.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/test/tools/llvm-readobj/ELF/dynamic-tags.test
M llvm/tools/llvm-readobj/ELFDumper.cpp
Log Message:
-----------
[llvm-readobj][ELF] Implement JSON output for --dynamic-table (#95976)
When printing JSON output with --dynamic-table I noticed that the output
is invalid JSON. This patch overrides the printDynamicTable() function
in the JSONELFDumper to return a list of dictionaries for the
DynamicSection value.
Before the output was:
```
{
"FileSummary": {
"File": "bin/llvm-readelf",
"Format": "elf64-x86-64",
"Arch": "x86_64",
"AddressSize": "64bit",
"LoadName": "<Not found>"
}DynamicSection [ (35 entries)
Tag Type Name/Value
0x000000000000001D RUNPATH Library runpath: [$ORIGIN/../lib:]
0x0000000000000001 NEEDED Shared library: [libm.so.6]
0x0000000000000001 NEEDED Shared library: [libz.so.1]
0x0000000000000001 NEEDED Shared library: [libzstd.so.1]
```
Now the output looks like:
```
"DynamicSection": [
{
"Tag": 29,
"Type": "RUNPATH",
"Value": 6322,
"Path": [
"$ORIGIN/../lib"
]
},
{
"Tag": 1,
"Type": "NEEDED",
"Value": 6109,
"Library": "libm.so.6"
},
```
Commit: 58cb0e651fee277777c7121465d488dba43adbd6
https://github.com/llvm/llvm-project/commit/58cb0e651fee277777c7121465d488dba43adbd6
Author: Jonas Devlieghere <jonas at devlieghere.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M .github/CODEOWNERS
Log Message:
-----------
[.github] Add myself as a reviewer for DWARFLinker & friends (#96531)
Commit: 0534953253725a0f3aa5a83ba5e09836c768860e
https://github.com/llvm/llvm-project/commit/0534953253725a0f3aa5a83ba5e09836c768860e
Author: Keith Smiley <keithbsmiley at gmail.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M utils/bazel/llvm-project-overlay/lldb/source/Plugins/BUILD.bazel
Log Message:
-----------
[bazel] Port #95312 (#96533)
Commit: 33a9c57b89c3ea901a057c3fcc9c9160eaf5a625
https://github.com/llvm/llvm-project/commit/33a9c57b89c3ea901a057c3fcc9c9160eaf5a625
Author: Miro Bucko <mbucko at meta.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M lldb/test/API/python_api/find_in_memory/address_ranges_helper.py
Log Message:
-----------
[lldb] Fix failing TestFind(Ranges)InMemory.py tests. (#96511)
This is to unblock #95007. Will investigate why the assertion is failing
on some arch.
Commit: 2f69e9a212bb3b965c4d79ff8a12e1e63d679587
https://github.com/llvm/llvm-project/commit/2f69e9a212bb3b965c4d79ff8a12e1e63d679587
Author: Fangrui Song <i at maskray.me>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/test/DebugInfo/COFF/fission-sections.ll
Log Message:
-----------
[COFF,test] Improve fission.ll
Commit: b7b337fb91f9b0538fcc4467ffca7c6c71192bc9
https://github.com/llvm/llvm-project/commit/b7b337fb91f9b0538fcc4467ffca7c6c71192bc9
Author: Kazu Hirata <kazu at google.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M mlir/lib/Dialect/Affine/Transforms/LoopFusion.cpp
M mlir/lib/Dialect/GPU/Transforms/NVVMAttachTarget.cpp
M mlir/lib/Dialect/GPU/Transforms/ROCDLAttachTarget.cpp
M mlir/lib/Dialect/GPU/Transforms/SPIRVAttachTarget.cpp
M mlir/lib/Dialect/Transform/Interfaces/MatchInterfaces.cpp
M mlir/lib/IR/AffineMap.cpp
M mlir/lib/Tools/mlir-pdll-lsp-server/PDLLServer.cpp
Log Message:
-----------
[mlir] Use llvm::unique (NFC) (#96415)
Commit: d6a3bd1325c5c54ef59e8a612451757c86186355
https://github.com/llvm/llvm-project/commit/d6a3bd1325c5c54ef59e8a612451757c86186355
Author: Aaron Ballman <aaron at aaronballman.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M clang/www/get_involved.html
Log Message:
-----------
Update Clang extension criteria (#96532)
This updates Clang's extension criteria to explicitly mention impacts on
other projects within the monorepo.
These changes were discussed in the following RFC:
https://discourse.llvm.org/t/rfc-require-discussion-of-impact-to-monorepo-stakeholders-when-adding-new-clang-extensions/79613
Commit: 2ae09052477e1a966afbc5482d88585f95694c53
https://github.com/llvm/llvm-project/commit/2ae09052477e1a966afbc5482d88585f95694c53
Author: Aaron Ballman <aaron at aaronballman.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
A clang/test/C/C2x/n2819.c
M clang/www/c_status.html
Log Message:
-----------
[C23] Claim we do not conform to N2819
This paper clarified the lifetime of compound literal objects in odd
scopes, such as use at function prototype scope.
We do not currently implement this paper, as the new test demonstrates.
Commit: 62baf21daa377c4ec1a641b26931063c1117d262
https://github.com/llvm/llvm-project/commit/62baf21daa377c4ec1a641b26931063c1117d262
Author: Sander de Smalen <sander.desmalen at arm.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
M llvm/lib/Target/AArch64/AArch64InstrInfo.cpp
M llvm/lib/Target/AArch64/AArch64InstrInfo.td
M llvm/lib/Target/AArch64/AArch64SMEInstrInfo.td
M llvm/test/CodeGen/AArch64/aarch64-sme2-asm.ll
M llvm/test/CodeGen/AArch64/sme-intrinsics-add.ll
M llvm/test/CodeGen/AArch64/sme-intrinsics-loads.ll
M llvm/test/CodeGen/AArch64/sme-intrinsics-mopa.ll
M llvm/test/CodeGen/AArch64/sme-intrinsics-mops.ll
M llvm/test/CodeGen/AArch64/sme-intrinsics-stores.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-add-sub-za16.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-add.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-cvt.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-cvtl.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-cvtn.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-extract-mova.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-fmlas.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-fmlas16.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-fp-dots.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-frint.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-insert-mova.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-int-dots.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-ld1.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-ldnt1.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti2-lane-x2.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti2-lane-x4.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti2-lane.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti4-lane-x2.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti4-lane-x4.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti4-lane.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-max.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-min.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-mlall.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-mlals.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-mop.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-mopa.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-qcvt.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-qcvtn.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-qrshr.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-rshl.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-select-sme-tileslice.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-sqdmulh.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-sub.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-vdot.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-zero-zt.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-zt0.ll
M llvm/test/CodeGen/AArch64/sme2p1-intrinsics-zero.ll
M llvm/test/CodeGen/AArch64/sve-callee-save-restore-pairs.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-contiguous-prefetches.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-conversion.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-counting-bits.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-counting-elems.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-fp-converts.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-ldN-sret-reg+imm-addr-mode.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-ldN-sret-reg+reg-addr-mode.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-logical.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-pred-creation.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-pred-operations.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-pred-testing.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-reinterpret.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-reversal.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-sel.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-sqdec.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-sqinc.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-st1-addressing-mode-reg-imm.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-st1-addressing-mode-reg-reg.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-st1.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-stN-reg-imm-addr-mode.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-stN-reg-reg-addr-mode.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-stores.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-uqdec.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-uqinc.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-while.ll
M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-reshuffle.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-binary-narrowing-add-sub.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-binary-narrowing-shr.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-complex-dot.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-contiguous-conflict-detection.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-fp-converts.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-fp-int-binary-logarithm.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-fp-widening-mul-acc.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-int-mul-lane.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-non-widening-pairwise-arith.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-polynomial-arithmetic.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-psel.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-revd.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-unary-narrowing.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-uniform-complex-arith.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-while-reversed.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-while.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-widening-complex-int-arith.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-widening-dsp.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-widening-pairwise-arith.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-cntp.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-dots.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-fclamp.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-fp-reduce.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-int-reduce.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-loads.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-predicate-as-counter.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-sclamp.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-selx2.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-selx4.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-stores.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-uclamp.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-unpk.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-uzpx2.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-uzpx4.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-while-pn.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-zipx2.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-zipx4.ll
Log Message:
-----------
[AArch64] Check for streaming mode in HasSME* features. (#96302)
This also fixes up some asserts in copyPhysReg, loadRegFromStackSlot and
storeRegToStackSlot.
Commit: 09c0337a581dfd8f39df131786cfc7f675adf483
https://github.com/llvm/llvm-project/commit/09c0337a581dfd8f39df131786cfc7f675adf483
Author: Sander de Smalen <sander.desmalen at arm.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M clang/include/clang/Basic/arm_sme.td
M clang/include/clang/Basic/arm_sve.td
M clang/include/clang/Basic/arm_sve_sme_incl.td
M clang/test/Sema/aarch64-sve-intrinsics/acle_sve_bfloat.cpp
M clang/test/Sema/aarch64-sve2-intrinsics/acle_sve2_bfloat.cpp
M clang/utils/TableGen/SveEmitter.cpp
Log Message:
-----------
[Clang][SveEmitter] Split up TargetGuard into SVE and SME component. (#96482)
One reason to want to split this up is to simplify the code added in
#93802, where it checks the SME streaming-mode requirements for a
builtin by checking for the absence of SVE. If the target guards are
separate, we can generate a table and make the Sema code to verify the
runtime mode simpler.
Another reason is to avoid an issue with a check in SveEmitter.cpp where
it ensures that the 'VerifyRuntimeMode' is set correctly for functions
that have both SVE and SME target guards:
if (!Def->isFlagSet(VerifyRuntimeMode) &&
Def->getGuard().contains("sve") &&
Def->getGuard().contains("sme"))
llvm_unreachable("Missing VerifyRuntimeMode flag");
However, if we ever add a new feature with "sme" in the name, even
though it is unrelated to FEAT_SME, then this code no longer works.
Note that the arm_sve.td and arm_sme.td files could do with a bit of
restructuring after this but it seems better to follow that up in an NFC
patch.
Commit: f2d3d829b97a221c9ce3a3467a20ea51bb29ecbd
https://github.com/llvm/llvm-project/commit/f2d3d829b97a221c9ce3a3467a20ea51bb29ecbd
Author: Matthias Springer <me at m-sp.org>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M mlir/lib/Dialect/Linalg/TransformOps/LinalgTransformOps.cpp
Log Message:
-----------
[mlir][linalg][Transform] Fix use-after-free in `SplitOp::apply` (#96390)
Detected with ASAN. `Operation::getLoc()` was called after erasing the
operation.
Reverts 48cf6b6bbe7a22bfcd98f82dc7afd21c9decd22f, which attempted to fix
the use-after-free. (But the use-after-free is still there when the
`hasFailed` branch is taken.)
Commit: d30b082fd4aeba0a3a99c3f17dbffe6691f859cc
https://github.com/llvm/llvm-project/commit/d30b082fd4aeba0a3a99c3f17dbffe6691f859cc
Author: Terry Wilmarth <terry.l.wilmarth at intel.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M openmp/runtime/src/dllexports
M openmp/runtime/src/kmp.h
M openmp/runtime/src/kmp_csupport.cpp
M openmp/runtime/src/kmp_runtime.cpp
A openmp/runtime/test/parallel/omp_parallel_num_threads_list.c
A openmp/runtime/test/parallel/omp_parallel_num_threads_strict.c
Log Message:
-----------
[OpenMP] Add num_threads clause list format and strict modifier support (#85466)
Add support to the runtime for 6.0 spec features that allow num_threads
clause to take a list, and also make use of the strict modifier.
Provides new compiler interface functions for these features.
Commit: 0555afd02401551c10dbf7f646a98501929b27e2
https://github.com/llvm/llvm-project/commit/0555afd02401551c10dbf7f646a98501929b27e2
Author: Arthur Eubanks <aeubanks at google.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/lib/Analysis/MLInlineAdvisor.cpp
Log Message:
-----------
[NFC][MLInliner] Rename LastSCC -> CurSCC (#96546)
The passed SCC is the current SCC we're working on.
Commit: b1a93dbaba87119556c1138b64cde6aa5ee8a854
https://github.com/llvm/llvm-project/commit/b1a93dbaba87119556c1138b64cde6aa5ee8a854
Author: Keith Smiley <keithbsmiley at gmail.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M utils/bazel/llvm-project-overlay/llvm/BUILD.bazel
M utils/bazel/llvm-project-overlay/mlir/BUILD.bazel
Log Message:
-----------
[bazel] Export distributable lldb files (#96549)
If you're building and vendoring lldb, you might need to also vendor
these files.
Commit: b097018fdafe61f1fe10337a71f56e5386930d54
https://github.com/llvm/llvm-project/commit/b097018fdafe61f1fe10337a71f56e5386930d54
Author: Mike Rice <michael.p.rice at intel.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M clang/lib/Sema/SemaOpenMP.cpp
M clang/test/OpenMP/Inputs/nesting_of_regions.cpp
Log Message:
-----------
[clang][OpenMP] Fix teams nesting of region check (#94806)
The static verifier flagged dead code in the check since the loop will
only execute once and never reach the iterator increment.
The loop needs to iterate twice to correctly diagnose when a statement
is after the teams.
Since there are two iterations again, reset the iterator to the first
teams directive when the double teams case is seen so the diagnostic can
report both locations.
Commit: 313b1a8250d3cd742356cc71c36402b10d8b2e16
https://github.com/llvm/llvm-project/commit/313b1a8250d3cd742356cc71c36402b10d8b2e16
Author: Mircea Trofin <mtrofin at google.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/include/llvm/Analysis/ReleaseModeModelRunner.h
M llvm/lib/Analysis/MLInlineAdvisor.cpp
M llvm/unittests/Analysis/MLModelRunnerTest.cpp
Log Message:
-----------
[mlgo] Support composite AOT-ed models (#96276)
This applies to the AOT case where we embed models in the compiler. The
change adds support for multiple models for the same agent, and allows
the user select one via a command line flag. "agent" refers to e.g. the
inline advisor or the register allocator eviction advisor.
To avoid build setup complexity, the support is delegated to the saved
model. Since saved models define computational graphs, we can generate a
composite model (this happens prior to building and embedding it in LLVM
and is not shown in this change) that exposes an extra feature with a
predefined name: `_model_selector`. The model, then, delegates
internally to contained models based on that feature value.
Model selection is expected to happen at model instantiation, there is
no current scenario for switching them afterwards.
If the model doesn't expose such a feature but the user passes one, we
report error.
If the model exposes such a feature but the user doesn't pass one, we
also report an error.
Invalid model selector values are expected to be handled by the saved
model.
Internally, the model uses a pair of uint64 values - the high and low of
the MD5 hash of the name.
A tool composing models would, then, need to:
- expose the extra feature, `_model_selector`, shape (2,), uint64 data
type
- test its value (`tf.cond` or `tf.case` in Tensorflow) against the MD5
hash, in the [high, low] order, of contained models based on a
user-specified name (which the user will then use as flag value to the
compiler)
Agents just need to add a flag to capture the name of a model and pass
it to `ReleaseModeModelRunner` at construction. This can be passed in
all cases without checking - the case where the model is not composite
and we pass an empty name, everything works as before.
This change also factors out the string flags we pass to the
`ReleaseModeModelRunner` for better maintainability (we risk confusing
parameters that are strings otherwise)
Commit: 5413a2bb843a3c71e0891aa5984afd63cd580dea
https://github.com/llvm/llvm-project/commit/5413a2bb843a3c71e0891aa5984afd63cd580dea
Author: Mike Rice <michael.p.rice at intel.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M clang/include/clang/Basic/DiagnosticParseKinds.td
M clang/lib/Parse/ParseOpenMP.cpp
M clang/test/OpenMP/declare_variant_clauses_messages.cpp
Log Message:
-----------
[clang][OpenMP] Fix error handling of the adjust_args clause (#94696)
Static verifier noticed the current code has logically dead code parsing
the clause where IsComma is assigned. Fix this and improve the error
message received when a bad adjust-op is specified.
This will now be handled like 'map' where a nice diagnostic is given
with the correct values, then parsing continues on the next clause
reducing unhelpful diagnostics.
Commit: 3aef525aa4b9a5395b6ac4ae771e28e64b27a126
https://github.com/llvm/llvm-project/commit/3aef525aa4b9a5395b6ac4ae771e28e64b27a126
Author: vangthao95 <vang.thao at amd.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
M llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.h
M llvm/lib/Target/AMDGPU/AMDGPUInstructionSelector.cpp
M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
M llvm/test/CodeGen/AMDGPU/GlobalISel/inst-select-load-constant.mir
M llvm/test/CodeGen/AMDGPU/GlobalISel/smrd.ll
M llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-smem.ll
M llvm/test/CodeGen/AMDGPU/gfx12_scalar_subword_loads.ll
M llvm/test/CodeGen/AMDGPU/global-saddr-load.ll
M llvm/test/CodeGen/AMDGPU/llvm.prefetch.ll
M llvm/test/CodeGen/AMDGPU/smrd.ll
Log Message:
-----------
[AMDGPU] Fix negative immediate offset for unbuffered smem loads (#89165)
For unbuffered smem loads, it is illegal for the immediate offset to be
negative if the resulting IOFFSET + (SGPR[Offset] or M0 or zero) is
negative.
New PR of https://github.com/llvm/llvm-project/pull/79553.
Commit: d2864d1716b5e5b5eabb3004b04a1d7f67803f0d
https://github.com/llvm/llvm-project/commit/d2864d1716b5e5b5eabb3004b04a1d7f67803f0d
Author: Louis Dionne <ldionne.2 at gmail.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M libcxx/CMakeLists.txt
M libcxx/lib/abi/powerpc-ibm-aix.libcxxabi.v1.stable.exceptions.nonew.abilist
M libcxx/lib/abi/powerpc64-ibm-aix.libcxxabi.v1.stable.exceptions.nonew.abilist
M libcxx/lib/abi/x86_64-unknown-freebsd.libcxxabi.v1.stable.exceptions.nonew.abilist
M libcxx/lib/abi/x86_64-unknown-linux-gnu.libcxxabi.v1.stable.exceptions.nonew.abilist
M libcxx/lib/abi/x86_64-unknown-linux-gnu.libcxxabi.v1.stable.noexceptions.nonew.abilist
Log Message:
-----------
[libc++] Build with -fsized-deallocation (#96217)
This patch makes libc++ build with -fsized-deallocation. That flag is
enabled by default in recent versions of Clang, so this patch will make
libc++ forward-compatible with ToT Clang.
Commit: bea7f3d8739a12c599525c18c151f37b1293a080
https://github.com/llvm/llvm-project/commit/bea7f3d8739a12c599525c18c151f37b1293a080
Author: Nick Desaulniers (paternity leave) <nickdesaulniers at users.noreply.github.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M libc/include/llvm-libc-macros/CMakeLists.txt
A libc/include/llvm-libc-macros/link-macros.h
M libc/startup/linux/CMakeLists.txt
M libc/startup/linux/do_start.cpp
Log Message:
-----------
[libc][startup] create header for ElfW and use in startup (#96510)
This is necessary for 32b platforms such as ARM and i386.
Link: #94128
Commit: dd8d978b7642b410f3a4895aa20f0aada4a0f7e4
https://github.com/llvm/llvm-project/commit/dd8d978b7642b410f3a4895aa20f0aada4a0f7e4
Author: Keith Smiley <keithbsmiley at gmail.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M utils/bazel/llvm-project-overlay/lldb/BUILD.bazel
Log Message:
-----------
[bazel] Switch mach_gen to apple_genrule (#96551)
mig is a tool vendored with Xcode. Using apple_genrule makes sure that
the bazel selected version of Xcode is preferred, and that the action is
invalidated when that version changes.
Commit: a030c8bc87be0599e861f5d8fe8254ac8d6e84ba
https://github.com/llvm/llvm-project/commit/a030c8bc87be0599e861f5d8fe8254ac8d6e84ba
Author: Keith Smiley <keithbsmiley at gmail.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
A utils/bazel/llvm-project-overlay/lldb/tools/lldb-dap/BUILD.bazel
Log Message:
-----------
[bazel] Add support for building lldb-dap (#96556)
Commit: 687d6fbfac6205e6f53ea58584278c09738ea708
https://github.com/llvm/llvm-project/commit/687d6fbfac6205e6f53ea58584278c09738ea708
Author: Akshay Deodhar <adeodhar at nvidia.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/include/llvm/IR/IntrinsicsNVVM.td
M llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
M llvm/lib/Target/NVPTX/NVPTXLowerArgs.cpp
M llvm/lib/Target/NVPTX/NVPTXUtilities.cpp
M llvm/lib/Target/NVPTX/NVPTXUtilities.h
A llvm/test/CodeGen/NVPTX/lower-args-gridconstant.ll
Log Message:
-----------
[NVPTX] Basic support for "grid_constant" (#96125)
- Adds a helper function for checking whether an argument is a
[grid_constant](https://docs.nvidia.com/cuda/nvvm-ir-spec/index.html#supported-properties).
- Adds support for cvta.param using changes from
https://github.com/llvm/llvm-project/pull/95289
- Supports escaped grid_constant pointers conservatively, by casting all
uses to the generic address space with cvta.param.
Commit: 5ae50698a0d6a3022af2e79d405a7eb6c8c790f0
https://github.com/llvm/llvm-project/commit/5ae50698a0d6a3022af2e79d405a7eb6c8c790f0
Author: Ramkumar Ramachandra <ramkumar.ramachandra at codasip.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/lib/Analysis/LoopAccessAnalysis.cpp
M llvm/test/Analysis/LoopAccessAnalysis/symbolic-stride.ll
Log Message:
-----------
LAA: strip unnecessary getUniqueCastUse (#92119)
733b8b2 ([LAA] Simplify identification of speculatable strides [nfc])
refactored getStrideFromPointer() to compute directly on SCEVs, and
return an SCEV expression instead of a Value. However, it left behind a
call to getUniqueCastUse(), which is completely unnecessary. Remove
this, showing a positive test update, and simplify the surrounding
program logic.
Commit: 16903ace180755b7558234ff2b2e8d89b00dcb88
https://github.com/llvm/llvm-project/commit/16903ace180755b7558234ff2b2e8d89b00dcb88
Author: lntue <35648136+lntue at users.noreply.github.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M libc/config/darwin/arm/entrypoints.txt
M libc/config/linux/aarch64/entrypoints.txt
M libc/config/linux/arm/entrypoints.txt
M libc/config/linux/riscv/entrypoints.txt
M libc/docs/math/index.rst
M libc/src/__support/FPUtil/double_double.h
M libc/src/__support/FPUtil/dyadic_float.h
M libc/src/__support/macros/optimization.h
M libc/src/math/generic/CMakeLists.txt
A libc/src/math/generic/range_reduction_double_common.h
A libc/src/math/generic/range_reduction_double_fma.h
A libc/src/math/generic/range_reduction_double_nofma.h
A libc/src/math/generic/sin.cpp
A libc/src/math/generic/sincos_eval.h
M libc/src/math/x86_64/CMakeLists.txt
R libc/src/math/x86_64/sin.cpp
M libc/test/src/math/sin_test.cpp
M libc/test/src/math/smoke/CMakeLists.txt
A libc/test/src/math/smoke/sin_test.cpp
Log Message:
-----------
[libc][math] Implement double precision sin correctly rounded to all rounding modes. (#95736)
- Algorithm:
- Step 1 - Range reduction: for a double precision input `x`, return `k`
and `u` such that
- k is an integer
- u = x - k * pi / 128, and |u| < pi/256
- Step 2 - Calculate `sin(u)` and `cos(u)` in double-double using Taylor
polynomials with errors < 2^-70 with FMA or < 2^-66 w/o FMA.
- Step 3 - Calculate `sin(x) = sin(k*pi/128) * cos(u) + cos(k*pi/128) *
sin(u)` using look-up table for `sin(k*pi/128)` and `cos(k*pi/128)`.
- Step 4 - Use Ziv's rounding test to decide if the result is correctly
rounded.
- Step 4' - If the Ziv's rounding test failed, redo step 1-3 using
128-bit precision.
- Currently, without FMA instructions, the large range reduction only
works correctly for the default rounding mode (FE_TONEAREST).
- Provide `LIBC_MATH` flag so that users can set `LIBC_MATH =
LIBC_MATH_SKIP_ACCURATE_PASS` to build the `sin` function without step 4
and 4'.
Commit: a32b7199f0c15ea1c6c9490b6166c019c9d4bd2b
https://github.com/llvm/llvm-project/commit/a32b7199f0c15ea1c6c9490b6166c019c9d4bd2b
Author: Chelsea Cassanova <chelsea_cassanova at apple.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M lldb/bindings/python/python-typemaps.swig
M lldb/include/lldb/API/SBProcess.h
M lldb/include/lldb/Core/AddressRangeListImpl.h
M lldb/include/lldb/Target/Process.h
M lldb/source/API/SBProcess.cpp
M lldb/source/Target/Process.cpp
R lldb/test/API/python_api/find_in_memory/Makefile
R lldb/test/API/python_api/find_in_memory/TestFindInMemory.py
R lldb/test/API/python_api/find_in_memory/TestFindRangesInMemory.py
R lldb/test/API/python_api/find_in_memory/address_ranges_helper.py
R lldb/test/API/python_api/find_in_memory/main.cpp
Log Message:
-----------
Revert commits that add `TestFind(Ranges)InMemory.py` (#96560)
Reverting to unblock macOS buildbots which are currently failing on
these tests.
https://green.lab.llvm.org/job/llvm.org/view/LLDB/job/as-lldb-cmake/6377/
Commit: 75ac887a3033c6e4eb8e423a78490c8d4bf7d5b5
https://github.com/llvm/llvm-project/commit/75ac887a3033c6e4eb8e423a78490c8d4bf7d5b5
Author: shaw young <58664393+shawbyoung at users.noreply.github.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M bolt/docs/CommandLineArgumentReference.md
Log Message:
-----------
[BOLT][NFC] Sync CommandLineArgumentReference with options (#96563)
Commit: a3a44bfbdfefe0928124f9e40d242507f75b87f4
https://github.com/llvm/llvm-project/commit/a3a44bfbdfefe0928124f9e40d242507f75b87f4
Author: Paul Kirth <paulkirth at google.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/include/llvm/IR/ProfDataUtils.h
M llvm/lib/IR/Instructions.cpp
M llvm/lib/IR/ProfDataUtils.cpp
M llvm/lib/IR/Verifier.cpp
Log Message:
-----------
[llvm][ProfDataUtils] Provide getNumBranchWeights API (#90146)
As suggested in
https://github.com/llvm/llvm-project/pull/86609/files#r1556689262
an API for getting the number of branch weights directly from the MD
node would be useful in a variety of checks, and keeps the logic within
ProfDataUtils.
Commit: 5e097c79d820683b72e2bac8e56df93801ad85ec
https://github.com/llvm/llvm-project/commit/5e097c79d820683b72e2bac8e56df93801ad85ec
Author: shaw young <58664393+shawbyoung at users.noreply.github.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M bolt/docs/CommandLineArgumentReference.md
M bolt/lib/Profile/YAMLProfileReader.cpp
M bolt/lib/Rewrite/RewriteInstance.cpp
M bolt/lib/Utils/CommandLineOpts.cpp
A bolt/test/X86/hashing-based-function-matching.test
Log Message:
-----------
[BOLT] Hash-based function matching (#95821)
Using the hashes of binary and profiled functions
to recover functions with changed names.
Test Plan: added
hashing-based-function-matching.test.
Commit: b3c668b0055717633503ed26787037d9e3499781
https://github.com/llvm/llvm-project/commit/b3c668b0055717633503ed26787037d9e3499781
Author: Jon Roelofs <jonathan_roelofs at apple.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M clang/docs/LanguageExtensions.rst
Log Message:
-----------
[clang][docs] '#pragma clang section' is supported on Mach-O. NFC
This was added back in 7f6e3316456f939a062aad0eeaac983251a1747c, but I forgot
to update the docs that referenced it.
Commit: 32e4906c28952d2dd8ed193cfd37856b98233ef5
https://github.com/llvm/llvm-project/commit/32e4906c28952d2dd8ed193cfd37856b98233ef5
Author: shaw young <58664393+shawbyoung at users.noreply.github.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M bolt/docs/CommandLineArgumentReference.md
M bolt/lib/Profile/YAMLProfileReader.cpp
M bolt/lib/Rewrite/RewriteInstance.cpp
M bolt/lib/Utils/CommandLineOpts.cpp
R bolt/test/X86/hashing-based-function-matching.test
Log Message:
-----------
Revert "[BOLT] Hash-based function matching" (#96568)
Reverts llvm/llvm-project#95821
Commit: 0d4da0df166ea7512c6e97e182b21cd706293eaa
https://github.com/llvm/llvm-project/commit/0d4da0df166ea7512c6e97e182b21cd706293eaa
Author: Miro Bucko <mbucko at meta.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M lldb/bindings/python/python-typemaps.swig
M lldb/include/lldb/API/SBProcess.h
M lldb/include/lldb/Core/AddressRangeListImpl.h
M lldb/include/lldb/Target/Process.h
M lldb/source/API/SBProcess.cpp
M lldb/source/Target/Process.cpp
A lldb/test/API/python_api/find_in_memory/Makefile
A lldb/test/API/python_api/find_in_memory/TestFindInMemory.py
A lldb/test/API/python_api/find_in_memory/TestFindRangesInMemory.py
A lldb/test/API/python_api/find_in_memory/address_ranges_helper.py
A lldb/test/API/python_api/find_in_memory/main.cpp
Log Message:
-----------
[lldb][API] Add Find(Ranges)InMemory() to Process SB API (#96569)
This is a second attempt to land #95007
Test Plan:
llvm-lit
llvm-project/lldb/test/API/python_api/find_in_memory/TestFindInMemory.py
llvm-project/lldb/test/API/python_api/find_in_memory/TestFindRangesInMemory.py
Reviewers: clayborg
Tasks: lldb
Commit: dc27ff10497ea0c7d93de028badffa62707d8b6c
https://github.com/llvm/llvm-project/commit/dc27ff10497ea0c7d93de028badffa62707d8b6c
Author: Joseph Huber <huberjn at outlook.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M libc/test/src/__support/CMakeLists.txt
Log Message:
-----------
[libc] Disable freelist test on NVPTX temporarily
Summary:
This test fails due to alignment issues, it's likely that it's
misaligned on other targets too and they just don't crash on it.
@PiJoules maybe we should run this with ubsan?
Commit: 361543e4100defe14334cfb11481be44a977e627
https://github.com/llvm/llvm-project/commit/361543e4100defe14334cfb11481be44a977e627
Author: Jacob Lalonde <jalalonde at fb.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M lldb/source/Plugins/ObjectFile/Minidump/MinidumpFileBuilder.cpp
M lldb/source/Plugins/ObjectFile/Minidump/MinidumpFileBuilder.h
Log Message:
-----------
[LLDB][Minidump] Change expected directories to the correct type; size_t (#96564)
In #95312 I incorrectly set `m_expected_directories` to uint, this broke
the windows build and is the incorrect type.
`size_t` is more accurate because this value only ever represents the
expected upper bound of the directory vector.
Commit: a2d340ba161fe48ee4ff736c6e7877038a7388cd
https://github.com/llvm/llvm-project/commit/a2d340ba161fe48ee4ff736c6e7877038a7388cd
Author: David Truby <david.truby at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/include/clang/Driver/Driver.h
M clang/include/clang/Driver/Options.td
M clang/lib/Driver/Driver.cpp
R flang/test/Driver/print-resource-dir.F90
Log Message:
-----------
Revert "[Flang][Driver] Add -print-resource-dir command line flag to emit Flang's resource directory" (#96557)
Reverts llvm/llvm-project#90886
These changes broke linking to compiler-rt on Windows
Commit: 954b692bd74b4e7571bb4a8045f4b488d504a6ba
https://github.com/llvm/llvm-project/commit/954b692bd74b4e7571bb4a8045f4b488d504a6ba
Author: David Truby <david.truby at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M flang/lib/Evaluate/intrinsics.cpp
M flang/unittests/Evaluate/intrinsics.cpp
Log Message:
-----------
[flang] Allow derf as alternate spelling for erf (#95784)
This patch adds derf as an alternate spelling for the erf intrinsic.
This spelling is supported by multiple other compilers and used by WRF.
Commit: 8ad32ce7382bb8d630a3d4d93d1e1f00f8e2bd0e
https://github.com/llvm/llvm-project/commit/8ad32ce7382bb8d630a3d4d93d1e1f00f8e2bd0e
Author: Shengchen Kan <shengchen.kan at intel.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/include/clang/Driver/Options.td
M clang/lib/Basic/Targets/X86.cpp
M clang/lib/Basic/Targets/X86.h
M clang/lib/Driver/ToolChains/Arch/X86.cpp
M clang/test/Driver/x86-target-features.c
M clang/test/Preprocessor/x86_target_features.c
M llvm/include/llvm/TargetParser/X86TargetParser.def
M llvm/lib/Target/X86/X86.td
M llvm/lib/TargetParser/X86TargetParser.cpp
Log Message:
-----------
[X86] Add sub-feature zu (zero upper) for APX
This is a follow-up patch for #74199
Commit: 7ea63b9db4198688873036f3b0b81f9124076f7a
https://github.com/llvm/llvm-project/commit/7ea63b9db4198688873036f3b0b81f9124076f7a
Author: Weining Lu <luweining at loongson.cn>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/CodeGen/LoongArch/psabi-restricted-scheduling.ll
Log Message:
-----------
[LoongArch][test] Remove the FIXME in psabi-restricted-scheduling.ll which has been addressed by #76555
Commit: b0e9b00ce7d623175c5e60e82afe24e7f8a200be
https://github.com/llvm/llvm-project/commit/b0e9b00ce7d623175c5e60e82afe24e7f8a200be
Author: weiwei chen <weiwei.chen at modular.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
A llvm/test/CodeGen/NVPTX/mma-no-sink-after-laneid-check.ll
Log Message:
-----------
[NVPTX] Make nvptx mma instructions convergent. (#96521)
We are running into NVPTX backend generating wrong code for an input:
```
%0 = llvm.nvvm.mma.m?n?k?.row.col.??? (...)
if laneid == 0:
ret
else:
store %0
```
The backend reorder the instruction (as an effect of `MachineSink` pass)
to
```
if laneid == 0:
ret
else:
%0 = llvm.nvvm.mma.m?n?k?.row.col.??? (...)
store %0
```
This is incorrect because `mma` is a warp instruction which needs all
threads to sync before performing the operation instead of being guarded
by a specific thread id. It should be similar as the shuffle instruction
`shfl` in terms of warp level sync, and `shfl` is marked as
`isConvergent = true`.
Apply `isConvergent = true` to `mma` instructions.
Commit: 4e0a0eae58f7a6998866719f7eb970096a2a52e9
https://github.com/llvm/llvm-project/commit/4e0a0eae58f7a6998866719f7eb970096a2a52e9
Author: Shengchen Kan <shengchen.kan at intel.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/TargetParser/X86TargetParser.def
Log Message:
-----------
[X86] Fix test Clang::CodeGen/builtin-cpu-supports.c failure
The test failed after
https://github.com/llvm/llvm-project/commit/8ad32ce7382bb8d630a3d4d93d1e1f00f8e2bd0e
In https://github.com/gcc-mirror/gcc/blob/master/gcc/common/config/i386/i386-cpuinfo.h
FEATURE_AVX512CD = 23 and FEATURE_AVX512VBMI = 26, we should only add 2
features between them. New features should be inserted at the end.
Commit: 9267f8f19a2e502ef5a216c0d52b352b3699d399
https://github.com/llvm/llvm-project/commit/9267f8f19a2e502ef5a216c0d52b352b3699d399
Author: sstwcw <su3e8a96kzlver at posteo.net>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/docs/ClangFormatStyleOptions.rst
M clang/docs/ReleaseNotes.rst
M clang/include/clang/Format/Format.h
M clang/lib/Format/Format.cpp
M clang/lib/Format/UnwrappedLineFormatter.cpp
M clang/unittests/Format/ConfigParseTest.cpp
M clang/unittests/Format/FormatTest.cpp
Log Message:
-----------
[clang-format] Add option to remove leading blank lines (#91221)
The options regarding which blank lines are kept are also aggregated.
The new option is `KeepEmptyLines`.
Commit: 437366b668cebefbf3cb7bd354fa4bfb5d90ccbb
https://github.com/llvm/llvm-project/commit/437366b668cebefbf3cb7bd354fa4bfb5d90ccbb
Author: Alexander Smarus <bender at readdle.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M clang/lib/CodeGen/CMakeLists.txt
M clang/lib/Tooling/Inclusions/Stdlib/CMakeLists.txt
Log Message:
-----------
Adjust MSVC version range for ARM64 build performance regression (#90731)
This is follow up for #65215
Mentioned regression was fixed in MSVC 19.39 (VS 17.9.0), so it makes
sense to not apply fix for that (and newer) compiler versions.
Same as original change, this patch is narrowly scoped to not affect any
other compiler.
Commit: c393121778d877661f6b50cc3b3c582ac1654437
https://github.com/llvm/llvm-project/commit/c393121778d877661f6b50cc3b3c582ac1654437
Author: Louis Dionne <ldionne.2 at gmail.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M libcxx/test/CMakeLists.txt
M libcxx/test/configs/llvm-libc++-shared-clangcl.cfg.in
M libcxx/test/configs/llvm-libc++-shared-no-vcruntime-clangcl.cfg.in
M libcxx/test/configs/llvm-libc++-static-clangcl.cfg.in
Log Message:
-----------
[libc++] Remove Windows-specific configuration from libcxx/test/CMakeLists.txt (#96330)
This is essentially a revert of 9853e9b which tried removing duplication
in the Windows config files by moving it to the CMake. However, we want
to decouple the CMake and the test suite as much as possible, so
encoding additional (non-official) Lit parameters in the CMake only as a
code reuse mechanism is not an approach we want to take.
Commit: 8348d720ef913b0ff92b468be2eb9f4ea273cb5a
https://github.com/llvm/llvm-project/commit/8348d720ef913b0ff92b468be2eb9f4ea273cb5a
Author: Clement Courbet <courbet at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang-tools-extra/clang-tidy/utils/DeclRefExprUtils.cpp
M clang-tools-extra/docs/ReleaseNotes.rst
M clang-tools-extra/test/clang-tidy/checkers/performance/unnecessary-copy-initialization.cpp
Log Message:
-----------
[clang-tidy] Fix assert in performance-unnecessary-copy-init. (#96506)
`GetDirectCallee` can be null.
Fixes #96498.
Commit: bd488c1247715f8d698b9b26611ff7f869c824cc
https://github.com/llvm/llvm-project/commit/bd488c1247715f8d698b9b26611ff7f869c824cc
Author: Craig Topper <craig.topper at sifive.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/lib/TargetParser/RISCVISAInfo.cpp
Log Message:
-----------
[RISCV] Use StringRef::starts_with(char) instead of starts_with(StringRef). NFC
Commit: 8599629d390270b21c97f8b7ad2d97a344211d05
https://github.com/llvm/llvm-project/commit/8599629d390270b21c97f8b7ad2d97a344211d05
Author: paperchalice <liujunchang97 at outlook.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/CodeGen/MachinePostDominators.h
M llvm/include/llvm/Passes/MachinePassRegistry.def
M llvm/lib/CodeGen/MachinePostDominators.cpp
M llvm/lib/Passes/PassBuilder.cpp
Log Message:
-----------
[CodeGen][NewPM] Port machine post dominator tree analysis to new pass manager (#96378)
Follows #95879.
Commit: 62d44fbd50e28a6cfe479fc07164b8c970f049c6
https://github.com/llvm/llvm-project/commit/62d44fbd50e28a6cfe479fc07164b8c970f049c6
Author: Fangrui Song <i at maskray.me>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
M llvm/lib/MC/MCDwarf.cpp
Log Message:
-----------
[MC] Remove setUseAssemblerInfoForParsing(false) workarounds
This reverts commit 245491a9f384e4c53421196533c2a2b693efaf8d ("[MC] Disable MCAssembler based constant folding for DwarfDebug")
and cb09b5f3d53e5b7b4452bb3db78dca79fc9b3f17 ("[MC] Disable MCAssembler based constant folding for compact unwind and emitJumpTableEntry").
Checking the relative order of FA and FB is now faster due to
de19f7b6d46f1c38e10e604154f0fdaaffde9ebd ("[MC] Replace fragment ilist with singly-linked lists").
Commit: f1e0657d144f5a3cfef4b625d0f875f4dacd21d1
https://github.com/llvm/llvm-project/commit/f1e0657d144f5a3cfef4b625d0f875f4dacd21d1
Author: Matthias Springer <me at m-sp.org>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M mlir/docs/DialectConversion.md
M mlir/include/mlir/Transforms/DialectConversion.h
M mlir/lib/Transforms/Utils/DialectConversion.cpp
M mlir/test/Transforms/test-legalize-type-conversion.mlir
Log Message:
-----------
[mlir][Transforms] Dialect conversion: Simplify handling of dropped arguments (#96207)
This commit simplifies the handling of dropped arguments and updates
some dialect conversion documentation that is outdated.
When converting a block signature, a `BlockTypeConversionRewrite` object
and potentially multiple `ReplaceBlockArgRewrite` are created. During
the "commit" phase, uses of the old block arguments are replaced with
the new block arguments, but the old implementation was written in an
inconsistent way: some block arguments were replaced in
`BlockTypeConversionRewrite::commit` and some were replaced in
`ReplaceBlockArgRewrite::commit`. The new
`BlockTypeConversionRewrite::commit` implementation is much simpler and
no longer modifies any IR; that is done only in `ReplaceBlockArgRewrite`
now. The `ConvertedArgInfo` data structure is no longer needed.
To that end, materializations of dropped arguments are now built in
`applySignatureConversion` instead of `materializeLiveConversions`; the
latter function no longer has to deal with dropped arguments.
Other minor improvements:
- Improve variable name: `origOutputType` -> `origArgType`. Add an
assertion to check that this field is only used for argument
materializations.
- Add more comments to `applySignatureConversion`.
Note: Error messages around failed materializations for dropped basic
block arguments changed slightly. That is because those materializations
are now built in `legalizeUnresolvedMaterialization` instead of
`legalizeConvertedArgumentTypes`.
This commit is in preparation of decoupling argument/source/target
materializations from the dialect conversion.
Commit: 41f8e6d3ea57315e619ab9f71e1663095f4ed59d
https://github.com/llvm/llvm-project/commit/41f8e6d3ea57315e619ab9f71e1663095f4ed59d
Author: Craig Topper <craig.topper at sifive.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/test/CodeGen/RISCV/GlobalISel/instruction-select/fp-arith.mir
Log Message:
-----------
[RISCV][GISel] Fix test case order in fp-arith.mir. NFC
The fadd_f64 test was in the middle of some f32 tests.
Commit: 4c91b49bab0728d4bc136aa33c4aeb4e8ea37d01
https://github.com/llvm/llvm-project/commit/4c91b49bab0728d4bc136aa33c4aeb4e8ea37d01
Author: Jie Fu <jiefu at tencent.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang-tools-extra/clangd/Format.cpp
Log Message:
-----------
[clangd] Fix the build broken (NFC)
/llvm-project/clang-tools-extra/clangd/Format.cpp:284:11:
error: no member named 'KeepEmptyLinesAtTheStartOfBlocks' in 'clang::format::FormatStyle'
Style.KeepEmptyLinesAtTheStartOfBlocks = true;
~~~~~ ^
1 error generated.
Commit: efa8463ab90147aacd4647eb7715763978235890
https://github.com/llvm/llvm-project/commit/efa8463ab90147aacd4647eb7715763978235890
Author: David Green <david.green at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Vectorize/VectorCombine.cpp
M llvm/test/Transforms/PhaseOrdering/AArch64/interleavevectorization.ll
M llvm/test/Transforms/VectorCombine/AArch64/shuffletoidentity-concat.ll
M llvm/test/Transforms/VectorCombine/AArch64/shuffletoidentity.ll
Log Message:
-----------
[VectorCombine] Add free concats to shuffleToIdentity. (#94954)
This is another relatively small adjustment to shuffleToIdentity, which
has had a few knock-one effects to need a few more changes. It attempts
to detect free concats, that will be legalized to multiple vector
operations. For example if the lanes are '[a[0], a[1], b[0], b[1]]' and
a and b are v2f64 under aarch64.
In order to do this:
- isFreeConcat detects whether the input has piece-wise identities from
multiple inputs that can become a concat.
- A tree of concat shuffles is created to concatenate the input values
into a single vector. This is a little different to most other inputs as
there are created from multiple values that are being combined together,
and we cannot rely on the Lane0 insert location always being valid.
- The insert location is changed to the original location instead of
updating per item, which ensure it is valid due to the order that we
visit and create items.
Commit: f01958137f358f4fd3e71636b187db14a37a5853
https://github.com/llvm/llvm-project/commit/f01958137f358f4fd3e71636b187db14a37a5853
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/ADT/SmallPtrSet.h
M llvm/unittests/ADT/SmallPtrSetTest.cpp
Log Message:
-----------
[SmallPtrSet] Add remove_if() method (#96468)
Add remove_if() method, similar to the one already present on SetVector.
It is intended to replace the following pattern:
for (Foo *Ptr : Set)
if (Pred(Ptr))
Set.erase(Ptr);
With:
Set.remove_if(Pred);
This pattern is commonly used for set intersection, where `Pred` is
something like `!OtherSet.contains(Ptr)`.
The implementation provided here is a bit more efficient than the naive
loop, because it does not require looking up the bucket during the
erase() operation again.
However, my actual motivation for this is to have a way to perform this
operation without relying on the current `std::set`-style guarantee that
erase() does not invalidate iterators. I'd like to stop making use of
tombstones in the small regime, which will make insertion operations a
good bit more efficient. However, this will invalidate iterators during
erase().
Commit: fa20184a8f336e4154f2ffeeeb8a538dc9462d9a
https://github.com/llvm/llvm-project/commit/fa20184a8f336e4154f2ffeeeb8a538dc9462d9a
Author: Chuanqi Xu <yedeng.yd at linux.alibaba.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/lib/Serialization/ASTWriter.cpp
A clang/test/Modules/no-external-identifier-id.cppm
A clang/test/Modules/no-external-type-id.cppm
Log Message:
-----------
[C++20] [Modules] [Serialization] Don't reuse type ID and identifier ID from imported modules
To support no-transitive-change model for named modules, we can't reuse
type ID and identifier ID from imported modules arbitrarily. Since the
theory for no-transitive-change model is,
for a user of a named module, the user can only access the
indirectly imported decls via the directly imported module. So that it is
possible to control what matters to the users when writing the module.
And it will be unsafe to do so if the users can reuse the type IDs and
identifier IDs from the indirectly imported modules not via the directly
imported modules.
So in this patch, we don't reuse the type ID and identifier ID in the
AST writer to avoid the problematic case.
Commit: 8153773b23032177546944ec2524dce131b8a46e
https://github.com/llvm/llvm-project/commit/8153773b23032177546944ec2524dce131b8a46e
Author: Timm Bäder <tbaeder at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/lib/AST/Interp/Pointer.cpp
M clang/test/AST/Interp/literals.cpp
Log Message:
-----------
[clang][Interp] Fix returning primitive non-blockpointers
We can't deref() them, so return false here.
Commit: 174f80c6030f9bc96df6ae8daeb4d6bce3f36fbb
https://github.com/llvm/llvm-project/commit/174f80c6030f9bc96df6ae8daeb4d6bce3f36fbb
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/Support/GenericDomTreeConstruction.h
Log Message:
-----------
[DomTree] Avoid duplicate hash lookups in runDFS() (NFCI) (#96460)
runDFS() currently performs three hash table lookups. One in the main
loop, one when checking whether a successor has already been visited and
another when adding parent and reverse children to the successor.
We can avoid the two additional lookups by making the parent number part
of the stack, and then making the parent / reverse children update part
of the main loop.
The main loop already has a check for already visited nodes, so we don't
have to check this in advance -- we can simply push the node to the
worklist and skip it later.
Commit: 01fb5290eb86f056e6ba8757011273bcb7f4f2ee
https://github.com/llvm/llvm-project/commit/01fb5290eb86f056e6ba8757011273bcb7f4f2ee
Author: Oleksandr "Alex" Zinenko <git at ozinenko.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M utils/bazel/llvm-project-overlay/mlir/BUILD.bazel
Log Message:
-----------
Fix bazel after 18cf1cd92b554ba0b870c6a2223ea4d0d3c6dd21
mlir-config.h is included but not listed in dependencies
Commit: 7f1a74429dfd62a410d4b51d2e75d3677429a51a
https://github.com/llvm/llvm-project/commit/7f1a74429dfd62a410d4b51d2e75d3677429a51a
Author: Björn Pettersson <bjorn.a.pettersson at ericsson.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/CodeGen/MachineSSAUpdater.cpp
A llvm/test/CodeGen/AArch64/taildup-ssa-update-pr62712.mir
Log Message:
-----------
[TailDup][MachineSSAUpdater] Let RewriteUse insert a COPY when needed (#95553)
When running early-tailduplication we've seen problems with machine
verifier errors due to register class mismatches after doing the machine
SSA updates.
Typical scenario is that there is a PHI node and another instruction
that is using the same vreg:
%othervreg:otherclass = PHI %vreg:origclass, %bb
MInstr %vreg:origclass
but then after TailDuplicator::tailDuplicateAndUpdate we get
%othervreg:otherclass = PHI %vreg:origclass, %bb, ...
MInstr %othervreg:otherclass
Such rewrites are only valid if 'otherclass' is equal to (or a subclass
of) 'origclass'.
The solution here is based on adding a COPY instruction to make sure we
satisfy constraints given by 'MInstr' in the example. So if 'otherclass'
isn't equal to (or a subclass of) 'origclass' we insert a copy after the
PHI like this:
%othervreg:otherclass = PHI %vreg:origclass, %bb, ...
%newvreg:origclass = COPY %othervreg:otherclass
MInstr %newvreg:origclass
A special case is when it is possible to constrain the register class
instead of inserting a COPY. We currently prefer to constrain the
register class instead of inserting a COPY, even if it is a bit unclear
if that always is better (considering register pressure for the
constrained class etc.).
Fixes: https://github.com/llvm/llvm-project/issues/62712
Commit: 11e12bd82b0358de053cfd0dc762e9b30948c8a1
https://github.com/llvm/llvm-project/commit/11e12bd82b0358de053cfd0dc762e9b30948c8a1
Author: Mitch Phillips <31459023+hctim at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M compiler-rt/lib/gwp_asan/guarded_pool_allocator.cpp
M compiler-rt/lib/gwp_asan/platform_specific/guarded_pool_allocator_fuchsia.cpp
M compiler-rt/lib/gwp_asan/platform_specific/guarded_pool_allocator_posix.cpp
M compiler-rt/lib/gwp_asan/utilities.h
Log Message:
-----------
[NFC] [GWP-ASan] Rename Check() to check() (#96605)
Change this function to be LLVM-style in name.
Commit: d78211912391b724a064ef017dbf239eff10380a
https://github.com/llvm/llvm-project/commit/d78211912391b724a064ef017dbf239eff10380a
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/IR/Analysis.h
Log Message:
-----------
[Analysis] Use SmallPtrSet::remove_if (NFC)
Commit: 2d84e0ffefda62da6193d339633dbb55654f7b5d
https://github.com/llvm/llvm-project/commit/2d84e0ffefda62da6193d339633dbb55654f7b5d
Author: Anton Sidorenko <anton.sidorenko at syntacore.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/RISCV/RISCV.td
M llvm/lib/Target/RISCV/RISCVProcessors.td
A llvm/lib/Target/RISCV/RISCVSchedSyntacoreSCR3.td
A llvm/test/tools/llvm-mca/RISCV/SyntacoreSCR/SCR3-ALU.s
A llvm/test/tools/llvm-mca/RISCV/SyntacoreSCR/SCR3-LSU.s
Log Message:
-----------
[RISCV] Add scheduling model for Syntacore SCR3 (#95427)
Syntacore SCR3 is a microcontroller-class processor core. Overview:
https://syntacore.com/products/scr3
Co-authored-by: Dmitrii Petrov <dmitrii.petrov at syntacore.com>
Commit: 8395f9cecd34af8a79c96e661e46a80d0d471fb1
https://github.com/llvm/llvm-project/commit/8395f9cecd34af8a79c96e661e46a80d0d471fb1
Author: Pavel Labath <pavel at labath.sk>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M lldb/source/Plugins/SymbolFile/DWARF/DWARFASTParserClang.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARF.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARF.h
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDebugMap.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDebugMap.h
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDwo.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDwo.h
Log Message:
-----------
[lldb/DWARF] Remove parsing recursion when searching for definition DIEs (#96484)
If ParseStructureLikeDIE (or ParseEnum) encountered a declaration DIE,
it would call FindDefinitionTypeForDIE. This returned a fully formed
type, which it achieved by recursing back into ParseStructureLikeDIE
with the definition DIE.
This obscured the control flow and caused us to repeat some work (e.g.
the UniqueDWARFASTTypeMap lookup), but it mostly worked until we tried
to delay the definition search in #90663. After this patch, the two
ParseStructureLikeDIE calls were no longer recursive, but rather the
second call happened as a part of the CompleteType() call. This opened
the door to inconsistencies, as the second ParseStructureLikeDIE call
was not aware it was called to process a definition die for an existing
type.
To make that possible, this patch removes the recusive type resolution
from this function, and leaves just the "find definition die"
functionality. After finding the definition DIE, we just go back to the
original ParseStructureLikeDIE call, and have it finish the parsing
process with the new DIE.
While this patch is motivated by the work on delaying the definition
searching, I believe it is also useful on its own.
Commit: f6aa50873463ebd9a459b7ccd4989460175a6e7f
https://github.com/llvm/llvm-project/commit/f6aa50873463ebd9a459b7ccd4989460175a6e7f
Author: Vyacheslav Levytskyy <vyacheslav.levytskyy at intel.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/SPIRV/SPIRVBuiltins.cpp
M llvm/lib/Target/SPIRV/SPIRVGlobalRegistry.cpp
M llvm/lib/Target/SPIRV/SPIRVGlobalRegistry.h
M llvm/lib/Target/SPIRV/SPIRVInstructionSelector.cpp
M llvm/lib/Target/SPIRV/SPIRVUtils.cpp
A llvm/test/CodeGen/SPIRV/var-uniform-const.ll
Log Message:
-----------
[SPIR-V]: Fix creation of constants of array types in SPIRV Backend (#96514)
This PR fixes https://github.com/llvm/llvm-project/issues/96513.
The way of creation of array type constant was incorrect: instead of
creating [1, 1, 1] or [1, 1, 1, 1, 1, ....] constants, the same [1]
constant was always created, substituting original composite constants.
This in its turn led to a situation when only one of constants might
exist in the code without emitting invalid code, the second constant
would be eventually rewritten to the first constant, because a key to
address both was an array of a single element (like [1]).
This PR fixes the issue and purges from the code unneeded copy/pasted
clone of the function that creates an array constant.
Commit: 0bd9c49a295829ed44e838c4d54cc905662a1afa
https://github.com/llvm/llvm-project/commit/0bd9c49a295829ed44e838c4d54cc905662a1afa
Author: Lukacma <Marian.Lukac at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp
A llvm/test/Transforms/InstCombine/AArch64/sve-intrinsic-comb-no-active-lanes-loads.ll
Log Message:
-----------
[AArch64][SVE] optimisation for SVE load intrinsics with no active lanes (#95269)
This patch extends #73964 and adds optimisation of load SVE intrinsics
when predicate is zero.
Commit: 919b1ecafc010379eff88368b050068223a01f99
https://github.com/llvm/llvm-project/commit/919b1ecafc010379eff88368b050068223a01f99
Author: Abid Qadeer <haqadeer at amd.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M flang/lib/Optimizer/Transforms/DebugTypeGenerator.cpp
A flang/test/Integration/debug-ptr-type.f90
A flang/test/Transforms/debug-ptr-type.fir
Log Message:
-----------
[flang][debug] Support pointer type. (#96153)
The handling of `PointerType` is similar to `HeapType`. The only
difference is that allocated flag is generated for `HeapType` and
associated flag for `PointerType`. The tests for pointer to allocatable
strings are disabled for now. I will enable them once #95906 is merged.
The debugging in GDB looks like this:
integer, pointer :: par2(:)
integer, target, allocatable :: ar2(:)
integer, target :: sc
integer, pointer :: psc
allocate(ar2(4))
par2 => ar2
psc => sc
19 par2 => ar2
(gdb) p par2
$3 = <not associated>
(gdb) n
20 do i=1,5
(gdb) p par2
$4 = (0, 0, 0, 0)
(gdb) ptype par2
type = integer (4)
(gdb) p sc
$5 = 3
(gdb) p psc
$6 = (PTR TO -> ( integer )) 0x7fffffffda24
(gdb) p *psc
$7 = 3
Commit: 5feb32ba929f9e517c530217cabb09d1d734a763
https://github.com/llvm/llvm-project/commit/5feb32ba929f9e517c530217cabb09d1d734a763
Author: Vikram Hegde <115221833+vikramRH at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/lib/CodeGen/CGBuiltin.cpp
M clang/test/CodeGenOpenCL/builtins-amdgcn.cl
M llvm/docs/AMDGPUUsage.rst
M llvm/include/llvm/IR/IntrinsicsAMDGPU.td
M llvm/lib/Target/AMDGPU/AMDGPUAtomicOptimizer.cpp
M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp
M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.h
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/lib/Target/AMDGPU/SIInstructions.td
M llvm/lib/Target/AMDGPU/VOP1Instructions.td
M llvm/lib/Target/AMDGPU/VOP2Instructions.td
M llvm/test/Analysis/UniformityAnalysis/AMDGPU/intrinsics.ll
M llvm/test/CodeGen/AMDGPU/GlobalISel/atomic_optimizations_mul_one.ll
M llvm/test/CodeGen/AMDGPU/atomic_optimization_split_dt_update.ll
A llvm/test/CodeGen/AMDGPU/convergence-laneops.ll
M llvm/test/CodeGen/AMDGPU/global-atomic-scan.ll
M llvm/test/CodeGen/AMDGPU/global_atomic_optimizer_fp_rtn.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_iterative_scan.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_iterative_scan_fp.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_optimizer_fp_no_rtn.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readfirstlane.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readfirstlane.ptr.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readlane.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readlane.ptr.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.writelane.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.writelane.ptr.ll
M llvm/test/Transforms/InstCombine/AMDGPU/amdgcn-intrinsics.ll
Log Message:
-----------
[AMDGPU] Extend readlane, writelane and readfirstlane intrinsic lowering for generic types (#89217)
This patch is intended to be the first of a series with end goal to
adapt atomic optimizer pass to support i64 and f64 operations (along
with removing all unnecessary bitcasts). This legalizes 64 bit readlane,
writelane and readfirstlane ops pre-ISel
---------
Co-authored-by: vikramRH <vikhegde at amd.com>
Commit: f9795f34a6b358bdf02389ba01765924a2ad0b9f
https://github.com/llvm/llvm-project/commit/f9795f34a6b358bdf02389ba01765924a2ad0b9f
Author: Serge Pavlov <sepavloff at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/CodeGen/GlobalISel/MachineIRBuilder.h
M llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp
M llvm/unittests/CodeGen/GlobalISel/MachineIRBuilderTest.cpp
Log Message:
-----------
[GlobalISel] Add build methods for FP environment intrinsics (#96607)
This change adds methods like buildGetFPEnv and similar for opcodes that
represent manipulation on floating-point state.
Commit: 16d02cd1c5df57ae317b115333cf75b347c90d1d
https://github.com/llvm/llvm-project/commit/16d02cd1c5df57ae317b115333cf75b347c90d1d
Author: Nikolas Klauser <nikolasklauser at berlin.de>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/include/__type_traits/is_nothrow_destructible.h
M libcxx/test/std/utilities/meta/meta.unary/meta.unary.prop/is_nothrow_destructible.pass.cpp
Log Message:
-----------
[libc++] Use __is_nothrow_destructible (#95766)
This changes the behaviour in C++03 mode because we'll now use the
builtin on Clang, but I don't think that's much of a problem.
Commit: 29f4a0561a03f4760a876e39d443c64ea7ee9e22
https://github.com/llvm/llvm-project/commit/29f4a0561a03f4760a876e39d443c64ea7ee9e22
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/ADT/SetOperations.h
Log Message:
-----------
[SetOperations] clang-format header (NFC)
This header used three-space indentation in a number of places.
Reformat it completely.
Commit: f09b02484b865e4e6e2243ee2ff59d70b60d28e4
https://github.com/llvm/llvm-project/commit/f09b02484b865e4e6e2243ee2ff59d70b60d28e4
Author: Haojian Wu <hokein.wu at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/lib/Sema/SemaTemplate.cpp
Log Message:
-----------
[clang] Remove a stale FIXME
This FIXME has already been addressed in #89358
Commit: 9d45077df92a64ff8db5b249ac2f2d45e82ab73c
https://github.com/llvm/llvm-project/commit/9d45077df92a64ff8db5b249ac2f2d45e82ab73c
Author: Florian Hahn <flo at fhahn.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Vectorize/LoopVectorizationPlanner.h
M llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
M llvm/lib/Transforms/Vectorize/VPlan.h
Log Message:
-----------
[VPlan] Iterate over VPlans to get VFs to compute cost for (NFCI).
Instead for iterating over all VFs when computing costs, simply iterate
over the VFs available in the created VPlans.
Split off from https://github.com/llvm/llvm-project/pull/92555.
This also prepares for moving the check if any vector instructions will
be generated to be based on VPlan, to unblock recommitting
https://github.com/llvm/llvm-project/pull/92555.
Commit: eeb0884e6696ec618feb2181a432d10f66d4e840
https://github.com/llvm/llvm-project/commit/eeb0884e6696ec618feb2181a432d10f66d4e840
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Utils/LoopUnrollRuntime.cpp
M llvm/test/Transforms/LoopUnroll/ARM/multi-blocks.ll
M llvm/test/Transforms/LoopUnroll/runtime-exit-phi-scev-invalidation.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop-at-most-two-exits.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop-multiple-exits.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop5.ll
M llvm/test/Transforms/LoopUnroll/runtime-multiexit-heuristic.ll
M llvm/test/Transforms/LoopUnroll/runtime-unroll-remainder.ll
M llvm/test/Transforms/LoopUnroll/tripcount-overflow.ll
M llvm/test/Transforms/LoopUnrollAndJam/unroll-and-jam.ll
M llvm/test/Transforms/LoopVectorize/X86/float-induction-x86.ll
Log Message:
-----------
[LoopUnroll] Use poison instead of undef for preheader value
Commit: a2e915704ffb040e544b13bf565ecad3aa661d98
https://github.com/llvm/llvm-project/commit/a2e915704ffb040e544b13bf565ecad3aa661d98
Author: Florian Hahn <flo at fhahn.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/Transforms/LoopVectorize/create-induction-resume.ll
Log Message:
-----------
[LV] Make create-induction-resume.ll more robust by adding store.
Without the store, the vector loop body is empty. Add a store to avoid
that, while not impacting the induction resume values that are created.
Commit: 37c736e0359770b6a2d2139d84f9df400f4b1049
https://github.com/llvm/llvm-project/commit/37c736e0359770b6a2d2139d84f9df400f4b1049
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Utils/LoopUnrollRuntime.cpp
M llvm/test/DebugInfo/unrolled-loop-remainder.ll
M llvm/test/Transforms/LoopUnroll/runtime-exit-phi-scev-invalidation.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop-multiple-exits.ll
M llvm/test/Transforms/LoopUnroll/tripcount-overflow.ll
Log Message:
-----------
[LoopUnroll] Use poison instead of undef for another preheader value
Commit: cc8fdd60c2f791dba70c9e458173640d7da836e3
https://github.com/llvm/llvm-project/commit/cc8fdd60c2f791dba70c9e458173640d7da836e3
Author: Andrei Safronov <safronov at espressif.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/Xtensa/XtensaAsmPrinter.cpp
M llvm/lib/Target/Xtensa/XtensaAsmPrinter.h
M llvm/lib/Target/Xtensa/XtensaISelLowering.cpp
M llvm/lib/Target/Xtensa/XtensaISelLowering.h
M llvm/lib/Target/Xtensa/XtensaInstrInfo.td
M llvm/lib/Target/Xtensa/XtensaOperators.td
A llvm/test/CodeGen/Xtensa/blockaddress.ll
A llvm/test/CodeGen/Xtensa/brcc.ll
A llvm/test/CodeGen/Xtensa/indirectbr.ll
A llvm/test/CodeGen/Xtensa/jumpt.ll
Log Message:
-----------
[Xtensa] Lower GlobalAddress/BlockAddress/JumpTable (#95256)
This patch implements lowering of the GlobalAddress, BlockAddress,
JumpTable and BR_JT. Also patch adds legal support of the BR_CC
operation for i32 type.
Commit: 4acc8eed94e3f4e21bcf1dedca021cb6a086720c
https://github.com/llvm/llvm-project/commit/4acc8eed94e3f4e21bcf1dedca021cb6a086720c
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/Transforms/SCCP/2009-01-14-IPSCCP-Invoke.ll
M llvm/test/Transforms/SCCP/apint-ipsccp2.ll
M llvm/test/Transforms/SCCP/ipsccp-clear-returned.ll
M llvm/test/Transforms/SCCP/ipsccp-noundef.ll
M llvm/test/Transforms/SCCP/ipsccp-preserve-analysis.ll
M llvm/test/Transforms/SCCP/range-and-ip.ll
M llvm/test/Transforms/SCCP/solve-after-each-resolving-undefs-for-function.ll
M llvm/test/Transforms/SCCP/struct-arg-resolve-undefs.ll
Log Message:
-----------
[SCCP] Generate test checks (NFC)
Some of these are just old, while others previously did not use
UTC due to missing features that have since been implemented
(such as signature matching).
Commit: 16bb8c16aab32e2ee623a2b64d976548be247180
https://github.com/llvm/llvm-project/commit/16bb8c16aab32e2ee623a2b64d976548be247180
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/IPO/SCCP.cpp
M llvm/test/Transforms/FunctionSpecialization/track-return.ll
M llvm/test/Transforms/SCCP/2009-01-14-IPSCCP-Invoke.ll
M llvm/test/Transforms/SCCP/apint-ipsccp2.ll
M llvm/test/Transforms/SCCP/arg-count-mismatch.ll
M llvm/test/Transforms/SCCP/indirectbr.ll
M llvm/test/Transforms/SCCP/ip-constant-ranges.ll
M llvm/test/Transforms/SCCP/ip-ranges-phis.ll
M llvm/test/Transforms/SCCP/ip-ranges-select.ll
M llvm/test/Transforms/SCCP/ipsccp-basic.ll
M llvm/test/Transforms/SCCP/ipsccp-clear-returned.ll
M llvm/test/Transforms/SCCP/ipsccp-noundef.ll
M llvm/test/Transforms/SCCP/ipsccp-preserve-analysis.ll
M llvm/test/Transforms/SCCP/issue59602-assume-like-call-users.ll
M llvm/test/Transforms/SCCP/metadata.ll
M llvm/test/Transforms/SCCP/musttail-call.ll
M llvm/test/Transforms/SCCP/range-and-ip.ll
M llvm/test/Transforms/SCCP/range-attribute.ll
M llvm/test/Transforms/SCCP/remove-call-inst.ll
M llvm/test/Transforms/SCCP/resolvedundefsin-tracked-fn.ll
M llvm/test/Transforms/SCCP/return-constant.ll
M llvm/test/Transforms/SCCP/solve-after-each-resolving-undefs-for-function.ll
M llvm/test/Transforms/SCCP/struct-arg-resolve-undefs.ll
Log Message:
-----------
[SCCP] Use poison instead of undef when zapping returns
Commit: 35eef9f97f077a7cfa2beb9df857aa1713e4faa6
https://github.com/llvm/llvm-project/commit/35eef9f97f077a7cfa2beb9df857aa1713e4faa6
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Scalar/Reassociate.cpp
Log Message:
-----------
[Reassociate] Use poison instead of undef for dummy operands (NFCI)
These will be replaced later.
Commit: 65f9601fb1479507820ca8381bb472ec4e8a2eee
https://github.com/llvm/llvm-project/commit/65f9601fb1479507820ca8381bb472ec4e8a2eee
Author: Daniil Kovalev <dkovalev at accesssoftek.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M lld/ELF/InputFiles.cpp
M lld/ELF/InputFiles.h
Log Message:
-----------
[NFC][lld][ELF] Remove unused `sec` param of `ObjFile<ELFT>::getRelocTarget` (#96500)
Commit: bfad8757159a3def7c07d4f5ef23ed76fafdf441
https://github.com/llvm/llvm-project/commit/bfad8757159a3def7c07d4f5ef23ed76fafdf441
Author: hev <wangrui at loongson.cn>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/LoongArch/LoongArchISelLowering.cpp
M llvm/lib/Target/LoongArch/LoongArchISelLowering.h
M llvm/lib/Target/LoongArch/LoongArchInstrInfo.td
M llvm/test/CodeGen/LoongArch/machinelicm-address-pseudos.ll
M llvm/test/CodeGen/LoongArch/psabi-restricted-scheduling.ll
Log Message:
-----------
[LoongArch] Ensure PseudoLA* can be hoisted (#94723)
Since we mark the pseudos as mayLoad but do not provide any MMOs,
isSafeToMove conservatively returns false, stopping MachineLICM from
hoisting the instructions. PseudoLA_TLS_{LD,GD} does not actually expand
to a load, so stop marking that as mayLoad to allow it to be hoisted,
and for the others make sure to add MMOs during lowering to indicate
they're GOT loads and thus can be freely moved.
Commit: 9952e00af6c93e9807c3dfe076e99e7f5a77f675
https://github.com/llvm/llvm-project/commit/9952e00af6c93e9807c3dfe076e99e7f5a77f675
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/unittests/Transforms/Vectorize/VPlanTest.cpp
Log Message:
-----------
[VPlanTest] Use poison instead of undef for dummy values (NFC)
Commit: 68efc506f58776fdfcfc60707f8fe52584736d8a
https://github.com/llvm/llvm-project/commit/68efc506f58776fdfcfc60707f8fe52584736d8a
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/unittests/IR/VectorBuilderTest.cpp
Log Message:
-----------
[VectorBuilderTest] Use poison instead of undef for dummy values (NFC)
Commit: 9acb533c38be833ec1d8daa06e127a9de8f0a5ef
https://github.com/llvm/llvm-project/commit/9acb533c38be833ec1d8daa06e127a9de8f0a5ef
Author: Alex Voicu <alexandru.voicu at amd.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/include/clang/Basic/Cuda.h
M clang/lib/Basic/Cuda.cpp
M clang/lib/Basic/Targets/NVPTX.cpp
M clang/lib/CodeGen/CGOpenMPRuntimeGPU.cpp
M clang/lib/CodeGen/CodeGenModule.cpp
M clang/lib/Driver/Driver.cpp
M clang/lib/Driver/ToolChains/Clang.cpp
M clang/lib/Driver/ToolChains/HIPAMD.cpp
M clang/lib/Driver/ToolChains/HIPAMD.h
M clang/test/Driver/cuda-arch-translation.cu
M clang/test/Frontend/embed-bitcode.ll
M clang/test/Misc/target-invalid-cpu-note.c
M llvm/lib/Bitcode/Writer/BitcodeWriter.cpp
Log Message:
-----------
[clang][Driver] Add HIPAMD Driver support for AMDGCN flavoured SPIR-V (#95061)
This patch augments the HIPAMD driver to allow it to target AMDGCN
flavoured SPIR-V compilation. It's mostly straightforward, as we re-use
some of the existing SPIRV infra, however there are a few notable
additions:
- we introduce an `amdgcnspirv` offload arch, rather than relying on
using `generic` (this is already fairly overloaded) or simply using
`spirv` or `spirv64` (we'll want to use these to denote unflavoured
SPIRV, once we bring up that capability)
- initially it is won't be possible to mix-in SPIR-V and concrete AMDGPU
targets, as it would require some relatively intrusive surgery in the
HIPAMD Toolchain and the Driver to deal with two triples
(`spirv64-amd-amdhsa` and `amdgcn-amd-amdhsa`, respectively)
- in order to retain user provided compiler flags and have them
available at JIT time, we rely on embedding the command line via
`-fembed-bitcode=marker`, which the bitcode writer had previously not
implemented for SPIRV; we only allow it conditionally for AMDGCN
flavoured SPIRV, and it is handled correctly by the Translator (it ends
up as a string literal)
Once the SPIRV BE is no longer experimental we'll switch to using that
rather than the translator. There's some additional work that'll come
via a separate PR around correctly piping through AMDGCN's
implementation of `printf`, for now we merely handle its flags
correctly.
Commit: a46a2c2b7d8daf68c793b6b96f1c5bc009aea62b
https://github.com/llvm/llvm-project/commit/a46a2c2b7d8daf68c793b6b96f1c5bc009aea62b
Author: Simon Pilgrim <llvm-dev at redking.me.uk>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/X86/X86ISelLowering.cpp
M llvm/lib/Target/X86/X86TargetTransformInfo.cpp
M llvm/test/Analysis/CostModel/X86/arith-int-codesize.ll
M llvm/test/Analysis/CostModel/X86/arith-int-latency.ll
M llvm/test/Analysis/CostModel/X86/arith-int-sizelatency.ll
M llvm/test/Analysis/CostModel/X86/arith-int.ll
M llvm/test/Analysis/CostModel/X86/rem-codesize.ll
M llvm/test/Analysis/CostModel/X86/rem-latency.ll
M llvm/test/Analysis/CostModel/X86/rem-sizelatency.ll
M llvm/test/Analysis/CostModel/X86/rem.ll
M llvm/test/Analysis/CostModel/X86/slm-arith-costs.ll
M llvm/test/CodeGen/X86/avx2-arith.ll
M llvm/test/CodeGen/X86/midpoint-int-vec-128.ll
M llvm/test/CodeGen/X86/midpoint-int-vec-256.ll
M llvm/test/CodeGen/X86/min-legal-vector-width.ll
M llvm/test/CodeGen/X86/pmul.ll
M llvm/test/CodeGen/X86/prefer-avx256-wide-mul.ll
M llvm/test/CodeGen/X86/vector-shuffle-combining-sse41.ll
Log Message:
-----------
[X86] Lower vXi8 multiplies using PMADDUBSW on SSSE3+ targets (#95690)
Extends https://github.com/llvm/llvm-project/pull/95403 to handle non-constant cases - we can avoid unpacks/extensions from vXi8 to vXi16 by using PMADDUBSW instead and truncating the vXi16 results back together.
Most targets benefit from performing this for non-constant cases - its just Intel Core/SandyBridge era CPUs that might experience additional Port0/15 contention (but lower instruction count).
Fixes https://github.com/llvm/llvm-project/issues/90748
Commit: 5ed5d723db58f7556782427444642d0571cdf649
https://github.com/llvm/llvm-project/commit/5ed5d723db58f7556782427444642d0571cdf649
Author: Benjamin Maxwell <benjamin.maxwell at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M mlir/lib/Dialect/ArmSME/Transforms/VectorLegalization.cpp
M mlir/test/Dialect/ArmSME/vector-legalization.mlir
M mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/multi-tile-matmul.mlir
Log Message:
-----------
[mlir][ArmSME] Lower multi-tile stores to a single loop (#96187)
This adds a new pattern that can legalize a multi-tile transfer_write as
a single store loop. This is done as part of type decomposition as at
this level we know each tile write is disjoint, but that information is
lost after decomposition (without analysis to reconstruct it).
Example (pseudo-MLIR):
```
vector.transfer_write %vector, %dest[%y, %x], %mask
: vector<[16]x[8]xi16>, memref<?x?xi16>
```
Is rewritten to:
```
scf.for %slice_idx = %c0 to %c8_vscale step %c1 {
%upper_slice_mask = vector.extract %mask[%slice_idx] ─┐
: vector<[8]xi1> from vector<[16]x[8]xi1> |
%upper_slice = vector.extract %upper_tile[%slice_idx] |- Store upper tile
: vector<[8]xi16> from vector<[8]x[8]xi16> |
vector.transfer_write %upper_slice, |
%dest[%slice_idx + %y, %x], %upper_slice_mask |
: vector<[8]xi16>, memref<?x?xi16> ┘
%lower_slice_idx = %slice_idx + %c8_vscale ─┐
%lower_slice_mask = vector.extract %mask[%lower_slice_idx] |
: vector<[8]xi1> from vector<[16]x[8]xi1> |
%lower_slice = vector.extract %lower_tile[%slice_idx] |- Store lower
: vector<[8]xi16> from vector<[8]x[8]xi16> | tile
vector.transfer_write %lower_slice, |
%dest[%lower_slice_idx + %y, %x], %lower_slice_mask |
: vector<[8]xi16>, memref<?x?xi16> ┘
}
```
Commit: 6c9916d0d8a40034c5e0dfc157f146855e42520e
https://github.com/llvm/llvm-project/commit/6c9916d0d8a40034c5e0dfc157f146855e42520e
Author: Joseph Huber <huberjn at outlook.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libc/cmake/modules/prepare_libc_gpu_build.cmake
M llvm/runtimes/CMakeLists.txt
Log Message:
-----------
[libc] Configure CMAKE_REQUIRED_FLAGS so the GPU can use flag checks (#95424)
Summary:
This patch adds `CMAKE_REQUIRED_FLAGS` for the GPU build so checks like
`check_cxx_compiler_flags` work as expected. This is required because we
need to hack around the potential lack of `nvlink` and `ptxas` for NVPTX
targets and the fact that the AMDGPU target needs `-nogpulib` to avoid
errors on lack of ROCm. This makes a few of the checks pass and also
allows us to just check `-mcpu=native` for architecture detection
instead of finding the tools manually.
Commit: 0f111ba790a28056d42a0ae376371d7b694454ae
https://github.com/llvm/llvm-project/commit/0f111ba790a28056d42a0ae376371d7b694454ae
Author: Ramkumar Ramachandra <ramkumar.ramachandra at codasip.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/Analysis/LoopInfo.h
M llvm/lib/Analysis/LoopAccessAnalysis.cpp
M llvm/lib/Analysis/LoopInfo.cpp
M llvm/lib/Transforms/Scalar/LoopDistribute.cpp
M llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
A llvm/test/Analysis/LoopAccessAnalysis/debug-loc.ll
M llvm/test/Analysis/LoopAccessAnalysis/print-order.ll
M llvm/test/Analysis/LoopAccessAnalysis/scalable-vector-regression-tests.ll
A llvm/test/Transforms/LoopDistribute/debug-loc.ll
A llvm/test/Transforms/LoopDistribute/debug-print.ll
M llvm/test/Transforms/LoopVectorize/ARM/mve-hoist-runtime-checks.ll
M llvm/test/Transforms/LoopVectorize/runtime-checks-hoist.ll
Log Message:
-----------
LoopInfo: introduce Loop::getLocStr; unify debug output (#93051)
Introduce a Loop::getLocStr stolen from LoopVectorize's static function
getDebugLocString in order to have uniform debug output headers across
LoopVectorize, LoopAccessAnalysis, and LoopDistribute. The motivation
for this change is to have UpdateTestChecks recognize the headers and
automatically generate CHECK lines for debug output, with minimal
special-casing.
Commit: 8263bec5331863113c6163afdc3f93e063f20a07
https://github.com/llvm/llvm-project/commit/8263bec5331863113c6163afdc3f93e063f20a07
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
Log Message:
-----------
[SLP] Use poison instead of undef in reorderScalars() (#96619)
-1 mask elements are specified to return poison rather than undef
nowadays , so update the reorderScalars() implementation to match.
Commit: 952bdaaf79c1e5d7364160b21de0cd1295cdfbd8
https://github.com/llvm/llvm-project/commit/952bdaaf79c1e5d7364160b21de0cd1295cdfbd8
Author: Leandro Lupori <leandro.lupori at linaro.org>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M flang/include/flang/Lower/AbstractConverter.h
M flang/lib/Lower/Bridge.cpp
M flang/lib/Lower/OpenMP/ClauseProcessor.cpp
A flang/test/Lower/OpenMP/copyprivate2.f90
Log Message:
-----------
[flang][OpenMP] Fix copyprivate allocatable/pointer lowering (#95975)
The lowering of copyprivate clauses with allocatable or pointer
variables was incorrect. This happened because the values passed to
copyVar() are always wrapped in SymbolBox::Intrinsic, which
resulted in allocatable/pointer variables being handled as regular
ones.
This is fixed by providing to copyVar() the attributes of the
variables being copied, to make it possible to detect and handle
allocatable/pointer variables correctly.
Fixes #95801
Commit: c43664931378c589091fb169a6d3840474f5c324
https://github.com/llvm/llvm-project/commit/c43664931378c589091fb169a6d3840474f5c324
Author: Sander de Smalen <sander.desmalen at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AArch64/AArch64ISelDAGToDAG.cpp
M llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
M llvm/lib/Target/AArch64/AArch64InstrInfo.cpp
M llvm/lib/Target/AArch64/AArch64Subtarget.h
M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp
Log Message:
-----------
[AArch64] Remove all instances of the 'hasSVEorSME' interfaces. (#96543)
I've not added any new tests for these, because the original conditions
were wrong (they did not consider streaming mode) and we have tests for
the positive cases.
Commit: c5aa983f91f0543b43926a32d2f70bb53f9945a2
https://github.com/llvm/llvm-project/commit/c5aa983f91f0543b43926a32d2f70bb53f9945a2
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Analysis/InstructionSimplify.cpp
M llvm/test/Transforms/InstCombine/shift.ll
M llvm/test/Transforms/InstSimplify/phi.ll
M llvm/test/Transforms/LoopDeletion/update-scev.ll
Log Message:
-----------
[InstSimplify] Fold all poison phi to poison instead of undef
Commit: 4780dc3d7fd63016ebb871544287fedc83b1006a
https://github.com/llvm/llvm-project/commit/4780dc3d7fd63016ebb871544287fedc83b1006a
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/Transforms/InstCombine/phi.ll
Log Message:
-----------
[InstCombine] Add poison variant to phi test (NFC)
And rename an argument to avoid an upper/lowercase clash.
Commit: abc8c4be3bb70755328051c99e1b196663140c66
https://github.com/llvm/llvm-project/commit/abc8c4be3bb70755328051c99e1b196663140c66
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/Transforms/LoopVectorize/uniform-blend.ll
Log Message:
-----------
[LoopVectorize] Generate test checks (NFC)
Commit: edbe698eadaefca31564e3207074c8ed4b10bc12
https://github.com/llvm/llvm-project/commit/edbe698eadaefca31564e3207074c8ed4b10bc12
Author: OverMighty <its.overmighty at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libc/config/linux/aarch64/entrypoints.txt
M libc/config/linux/x86_64/entrypoints.txt
M libc/docs/math/index.rst
M libc/spec/stdc.td
M libc/src/__support/FPUtil/CMakeLists.txt
M libc/src/__support/FPUtil/dyadic_float.h
M libc/src/__support/FPUtil/generic/CMakeLists.txt
A libc/src/__support/FPUtil/generic/div.h
M libc/src/math/CMakeLists.txt
A libc/src/math/f16divf.h
M libc/src/math/generic/CMakeLists.txt
A libc/src/math/generic/f16divf.cpp
M libc/test/src/math/CMakeLists.txt
A libc/test/src/math/DivTest.h
A libc/test/src/math/f16divf_test.cpp
M libc/test/src/math/smoke/CMakeLists.txt
A libc/test/src/math/smoke/DivTest.h
A libc/test/src/math/smoke/f16divf_test.cpp
M libc/utils/MPFRWrapper/MPFRUtils.cpp
M libc/utils/MPFRWrapper/MPFRUtils.h
Log Message:
-----------
[libc][math][c23] Add f16divf C23 math function (#96131)
Part of #93566.
Commit: eabe091cdba859890fd169bd51924f34fd754bbf
https://github.com/llvm/llvm-project/commit/eabe091cdba859890fd169bd51924f34fd754bbf
Author: Benjamin Kramer <benny.kra at googlemail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M utils/bazel/llvm-project-overlay/mlir/BUILD.bazel
Log Message:
-----------
[bazel] Add missing dependency for 5ed5d723db58f7556782427444642d0571cdf649
Commit: 70cfece24d6cbb57e35dd961cc97eb2a6bf1e387
https://github.com/llvm/llvm-project/commit/70cfece24d6cbb57e35dd961cc97eb2a6bf1e387
Author: sstwcw <su3e8a96kzlver at posteo.net>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/docs/ClangFormatStyleOptions.rst
M clang/docs/ReleaseNotes.rst
M clang/include/clang/Format/Format.h
M clang/lib/Format/Format.cpp
M clang/lib/Format/UnwrappedLineFormatter.cpp
M clang/unittests/Format/ConfigParseTest.cpp
M clang/unittests/Format/FormatTest.cpp
Log Message:
-----------
Revert "[clang-format] Add option to remove leading blank lines (#91221)"
This reverts commit 9267f8f19a2e502ef5a216c0d52b352b3699d399.
I changed a formatter option. I forgot to update other components that
depend on the formatter when the option name changed.
Commit: c69ea04fb9738db283263eb350669e00b77ee4fd
https://github.com/llvm/llvm-project/commit/c69ea04fb9738db283263eb350669e00b77ee4fd
Author: Jie Fu <jiefu at tencent.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang-tools-extra/clangd/Format.cpp
Log Message:
-----------
Revert "[clangd] Fix the build broken (NFC)"
This reverts commit 4c91b49bab0728d4bc136aa33c4aeb4e8ea37d01 after 70cfece24d6cbb57e35dd961cc97eb2a6bf1e387.
Commit: fd62906ddb252298f6ed63fe85e146d477acdaed
https://github.com/llvm/llvm-project/commit/fd62906ddb252298f6ed63fe85e146d477acdaed
Author: Louis Dionne <ldionne.2 at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/include/__numeric/gcd_lcm.h
M libcxx/test/std/numerics/numeric.ops/numeric.ops.lcm/lcm.pass.cpp
Log Message:
-----------
[libc++] Fix incorrect overflow checking in std::lcm (#96310)
We should have been using __builtin_mul_overflow from the start instead
of adding a manual (and error-prone) check for overflow.
Fixes #96196
Commit: 44c9a26756192228fd9dcd33c238ec8858d40329
https://github.com/llvm/llvm-project/commit/44c9a26756192228fd9dcd33c238ec8858d40329
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/Transforms/Mem2Reg/preserve-nonnull-load-metadata.ll
Log Message:
-----------
[Mem2Reg] Add tests with !noundef (NFC)
Commit: c9f083a9940d1d62f77c39f05bb0fc186cc4832c
https://github.com/llvm/llvm-project/commit/c9f083a9940d1d62f77c39f05bb0fc186cc4832c
Author: Shilei Tian <i at tianshilei.me>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/include/clang/Basic/BuiltinsAMDGPU.def
M clang/lib/CodeGen/CGBuiltin.cpp
A clang/test/CodeGenOpenCL/builtins-amdgcn-raw-buffer-store.cl
A clang/test/SemaOpenCL/builtins-amdgcn-raw-buffer-store-error.cl
Log Message:
-----------
[Clang][AMDGPU] Add builtins for instrinsic `llvm.amdgcn.raw.ptr.buffer.store` (#94576)
Depends on https://github.com/llvm/llvm-project/pull/96313.
Commit: 1462605ab0a121fc2f652f178c7c13dc0179f10e
https://github.com/llvm/llvm-project/commit/1462605ab0a121fc2f652f178c7c13dc0179f10e
Author: Kazu Hirata <kazu at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/Analysis/CFGPrinter.h
M llvm/lib/Analysis/CallGraph.cpp
M llvm/lib/Analysis/ConstraintSystem.cpp
M llvm/lib/Analysis/LoopAccessAnalysis.cpp
M llvm/lib/Analysis/PHITransAddr.cpp
M llvm/lib/Analysis/ScalarEvolution.cpp
M llvm/lib/Analysis/TargetTransformInfo.cpp
Log Message:
-----------
[Analysis] Use range-based for loops (NFC) (#96587)
Commit: 2dd4167a09cfe1a1e09852e99b4984ddb3e89cf7
https://github.com/llvm/llvm-project/commit/2dd4167a09cfe1a1e09852e99b4984ddb3e89cf7
Author: David Sherwood <david.sherwood at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp
M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.h
M llvm/test/Analysis/CostModel/AArch64/masked_ldst.ll
M llvm/test/Analysis/CostModel/AArch64/sve-gather.ll
M llvm/test/Analysis/CostModel/AArch64/sve-ldst.ll
M llvm/test/Analysis/CostModel/AArch64/sve-scatter.ll
M llvm/test/Transforms/LoopVectorize/AArch64/scalable-reductions.ll
M llvm/test/Transforms/LoopVectorize/AArch64/sve-illegal-type.ll
A llvm/test/Transforms/LoopVectorize/AArch64/sve-invalid-costs.ll
Log Message:
-----------
[LoopVectorize][AArch64] Add limited support for scalable vectorisation of i1 types (#95920)
Previously isElementTypeLegalForScalableVector returned false for i1
types, which also prevented vectorisation of loops with i1 reductions.
This is overkill - we only need to disable vectorisation for loads
and/or stores of i1 types. I've added i1 as a legal type, but changed
the cost model to return an invalid cost for loads and stores.
Commit: 54b61adc0cbefb7f923ef43c407704ba9f9d8b69
https://github.com/llvm/llvm-project/commit/54b61adc0cbefb7f923ef43c407704ba9f9d8b69
Author: Mike Rice <michael.p.rice at intel.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/lib/Lex/LiteralSupport.cpp
Log Message:
-----------
[NFC][clang] Replace unreachable code in literal processing with assert (#96579)
Address static verifier concerns about dead code in DoubleUnderscore
check. Replace it with an assert.
Commit: eb12d9b5cb6d15ce151a63ad32b4e0e5823ddb87
https://github.com/llvm/llvm-project/commit/eb12d9b5cb6d15ce151a63ad32b4e0e5823ddb87
Author: Louis Dionne <ldionne.2 at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/docs/ReleaseNotes/19.rst
M libcxx/include/__memory/allocator.h
M libcxx/include/__memory/shared_ptr.h
R libcxx/test/libcxx/containers/sequences/vector/const_T.compile.pass.cpp
R libcxx/test/libcxx/containers/sequences/vector/const_value_type.pass.cpp
M libcxx/test/libcxx/diagnostics/memory.nodiscard.verify.cpp
M libcxx/test/libcxx/memory/allocator_void.trivial.compile.pass.cpp
M libcxx/test/libcxx/memory/allocator_volatile.verify.cpp
M libcxx/test/std/concepts/concepts.lang/concept.default.init/default_initializable.compile.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.ctor.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.dtor.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.members/allocate.constexpr.size.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.members/allocate.size.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.deprecated_in_cxx17.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.deprecated_in_cxx23.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.removed_in_cxx20.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.removed_in_cxx26.verify.cpp
M libcxx/test/std/utilities/memory/specialized.algorithms/specialized.construct/construct_at.pass.cpp
Log Message:
-----------
[libc++] Remove the allocator<const T> extension (#96319)
This effort has quite a history:
- This was first attempted in 2022 via bed3240bf7d1, which broke
std::shared_ptr<T const> and caused the change to be reverted in
9138666f5464.
- We then re-attempted landing the change in 276ca87382b8 after fixing
std::shared_ptr, but reports were made that this broke code en masse
within Google. This led to the patch being reverted again in
a54d028895c9 with the goal to land this again with a migration path for
vendors.
This patch re-lands the removal while providing a migration path for
vendors by providing the `_LIBCPP_ENABLE_REMOVED_ALLOCATOR_CONST` macro.
This macro will be honored for the LLVM 19 release and will be removed
after that, at which point allocator<const T> will be removed
unconditionally.
Fixes #73665
Commit: d6c74102626ead8c4e2f7e638fdfea1fcd3d0bc7
https://github.com/llvm/llvm-project/commit/d6c74102626ead8c4e2f7e638fdfea1fcd3d0bc7
Author: Nicolai Hähnle <nicolai.haehnle at amd.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.cpp
Log Message:
-----------
AMDGPU: Remove an outdated TODO (#96446)
We have a fixed calling convention for stack pointer and frame pointer,
we shouldn't try to shift anything around.
Commit: 7e9b49f6b86c8616e6211ec02dbccc3ebb615e79
https://github.com/llvm/llvm-project/commit/7e9b49f6b86c8616e6211ec02dbccc3ebb615e79
Author: Nicolai Hähnle <nicolai.haehnle at amd.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPUArgumentUsageInfo.cpp
M llvm/lib/Target/AMDGPU/AMDGPUArgumentUsageInfo.h
M llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp
M llvm/lib/Target/AMDGPU/AMDGPUSubtarget.cpp
M llvm/lib/Target/AMDGPU/GCNSubtarget.h
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.cpp
M llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.h
Log Message:
-----------
AMDGPU: Add plumbing for private segment size argument (#96445)
The actual size of scratch/private is determined at dispatch time, so
add more plumbing to request it. Will be used in subsequent change.
Commit: 13ed349c44ca44f470e2e6181888f14b6b22a44a
https://github.com/llvm/llvm-project/commit/13ed349c44ca44f470e2e6181888f14b6b22a44a
Author: Ivan Kosarev <ivan.kosarev at amd.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp
M llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUMCExpr.cpp
M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUMCExpr.h
Log Message:
-----------
[AMDGPU][NFC] Rename AMDGPUVariadicMCExpr to AMDGPUMCExpr. (#96618)
Some of our custom expressions are not variadic and there seems to be
little benefit in mentioning the variadic nature of expression nodes in
the name anyway.
Commit: fb463e154e7fe1200b9d91f82fafa532331ce764
https://github.com/llvm/llvm-project/commit/fb463e154e7fe1200b9d91f82fafa532331ce764
Author: Aaron Ballman <aaron at aaronballman.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/www/c_status.html
Log Message:
-----------
[C23] Remove WG14 N2379 from the C status page
This paper was adopted at the Oct 2019 meeting in Ithaca, but the
changes from the paper were later removed (editorially) by the C
Floating Point study group due to being incorrect and unnecessary.
Commit: c6973ade4c46e28b1be9e1f5d873ca9a59197418
https://github.com/llvm/llvm-project/commit/c6973ade4c46e28b1be9e1f5d873ca9a59197418
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Utils/PromoteMemoryToRegister.cpp
M llvm/test/Transforms/Mem2Reg/preserve-nonnull-load-metadata.ll
Log Message:
-----------
[Mem2Reg] Generate non-terminator unreachable for !noundef undef (#96639)
When performing a load from uninitialized memory using !noundef, insert
a non-terminator unreachable instruction, which will be converted to a
proper unreachable by SimplifyCFG. This way we retain the fact that UB
occurred on this code path.
Commit: de528ffb17ebce96e0bc4dde1749146c41ca1d0d
https://github.com/llvm/llvm-project/commit/de528ffb17ebce96e0bc4dde1749146c41ca1d0d
Author: Mats Petersson <mats.petersson at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M flang/include/flang/Optimizer/Transforms/Passes.h
M flang/include/flang/Optimizer/Transforms/Passes.td
M flang/include/flang/Tools/CLOptions.inc
M flang/lib/Optimizer/Transforms/CMakeLists.txt
A flang/lib/Optimizer/Transforms/ConstantArgumentGlobalisation.cpp
M flang/test/Fir/boxproc.fir
M flang/test/Lower/character-local-variables.f90
A flang/test/Transforms/constant-argument-globalisation-2.fir
A flang/test/Transforms/constant-argument-globalisation.fir
Log Message:
-----------
[Flang] Extracting internal constants from scalar literals (#73829)
Constants actual arguments in function/subroutine calls are currently
lowered as allocas + store. This can sometimes inhibit LTO and the
constant will not be propagated to the called function. Particularly in
cases where the function/subroutine call happens inside a condition.
This patch changes the lowering of these constant actual arguments to a
global constant + fir.address_of_op. This lowering makes it easier for
LTO to propagate the constant.
The optimization must be enabled explicitly to run. Use -mmlir
--enable-constant-argument-globalisation to enable.
---------
Co-authored-by: Dmitriy Smirnov <dmitriy.smirnov at arm.com>
Commit: 2cf197506f59b6151c542faec594a7e1e97d9c64
https://github.com/llvm/llvm-project/commit/2cf197506f59b6151c542faec594a7e1e97d9c64
Author: harishch4 <harishcse44 at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M mlir/include/mlir/Dialect/OpenMP/OpenMPClauseOperands.h
M mlir/include/mlir/Dialect/OpenMP/OpenMPEnums.td
M mlir/include/mlir/Dialect/OpenMP/OpenMPOps.td
M mlir/lib/Dialect/OpenMP/IR/OpenMPDialect.cpp
M mlir/test/Dialect/OpenMP/invalid.mlir
M mlir/test/Dialect/OpenMP/ops.mlir
Log Message:
-----------
[MLIR][OpenMP]Add order-modifier support to Order clause (#93805)
This adds order-modifier (reproducible|unconstrained) support to Order
clause.
Commit: a4045299d3024a69746c4fd9517e27e5cef4785e
https://github.com/llvm/llvm-project/commit/a4045299d3024a69746c4fd9517e27e5cef4785e
Author: bwlodarcz <bertrand.wlodarczyk at intel.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/SPIRV/MCTargetDesc/SPIRVBaseInfo.cpp
M llvm/lib/Target/SPIRV/MCTargetDesc/SPIRVBaseInfo.h
M llvm/lib/Target/SPIRV/SPIRVBuiltins.td
Log Message:
-----------
[SPIRV] Add definitions for NonSemantic debug info (#95530)
This commit adds basic types and definitions for
NonSemantic.Shader.DebugInfo.100 standard for SPIRV.
Full implementation of the standard will allow SPIRV backend to emit
files with debug info included.
Link to standard:
https://github.com/KhronosGroup/SPIRV-Registry/blob/main/nonsemantic/NonSemantic.Shader.DebugInfo.100.html
Commit: ec9ce89a0823165e245be8fc095da8a88fb4cf79
https://github.com/llvm/llvm-project/commit/ec9ce89a0823165e245be8fc095da8a88fb4cf79
Author: David Sherwood <david.sherwood at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/Transforms/LoopVectorize/AArch64/sve-invalid-costs.ll
Log Message:
-----------
[LoopVectorize] Fix build issue caused by #95920 (#96647)
Commit: 2274c66e6faaaf29ad693b1ae3e5a7b0228a1950
https://github.com/llvm/llvm-project/commit/2274c66e6faaaf29ad693b1ae3e5a7b0228a1950
Author: Nikolas Klauser <nikolasklauser at berlin.de>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/include/__type_traits/conditional.h
Log Message:
-----------
[libc++] Use _If for conditional_t (#96193)
This avoids different instantiations when the if and else types are
different, resulting in reduced memory use by the compiler.
Commit: ed1273d4ddee160f847cbee28714c79353022a3c
https://github.com/llvm/llvm-project/commit/ed1273d4ddee160f847cbee28714c79353022a3c
Author: RichardLuo <CommAdama at outlook.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/modules/CMakeLists.txt.in
Log Message:
-----------
[libc++] change the visibility of libc++ header to public in libcxx module (#91240)
This PR addresses a problem that headers may not be able to be found if
`#include` is used with std modules.
Consider the following file:
#include <boost/json.hpp>
import std;
int main(int, const char **) { }
Boost will include something from libc++, but we are using -nostdinc++
at [1] so the compiler can not find any default std header. Therefore
the locally built header needs to be public.
[1]: https://github.com/RichardLuo0/llvm-project/blob/15fdd47c4b110b64dc61f636e42e0484bf8bdbe0/libcxx/modules/CMakeLists.txt.in#L52
Commit: bb5ab1ffe719f5e801ef08ac08be975546aa3266
https://github.com/llvm/llvm-project/commit/bb5ab1ffe719f5e801ef08ac08be975546aa3266
Author: shawbyoung <shawbyoung at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M bolt/docs/CommandLineArgumentReference.md
M bolt/lib/Profile/YAMLProfileReader.cpp
M bolt/lib/Rewrite/RewriteInstance.cpp
M bolt/lib/Utils/CommandLineOpts.cpp
A bolt/test/X86/hashing-based-function-matching.test
Log Message:
-----------
[𝘀𝗽𝗿] initial version
Created using spr 1.3.4
Commit: 020f69cd10a2ff1233cc28088989319e5a58b116
https://github.com/llvm/llvm-project/commit/020f69cd10a2ff1233cc28088989319e5a58b116
Author: shawbyoung <shawbyoung at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M bolt/lib/Rewrite/RewriteInstance.cpp
Log Message:
-----------
Added opts::Lite to RewriteInstance
Commit: c097e643efc2a2ee5170fbcdb5e8c644a7207452
https://github.com/llvm/llvm-project/commit/c097e643efc2a2ee5170fbcdb5e8c644a7207452
Author: shawbyoung <shawbyoung at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M bolt/lib/Rewrite/RewriteInstance.cpp
Log Message:
-----------
Revert "Added opts::Lite to RewriteInstance"
This reverts commit 020f69cd10a2ff1233cc28088989319e5a58b116.
Commit: 0e11a7e7179a2eb066e5f63f25b9277aa63f38eb
https://github.com/llvm/llvm-project/commit/0e11a7e7179a2eb066e5f63f25b9277aa63f38eb
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/Transforms/EarlyCSE/flags.ll
Log Message:
-----------
[EarlyCSE] Add test with noundef load of undef (NFC)
Commit: 79e8a5952366eacd92201a8d6472726fc14e00fd
https://github.com/llvm/llvm-project/commit/79e8a5952366eacd92201a8d6472726fc14e00fd
Author: Hui <hui.xie1990 at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/include/__memory/allocator_traits.h
M libcxx/include/deque
M libcxx/include/forward_list
M libcxx/include/list
M libcxx/include/map
M libcxx/include/set
M libcxx/include/string
M libcxx/include/unordered_map
M libcxx/include/unordered_set
M libcxx/include/vector
Log Message:
-----------
[libc++] Move allocator assertion into allocator_traits (#94750)
There is code duplication in all containers that static_assert the
allocator matches the allocator requirements in the spec. This check can
be moved into a more centralised place.
Commit: 7f10ed637e53c68ce62b756a3be8546a3dccf751
https://github.com/llvm/llvm-project/commit/7f10ed637e53c68ce62b756a3be8546a3dccf751
Author: Vitaly Buka <vitalybuka at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M compiler-rt/lib/tsan/rtl/tsan_interceptors_posix.cpp
Log Message:
-----------
[tsan] Fix dead lock when starting StackDepot thread (#96456)
Sometime tsan runtimes calls, like
`__tsan_mutex_create ()`, need to store a stack
in the StackDepot, and the Depot may need to start
and maintenance thread.
Example:
```
__sanitizer::FutexWait ()
__sanitizer::Semaphore::Wait ()
__sanitizer::Mutex::Lock ()
__tsan::SlotLock ()
__tsan::SlotLocker::SlotLocker ()
__tsan::Acquire ()
__tsan::CallUserSignalHandler ()
__tsan::ProcessPendingSignalsImpl ()
__tsan::ProcessPendingSignals ()
__tsan::ScopedInterceptor::~ScopedInterceptor ()
___interceptor_mmap ()
pthread_create ()
__sanitizer::internal_start_thread ()
__sanitizer::(anonymous namespace)::CompressThread::NewWorkNotify ()
__sanitizer::StackDepotNode::store ()
__sanitizer::StackDepotBase<__sanitizer::StackDepotNode, 1, 20>::Put ()
__tsan::CurrentStackId ()
__tsan::MutexCreate ()
__tsan_mutex_create ()
```
pthread_create() implementation may hit other
interceptors recursively, which may invoke
ProcessPendingSignals, which deadlocks.
Alternative solution could be block interceptors
closer to TSAN runtime API function, like
`__tsan_mutex_create`, or just before
`StackDepotPut``, but it's not needed for most
calls, only when new thread is created using
`real_pthread_create`.
I don't see a reasonable way to create a
regression test.
Commit: dddef9d1c9b7b4e8a96bd68d44cd3ed6dc85f758
https://github.com/llvm/llvm-project/commit/dddef9d1c9b7b4e8a96bd68d44cd3ed6dc85f758
Author: Craig Topper <craig.topper at sifive.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/RISCV/GISel/RISCVLegalizerInfo.cpp
M llvm/lib/Target/RISCV/GISel/RISCVRegisterBankInfo.cpp
A llvm/test/CodeGen/RISCV/GlobalISel/instruction-select/fp-arith-f16.mir
A llvm/test/CodeGen/RISCV/GlobalISel/legalizer/legalize-fp-arith-f16.mir
A llvm/test/CodeGen/RISCV/GlobalISel/regbankselect/fp-arith-f16.mir
Log Message:
-----------
[RISCV] Add FPR16 regbank and start legalizing f16 operations for Zfh. (#96582)
Commit: 8a46bbbc22a51db57f05beb0026772b899a785b9
https://github.com/llvm/llvm-project/commit/8a46bbbc22a51db57f05beb0026772b899a785b9
Author: Lukacma <Marian.Lukac at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/include/clang/Basic/DiagnosticSemaKinds.td
M clang/lib/Sema/SemaType.cpp
M clang/test/Sema/arm-vector-types-support.c
R clang/test/SemaCUDA/neon-attrs.cu
M clang/utils/TableGen/NeonEmitter.cpp
Log Message:
-----------
[Clang] Remove preprocessor guards and global feature checks for NEON (#95224)
To enable function multi-versioning (FMV), current checks which rely on
cmd line options or global macros to see if target feature is present
need to be removed. This patch removes those for NEON and also
implements changes to NEON header file as proposed in
[ACLE](https://github.com/ARM-software/acle/pull/321).
Commit: 8c11d3788c26cd85f102959068109d3e498e8498
https://github.com/llvm/llvm-project/commit/8c11d3788c26cd85f102959068109d3e498e8498
Author: Xiaoyang Liu <siujoeng.lau at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/include/__mdspan/extents.h
M libcxx/test/std/containers/views/mdspan/extents/ctad.pass.cpp
Log Message:
-----------
[libc++] P3029R1: Better `mdspan`'s CTAD - `std::extents` (#89015)
This patch implements an improvement introduced in P3029R1 that was
missed in #87873. It adds a deduction of static extents if
integral_constant-like constants are passed to `std::extents`.
Commit: 902952ae04afc2dfe28805b949a1e2218affe65e
https://github.com/llvm/llvm-project/commit/902952ae04afc2dfe28805b949a1e2218affe65e
Author: shawbyoung <shawbyoung at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M bolt/docs/CommandLineArgumentReference.md
M bolt/lib/Profile/YAMLProfileReader.cpp
M bolt/lib/Rewrite/RewriteInstance.cpp
M bolt/lib/Utils/CommandLineOpts.cpp
R bolt/test/X86/hashing-based-function-matching.test
Log Message:
-----------
Revert "[𝘀𝗽𝗿] initial version"
This reverts commit bb5ab1ffe719f5e801ef08ac08be975546aa3266.
Commit: 731db06a878f5c8cb29b36d526a54493677ea89f
https://github.com/llvm/llvm-project/commit/731db06a878f5c8cb29b36d526a54493677ea89f
Author: Nikolas Klauser <nikolasklauser at berlin.de>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/include/__atomic/atomic_init.h
M libcxx/src/barrier.cpp
M libcxx/src/filesystem/operations.cpp
M libcxx/src/filesystem/path.cpp
M libcxx/src/filesystem/path_parser.h
M libcxx/src/locale.cpp
M runtimes/cmake/Modules/WarningFlags.cmake
Log Message:
-----------
[libc++] Get the GCC build mostly clean of warnings (#96604)
The GCC build has gotten to the point where it's often hard to find the
actual error in the build log. We should look into enabling these
warnings again in the future, but it looks like a lot of them are
bogous.
Commit: 2604830aacdd563715da030d0396b565e912436f
https://github.com/llvm/llvm-project/commit/2604830aacdd563715da030d0396b565e912436f
Author: Akira Hatanaka <ahatanak at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/docs/LanguageExtensions.rst
M clang/include/clang/AST/Expr.h
M clang/include/clang/Basic/Builtins.td
M clang/include/clang/Basic/DiagnosticSemaKinds.td
M clang/include/clang/CodeGen/ModuleBuilder.h
M clang/lib/AST/ExprConstant.cpp
M clang/lib/CodeGen/CGBuiltin.cpp
M clang/lib/CodeGen/CGDebugInfo.cpp
M clang/lib/CodeGen/CGDebugInfo.h
M clang/lib/Sema/SemaChecking.cpp
A clang/test/CodeGenCXX/debug-info-verbose-trap.cpp
A clang/test/SemaCXX/verbose-trap.cpp
Log Message:
-----------
Add support for __builtin_verbose_trap (#79230)
The builtin causes the program to stop its execution abnormally and
shows a human-readable description of the reason for the termination
when a debugger is attached or in a symbolicated crash log.
The motivation for the builtin is explained in the following RFC:
https://discourse.llvm.org/t/rfc-adding-builtin-verbose-trap-string-literal/75845
clang's CodeGen lowers the builtin to `llvm.trap` and emits debugging
information that represents an artificial inline frame whose name
encodes the category and reason strings passed to the builtin.
Commit: bb075eeb892374a209953ad20e02c1324e272679
https://github.com/llvm/llvm-project/commit/bb075eeb892374a209953ad20e02c1324e272679
Author: Jakub Mazurkiewicz <mazkuba3 at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
A libcxx/test/std/containers/sequences/array/array.overview/nttp.equivalence.compile.pass.cpp
A libcxx/test/std/containers/sequences/array/array.overview/nttp.verify.cpp
A libcxx/test/std/utilities/utility/pairs/pairs.pair/nttp.equivalence.compile.pass.cpp
A libcxx/test/std/utilities/utility/pairs/pairs.pair/nttp.verify.cpp
Log Message:
-----------
[libc++] LWG3382: NTTP for `pair` and `array` (#85811)
Mark LWG3382 as "Nothing To Do" and add tests.
Commit: 889f3c5741e78ce90b4dcc643bb15d992225a67c
https://github.com/llvm/llvm-project/commit/889f3c5741e78ce90b4dcc643bb15d992225a67c
Author: Matt Arsenault <Matthew.Arsenault at amd.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPU.td
M llvm/lib/Target/AMDGPU/BUFInstructions.td
M llvm/lib/Target/AMDGPU/GCNSubtarget.h
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/test/CodeGen/AMDGPU/buffer-fat-pointer-atomicrmw-fadd.ll
Log Message:
-----------
AMDGPU: Handle legal v2bf16 atomicrmw fadd for gfx12 (#95930)
Annoyingly gfx90a/940 support this for global/flat but not buffer.
Commit: b7768c5485844306cd3c8afeef609ddf9d79696a
https://github.com/llvm/llvm-project/commit/b7768c5485844306cd3c8afeef609ddf9d79696a
Author: Timm Bäder <tbaeder at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/lib/AST/Interp/ByteCodeExprGen.cpp
Log Message:
-----------
[clang][Interp][NFC] Use delegate() to delegate to only initlist item
Commit: e951bd0f51f8b077296f09d9c60ddf150048042f
https://github.com/llvm/llvm-project/commit/e951bd0f51f8b077296f09d9c60ddf150048042f
Author: Vy Nguyen <vyng at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M lldb/include/lldb/API/SBDebugger.h
M lldb/include/lldb/Symbol/TypeSystem.h
M lldb/source/API/SBDebugger.cpp
M lldb/source/Symbol/TypeSystem.cpp
M lldb/tools/lldb-dap/DAP.cpp
M lldb/tools/lldb-dap/DAP.h
M lldb/tools/lldb-dap/lldb-dap.cpp
Log Message:
-----------
Reapply PR/87550 (again) (#95571)
New fixes:
- properly init the `std::optional<std::vector>` to an empty vector as
opposed to `{}` (which was effectively `std::nullopt`).
---------
Co-authored-by: Vy Nguyen <oontvoo at users.noreply.github.com>
Commit: dca49d739de07b1755ad65aa26dacd2e2c22af20
https://github.com/llvm/llvm-project/commit/dca49d739de07b1755ad65aa26dacd2e2c22af20
Author: Nick Desaulniers (paternity leave) <nickdesaulniers at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libc/config/linux/app.h
M libc/src/__support/threads/thread.h
M libc/startup/linux/do_start.cpp
Log Message:
-----------
[libc][arm32] define argc type and stack alignment (#96367)
https://github.com/ARM-software/abi-aa/blob/main/aapcs32/aapcs32.rst#6212stack-constraints-at-a-public-interface
mentions that the stack on ARM32 is double word aligned.
Remove confused comments around ArgcType. argc is always an int, passed on the
stack, so we need to store a pointer to it (regardless of ILP32 or LP64).
Commit: fb07afedbebb0c9f2647d02c6f254245cdb84085
https://github.com/llvm/llvm-project/commit/fb07afedbebb0c9f2647d02c6f254245cdb84085
Author: yonghong-song <yhs at fb.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/BPF/BPFRegisterInfo.cpp
Log Message:
-----------
[BPF] Avoid potential long compilation time without -g (#96575)
Alastair Robertson reported a huge compilation time increase without -g
for bpf target when comparing to x86 ([1]). In my setup, with '-O0', for
x86, a large basic block compilation takes 0.19s while bpf target takes
2.46s. The top function which contributes to the compile time is
eliminateFrameIndex().
Such long compilation time without -g is caused by commit
05de2e481811 ("[bpf] error when BPF stack size exceeds 512 bytes")
The compiler tries to get some debug loc by iterating all insns in the
basic block which will be used when compiler warns larger-than-512 stack
size. Even without -g, such iterating also happens which cause
unnecessary compile time increase.
To fix the issue, let us move the related code when the compiler is
about to warn stack limit violation. This fixed the compile time
regression, and on my system, the compile time is reduced from 2.46s to
0.35s.
[1] https://github.com/bpftrace/bpftrace/issues/3257
Co-authored-by: Yonghong Song <yonghong.song at linux.dev>
Commit: ac1e22f3053f761e4e2ef832b92de15876e68335
https://github.com/llvm/llvm-project/commit/ac1e22f3053f761e4e2ef832b92de15876e68335
Author: Stanley Winata <68087699+raikonenfnu at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp
M mlir/test/Dialect/Vector/fold-arith-extf-into-vector-contract.mlir
Log Message:
-----------
[mlir][vector] Generalize folding of ext-contractionOp to other types. (#96593)
Many state of the art models and quantization operations are now
directly working on vector.contract on integers.
This commit enables generalizes ext-contraction folding S.T we can emit
more performant vector.contracts on codegen pipelines.
Signed-off-by: Stanley Winata <stanley.winata at amd.com>
Commit: 580343d96f18e3c2fa8d7a8e25a175aaa2e20522
https://github.com/llvm/llvm-project/commit/580343d96f18e3c2fa8d7a8e25a175aaa2e20522
Author: Timm Bäder <tbaeder at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/lib/AST/Interp/Descriptor.cpp
Log Message:
-----------
[clang][Interp][NFC] Destroy InitMap when moving contents to DeadBlock
Commit: c9529f76017f0b517dca1e89eecdf6bbd97c3e84
https://github.com/llvm/llvm-project/commit/c9529f76017f0b517dca1e89eecdf6bbd97c3e84
Author: Max191 <44243577+Max191 at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M mlir/lib/Dialect/Tensor/IR/TensorOps.cpp
M mlir/test/Dialect/Tensor/fold-tensor-subset-ops-into-vector-transfers.mlir
M mlir/test/Dialect/Tensor/fold-tensor-subset-ops.mlir
Log Message:
-----------
[mlir] Drop outermost dims in slice rank reduction inference (#95020)
The `getDroppedDims` utility function does not follow the convention of
dropping outermost unit dimensions first when inferring a rank reduction
mask for a slice. This PR updates the implementation to match this
convention.
Commit: 05ca20744159a33349271a96b90959a0a3807133
https://github.com/llvm/llvm-project/commit/05ca20744159a33349271a96b90959a0a3807133
Author: Aaron Ballman <aaron at aaronballman.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/www/c_status.html
Log Message:
-----------
[C23] Update status page regarding FLT_MAX_EXP
N2843 was subsumed by N2882; we could probably consider removing
subsumed entries, but I've been leaving them to help folks looking at
the editor's report from various working drafts and wondering about the
changes.
Commit: d7dd778cde84110e38521a6b55dfeb4e1c649ec2
https://github.com/llvm/llvm-project/commit/d7dd778cde84110e38521a6b55dfeb4e1c649ec2
Author: PeterChou1 <peter.chou at mail.utoronto.ca>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang-tools-extra/clang-doc/tool/CMakeLists.txt
M clang-tools-extra/clang-doc/tool/ClangDocMain.cpp
Log Message:
-----------
[clang-doc] update install path to share/clang-doc instead of share/clang (#96555)
Updates the install path for clang-doc to share/clang-doc instead
share/clang to avoid confusion
Commit: dbd5c7805bae510c79b51c2c7700a590f2df446d
https://github.com/llvm/llvm-project/commit/dbd5c7805bae510c79b51c2c7700a590f2df446d
Author: PeterChou1 <peter.chou at mail.utoronto.ca>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang-tools-extra/test/clang-doc/Inputs/basic-project/src/Calculator.cpp
M clang-tools-extra/test/clang-doc/basic-project.test
Log Message:
-----------
[clang-doc] Remove stdexecpt from clang-doc test (#96552)
Removes stdexecpt from clang-doc test introduced in
https://github.com/llvm/llvm-project/pull/93928
since it violates the rule that test must be freestanding
Commit: f0f774ebf09b1f1ae8129074801342eeadf5495b
https://github.com/llvm/llvm-project/commit/f0f774ebf09b1f1ae8129074801342eeadf5495b
Author: Vitaly Buka <vitalybuka at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M compiler-rt/lib/asan/asan_interceptors.cpp
M compiler-rt/lib/hwasan/hwasan_interceptors.cpp
M compiler-rt/lib/lsan/lsan_interceptors.cpp
M compiler-rt/lib/memprof/memprof_interceptors.cpp
M compiler-rt/lib/msan/msan_interceptors.cpp
M compiler-rt/lib/sanitizer_common/sanitizer_common_libcdep.cpp
M compiler-rt/lib/sanitizer_common/sanitizer_linux.cpp
M compiler-rt/lib/sanitizer_common/sanitizer_posix.h
M compiler-rt/lib/tsan/rtl/tsan_interceptors_posix.cpp
Log Message:
-----------
[sanitizer] Rename DEFINE_REAL_PTHREAD_FUNCTIONS (#96527)
We use REAL() calls in interceptors, but
DEFINE_REAL_PTHREAD_FUNCTIONS has nothing to do
with them and only used for internal maintenance
threads.
This is done to avoid confusion like in #96456.
Commit: de7c1396f29b9bf7011912e7cfea9edad1efb492
https://github.com/llvm/llvm-project/commit/de7c1396f29b9bf7011912e7cfea9edad1efb492
Author: Han-Kuan Chen <hankuan.chen at sifive.com>
Date: 2024-06-26 (Wed, 26 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
Log Message:
-----------
[SLP] NFC. Refactor and add getAltInstrMask help function. (#94709)
Co-authored-by: Alexey Bataev <a.bataev at gmx.com>
Commit: aaf50bf34f3a2007221c644384d238666cfc2bc3
https://github.com/llvm/llvm-project/commit/aaf50bf34f3a2007221c644384d238666cfc2bc3
Author: Jay Foad <jay.foad at amd.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-smem.ll
Log Message:
-----------
[AMDGPU] Disallow negative s_load offsets in isLegalAddressingMode (#91327)
Commit: 5e2beed9a1643cd6358fb8b43feb893543d90bf3
https://github.com/llvm/llvm-project/commit/5e2beed9a1643cd6358fb8b43feb893543d90bf3
Author: Aaron Ballman <aaron at aaronballman.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/www/c_status.html
Log Message:
-----------
[C23] Move WG14 N2931 to the TS18661 section
This paper only matters for TS18661-3 integration.
Commit: 0b049ce64653a8cbdeedaa2bfe6cc79fc6af5d40
https://github.com/llvm/llvm-project/commit/0b049ce64653a8cbdeedaa2bfe6cc79fc6af5d40
Author: Vitaly Buka <vitalybuka at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M compiler-rt/lib/tsan/rtl/tsan_rtl.cpp
A compiler-rt/test/tsan/Darwin/fork_deadlock.cpp
R compiler-rt/test/tsan/Linux/fork_deadlock.cpp
Log Message:
-----------
[tsan] Test `__tsan_test_only_on_fork` only on Mac (#96597)
According to https://reviews.llvm.org/D114250
this was to handle Mac specific issue, however
the test is Linux only.
The test effectively prevents to lock main allocator
on fork, but we do that on Linux for other
sanitizers for years, and need to do the same
for TSAN to avoid deadlocks.
Commit: 4c87212d63c3b45e9371434239553ef007216106
https://github.com/llvm/llvm-project/commit/4c87212d63c3b45e9371434239553ef007216106
Author: Nick Desaulniers (paternity leave) <nickdesaulniers at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libc/src/__support/OSUtil/linux/arm/syscall.h
Log Message:
-----------
[libc][thumb] support syscalls from thumb mode (#96558)
r7 is reserved in thumb2 (typically for the frame pointer, as opposed to r11 in
ARM mode), so assigning to a variable with explicit register storage in r7 will
produce an error.
But r7 is where the Linux kernel expects the syscall number to be placed. We
can use a temporary to get the register allocator to pick a temporary, which we
save+restore the previous value of r7 in.
Fixes: #93738
Commit: cd2bac81a9ae961f8299198750a8910dc389491e
https://github.com/llvm/llvm-project/commit/cd2bac81a9ae961f8299198750a8910dc389491e
Author: Vitaly Buka <vitalybuka at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M compiler-rt/lib/tsan/rtl/tsan_mman.cpp
M compiler-rt/lib/tsan/rtl/tsan_mman.h
M compiler-rt/lib/tsan/rtl/tsan_rtl.cpp
Log Message:
-----------
[nfc][tsan] Better name for locking functions (#96598)
These functions used only for `fork`.
Unused parameter `child` will be used in followup patches.
Commit: 0258a60cd97c5d44b150a6c954f992b39a6c4f2f
https://github.com/llvm/llvm-project/commit/0258a60cd97c5d44b150a6c954f992b39a6c4f2f
Author: Vitaly Buka <vitalybuka at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M compiler-rt/lib/tsan/rtl/tsan_mman.cpp
Log Message:
-----------
[nfc][tsan] Clang format includes (#96599)
Commit: 557b5ad32da8d0e63fbd8cabaf836662dc751b15
https://github.com/llvm/llvm-project/commit/557b5ad32da8d0e63fbd8cabaf836662dc751b15
Author: Vitaly Buka <vitalybuka at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M .github/CODEOWNERS
M bolt/docs/CommandLineArgumentReference.md
M clang-tools-extra/clang-doc/tool/CMakeLists.txt
M clang-tools-extra/clang-doc/tool/ClangDocMain.cpp
M clang-tools-extra/clang-tidy/utils/DeclRefExprUtils.cpp
M clang-tools-extra/docs/ReleaseNotes.rst
M clang-tools-extra/test/clang-doc/Inputs/basic-project/src/Calculator.cpp
M clang-tools-extra/test/clang-doc/basic-project.test
M clang-tools-extra/test/clang-tidy/checkers/performance/unnecessary-copy-initialization.cpp
M clang/docs/LanguageExtensions.rst
M clang/include/clang/AST/Expr.h
M clang/include/clang/Basic/Builtins.td
M clang/include/clang/Basic/BuiltinsAMDGPU.def
M clang/include/clang/Basic/Cuda.h
M clang/include/clang/Basic/DiagnosticParseKinds.td
M clang/include/clang/Basic/DiagnosticSemaKinds.td
M clang/include/clang/Basic/LangOptions.h
M clang/include/clang/Basic/arm_sme.td
M clang/include/clang/Basic/arm_sve.td
M clang/include/clang/Basic/arm_sve_sme_incl.td
M clang/include/clang/CodeGen/ModuleBuilder.h
M clang/include/clang/Driver/Driver.h
M clang/include/clang/Driver/Options.td
M clang/lib/AST/ExprConstant.cpp
M clang/lib/AST/Interp/ByteCodeExprGen.cpp
M clang/lib/AST/Interp/Descriptor.cpp
M clang/lib/AST/Interp/Pointer.cpp
M clang/lib/Basic/Cuda.cpp
M clang/lib/Basic/Targets/NVPTX.cpp
M clang/lib/Basic/Targets/X86.cpp
M clang/lib/Basic/Targets/X86.h
M clang/lib/CodeGen/CGBuiltin.cpp
M clang/lib/CodeGen/CGDebugInfo.cpp
M clang/lib/CodeGen/CGDebugInfo.h
M clang/lib/CodeGen/CGOpenMPRuntimeGPU.cpp
M clang/lib/CodeGen/CMakeLists.txt
M clang/lib/CodeGen/CodeGenModule.cpp
M clang/lib/Driver/Driver.cpp
M clang/lib/Driver/ToolChains/Arch/X86.cpp
M clang/lib/Driver/ToolChains/Clang.cpp
M clang/lib/Driver/ToolChains/HIPAMD.cpp
M clang/lib/Driver/ToolChains/HIPAMD.h
M clang/lib/Driver/ToolChains/WebAssembly.cpp
M clang/lib/Lex/LiteralSupport.cpp
M clang/lib/Parse/ParseOpenMP.cpp
M clang/lib/Sema/SemaChecking.cpp
M clang/lib/Sema/SemaOpenMP.cpp
M clang/lib/Sema/SemaTemplate.cpp
M clang/lib/Sema/SemaType.cpp
M clang/lib/Serialization/ASTWriter.cpp
M clang/lib/Tooling/Inclusions/Stdlib/CMakeLists.txt
M clang/test/AST/Interp/literals.cpp
A clang/test/C/C2x/n2819.c
A clang/test/CodeGenCXX/debug-info-verbose-trap.cpp
A clang/test/CodeGenOpenCL/builtins-amdgcn-raw-buffer-store.cl
M clang/test/CodeGenOpenCL/builtins-amdgcn.cl
M clang/test/Driver/cuda-arch-translation.cu
M clang/test/Driver/x86-target-features.c
M clang/test/Frontend/embed-bitcode.ll
M clang/test/Misc/target-invalid-cpu-note.c
A clang/test/Modules/no-external-identifier-id.cppm
A clang/test/Modules/no-external-type-id.cppm
M clang/test/OpenMP/Inputs/nesting_of_regions.cpp
M clang/test/OpenMP/declare_variant_clauses_messages.cpp
M clang/test/Preprocessor/x86_target_features.c
M clang/test/Sema/aarch64-sve-intrinsics/acle_sve_bfloat.cpp
M clang/test/Sema/aarch64-sve2-intrinsics/acle_sve2_bfloat.cpp
M clang/test/Sema/arm-vector-types-support.c
R clang/test/SemaCUDA/neon-attrs.cu
A clang/test/SemaCXX/verbose-trap.cpp
A clang/test/SemaOpenCL/builtins-amdgcn-raw-buffer-store-error.cl
M clang/utils/TableGen/NeonEmitter.cpp
M clang/utils/TableGen/SveEmitter.cpp
M clang/www/c_status.html
M clang/www/get_involved.html
M compiler-rt/lib/gwp_asan/guarded_pool_allocator.cpp
M compiler-rt/lib/gwp_asan/platform_specific/guarded_pool_allocator_fuchsia.cpp
M compiler-rt/lib/gwp_asan/platform_specific/guarded_pool_allocator_posix.cpp
M compiler-rt/lib/gwp_asan/utilities.h
M flang/include/flang/Lower/AbstractConverter.h
M flang/include/flang/Optimizer/Transforms/Passes.h
M flang/include/flang/Optimizer/Transforms/Passes.td
M flang/include/flang/Tools/CLOptions.inc
M flang/lib/Evaluate/intrinsics.cpp
M flang/lib/Lower/Bridge.cpp
M flang/lib/Lower/OpenMP/ClauseProcessor.cpp
M flang/lib/Optimizer/Transforms/CMakeLists.txt
A flang/lib/Optimizer/Transforms/ConstantArgumentGlobalisation.cpp
M flang/lib/Optimizer/Transforms/DebugTypeGenerator.cpp
R flang/test/Driver/print-resource-dir.F90
M flang/test/Fir/boxproc.fir
A flang/test/Integration/debug-ptr-type.f90
A flang/test/Lower/OpenMP/copyprivate2.f90
M flang/test/Lower/character-local-variables.f90
A flang/test/Transforms/constant-argument-globalisation-2.fir
A flang/test/Transforms/constant-argument-globalisation.fir
A flang/test/Transforms/debug-ptr-type.fir
M flang/unittests/Evaluate/intrinsics.cpp
M libc/cmake/modules/prepare_libc_gpu_build.cmake
M libc/config/darwin/arm/entrypoints.txt
M libc/config/linux/aarch64/entrypoints.txt
M libc/config/linux/app.h
M libc/config/linux/arm/entrypoints.txt
M libc/config/linux/riscv/entrypoints.txt
M libc/config/linux/x86_64/entrypoints.txt
M libc/docs/math/index.rst
M libc/include/llvm-libc-macros/CMakeLists.txt
A libc/include/llvm-libc-macros/link-macros.h
M libc/spec/stdc.td
M libc/src/__support/FPUtil/CMakeLists.txt
M libc/src/__support/FPUtil/double_double.h
M libc/src/__support/FPUtil/dyadic_float.h
M libc/src/__support/FPUtil/generic/CMakeLists.txt
A libc/src/__support/FPUtil/generic/div.h
M libc/src/__support/OSUtil/linux/arm/syscall.h
M libc/src/__support/macros/optimization.h
M libc/src/__support/threads/thread.h
M libc/src/math/CMakeLists.txt
A libc/src/math/f16divf.h
M libc/src/math/generic/CMakeLists.txt
A libc/src/math/generic/f16divf.cpp
A libc/src/math/generic/range_reduction_double_common.h
A libc/src/math/generic/range_reduction_double_fma.h
A libc/src/math/generic/range_reduction_double_nofma.h
A libc/src/math/generic/sin.cpp
A libc/src/math/generic/sincos_eval.h
M libc/src/math/x86_64/CMakeLists.txt
R libc/src/math/x86_64/sin.cpp
M libc/startup/linux/CMakeLists.txt
M libc/startup/linux/do_start.cpp
M libc/test/src/__support/CMakeLists.txt
M libc/test/src/math/CMakeLists.txt
A libc/test/src/math/DivTest.h
A libc/test/src/math/f16divf_test.cpp
M libc/test/src/math/sin_test.cpp
M libc/test/src/math/smoke/CMakeLists.txt
A libc/test/src/math/smoke/DivTest.h
A libc/test/src/math/smoke/f16divf_test.cpp
A libc/test/src/math/smoke/sin_test.cpp
M libc/utils/MPFRWrapper/MPFRUtils.cpp
M libc/utils/MPFRWrapper/MPFRUtils.h
M libcxx/CMakeLists.txt
M libcxx/docs/ReleaseNotes/19.rst
M libcxx/include/__atomic/atomic_init.h
M libcxx/include/__mdspan/extents.h
M libcxx/include/__memory/allocator.h
M libcxx/include/__memory/allocator_traits.h
M libcxx/include/__memory/shared_ptr.h
M libcxx/include/__numeric/gcd_lcm.h
M libcxx/include/__type_traits/conditional.h
M libcxx/include/__type_traits/is_nothrow_destructible.h
M libcxx/include/deque
M libcxx/include/forward_list
M libcxx/include/list
M libcxx/include/map
M libcxx/include/set
M libcxx/include/string
M libcxx/include/unordered_map
M libcxx/include/unordered_set
M libcxx/include/vector
M libcxx/lib/abi/powerpc-ibm-aix.libcxxabi.v1.stable.exceptions.nonew.abilist
M libcxx/lib/abi/powerpc64-ibm-aix.libcxxabi.v1.stable.exceptions.nonew.abilist
M libcxx/lib/abi/x86_64-unknown-freebsd.libcxxabi.v1.stable.exceptions.nonew.abilist
M libcxx/lib/abi/x86_64-unknown-linux-gnu.libcxxabi.v1.stable.exceptions.nonew.abilist
M libcxx/lib/abi/x86_64-unknown-linux-gnu.libcxxabi.v1.stable.noexceptions.nonew.abilist
M libcxx/modules/CMakeLists.txt.in
M libcxx/src/barrier.cpp
M libcxx/src/filesystem/operations.cpp
M libcxx/src/filesystem/path.cpp
M libcxx/src/filesystem/path_parser.h
M libcxx/src/locale.cpp
M libcxx/test/CMakeLists.txt
M libcxx/test/configs/llvm-libc++-shared-clangcl.cfg.in
M libcxx/test/configs/llvm-libc++-shared-no-vcruntime-clangcl.cfg.in
M libcxx/test/configs/llvm-libc++-static-clangcl.cfg.in
R libcxx/test/libcxx/containers/sequences/vector/const_T.compile.pass.cpp
R libcxx/test/libcxx/containers/sequences/vector/const_value_type.pass.cpp
M libcxx/test/libcxx/diagnostics/memory.nodiscard.verify.cpp
M libcxx/test/libcxx/memory/allocator_void.trivial.compile.pass.cpp
M libcxx/test/libcxx/memory/allocator_volatile.verify.cpp
M libcxx/test/std/concepts/concepts.lang/concept.default.init/default_initializable.compile.pass.cpp
A libcxx/test/std/containers/sequences/array/array.overview/nttp.equivalence.compile.pass.cpp
A libcxx/test/std/containers/sequences/array/array.overview/nttp.verify.cpp
M libcxx/test/std/containers/views/mdspan/extents/ctad.pass.cpp
M libcxx/test/std/numerics/numeric.ops/numeric.ops.lcm/lcm.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.ctor.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.dtor.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.members/allocate.constexpr.size.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.members/allocate.size.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.deprecated_in_cxx17.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.deprecated_in_cxx23.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.removed_in_cxx20.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.removed_in_cxx26.verify.cpp
M libcxx/test/std/utilities/memory/specialized.algorithms/specialized.construct/construct_at.pass.cpp
M libcxx/test/std/utilities/meta/meta.unary/meta.unary.prop/is_nothrow_destructible.pass.cpp
A libcxx/test/std/utilities/utility/pairs/pairs.pair/nttp.equivalence.compile.pass.cpp
A libcxx/test/std/utilities/utility/pairs/pairs.pair/nttp.verify.cpp
M lld/ELF/InputFiles.cpp
M lld/ELF/InputFiles.h
M lldb/include/lldb/API/SBDebugger.h
M lldb/include/lldb/Symbol/TypeSystem.h
M lldb/source/API/SBDebugger.cpp
M lldb/source/Plugins/ObjectFile/Minidump/MinidumpFileBuilder.cpp
M lldb/source/Plugins/ObjectFile/Minidump/MinidumpFileBuilder.h
M lldb/source/Plugins/SymbolFile/DWARF/DWARFASTParserClang.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARF.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARF.h
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDebugMap.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDebugMap.h
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDwo.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDwo.h
M lldb/source/Symbol/TypeSystem.cpp
M lldb/test/API/python_api/find_in_memory/TestFindInMemory.py
M lldb/test/API/python_api/find_in_memory/address_ranges_helper.py
M lldb/test/API/python_api/find_in_memory/main.cpp
M lldb/tools/lldb-dap/DAP.cpp
M lldb/tools/lldb-dap/DAP.h
M lldb/tools/lldb-dap/lldb-dap.cpp
M llvm/docs/AMDGPUUsage.rst
M llvm/include/llvm/ADT/SetOperations.h
M llvm/include/llvm/ADT/SmallPtrSet.h
M llvm/include/llvm/Analysis/CFGPrinter.h
M llvm/include/llvm/Analysis/LoopInfo.h
M llvm/include/llvm/Analysis/ReleaseModeModelRunner.h
M llvm/include/llvm/CodeGen/GlobalISel/MachineIRBuilder.h
M llvm/include/llvm/CodeGen/MachinePostDominators.h
M llvm/include/llvm/IR/Analysis.h
M llvm/include/llvm/IR/IntrinsicsAMDGPU.td
M llvm/include/llvm/IR/IntrinsicsNVVM.td
M llvm/include/llvm/IR/ProfDataUtils.h
M llvm/include/llvm/Passes/MachinePassRegistry.def
M llvm/include/llvm/Support/GenericDomTreeConstruction.h
M llvm/include/llvm/TargetParser/X86TargetParser.def
M llvm/lib/Analysis/CallGraph.cpp
M llvm/lib/Analysis/ConstraintSystem.cpp
M llvm/lib/Analysis/InstructionSimplify.cpp
M llvm/lib/Analysis/LoopAccessAnalysis.cpp
M llvm/lib/Analysis/LoopInfo.cpp
M llvm/lib/Analysis/MLInlineAdvisor.cpp
M llvm/lib/Analysis/PHITransAddr.cpp
M llvm/lib/Analysis/ScalarEvolution.cpp
M llvm/lib/Analysis/TargetTransformInfo.cpp
M llvm/lib/Bitcode/Writer/BitcodeWriter.cpp
M llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
M llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp
M llvm/lib/CodeGen/MachinePostDominators.cpp
M llvm/lib/CodeGen/MachineSSAUpdater.cpp
M llvm/lib/IR/Instructions.cpp
M llvm/lib/IR/ProfDataUtils.cpp
M llvm/lib/IR/Verifier.cpp
M llvm/lib/MC/MCDwarf.cpp
M llvm/lib/Passes/PassBuilder.cpp
M llvm/lib/Target/AArch64/AArch64ISelDAGToDAG.cpp
M llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
M llvm/lib/Target/AArch64/AArch64InstrInfo.cpp
M llvm/lib/Target/AArch64/AArch64InstrInfo.td
M llvm/lib/Target/AArch64/AArch64SMEInstrInfo.td
M llvm/lib/Target/AArch64/AArch64Subtarget.h
M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp
M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.h
M llvm/lib/Target/AMDGPU/AMDGPU.td
M llvm/lib/Target/AMDGPU/AMDGPUArgumentUsageInfo.cpp
M llvm/lib/Target/AMDGPU/AMDGPUArgumentUsageInfo.h
M llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp
M llvm/lib/Target/AMDGPU/AMDGPUAtomicOptimizer.cpp
M llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
M llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.h
M llvm/lib/Target/AMDGPU/AMDGPUInstructionSelector.cpp
M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp
M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.h
M llvm/lib/Target/AMDGPU/AMDGPUSubtarget.cpp
M llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
M llvm/lib/Target/AMDGPU/BUFInstructions.td
M llvm/lib/Target/AMDGPU/GCNSubtarget.h
M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUMCExpr.cpp
M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUMCExpr.h
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/lib/Target/AMDGPU/SIInstructions.td
M llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.cpp
M llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.h
M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
M llvm/lib/Target/AMDGPU/VOP1Instructions.td
M llvm/lib/Target/AMDGPU/VOP2Instructions.td
M llvm/lib/Target/BPF/BPFRegisterInfo.cpp
M llvm/lib/Target/LoongArch/LoongArchISelLowering.cpp
M llvm/lib/Target/LoongArch/LoongArchISelLowering.h
M llvm/lib/Target/LoongArch/LoongArchInstrInfo.td
M llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
M llvm/lib/Target/NVPTX/NVPTXLowerArgs.cpp
M llvm/lib/Target/NVPTX/NVPTXUtilities.cpp
M llvm/lib/Target/NVPTX/NVPTXUtilities.h
M llvm/lib/Target/RISCV/GISel/RISCVLegalizerInfo.cpp
M llvm/lib/Target/RISCV/GISel/RISCVRegisterBankInfo.cpp
M llvm/lib/Target/RISCV/RISCV.td
M llvm/lib/Target/RISCV/RISCVProcessors.td
A llvm/lib/Target/RISCV/RISCVSchedSyntacoreSCR3.td
M llvm/lib/Target/SPIRV/MCTargetDesc/SPIRVBaseInfo.cpp
M llvm/lib/Target/SPIRV/MCTargetDesc/SPIRVBaseInfo.h
M llvm/lib/Target/SPIRV/SPIRVBuiltins.cpp
M llvm/lib/Target/SPIRV/SPIRVBuiltins.td
M llvm/lib/Target/SPIRV/SPIRVGlobalRegistry.cpp
M llvm/lib/Target/SPIRV/SPIRVGlobalRegistry.h
M llvm/lib/Target/SPIRV/SPIRVInstructionSelector.cpp
M llvm/lib/Target/SPIRV/SPIRVUtils.cpp
M llvm/lib/Target/X86/X86.td
M llvm/lib/Target/X86/X86ISelLowering.cpp
M llvm/lib/Target/X86/X86TargetTransformInfo.cpp
M llvm/lib/Target/Xtensa/XtensaAsmPrinter.cpp
M llvm/lib/Target/Xtensa/XtensaAsmPrinter.h
M llvm/lib/Target/Xtensa/XtensaISelLowering.cpp
M llvm/lib/Target/Xtensa/XtensaISelLowering.h
M llvm/lib/Target/Xtensa/XtensaInstrInfo.td
M llvm/lib/Target/Xtensa/XtensaOperators.td
M llvm/lib/TargetParser/RISCVISAInfo.cpp
M llvm/lib/TargetParser/X86TargetParser.cpp
M llvm/lib/Transforms/IPO/SCCP.cpp
M llvm/lib/Transforms/Scalar/LoopDistribute.cpp
M llvm/lib/Transforms/Scalar/Reassociate.cpp
M llvm/lib/Transforms/Utils/LoopUnrollRuntime.cpp
M llvm/lib/Transforms/Utils/PromoteMemoryToRegister.cpp
M llvm/lib/Transforms/Vectorize/LoopVectorizationPlanner.h
M llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
M llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
M llvm/lib/Transforms/Vectorize/VPlan.h
M llvm/lib/Transforms/Vectorize/VectorCombine.cpp
M llvm/runtimes/CMakeLists.txt
M llvm/test/Analysis/CostModel/AArch64/masked_ldst.ll
M llvm/test/Analysis/CostModel/AArch64/sve-gather.ll
M llvm/test/Analysis/CostModel/AArch64/sve-ldst.ll
M llvm/test/Analysis/CostModel/AArch64/sve-scatter.ll
M llvm/test/Analysis/CostModel/X86/arith-int-codesize.ll
M llvm/test/Analysis/CostModel/X86/arith-int-latency.ll
M llvm/test/Analysis/CostModel/X86/arith-int-sizelatency.ll
M llvm/test/Analysis/CostModel/X86/arith-int.ll
M llvm/test/Analysis/CostModel/X86/rem-codesize.ll
M llvm/test/Analysis/CostModel/X86/rem-latency.ll
M llvm/test/Analysis/CostModel/X86/rem-sizelatency.ll
M llvm/test/Analysis/CostModel/X86/rem.ll
M llvm/test/Analysis/CostModel/X86/slm-arith-costs.ll
A llvm/test/Analysis/LoopAccessAnalysis/debug-loc.ll
M llvm/test/Analysis/LoopAccessAnalysis/print-order.ll
M llvm/test/Analysis/LoopAccessAnalysis/scalable-vector-regression-tests.ll
M llvm/test/Analysis/LoopAccessAnalysis/symbolic-stride.ll
M llvm/test/Analysis/UniformityAnalysis/AMDGPU/intrinsics.ll
M llvm/test/CodeGen/AArch64/aarch64-sme2-asm.ll
M llvm/test/CodeGen/AArch64/sme-intrinsics-add.ll
M llvm/test/CodeGen/AArch64/sme-intrinsics-loads.ll
M llvm/test/CodeGen/AArch64/sme-intrinsics-mopa.ll
M llvm/test/CodeGen/AArch64/sme-intrinsics-mops.ll
M llvm/test/CodeGen/AArch64/sme-intrinsics-stores.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-add-sub-za16.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-add.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-cvt.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-cvtl.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-cvtn.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-extract-mova.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-fmlas.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-fmlas16.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-fp-dots.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-frint.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-insert-mova.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-int-dots.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-ld1.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-ldnt1.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti2-lane-x2.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti2-lane-x4.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti2-lane.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti4-lane-x2.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti4-lane-x4.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti4-lane.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-max.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-min.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-mlall.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-mlals.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-mop.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-mopa.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-qcvt.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-qcvtn.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-qrshr.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-rshl.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-select-sme-tileslice.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-sqdmulh.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-sub.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-vdot.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-zero-zt.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-zt0.ll
M llvm/test/CodeGen/AArch64/sme2p1-intrinsics-zero.ll
M llvm/test/CodeGen/AArch64/sve-callee-save-restore-pairs.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-contiguous-prefetches.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-conversion.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-counting-bits.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-counting-elems.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-fp-converts.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-ldN-sret-reg+imm-addr-mode.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-ldN-sret-reg+reg-addr-mode.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-logical.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-pred-creation.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-pred-operations.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-pred-testing.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-reinterpret.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-reversal.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-sel.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-sqdec.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-sqinc.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-st1-addressing-mode-reg-imm.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-st1-addressing-mode-reg-reg.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-st1.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-stN-reg-imm-addr-mode.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-stN-reg-reg-addr-mode.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-stores.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-uqdec.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-uqinc.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-while.ll
M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-reshuffle.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-binary-narrowing-add-sub.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-binary-narrowing-shr.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-complex-dot.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-contiguous-conflict-detection.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-fp-converts.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-fp-int-binary-logarithm.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-fp-widening-mul-acc.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-int-mul-lane.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-non-widening-pairwise-arith.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-polynomial-arithmetic.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-psel.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-revd.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-unary-narrowing.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-uniform-complex-arith.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-while-reversed.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-while.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-widening-complex-int-arith.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-widening-dsp.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-widening-pairwise-arith.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-cntp.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-dots.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-fclamp.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-fp-reduce.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-int-reduce.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-loads.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-predicate-as-counter.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-sclamp.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-selx2.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-selx4.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-stores.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-uclamp.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-unpk.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-uzpx2.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-uzpx4.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-while-pn.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-zipx2.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-zipx4.ll
A llvm/test/CodeGen/AArch64/taildup-ssa-update-pr62712.mir
M llvm/test/CodeGen/AMDGPU/GlobalISel/atomic_optimizations_mul_one.ll
M llvm/test/CodeGen/AMDGPU/GlobalISel/inst-select-load-constant.mir
M llvm/test/CodeGen/AMDGPU/GlobalISel/smrd.ll
M llvm/test/CodeGen/AMDGPU/atomic_optimization_split_dt_update.ll
M llvm/test/CodeGen/AMDGPU/buffer-fat-pointer-atomicrmw-fadd.ll
M llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-smem.ll
A llvm/test/CodeGen/AMDGPU/convergence-laneops.ll
M llvm/test/CodeGen/AMDGPU/gfx12_scalar_subword_loads.ll
M llvm/test/CodeGen/AMDGPU/global-atomic-scan.ll
M llvm/test/CodeGen/AMDGPU/global-saddr-load.ll
M llvm/test/CodeGen/AMDGPU/global_atomic_optimizer_fp_rtn.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_iterative_scan.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_iterative_scan_fp.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_optimizer_fp_no_rtn.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readfirstlane.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readfirstlane.ptr.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readlane.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readlane.ptr.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.writelane.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.writelane.ptr.ll
M llvm/test/CodeGen/AMDGPU/llvm.prefetch.ll
M llvm/test/CodeGen/AMDGPU/smrd.ll
M llvm/test/CodeGen/LoongArch/machinelicm-address-pseudos.ll
M llvm/test/CodeGen/LoongArch/psabi-restricted-scheduling.ll
A llvm/test/CodeGen/NVPTX/lower-args-gridconstant.ll
A llvm/test/CodeGen/NVPTX/mma-no-sink-after-laneid-check.ll
A llvm/test/CodeGen/RISCV/GlobalISel/instruction-select/fp-arith-f16.mir
M llvm/test/CodeGen/RISCV/GlobalISel/instruction-select/fp-arith.mir
A llvm/test/CodeGen/RISCV/GlobalISel/legalizer/legalize-fp-arith-f16.mir
A llvm/test/CodeGen/RISCV/GlobalISel/regbankselect/fp-arith-f16.mir
A llvm/test/CodeGen/SPIRV/var-uniform-const.ll
M llvm/test/CodeGen/X86/avx2-arith.ll
M llvm/test/CodeGen/X86/midpoint-int-vec-128.ll
M llvm/test/CodeGen/X86/midpoint-int-vec-256.ll
M llvm/test/CodeGen/X86/min-legal-vector-width.ll
M llvm/test/CodeGen/X86/pmul.ll
M llvm/test/CodeGen/X86/prefer-avx256-wide-mul.ll
M llvm/test/CodeGen/X86/vector-shuffle-combining-sse41.ll
A llvm/test/CodeGen/Xtensa/blockaddress.ll
A llvm/test/CodeGen/Xtensa/brcc.ll
A llvm/test/CodeGen/Xtensa/indirectbr.ll
A llvm/test/CodeGen/Xtensa/jumpt.ll
M llvm/test/DebugInfo/COFF/fission-sections.ll
M llvm/test/DebugInfo/unrolled-loop-remainder.ll
M llvm/test/Transforms/EarlyCSE/flags.ll
M llvm/test/Transforms/FunctionSpecialization/track-return.ll
A llvm/test/Transforms/InstCombine/AArch64/sve-intrinsic-comb-no-active-lanes-loads.ll
M llvm/test/Transforms/InstCombine/AMDGPU/amdgcn-intrinsics.ll
M llvm/test/Transforms/InstCombine/phi.ll
M llvm/test/Transforms/InstCombine/shift.ll
M llvm/test/Transforms/InstSimplify/phi.ll
M llvm/test/Transforms/LoopDeletion/update-scev.ll
A llvm/test/Transforms/LoopDistribute/debug-loc.ll
A llvm/test/Transforms/LoopDistribute/debug-print.ll
M llvm/test/Transforms/LoopUnroll/ARM/multi-blocks.ll
M llvm/test/Transforms/LoopUnroll/runtime-exit-phi-scev-invalidation.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop-at-most-two-exits.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop-multiple-exits.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop5.ll
M llvm/test/Transforms/LoopUnroll/runtime-multiexit-heuristic.ll
M llvm/test/Transforms/LoopUnroll/runtime-unroll-remainder.ll
M llvm/test/Transforms/LoopUnroll/tripcount-overflow.ll
M llvm/test/Transforms/LoopUnrollAndJam/unroll-and-jam.ll
M llvm/test/Transforms/LoopVectorize/AArch64/scalable-reductions.ll
M llvm/test/Transforms/LoopVectorize/AArch64/sve-illegal-type.ll
A llvm/test/Transforms/LoopVectorize/AArch64/sve-invalid-costs.ll
M llvm/test/Transforms/LoopVectorize/ARM/mve-hoist-runtime-checks.ll
M llvm/test/Transforms/LoopVectorize/X86/float-induction-x86.ll
M llvm/test/Transforms/LoopVectorize/create-induction-resume.ll
M llvm/test/Transforms/LoopVectorize/runtime-checks-hoist.ll
M llvm/test/Transforms/LoopVectorize/uniform-blend.ll
M llvm/test/Transforms/Mem2Reg/preserve-nonnull-load-metadata.ll
M llvm/test/Transforms/PhaseOrdering/AArch64/interleavevectorization.ll
M llvm/test/Transforms/SCCP/2009-01-14-IPSCCP-Invoke.ll
M llvm/test/Transforms/SCCP/apint-ipsccp2.ll
M llvm/test/Transforms/SCCP/arg-count-mismatch.ll
M llvm/test/Transforms/SCCP/indirectbr.ll
M llvm/test/Transforms/SCCP/ip-constant-ranges.ll
M llvm/test/Transforms/SCCP/ip-ranges-phis.ll
M llvm/test/Transforms/SCCP/ip-ranges-select.ll
M llvm/test/Transforms/SCCP/ipsccp-basic.ll
M llvm/test/Transforms/SCCP/ipsccp-clear-returned.ll
M llvm/test/Transforms/SCCP/ipsccp-noundef.ll
M llvm/test/Transforms/SCCP/ipsccp-preserve-analysis.ll
M llvm/test/Transforms/SCCP/issue59602-assume-like-call-users.ll
M llvm/test/Transforms/SCCP/metadata.ll
M llvm/test/Transforms/SCCP/musttail-call.ll
M llvm/test/Transforms/SCCP/range-and-ip.ll
M llvm/test/Transforms/SCCP/range-attribute.ll
M llvm/test/Transforms/SCCP/remove-call-inst.ll
M llvm/test/Transforms/SCCP/resolvedundefsin-tracked-fn.ll
M llvm/test/Transforms/SCCP/return-constant.ll
M llvm/test/Transforms/SCCP/solve-after-each-resolving-undefs-for-function.ll
M llvm/test/Transforms/SCCP/struct-arg-resolve-undefs.ll
M llvm/test/Transforms/VectorCombine/AArch64/shuffletoidentity-concat.ll
M llvm/test/Transforms/VectorCombine/AArch64/shuffletoidentity.ll
A llvm/test/tools/llvm-mca/RISCV/SyntacoreSCR/SCR3-ALU.s
A llvm/test/tools/llvm-mca/RISCV/SyntacoreSCR/SCR3-LSU.s
M llvm/test/tools/llvm-readobj/ELF/dynamic-tags.test
M llvm/tools/llvm-readobj/ELFDumper.cpp
M llvm/unittests/ADT/SmallPtrSetTest.cpp
M llvm/unittests/Analysis/MLModelRunnerTest.cpp
M llvm/unittests/CodeGen/GlobalISel/MachineIRBuilderTest.cpp
M llvm/unittests/IR/VectorBuilderTest.cpp
M llvm/unittests/Transforms/Vectorize/VPlanTest.cpp
M mlir/docs/DialectConversion.md
M mlir/include/mlir/Dialect/Linalg/Transforms/Transforms.h
M mlir/include/mlir/Dialect/OpenMP/OpenMPClauseOperands.h
M mlir/include/mlir/Dialect/OpenMP/OpenMPEnums.td
M mlir/include/mlir/Dialect/OpenMP/OpenMPOps.td
M mlir/include/mlir/Transforms/DialectConversion.h
M mlir/lib/Dialect/Affine/Transforms/LoopFusion.cpp
M mlir/lib/Dialect/ArmSME/Transforms/VectorLegalization.cpp
M mlir/lib/Dialect/GPU/Transforms/NVVMAttachTarget.cpp
M mlir/lib/Dialect/GPU/Transforms/ROCDLAttachTarget.cpp
M mlir/lib/Dialect/GPU/Transforms/SPIRVAttachTarget.cpp
M mlir/lib/Dialect/Linalg/TransformOps/LinalgTransformOps.cpp
M mlir/lib/Dialect/Linalg/Transforms/DropUnitDims.cpp
M mlir/lib/Dialect/OpenMP/IR/OpenMPDialect.cpp
M mlir/lib/Dialect/Tensor/IR/TensorOps.cpp
M mlir/lib/Dialect/Transform/Interfaces/MatchInterfaces.cpp
M mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp
M mlir/lib/IR/AffineMap.cpp
M mlir/lib/Tools/mlir-pdll-lsp-server/PDLLServer.cpp
M mlir/lib/Transforms/Utils/DialectConversion.cpp
M mlir/test/Dialect/ArmSME/vector-legalization.mlir
A mlir/test/Dialect/Linalg/rank-reduce-contraction-ops.mlir
M mlir/test/Dialect/OpenMP/invalid.mlir
M mlir/test/Dialect/OpenMP/ops.mlir
M mlir/test/Dialect/Tensor/fold-tensor-subset-ops-into-vector-transfers.mlir
M mlir/test/Dialect/Tensor/fold-tensor-subset-ops.mlir
M mlir/test/Dialect/Vector/fold-arith-extf-into-vector-contract.mlir
M mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/multi-tile-matmul.mlir
M mlir/test/Transforms/test-legalize-type-conversion.mlir
M mlir/test/lib/Dialect/Linalg/CMakeLists.txt
A mlir/test/lib/Dialect/Linalg/TestLinalgRankReduceContractionOps.cpp
M mlir/tools/mlir-opt/mlir-opt.cpp
M openmp/runtime/src/dllexports
M openmp/runtime/src/kmp.h
M openmp/runtime/src/kmp_csupport.cpp
M openmp/runtime/src/kmp_runtime.cpp
A openmp/runtime/test/parallel/omp_parallel_num_threads_list.c
A openmp/runtime/test/parallel/omp_parallel_num_threads_strict.c
M runtimes/cmake/Modules/WarningFlags.cmake
M utils/bazel/llvm-project-overlay/lldb/BUILD.bazel
M utils/bazel/llvm-project-overlay/lldb/source/Plugins/BUILD.bazel
A utils/bazel/llvm-project-overlay/lldb/tools/lldb-dap/BUILD.bazel
M utils/bazel/llvm-project-overlay/llvm/BUILD.bazel
M utils/bazel/llvm-project-overlay/mlir/BUILD.bazel
Log Message:
-----------
[𝘀𝗽𝗿] changes introduced through rebase
Created using spr 1.3.4
[skip ci]
Commit: c7058d67f34bb67ef1c0a1581773036223a57c2e
https://github.com/llvm/llvm-project/commit/c7058d67f34bb67ef1c0a1581773036223a57c2e
Author: Vitaly Buka <vitalybuka at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M .github/CODEOWNERS
M bolt/docs/CommandLineArgumentReference.md
M clang-tools-extra/clang-doc/tool/CMakeLists.txt
M clang-tools-extra/clang-doc/tool/ClangDocMain.cpp
M clang-tools-extra/clang-tidy/utils/DeclRefExprUtils.cpp
M clang-tools-extra/docs/ReleaseNotes.rst
M clang-tools-extra/test/clang-doc/Inputs/basic-project/src/Calculator.cpp
M clang-tools-extra/test/clang-doc/basic-project.test
M clang-tools-extra/test/clang-tidy/checkers/performance/unnecessary-copy-initialization.cpp
M clang/docs/LanguageExtensions.rst
M clang/include/clang/AST/Expr.h
M clang/include/clang/Basic/Builtins.td
M clang/include/clang/Basic/BuiltinsAMDGPU.def
M clang/include/clang/Basic/Cuda.h
M clang/include/clang/Basic/DiagnosticParseKinds.td
M clang/include/clang/Basic/DiagnosticSemaKinds.td
M clang/include/clang/Basic/LangOptions.h
M clang/include/clang/Basic/arm_sme.td
M clang/include/clang/Basic/arm_sve.td
M clang/include/clang/Basic/arm_sve_sme_incl.td
M clang/include/clang/CodeGen/ModuleBuilder.h
M clang/include/clang/Driver/Driver.h
M clang/include/clang/Driver/Options.td
M clang/lib/AST/ExprConstant.cpp
M clang/lib/AST/Interp/ByteCodeExprGen.cpp
M clang/lib/AST/Interp/Descriptor.cpp
M clang/lib/AST/Interp/Pointer.cpp
M clang/lib/Basic/Cuda.cpp
M clang/lib/Basic/Targets/NVPTX.cpp
M clang/lib/Basic/Targets/X86.cpp
M clang/lib/Basic/Targets/X86.h
M clang/lib/CodeGen/CGBuiltin.cpp
M clang/lib/CodeGen/CGDebugInfo.cpp
M clang/lib/CodeGen/CGDebugInfo.h
M clang/lib/CodeGen/CGOpenMPRuntimeGPU.cpp
M clang/lib/CodeGen/CMakeLists.txt
M clang/lib/CodeGen/CodeGenModule.cpp
M clang/lib/Driver/Driver.cpp
M clang/lib/Driver/ToolChains/Arch/X86.cpp
M clang/lib/Driver/ToolChains/Clang.cpp
M clang/lib/Driver/ToolChains/HIPAMD.cpp
M clang/lib/Driver/ToolChains/HIPAMD.h
M clang/lib/Driver/ToolChains/WebAssembly.cpp
M clang/lib/Lex/LiteralSupport.cpp
M clang/lib/Parse/ParseOpenMP.cpp
M clang/lib/Sema/SemaChecking.cpp
M clang/lib/Sema/SemaOpenMP.cpp
M clang/lib/Sema/SemaTemplate.cpp
M clang/lib/Sema/SemaType.cpp
M clang/lib/Serialization/ASTWriter.cpp
M clang/lib/Tooling/Inclusions/Stdlib/CMakeLists.txt
M clang/test/AST/Interp/literals.cpp
A clang/test/C/C2x/n2819.c
A clang/test/CodeGenCXX/debug-info-verbose-trap.cpp
A clang/test/CodeGenOpenCL/builtins-amdgcn-raw-buffer-store.cl
M clang/test/CodeGenOpenCL/builtins-amdgcn.cl
M clang/test/Driver/cuda-arch-translation.cu
M clang/test/Driver/x86-target-features.c
M clang/test/Frontend/embed-bitcode.ll
M clang/test/Misc/target-invalid-cpu-note.c
A clang/test/Modules/no-external-identifier-id.cppm
A clang/test/Modules/no-external-type-id.cppm
M clang/test/OpenMP/Inputs/nesting_of_regions.cpp
M clang/test/OpenMP/declare_variant_clauses_messages.cpp
M clang/test/Preprocessor/x86_target_features.c
M clang/test/Sema/aarch64-sve-intrinsics/acle_sve_bfloat.cpp
M clang/test/Sema/aarch64-sve2-intrinsics/acle_sve2_bfloat.cpp
M clang/test/Sema/arm-vector-types-support.c
R clang/test/SemaCUDA/neon-attrs.cu
A clang/test/SemaCXX/verbose-trap.cpp
A clang/test/SemaOpenCL/builtins-amdgcn-raw-buffer-store-error.cl
M clang/utils/TableGen/NeonEmitter.cpp
M clang/utils/TableGen/SveEmitter.cpp
M clang/www/c_status.html
M clang/www/get_involved.html
M compiler-rt/lib/gwp_asan/guarded_pool_allocator.cpp
M compiler-rt/lib/gwp_asan/platform_specific/guarded_pool_allocator_fuchsia.cpp
M compiler-rt/lib/gwp_asan/platform_specific/guarded_pool_allocator_posix.cpp
M compiler-rt/lib/gwp_asan/utilities.h
M flang/include/flang/Lower/AbstractConverter.h
M flang/include/flang/Optimizer/Transforms/Passes.h
M flang/include/flang/Optimizer/Transforms/Passes.td
M flang/include/flang/Tools/CLOptions.inc
M flang/lib/Evaluate/intrinsics.cpp
M flang/lib/Lower/Bridge.cpp
M flang/lib/Lower/OpenMP/ClauseProcessor.cpp
M flang/lib/Optimizer/Transforms/CMakeLists.txt
A flang/lib/Optimizer/Transforms/ConstantArgumentGlobalisation.cpp
M flang/lib/Optimizer/Transforms/DebugTypeGenerator.cpp
R flang/test/Driver/print-resource-dir.F90
M flang/test/Fir/boxproc.fir
A flang/test/Integration/debug-ptr-type.f90
A flang/test/Lower/OpenMP/copyprivate2.f90
M flang/test/Lower/character-local-variables.f90
A flang/test/Transforms/constant-argument-globalisation-2.fir
A flang/test/Transforms/constant-argument-globalisation.fir
A flang/test/Transforms/debug-ptr-type.fir
M flang/unittests/Evaluate/intrinsics.cpp
M libc/cmake/modules/prepare_libc_gpu_build.cmake
M libc/config/darwin/arm/entrypoints.txt
M libc/config/linux/aarch64/entrypoints.txt
M libc/config/linux/app.h
M libc/config/linux/arm/entrypoints.txt
M libc/config/linux/riscv/entrypoints.txt
M libc/config/linux/x86_64/entrypoints.txt
M libc/docs/math/index.rst
M libc/include/llvm-libc-macros/CMakeLists.txt
A libc/include/llvm-libc-macros/link-macros.h
M libc/spec/stdc.td
M libc/src/__support/FPUtil/CMakeLists.txt
M libc/src/__support/FPUtil/double_double.h
M libc/src/__support/FPUtil/dyadic_float.h
M libc/src/__support/FPUtil/generic/CMakeLists.txt
A libc/src/__support/FPUtil/generic/div.h
M libc/src/__support/OSUtil/linux/arm/syscall.h
M libc/src/__support/macros/optimization.h
M libc/src/__support/threads/thread.h
M libc/src/math/CMakeLists.txt
A libc/src/math/f16divf.h
M libc/src/math/generic/CMakeLists.txt
A libc/src/math/generic/f16divf.cpp
A libc/src/math/generic/range_reduction_double_common.h
A libc/src/math/generic/range_reduction_double_fma.h
A libc/src/math/generic/range_reduction_double_nofma.h
A libc/src/math/generic/sin.cpp
A libc/src/math/generic/sincos_eval.h
M libc/src/math/x86_64/CMakeLists.txt
R libc/src/math/x86_64/sin.cpp
M libc/startup/linux/CMakeLists.txt
M libc/startup/linux/do_start.cpp
M libc/test/src/__support/CMakeLists.txt
M libc/test/src/math/CMakeLists.txt
A libc/test/src/math/DivTest.h
A libc/test/src/math/f16divf_test.cpp
M libc/test/src/math/sin_test.cpp
M libc/test/src/math/smoke/CMakeLists.txt
A libc/test/src/math/smoke/DivTest.h
A libc/test/src/math/smoke/f16divf_test.cpp
A libc/test/src/math/smoke/sin_test.cpp
M libc/utils/MPFRWrapper/MPFRUtils.cpp
M libc/utils/MPFRWrapper/MPFRUtils.h
M libcxx/CMakeLists.txt
M libcxx/docs/ReleaseNotes/19.rst
M libcxx/include/__atomic/atomic_init.h
M libcxx/include/__mdspan/extents.h
M libcxx/include/__memory/allocator.h
M libcxx/include/__memory/allocator_traits.h
M libcxx/include/__memory/shared_ptr.h
M libcxx/include/__numeric/gcd_lcm.h
M libcxx/include/__type_traits/conditional.h
M libcxx/include/__type_traits/is_nothrow_destructible.h
M libcxx/include/deque
M libcxx/include/forward_list
M libcxx/include/list
M libcxx/include/map
M libcxx/include/set
M libcxx/include/string
M libcxx/include/unordered_map
M libcxx/include/unordered_set
M libcxx/include/vector
M libcxx/lib/abi/powerpc-ibm-aix.libcxxabi.v1.stable.exceptions.nonew.abilist
M libcxx/lib/abi/powerpc64-ibm-aix.libcxxabi.v1.stable.exceptions.nonew.abilist
M libcxx/lib/abi/x86_64-unknown-freebsd.libcxxabi.v1.stable.exceptions.nonew.abilist
M libcxx/lib/abi/x86_64-unknown-linux-gnu.libcxxabi.v1.stable.exceptions.nonew.abilist
M libcxx/lib/abi/x86_64-unknown-linux-gnu.libcxxabi.v1.stable.noexceptions.nonew.abilist
M libcxx/modules/CMakeLists.txt.in
M libcxx/src/barrier.cpp
M libcxx/src/filesystem/operations.cpp
M libcxx/src/filesystem/path.cpp
M libcxx/src/filesystem/path_parser.h
M libcxx/src/locale.cpp
M libcxx/test/CMakeLists.txt
M libcxx/test/configs/llvm-libc++-shared-clangcl.cfg.in
M libcxx/test/configs/llvm-libc++-shared-no-vcruntime-clangcl.cfg.in
M libcxx/test/configs/llvm-libc++-static-clangcl.cfg.in
R libcxx/test/libcxx/containers/sequences/vector/const_T.compile.pass.cpp
R libcxx/test/libcxx/containers/sequences/vector/const_value_type.pass.cpp
M libcxx/test/libcxx/diagnostics/memory.nodiscard.verify.cpp
M libcxx/test/libcxx/memory/allocator_void.trivial.compile.pass.cpp
M libcxx/test/libcxx/memory/allocator_volatile.verify.cpp
M libcxx/test/std/concepts/concepts.lang/concept.default.init/default_initializable.compile.pass.cpp
A libcxx/test/std/containers/sequences/array/array.overview/nttp.equivalence.compile.pass.cpp
A libcxx/test/std/containers/sequences/array/array.overview/nttp.verify.cpp
M libcxx/test/std/containers/views/mdspan/extents/ctad.pass.cpp
M libcxx/test/std/numerics/numeric.ops/numeric.ops.lcm/lcm.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.ctor.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.dtor.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.members/allocate.constexpr.size.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.members/allocate.size.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.deprecated_in_cxx17.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.deprecated_in_cxx23.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.removed_in_cxx20.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.removed_in_cxx26.verify.cpp
M libcxx/test/std/utilities/memory/specialized.algorithms/specialized.construct/construct_at.pass.cpp
M libcxx/test/std/utilities/meta/meta.unary/meta.unary.prop/is_nothrow_destructible.pass.cpp
A libcxx/test/std/utilities/utility/pairs/pairs.pair/nttp.equivalence.compile.pass.cpp
A libcxx/test/std/utilities/utility/pairs/pairs.pair/nttp.verify.cpp
M lld/ELF/InputFiles.cpp
M lld/ELF/InputFiles.h
M lldb/include/lldb/API/SBDebugger.h
M lldb/include/lldb/Symbol/TypeSystem.h
M lldb/source/API/SBDebugger.cpp
M lldb/source/Plugins/ObjectFile/Minidump/MinidumpFileBuilder.cpp
M lldb/source/Plugins/ObjectFile/Minidump/MinidumpFileBuilder.h
M lldb/source/Plugins/SymbolFile/DWARF/DWARFASTParserClang.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARF.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARF.h
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDebugMap.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDebugMap.h
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDwo.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDwo.h
M lldb/source/Symbol/TypeSystem.cpp
M lldb/test/API/python_api/find_in_memory/TestFindInMemory.py
M lldb/test/API/python_api/find_in_memory/address_ranges_helper.py
M lldb/test/API/python_api/find_in_memory/main.cpp
M lldb/tools/lldb-dap/DAP.cpp
M lldb/tools/lldb-dap/DAP.h
M lldb/tools/lldb-dap/lldb-dap.cpp
M llvm/docs/AMDGPUUsage.rst
M llvm/include/llvm/ADT/SetOperations.h
M llvm/include/llvm/ADT/SmallPtrSet.h
M llvm/include/llvm/Analysis/CFGPrinter.h
M llvm/include/llvm/Analysis/LoopInfo.h
M llvm/include/llvm/Analysis/ReleaseModeModelRunner.h
M llvm/include/llvm/CodeGen/GlobalISel/MachineIRBuilder.h
M llvm/include/llvm/CodeGen/MachinePostDominators.h
M llvm/include/llvm/IR/Analysis.h
M llvm/include/llvm/IR/IntrinsicsAMDGPU.td
M llvm/include/llvm/IR/IntrinsicsNVVM.td
M llvm/include/llvm/IR/ProfDataUtils.h
M llvm/include/llvm/Passes/MachinePassRegistry.def
M llvm/include/llvm/Support/GenericDomTreeConstruction.h
M llvm/include/llvm/TargetParser/X86TargetParser.def
M llvm/lib/Analysis/CallGraph.cpp
M llvm/lib/Analysis/ConstraintSystem.cpp
M llvm/lib/Analysis/InstructionSimplify.cpp
M llvm/lib/Analysis/LoopAccessAnalysis.cpp
M llvm/lib/Analysis/LoopInfo.cpp
M llvm/lib/Analysis/MLInlineAdvisor.cpp
M llvm/lib/Analysis/PHITransAddr.cpp
M llvm/lib/Analysis/ScalarEvolution.cpp
M llvm/lib/Analysis/TargetTransformInfo.cpp
M llvm/lib/Bitcode/Writer/BitcodeWriter.cpp
M llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
M llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp
M llvm/lib/CodeGen/MachinePostDominators.cpp
M llvm/lib/CodeGen/MachineSSAUpdater.cpp
M llvm/lib/IR/Instructions.cpp
M llvm/lib/IR/ProfDataUtils.cpp
M llvm/lib/IR/Verifier.cpp
M llvm/lib/MC/MCDwarf.cpp
M llvm/lib/Passes/PassBuilder.cpp
M llvm/lib/Target/AArch64/AArch64ISelDAGToDAG.cpp
M llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
M llvm/lib/Target/AArch64/AArch64InstrInfo.cpp
M llvm/lib/Target/AArch64/AArch64InstrInfo.td
M llvm/lib/Target/AArch64/AArch64SMEInstrInfo.td
M llvm/lib/Target/AArch64/AArch64Subtarget.h
M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp
M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.h
M llvm/lib/Target/AMDGPU/AMDGPU.td
M llvm/lib/Target/AMDGPU/AMDGPUArgumentUsageInfo.cpp
M llvm/lib/Target/AMDGPU/AMDGPUArgumentUsageInfo.h
M llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp
M llvm/lib/Target/AMDGPU/AMDGPUAtomicOptimizer.cpp
M llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
M llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.h
M llvm/lib/Target/AMDGPU/AMDGPUInstructionSelector.cpp
M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp
M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.h
M llvm/lib/Target/AMDGPU/AMDGPUSubtarget.cpp
M llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
M llvm/lib/Target/AMDGPU/BUFInstructions.td
M llvm/lib/Target/AMDGPU/GCNSubtarget.h
M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUMCExpr.cpp
M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUMCExpr.h
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/lib/Target/AMDGPU/SIInstructions.td
M llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.cpp
M llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.h
M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
M llvm/lib/Target/AMDGPU/VOP1Instructions.td
M llvm/lib/Target/AMDGPU/VOP2Instructions.td
M llvm/lib/Target/BPF/BPFRegisterInfo.cpp
M llvm/lib/Target/LoongArch/LoongArchISelLowering.cpp
M llvm/lib/Target/LoongArch/LoongArchISelLowering.h
M llvm/lib/Target/LoongArch/LoongArchInstrInfo.td
M llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
M llvm/lib/Target/NVPTX/NVPTXLowerArgs.cpp
M llvm/lib/Target/NVPTX/NVPTXUtilities.cpp
M llvm/lib/Target/NVPTX/NVPTXUtilities.h
M llvm/lib/Target/RISCV/GISel/RISCVLegalizerInfo.cpp
M llvm/lib/Target/RISCV/GISel/RISCVRegisterBankInfo.cpp
M llvm/lib/Target/RISCV/RISCV.td
M llvm/lib/Target/RISCV/RISCVProcessors.td
A llvm/lib/Target/RISCV/RISCVSchedSyntacoreSCR3.td
M llvm/lib/Target/SPIRV/MCTargetDesc/SPIRVBaseInfo.cpp
M llvm/lib/Target/SPIRV/MCTargetDesc/SPIRVBaseInfo.h
M llvm/lib/Target/SPIRV/SPIRVBuiltins.cpp
M llvm/lib/Target/SPIRV/SPIRVBuiltins.td
M llvm/lib/Target/SPIRV/SPIRVGlobalRegistry.cpp
M llvm/lib/Target/SPIRV/SPIRVGlobalRegistry.h
M llvm/lib/Target/SPIRV/SPIRVInstructionSelector.cpp
M llvm/lib/Target/SPIRV/SPIRVUtils.cpp
M llvm/lib/Target/X86/X86.td
M llvm/lib/Target/X86/X86ISelLowering.cpp
M llvm/lib/Target/X86/X86TargetTransformInfo.cpp
M llvm/lib/Target/Xtensa/XtensaAsmPrinter.cpp
M llvm/lib/Target/Xtensa/XtensaAsmPrinter.h
M llvm/lib/Target/Xtensa/XtensaISelLowering.cpp
M llvm/lib/Target/Xtensa/XtensaISelLowering.h
M llvm/lib/Target/Xtensa/XtensaInstrInfo.td
M llvm/lib/Target/Xtensa/XtensaOperators.td
M llvm/lib/TargetParser/RISCVISAInfo.cpp
M llvm/lib/TargetParser/X86TargetParser.cpp
M llvm/lib/Transforms/IPO/SCCP.cpp
M llvm/lib/Transforms/Scalar/LoopDistribute.cpp
M llvm/lib/Transforms/Scalar/Reassociate.cpp
M llvm/lib/Transforms/Utils/LoopUnrollRuntime.cpp
M llvm/lib/Transforms/Utils/PromoteMemoryToRegister.cpp
M llvm/lib/Transforms/Vectorize/LoopVectorizationPlanner.h
M llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
M llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
M llvm/lib/Transforms/Vectorize/VPlan.h
M llvm/lib/Transforms/Vectorize/VectorCombine.cpp
M llvm/runtimes/CMakeLists.txt
M llvm/test/Analysis/CostModel/AArch64/masked_ldst.ll
M llvm/test/Analysis/CostModel/AArch64/sve-gather.ll
M llvm/test/Analysis/CostModel/AArch64/sve-ldst.ll
M llvm/test/Analysis/CostModel/AArch64/sve-scatter.ll
M llvm/test/Analysis/CostModel/X86/arith-int-codesize.ll
M llvm/test/Analysis/CostModel/X86/arith-int-latency.ll
M llvm/test/Analysis/CostModel/X86/arith-int-sizelatency.ll
M llvm/test/Analysis/CostModel/X86/arith-int.ll
M llvm/test/Analysis/CostModel/X86/rem-codesize.ll
M llvm/test/Analysis/CostModel/X86/rem-latency.ll
M llvm/test/Analysis/CostModel/X86/rem-sizelatency.ll
M llvm/test/Analysis/CostModel/X86/rem.ll
M llvm/test/Analysis/CostModel/X86/slm-arith-costs.ll
A llvm/test/Analysis/LoopAccessAnalysis/debug-loc.ll
M llvm/test/Analysis/LoopAccessAnalysis/print-order.ll
M llvm/test/Analysis/LoopAccessAnalysis/scalable-vector-regression-tests.ll
M llvm/test/Analysis/LoopAccessAnalysis/symbolic-stride.ll
M llvm/test/Analysis/UniformityAnalysis/AMDGPU/intrinsics.ll
M llvm/test/CodeGen/AArch64/aarch64-sme2-asm.ll
M llvm/test/CodeGen/AArch64/sme-intrinsics-add.ll
M llvm/test/CodeGen/AArch64/sme-intrinsics-loads.ll
M llvm/test/CodeGen/AArch64/sme-intrinsics-mopa.ll
M llvm/test/CodeGen/AArch64/sme-intrinsics-mops.ll
M llvm/test/CodeGen/AArch64/sme-intrinsics-stores.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-add-sub-za16.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-add.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-cvt.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-cvtl.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-cvtn.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-extract-mova.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-fmlas.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-fmlas16.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-fp-dots.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-frint.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-insert-mova.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-int-dots.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-ld1.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-ldnt1.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti2-lane-x2.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti2-lane-x4.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti2-lane.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti4-lane-x2.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti4-lane-x4.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-luti4-lane.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-max.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-min.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-mlall.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-mlals.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-mop.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-mopa.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-qcvt.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-qcvtn.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-qrshr.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-rshl.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-select-sme-tileslice.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-sqdmulh.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-sub.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-vdot.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-zero-zt.ll
M llvm/test/CodeGen/AArch64/sme2-intrinsics-zt0.ll
M llvm/test/CodeGen/AArch64/sme2p1-intrinsics-zero.ll
M llvm/test/CodeGen/AArch64/sve-callee-save-restore-pairs.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-contiguous-prefetches.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-conversion.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-counting-bits.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-counting-elems.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-fp-converts.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-ldN-sret-reg+imm-addr-mode.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-ldN-sret-reg+reg-addr-mode.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-logical.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-pred-creation.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-pred-operations.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-pred-testing.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-reinterpret.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-reversal.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-sel.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-sqdec.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-sqinc.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-st1-addressing-mode-reg-imm.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-st1-addressing-mode-reg-reg.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-st1.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-stN-reg-imm-addr-mode.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-stN-reg-reg-addr-mode.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-stores.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-uqdec.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-uqinc.ll
M llvm/test/CodeGen/AArch64/sve-intrinsics-while.ll
M llvm/test/CodeGen/AArch64/sve-streaming-mode-fixed-length-reshuffle.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-binary-narrowing-add-sub.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-binary-narrowing-shr.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-complex-dot.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-contiguous-conflict-detection.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-fp-converts.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-fp-int-binary-logarithm.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-fp-widening-mul-acc.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-int-mul-lane.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-non-widening-pairwise-arith.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-polynomial-arithmetic.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-psel.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-revd.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-unary-narrowing.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-uniform-complex-arith.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-while-reversed.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-while.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-widening-complex-int-arith.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-widening-dsp.ll
M llvm/test/CodeGen/AArch64/sve2-intrinsics-widening-pairwise-arith.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-cntp.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-dots.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-fclamp.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-fp-reduce.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-int-reduce.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-loads.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-predicate-as-counter.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-sclamp.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-selx2.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-selx4.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-stores.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-uclamp.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-unpk.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-uzpx2.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-uzpx4.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-while-pn.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-zipx2.ll
M llvm/test/CodeGen/AArch64/sve2p1-intrinsics-zipx4.ll
A llvm/test/CodeGen/AArch64/taildup-ssa-update-pr62712.mir
M llvm/test/CodeGen/AMDGPU/GlobalISel/atomic_optimizations_mul_one.ll
M llvm/test/CodeGen/AMDGPU/GlobalISel/inst-select-load-constant.mir
M llvm/test/CodeGen/AMDGPU/GlobalISel/smrd.ll
M llvm/test/CodeGen/AMDGPU/atomic_optimization_split_dt_update.ll
M llvm/test/CodeGen/AMDGPU/buffer-fat-pointer-atomicrmw-fadd.ll
M llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-smem.ll
A llvm/test/CodeGen/AMDGPU/convergence-laneops.ll
M llvm/test/CodeGen/AMDGPU/gfx12_scalar_subword_loads.ll
M llvm/test/CodeGen/AMDGPU/global-atomic-scan.ll
M llvm/test/CodeGen/AMDGPU/global-saddr-load.ll
M llvm/test/CodeGen/AMDGPU/global_atomic_optimizer_fp_rtn.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_iterative_scan.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_iterative_scan_fp.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_optimizer_fp_no_rtn.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readfirstlane.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readfirstlane.ptr.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readlane.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readlane.ptr.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.writelane.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.writelane.ptr.ll
M llvm/test/CodeGen/AMDGPU/llvm.prefetch.ll
M llvm/test/CodeGen/AMDGPU/smrd.ll
M llvm/test/CodeGen/LoongArch/machinelicm-address-pseudos.ll
M llvm/test/CodeGen/LoongArch/psabi-restricted-scheduling.ll
A llvm/test/CodeGen/NVPTX/lower-args-gridconstant.ll
A llvm/test/CodeGen/NVPTX/mma-no-sink-after-laneid-check.ll
A llvm/test/CodeGen/RISCV/GlobalISel/instruction-select/fp-arith-f16.mir
M llvm/test/CodeGen/RISCV/GlobalISel/instruction-select/fp-arith.mir
A llvm/test/CodeGen/RISCV/GlobalISel/legalizer/legalize-fp-arith-f16.mir
A llvm/test/CodeGen/RISCV/GlobalISel/regbankselect/fp-arith-f16.mir
A llvm/test/CodeGen/SPIRV/var-uniform-const.ll
M llvm/test/CodeGen/X86/avx2-arith.ll
M llvm/test/CodeGen/X86/midpoint-int-vec-128.ll
M llvm/test/CodeGen/X86/midpoint-int-vec-256.ll
M llvm/test/CodeGen/X86/min-legal-vector-width.ll
M llvm/test/CodeGen/X86/pmul.ll
M llvm/test/CodeGen/X86/prefer-avx256-wide-mul.ll
M llvm/test/CodeGen/X86/vector-shuffle-combining-sse41.ll
A llvm/test/CodeGen/Xtensa/blockaddress.ll
A llvm/test/CodeGen/Xtensa/brcc.ll
A llvm/test/CodeGen/Xtensa/indirectbr.ll
A llvm/test/CodeGen/Xtensa/jumpt.ll
M llvm/test/DebugInfo/COFF/fission-sections.ll
M llvm/test/DebugInfo/unrolled-loop-remainder.ll
M llvm/test/Transforms/EarlyCSE/flags.ll
M llvm/test/Transforms/FunctionSpecialization/track-return.ll
A llvm/test/Transforms/InstCombine/AArch64/sve-intrinsic-comb-no-active-lanes-loads.ll
M llvm/test/Transforms/InstCombine/AMDGPU/amdgcn-intrinsics.ll
M llvm/test/Transforms/InstCombine/phi.ll
M llvm/test/Transforms/InstCombine/shift.ll
M llvm/test/Transforms/InstSimplify/phi.ll
M llvm/test/Transforms/LoopDeletion/update-scev.ll
A llvm/test/Transforms/LoopDistribute/debug-loc.ll
A llvm/test/Transforms/LoopDistribute/debug-print.ll
M llvm/test/Transforms/LoopUnroll/ARM/multi-blocks.ll
M llvm/test/Transforms/LoopUnroll/runtime-exit-phi-scev-invalidation.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop-at-most-two-exits.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop-multiple-exits.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop5.ll
M llvm/test/Transforms/LoopUnroll/runtime-multiexit-heuristic.ll
M llvm/test/Transforms/LoopUnroll/runtime-unroll-remainder.ll
M llvm/test/Transforms/LoopUnroll/tripcount-overflow.ll
M llvm/test/Transforms/LoopUnrollAndJam/unroll-and-jam.ll
M llvm/test/Transforms/LoopVectorize/AArch64/scalable-reductions.ll
M llvm/test/Transforms/LoopVectorize/AArch64/sve-illegal-type.ll
A llvm/test/Transforms/LoopVectorize/AArch64/sve-invalid-costs.ll
M llvm/test/Transforms/LoopVectorize/ARM/mve-hoist-runtime-checks.ll
M llvm/test/Transforms/LoopVectorize/X86/float-induction-x86.ll
M llvm/test/Transforms/LoopVectorize/create-induction-resume.ll
M llvm/test/Transforms/LoopVectorize/runtime-checks-hoist.ll
M llvm/test/Transforms/LoopVectorize/uniform-blend.ll
M llvm/test/Transforms/Mem2Reg/preserve-nonnull-load-metadata.ll
M llvm/test/Transforms/PhaseOrdering/AArch64/interleavevectorization.ll
M llvm/test/Transforms/SCCP/2009-01-14-IPSCCP-Invoke.ll
M llvm/test/Transforms/SCCP/apint-ipsccp2.ll
M llvm/test/Transforms/SCCP/arg-count-mismatch.ll
M llvm/test/Transforms/SCCP/indirectbr.ll
M llvm/test/Transforms/SCCP/ip-constant-ranges.ll
M llvm/test/Transforms/SCCP/ip-ranges-phis.ll
M llvm/test/Transforms/SCCP/ip-ranges-select.ll
M llvm/test/Transforms/SCCP/ipsccp-basic.ll
M llvm/test/Transforms/SCCP/ipsccp-clear-returned.ll
M llvm/test/Transforms/SCCP/ipsccp-noundef.ll
M llvm/test/Transforms/SCCP/ipsccp-preserve-analysis.ll
M llvm/test/Transforms/SCCP/issue59602-assume-like-call-users.ll
M llvm/test/Transforms/SCCP/metadata.ll
M llvm/test/Transforms/SCCP/musttail-call.ll
M llvm/test/Transforms/SCCP/range-and-ip.ll
M llvm/test/Transforms/SCCP/range-attribute.ll
M llvm/test/Transforms/SCCP/remove-call-inst.ll
M llvm/test/Transforms/SCCP/resolvedundefsin-tracked-fn.ll
M llvm/test/Transforms/SCCP/return-constant.ll
M llvm/test/Transforms/SCCP/solve-after-each-resolving-undefs-for-function.ll
M llvm/test/Transforms/SCCP/struct-arg-resolve-undefs.ll
M llvm/test/Transforms/VectorCombine/AArch64/shuffletoidentity-concat.ll
M llvm/test/Transforms/VectorCombine/AArch64/shuffletoidentity.ll
A llvm/test/tools/llvm-mca/RISCV/SyntacoreSCR/SCR3-ALU.s
A llvm/test/tools/llvm-mca/RISCV/SyntacoreSCR/SCR3-LSU.s
M llvm/test/tools/llvm-readobj/ELF/dynamic-tags.test
M llvm/tools/llvm-readobj/ELFDumper.cpp
M llvm/unittests/ADT/SmallPtrSetTest.cpp
M llvm/unittests/Analysis/MLModelRunnerTest.cpp
M llvm/unittests/CodeGen/GlobalISel/MachineIRBuilderTest.cpp
M llvm/unittests/IR/VectorBuilderTest.cpp
M llvm/unittests/Transforms/Vectorize/VPlanTest.cpp
M mlir/docs/DialectConversion.md
M mlir/include/mlir/Dialect/Linalg/Transforms/Transforms.h
M mlir/include/mlir/Dialect/OpenMP/OpenMPClauseOperands.h
M mlir/include/mlir/Dialect/OpenMP/OpenMPEnums.td
M mlir/include/mlir/Dialect/OpenMP/OpenMPOps.td
M mlir/include/mlir/Transforms/DialectConversion.h
M mlir/lib/Dialect/Affine/Transforms/LoopFusion.cpp
M mlir/lib/Dialect/ArmSME/Transforms/VectorLegalization.cpp
M mlir/lib/Dialect/GPU/Transforms/NVVMAttachTarget.cpp
M mlir/lib/Dialect/GPU/Transforms/ROCDLAttachTarget.cpp
M mlir/lib/Dialect/GPU/Transforms/SPIRVAttachTarget.cpp
M mlir/lib/Dialect/Linalg/TransformOps/LinalgTransformOps.cpp
M mlir/lib/Dialect/Linalg/Transforms/DropUnitDims.cpp
M mlir/lib/Dialect/OpenMP/IR/OpenMPDialect.cpp
M mlir/lib/Dialect/Tensor/IR/TensorOps.cpp
M mlir/lib/Dialect/Transform/Interfaces/MatchInterfaces.cpp
M mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp
M mlir/lib/IR/AffineMap.cpp
M mlir/lib/Tools/mlir-pdll-lsp-server/PDLLServer.cpp
M mlir/lib/Transforms/Utils/DialectConversion.cpp
M mlir/test/Dialect/ArmSME/vector-legalization.mlir
A mlir/test/Dialect/Linalg/rank-reduce-contraction-ops.mlir
M mlir/test/Dialect/OpenMP/invalid.mlir
M mlir/test/Dialect/OpenMP/ops.mlir
M mlir/test/Dialect/Tensor/fold-tensor-subset-ops-into-vector-transfers.mlir
M mlir/test/Dialect/Tensor/fold-tensor-subset-ops.mlir
M mlir/test/Dialect/Vector/fold-arith-extf-into-vector-contract.mlir
M mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/multi-tile-matmul.mlir
M mlir/test/Transforms/test-legalize-type-conversion.mlir
M mlir/test/lib/Dialect/Linalg/CMakeLists.txt
A mlir/test/lib/Dialect/Linalg/TestLinalgRankReduceContractionOps.cpp
M mlir/tools/mlir-opt/mlir-opt.cpp
M openmp/runtime/src/dllexports
M openmp/runtime/src/kmp.h
M openmp/runtime/src/kmp_csupport.cpp
M openmp/runtime/src/kmp_runtime.cpp
A openmp/runtime/test/parallel/omp_parallel_num_threads_list.c
A openmp/runtime/test/parallel/omp_parallel_num_threads_strict.c
M runtimes/cmake/Modules/WarningFlags.cmake
M utils/bazel/llvm-project-overlay/lldb/BUILD.bazel
M utils/bazel/llvm-project-overlay/lldb/source/Plugins/BUILD.bazel
A utils/bazel/llvm-project-overlay/lldb/tools/lldb-dap/BUILD.bazel
M utils/bazel/llvm-project-overlay/llvm/BUILD.bazel
M utils/bazel/llvm-project-overlay/mlir/BUILD.bazel
Log Message:
-----------
rebase
Created using spr 1.3.4
Compare: https://github.com/llvm/llvm-project/compare/62e1a0a8f058...c7058d67f34b
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list