[all-commits] [llvm/llvm-project] 8348d7: [clang-tidy] Fix assert in performance-unnecessary...
Fangrui Song via All-commits
all-commits at lists.llvm.org
Tue Jun 25 20:11:14 PDT 2024
Branch: refs/heads/users/MaskRay/spr/elf-rename-isrela-to-hasaddend
Home: https://github.com/llvm/llvm-project
Commit: 8348d720ef913b0ff92b468be2eb9f4ea273cb5a
https://github.com/llvm/llvm-project/commit/8348d720ef913b0ff92b468be2eb9f4ea273cb5a
Author: Clement Courbet <courbet at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang-tools-extra/clang-tidy/utils/DeclRefExprUtils.cpp
M clang-tools-extra/docs/ReleaseNotes.rst
M clang-tools-extra/test/clang-tidy/checkers/performance/unnecessary-copy-initialization.cpp
Log Message:
-----------
[clang-tidy] Fix assert in performance-unnecessary-copy-init. (#96506)
`GetDirectCallee` can be null.
Fixes #96498.
Commit: bd488c1247715f8d698b9b26611ff7f869c824cc
https://github.com/llvm/llvm-project/commit/bd488c1247715f8d698b9b26611ff7f869c824cc
Author: Craig Topper <craig.topper at sifive.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/lib/TargetParser/RISCVISAInfo.cpp
Log Message:
-----------
[RISCV] Use StringRef::starts_with(char) instead of starts_with(StringRef). NFC
Commit: 8599629d390270b21c97f8b7ad2d97a344211d05
https://github.com/llvm/llvm-project/commit/8599629d390270b21c97f8b7ad2d97a344211d05
Author: paperchalice <liujunchang97 at outlook.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/CodeGen/MachinePostDominators.h
M llvm/include/llvm/Passes/MachinePassRegistry.def
M llvm/lib/CodeGen/MachinePostDominators.cpp
M llvm/lib/Passes/PassBuilder.cpp
Log Message:
-----------
[CodeGen][NewPM] Port machine post dominator tree analysis to new pass manager (#96378)
Follows #95879.
Commit: 62d44fbd50e28a6cfe479fc07164b8c970f049c6
https://github.com/llvm/llvm-project/commit/62d44fbd50e28a6cfe479fc07164b8c970f049c6
Author: Fangrui Song <i at maskray.me>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
M llvm/lib/MC/MCDwarf.cpp
Log Message:
-----------
[MC] Remove setUseAssemblerInfoForParsing(false) workarounds
This reverts commit 245491a9f384e4c53421196533c2a2b693efaf8d ("[MC] Disable MCAssembler based constant folding for DwarfDebug")
and cb09b5f3d53e5b7b4452bb3db78dca79fc9b3f17 ("[MC] Disable MCAssembler based constant folding for compact unwind and emitJumpTableEntry").
Checking the relative order of FA and FB is now faster due to
de19f7b6d46f1c38e10e604154f0fdaaffde9ebd ("[MC] Replace fragment ilist with singly-linked lists").
Commit: f1e0657d144f5a3cfef4b625d0f875f4dacd21d1
https://github.com/llvm/llvm-project/commit/f1e0657d144f5a3cfef4b625d0f875f4dacd21d1
Author: Matthias Springer <me at m-sp.org>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M mlir/docs/DialectConversion.md
M mlir/include/mlir/Transforms/DialectConversion.h
M mlir/lib/Transforms/Utils/DialectConversion.cpp
M mlir/test/Transforms/test-legalize-type-conversion.mlir
Log Message:
-----------
[mlir][Transforms] Dialect conversion: Simplify handling of dropped arguments (#96207)
This commit simplifies the handling of dropped arguments and updates
some dialect conversion documentation that is outdated.
When converting a block signature, a `BlockTypeConversionRewrite` object
and potentially multiple `ReplaceBlockArgRewrite` are created. During
the "commit" phase, uses of the old block arguments are replaced with
the new block arguments, but the old implementation was written in an
inconsistent way: some block arguments were replaced in
`BlockTypeConversionRewrite::commit` and some were replaced in
`ReplaceBlockArgRewrite::commit`. The new
`BlockTypeConversionRewrite::commit` implementation is much simpler and
no longer modifies any IR; that is done only in `ReplaceBlockArgRewrite`
now. The `ConvertedArgInfo` data structure is no longer needed.
To that end, materializations of dropped arguments are now built in
`applySignatureConversion` instead of `materializeLiveConversions`; the
latter function no longer has to deal with dropped arguments.
Other minor improvements:
- Improve variable name: `origOutputType` -> `origArgType`. Add an
assertion to check that this field is only used for argument
materializations.
- Add more comments to `applySignatureConversion`.
Note: Error messages around failed materializations for dropped basic
block arguments changed slightly. That is because those materializations
are now built in `legalizeUnresolvedMaterialization` instead of
`legalizeConvertedArgumentTypes`.
This commit is in preparation of decoupling argument/source/target
materializations from the dialect conversion.
Commit: 41f8e6d3ea57315e619ab9f71e1663095f4ed59d
https://github.com/llvm/llvm-project/commit/41f8e6d3ea57315e619ab9f71e1663095f4ed59d
Author: Craig Topper <craig.topper at sifive.com>
Date: 2024-06-24 (Mon, 24 Jun 2024)
Changed paths:
M llvm/test/CodeGen/RISCV/GlobalISel/instruction-select/fp-arith.mir
Log Message:
-----------
[RISCV][GISel] Fix test case order in fp-arith.mir. NFC
The fadd_f64 test was in the middle of some f32 tests.
Commit: 4c91b49bab0728d4bc136aa33c4aeb4e8ea37d01
https://github.com/llvm/llvm-project/commit/4c91b49bab0728d4bc136aa33c4aeb4e8ea37d01
Author: Jie Fu <jiefu at tencent.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang-tools-extra/clangd/Format.cpp
Log Message:
-----------
[clangd] Fix the build broken (NFC)
/llvm-project/clang-tools-extra/clangd/Format.cpp:284:11:
error: no member named 'KeepEmptyLinesAtTheStartOfBlocks' in 'clang::format::FormatStyle'
Style.KeepEmptyLinesAtTheStartOfBlocks = true;
~~~~~ ^
1 error generated.
Commit: efa8463ab90147aacd4647eb7715763978235890
https://github.com/llvm/llvm-project/commit/efa8463ab90147aacd4647eb7715763978235890
Author: David Green <david.green at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Vectorize/VectorCombine.cpp
M llvm/test/Transforms/PhaseOrdering/AArch64/interleavevectorization.ll
M llvm/test/Transforms/VectorCombine/AArch64/shuffletoidentity-concat.ll
M llvm/test/Transforms/VectorCombine/AArch64/shuffletoidentity.ll
Log Message:
-----------
[VectorCombine] Add free concats to shuffleToIdentity. (#94954)
This is another relatively small adjustment to shuffleToIdentity, which
has had a few knock-one effects to need a few more changes. It attempts
to detect free concats, that will be legalized to multiple vector
operations. For example if the lanes are '[a[0], a[1], b[0], b[1]]' and
a and b are v2f64 under aarch64.
In order to do this:
- isFreeConcat detects whether the input has piece-wise identities from
multiple inputs that can become a concat.
- A tree of concat shuffles is created to concatenate the input values
into a single vector. This is a little different to most other inputs as
there are created from multiple values that are being combined together,
and we cannot rely on the Lane0 insert location always being valid.
- The insert location is changed to the original location instead of
updating per item, which ensure it is valid due to the order that we
visit and create items.
Commit: f01958137f358f4fd3e71636b187db14a37a5853
https://github.com/llvm/llvm-project/commit/f01958137f358f4fd3e71636b187db14a37a5853
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/ADT/SmallPtrSet.h
M llvm/unittests/ADT/SmallPtrSetTest.cpp
Log Message:
-----------
[SmallPtrSet] Add remove_if() method (#96468)
Add remove_if() method, similar to the one already present on SetVector.
It is intended to replace the following pattern:
for (Foo *Ptr : Set)
if (Pred(Ptr))
Set.erase(Ptr);
With:
Set.remove_if(Pred);
This pattern is commonly used for set intersection, where `Pred` is
something like `!OtherSet.contains(Ptr)`.
The implementation provided here is a bit more efficient than the naive
loop, because it does not require looking up the bucket during the
erase() operation again.
However, my actual motivation for this is to have a way to perform this
operation without relying on the current `std::set`-style guarantee that
erase() does not invalidate iterators. I'd like to stop making use of
tombstones in the small regime, which will make insertion operations a
good bit more efficient. However, this will invalidate iterators during
erase().
Commit: fa20184a8f336e4154f2ffeeeb8a538dc9462d9a
https://github.com/llvm/llvm-project/commit/fa20184a8f336e4154f2ffeeeb8a538dc9462d9a
Author: Chuanqi Xu <yedeng.yd at linux.alibaba.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/lib/Serialization/ASTWriter.cpp
A clang/test/Modules/no-external-identifier-id.cppm
A clang/test/Modules/no-external-type-id.cppm
Log Message:
-----------
[C++20] [Modules] [Serialization] Don't reuse type ID and identifier ID from imported modules
To support no-transitive-change model for named modules, we can't reuse
type ID and identifier ID from imported modules arbitrarily. Since the
theory for no-transitive-change model is,
for a user of a named module, the user can only access the
indirectly imported decls via the directly imported module. So that it is
possible to control what matters to the users when writing the module.
And it will be unsafe to do so if the users can reuse the type IDs and
identifier IDs from the indirectly imported modules not via the directly
imported modules.
So in this patch, we don't reuse the type ID and identifier ID in the
AST writer to avoid the problematic case.
Commit: 8153773b23032177546944ec2524dce131b8a46e
https://github.com/llvm/llvm-project/commit/8153773b23032177546944ec2524dce131b8a46e
Author: Timm Bäder <tbaeder at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/lib/AST/Interp/Pointer.cpp
M clang/test/AST/Interp/literals.cpp
Log Message:
-----------
[clang][Interp] Fix returning primitive non-blockpointers
We can't deref() them, so return false here.
Commit: 174f80c6030f9bc96df6ae8daeb4d6bce3f36fbb
https://github.com/llvm/llvm-project/commit/174f80c6030f9bc96df6ae8daeb4d6bce3f36fbb
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/Support/GenericDomTreeConstruction.h
Log Message:
-----------
[DomTree] Avoid duplicate hash lookups in runDFS() (NFCI) (#96460)
runDFS() currently performs three hash table lookups. One in the main
loop, one when checking whether a successor has already been visited and
another when adding parent and reverse children to the successor.
We can avoid the two additional lookups by making the parent number part
of the stack, and then making the parent / reverse children update part
of the main loop.
The main loop already has a check for already visited nodes, so we don't
have to check this in advance -- we can simply push the node to the
worklist and skip it later.
Commit: 01fb5290eb86f056e6ba8757011273bcb7f4f2ee
https://github.com/llvm/llvm-project/commit/01fb5290eb86f056e6ba8757011273bcb7f4f2ee
Author: Oleksandr "Alex" Zinenko <git at ozinenko.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M utils/bazel/llvm-project-overlay/mlir/BUILD.bazel
Log Message:
-----------
Fix bazel after 18cf1cd92b554ba0b870c6a2223ea4d0d3c6dd21
mlir-config.h is included but not listed in dependencies
Commit: 7f1a74429dfd62a410d4b51d2e75d3677429a51a
https://github.com/llvm/llvm-project/commit/7f1a74429dfd62a410d4b51d2e75d3677429a51a
Author: Björn Pettersson <bjorn.a.pettersson at ericsson.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/CodeGen/MachineSSAUpdater.cpp
A llvm/test/CodeGen/AArch64/taildup-ssa-update-pr62712.mir
Log Message:
-----------
[TailDup][MachineSSAUpdater] Let RewriteUse insert a COPY when needed (#95553)
When running early-tailduplication we've seen problems with machine
verifier errors due to register class mismatches after doing the machine
SSA updates.
Typical scenario is that there is a PHI node and another instruction
that is using the same vreg:
%othervreg:otherclass = PHI %vreg:origclass, %bb
MInstr %vreg:origclass
but then after TailDuplicator::tailDuplicateAndUpdate we get
%othervreg:otherclass = PHI %vreg:origclass, %bb, ...
MInstr %othervreg:otherclass
Such rewrites are only valid if 'otherclass' is equal to (or a subclass
of) 'origclass'.
The solution here is based on adding a COPY instruction to make sure we
satisfy constraints given by 'MInstr' in the example. So if 'otherclass'
isn't equal to (or a subclass of) 'origclass' we insert a copy after the
PHI like this:
%othervreg:otherclass = PHI %vreg:origclass, %bb, ...
%newvreg:origclass = COPY %othervreg:otherclass
MInstr %newvreg:origclass
A special case is when it is possible to constrain the register class
instead of inserting a COPY. We currently prefer to constrain the
register class instead of inserting a COPY, even if it is a bit unclear
if that always is better (considering register pressure for the
constrained class etc.).
Fixes: https://github.com/llvm/llvm-project/issues/62712
Commit: 11e12bd82b0358de053cfd0dc762e9b30948c8a1
https://github.com/llvm/llvm-project/commit/11e12bd82b0358de053cfd0dc762e9b30948c8a1
Author: Mitch Phillips <31459023+hctim at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M compiler-rt/lib/gwp_asan/guarded_pool_allocator.cpp
M compiler-rt/lib/gwp_asan/platform_specific/guarded_pool_allocator_fuchsia.cpp
M compiler-rt/lib/gwp_asan/platform_specific/guarded_pool_allocator_posix.cpp
M compiler-rt/lib/gwp_asan/utilities.h
Log Message:
-----------
[NFC] [GWP-ASan] Rename Check() to check() (#96605)
Change this function to be LLVM-style in name.
Commit: d78211912391b724a064ef017dbf239eff10380a
https://github.com/llvm/llvm-project/commit/d78211912391b724a064ef017dbf239eff10380a
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/IR/Analysis.h
Log Message:
-----------
[Analysis] Use SmallPtrSet::remove_if (NFC)
Commit: 2d84e0ffefda62da6193d339633dbb55654f7b5d
https://github.com/llvm/llvm-project/commit/2d84e0ffefda62da6193d339633dbb55654f7b5d
Author: Anton Sidorenko <anton.sidorenko at syntacore.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/RISCV/RISCV.td
M llvm/lib/Target/RISCV/RISCVProcessors.td
A llvm/lib/Target/RISCV/RISCVSchedSyntacoreSCR3.td
A llvm/test/tools/llvm-mca/RISCV/SyntacoreSCR/SCR3-ALU.s
A llvm/test/tools/llvm-mca/RISCV/SyntacoreSCR/SCR3-LSU.s
Log Message:
-----------
[RISCV] Add scheduling model for Syntacore SCR3 (#95427)
Syntacore SCR3 is a microcontroller-class processor core. Overview:
https://syntacore.com/products/scr3
Co-authored-by: Dmitrii Petrov <dmitrii.petrov at syntacore.com>
Commit: 8395f9cecd34af8a79c96e661e46a80d0d471fb1
https://github.com/llvm/llvm-project/commit/8395f9cecd34af8a79c96e661e46a80d0d471fb1
Author: Pavel Labath <pavel at labath.sk>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M lldb/source/Plugins/SymbolFile/DWARF/DWARFASTParserClang.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARF.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARF.h
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDebugMap.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDebugMap.h
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDwo.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDwo.h
Log Message:
-----------
[lldb/DWARF] Remove parsing recursion when searching for definition DIEs (#96484)
If ParseStructureLikeDIE (or ParseEnum) encountered a declaration DIE,
it would call FindDefinitionTypeForDIE. This returned a fully formed
type, which it achieved by recursing back into ParseStructureLikeDIE
with the definition DIE.
This obscured the control flow and caused us to repeat some work (e.g.
the UniqueDWARFASTTypeMap lookup), but it mostly worked until we tried
to delay the definition search in #90663. After this patch, the two
ParseStructureLikeDIE calls were no longer recursive, but rather the
second call happened as a part of the CompleteType() call. This opened
the door to inconsistencies, as the second ParseStructureLikeDIE call
was not aware it was called to process a definition die for an existing
type.
To make that possible, this patch removes the recusive type resolution
from this function, and leaves just the "find definition die"
functionality. After finding the definition DIE, we just go back to the
original ParseStructureLikeDIE call, and have it finish the parsing
process with the new DIE.
While this patch is motivated by the work on delaying the definition
searching, I believe it is also useful on its own.
Commit: f6aa50873463ebd9a459b7ccd4989460175a6e7f
https://github.com/llvm/llvm-project/commit/f6aa50873463ebd9a459b7ccd4989460175a6e7f
Author: Vyacheslav Levytskyy <vyacheslav.levytskyy at intel.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/SPIRV/SPIRVBuiltins.cpp
M llvm/lib/Target/SPIRV/SPIRVGlobalRegistry.cpp
M llvm/lib/Target/SPIRV/SPIRVGlobalRegistry.h
M llvm/lib/Target/SPIRV/SPIRVInstructionSelector.cpp
M llvm/lib/Target/SPIRV/SPIRVUtils.cpp
A llvm/test/CodeGen/SPIRV/var-uniform-const.ll
Log Message:
-----------
[SPIR-V]: Fix creation of constants of array types in SPIRV Backend (#96514)
This PR fixes https://github.com/llvm/llvm-project/issues/96513.
The way of creation of array type constant was incorrect: instead of
creating [1, 1, 1] or [1, 1, 1, 1, 1, ....] constants, the same [1]
constant was always created, substituting original composite constants.
This in its turn led to a situation when only one of constants might
exist in the code without emitting invalid code, the second constant
would be eventually rewritten to the first constant, because a key to
address both was an array of a single element (like [1]).
This PR fixes the issue and purges from the code unneeded copy/pasted
clone of the function that creates an array constant.
Commit: 0bd9c49a295829ed44e838c4d54cc905662a1afa
https://github.com/llvm/llvm-project/commit/0bd9c49a295829ed44e838c4d54cc905662a1afa
Author: Lukacma <Marian.Lukac at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp
A llvm/test/Transforms/InstCombine/AArch64/sve-intrinsic-comb-no-active-lanes-loads.ll
Log Message:
-----------
[AArch64][SVE] optimisation for SVE load intrinsics with no active lanes (#95269)
This patch extends #73964 and adds optimisation of load SVE intrinsics
when predicate is zero.
Commit: 919b1ecafc010379eff88368b050068223a01f99
https://github.com/llvm/llvm-project/commit/919b1ecafc010379eff88368b050068223a01f99
Author: Abid Qadeer <haqadeer at amd.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M flang/lib/Optimizer/Transforms/DebugTypeGenerator.cpp
A flang/test/Integration/debug-ptr-type.f90
A flang/test/Transforms/debug-ptr-type.fir
Log Message:
-----------
[flang][debug] Support pointer type. (#96153)
The handling of `PointerType` is similar to `HeapType`. The only
difference is that allocated flag is generated for `HeapType` and
associated flag for `PointerType`. The tests for pointer to allocatable
strings are disabled for now. I will enable them once #95906 is merged.
The debugging in GDB looks like this:
integer, pointer :: par2(:)
integer, target, allocatable :: ar2(:)
integer, target :: sc
integer, pointer :: psc
allocate(ar2(4))
par2 => ar2
psc => sc
19 par2 => ar2
(gdb) p par2
$3 = <not associated>
(gdb) n
20 do i=1,5
(gdb) p par2
$4 = (0, 0, 0, 0)
(gdb) ptype par2
type = integer (4)
(gdb) p sc
$5 = 3
(gdb) p psc
$6 = (PTR TO -> ( integer )) 0x7fffffffda24
(gdb) p *psc
$7 = 3
Commit: 5feb32ba929f9e517c530217cabb09d1d734a763
https://github.com/llvm/llvm-project/commit/5feb32ba929f9e517c530217cabb09d1d734a763
Author: Vikram Hegde <115221833+vikramRH at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/lib/CodeGen/CGBuiltin.cpp
M clang/test/CodeGenOpenCL/builtins-amdgcn.cl
M llvm/docs/AMDGPUUsage.rst
M llvm/include/llvm/IR/IntrinsicsAMDGPU.td
M llvm/lib/Target/AMDGPU/AMDGPUAtomicOptimizer.cpp
M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp
M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.h
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/lib/Target/AMDGPU/SIInstructions.td
M llvm/lib/Target/AMDGPU/VOP1Instructions.td
M llvm/lib/Target/AMDGPU/VOP2Instructions.td
M llvm/test/Analysis/UniformityAnalysis/AMDGPU/intrinsics.ll
M llvm/test/CodeGen/AMDGPU/GlobalISel/atomic_optimizations_mul_one.ll
M llvm/test/CodeGen/AMDGPU/atomic_optimization_split_dt_update.ll
A llvm/test/CodeGen/AMDGPU/convergence-laneops.ll
M llvm/test/CodeGen/AMDGPU/global-atomic-scan.ll
M llvm/test/CodeGen/AMDGPU/global_atomic_optimizer_fp_rtn.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_iterative_scan.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_iterative_scan_fp.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_optimizer_fp_no_rtn.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readfirstlane.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readfirstlane.ptr.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readlane.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readlane.ptr.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.writelane.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.writelane.ptr.ll
M llvm/test/Transforms/InstCombine/AMDGPU/amdgcn-intrinsics.ll
Log Message:
-----------
[AMDGPU] Extend readlane, writelane and readfirstlane intrinsic lowering for generic types (#89217)
This patch is intended to be the first of a series with end goal to
adapt atomic optimizer pass to support i64 and f64 operations (along
with removing all unnecessary bitcasts). This legalizes 64 bit readlane,
writelane and readfirstlane ops pre-ISel
---------
Co-authored-by: vikramRH <vikhegde at amd.com>
Commit: f9795f34a6b358bdf02389ba01765924a2ad0b9f
https://github.com/llvm/llvm-project/commit/f9795f34a6b358bdf02389ba01765924a2ad0b9f
Author: Serge Pavlov <sepavloff at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/CodeGen/GlobalISel/MachineIRBuilder.h
M llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp
M llvm/unittests/CodeGen/GlobalISel/MachineIRBuilderTest.cpp
Log Message:
-----------
[GlobalISel] Add build methods for FP environment intrinsics (#96607)
This change adds methods like buildGetFPEnv and similar for opcodes that
represent manipulation on floating-point state.
Commit: 16d02cd1c5df57ae317b115333cf75b347c90d1d
https://github.com/llvm/llvm-project/commit/16d02cd1c5df57ae317b115333cf75b347c90d1d
Author: Nikolas Klauser <nikolasklauser at berlin.de>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/include/__type_traits/is_nothrow_destructible.h
M libcxx/test/std/utilities/meta/meta.unary/meta.unary.prop/is_nothrow_destructible.pass.cpp
Log Message:
-----------
[libc++] Use __is_nothrow_destructible (#95766)
This changes the behaviour in C++03 mode because we'll now use the
builtin on Clang, but I don't think that's much of a problem.
Commit: 29f4a0561a03f4760a876e39d443c64ea7ee9e22
https://github.com/llvm/llvm-project/commit/29f4a0561a03f4760a876e39d443c64ea7ee9e22
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/ADT/SetOperations.h
Log Message:
-----------
[SetOperations] clang-format header (NFC)
This header used three-space indentation in a number of places.
Reformat it completely.
Commit: f09b02484b865e4e6e2243ee2ff59d70b60d28e4
https://github.com/llvm/llvm-project/commit/f09b02484b865e4e6e2243ee2ff59d70b60d28e4
Author: Haojian Wu <hokein.wu at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/lib/Sema/SemaTemplate.cpp
Log Message:
-----------
[clang] Remove a stale FIXME
This FIXME has already been addressed in #89358
Commit: 9d45077df92a64ff8db5b249ac2f2d45e82ab73c
https://github.com/llvm/llvm-project/commit/9d45077df92a64ff8db5b249ac2f2d45e82ab73c
Author: Florian Hahn <flo at fhahn.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Vectorize/LoopVectorizationPlanner.h
M llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
M llvm/lib/Transforms/Vectorize/VPlan.h
Log Message:
-----------
[VPlan] Iterate over VPlans to get VFs to compute cost for (NFCI).
Instead for iterating over all VFs when computing costs, simply iterate
over the VFs available in the created VPlans.
Split off from https://github.com/llvm/llvm-project/pull/92555.
This also prepares for moving the check if any vector instructions will
be generated to be based on VPlan, to unblock recommitting
https://github.com/llvm/llvm-project/pull/92555.
Commit: eeb0884e6696ec618feb2181a432d10f66d4e840
https://github.com/llvm/llvm-project/commit/eeb0884e6696ec618feb2181a432d10f66d4e840
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Utils/LoopUnrollRuntime.cpp
M llvm/test/Transforms/LoopUnroll/ARM/multi-blocks.ll
M llvm/test/Transforms/LoopUnroll/runtime-exit-phi-scev-invalidation.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop-at-most-two-exits.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop-multiple-exits.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop5.ll
M llvm/test/Transforms/LoopUnroll/runtime-multiexit-heuristic.ll
M llvm/test/Transforms/LoopUnroll/runtime-unroll-remainder.ll
M llvm/test/Transforms/LoopUnroll/tripcount-overflow.ll
M llvm/test/Transforms/LoopUnrollAndJam/unroll-and-jam.ll
M llvm/test/Transforms/LoopVectorize/X86/float-induction-x86.ll
Log Message:
-----------
[LoopUnroll] Use poison instead of undef for preheader value
Commit: a2e915704ffb040e544b13bf565ecad3aa661d98
https://github.com/llvm/llvm-project/commit/a2e915704ffb040e544b13bf565ecad3aa661d98
Author: Florian Hahn <flo at fhahn.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/Transforms/LoopVectorize/create-induction-resume.ll
Log Message:
-----------
[LV] Make create-induction-resume.ll more robust by adding store.
Without the store, the vector loop body is empty. Add a store to avoid
that, while not impacting the induction resume values that are created.
Commit: 37c736e0359770b6a2d2139d84f9df400f4b1049
https://github.com/llvm/llvm-project/commit/37c736e0359770b6a2d2139d84f9df400f4b1049
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Utils/LoopUnrollRuntime.cpp
M llvm/test/DebugInfo/unrolled-loop-remainder.ll
M llvm/test/Transforms/LoopUnroll/runtime-exit-phi-scev-invalidation.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop-multiple-exits.ll
M llvm/test/Transforms/LoopUnroll/tripcount-overflow.ll
Log Message:
-----------
[LoopUnroll] Use poison instead of undef for another preheader value
Commit: cc8fdd60c2f791dba70c9e458173640d7da836e3
https://github.com/llvm/llvm-project/commit/cc8fdd60c2f791dba70c9e458173640d7da836e3
Author: Andrei Safronov <safronov at espressif.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/Xtensa/XtensaAsmPrinter.cpp
M llvm/lib/Target/Xtensa/XtensaAsmPrinter.h
M llvm/lib/Target/Xtensa/XtensaISelLowering.cpp
M llvm/lib/Target/Xtensa/XtensaISelLowering.h
M llvm/lib/Target/Xtensa/XtensaInstrInfo.td
M llvm/lib/Target/Xtensa/XtensaOperators.td
A llvm/test/CodeGen/Xtensa/blockaddress.ll
A llvm/test/CodeGen/Xtensa/brcc.ll
A llvm/test/CodeGen/Xtensa/indirectbr.ll
A llvm/test/CodeGen/Xtensa/jumpt.ll
Log Message:
-----------
[Xtensa] Lower GlobalAddress/BlockAddress/JumpTable (#95256)
This patch implements lowering of the GlobalAddress, BlockAddress,
JumpTable and BR_JT. Also patch adds legal support of the BR_CC
operation for i32 type.
Commit: 4acc8eed94e3f4e21bcf1dedca021cb6a086720c
https://github.com/llvm/llvm-project/commit/4acc8eed94e3f4e21bcf1dedca021cb6a086720c
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/Transforms/SCCP/2009-01-14-IPSCCP-Invoke.ll
M llvm/test/Transforms/SCCP/apint-ipsccp2.ll
M llvm/test/Transforms/SCCP/ipsccp-clear-returned.ll
M llvm/test/Transforms/SCCP/ipsccp-noundef.ll
M llvm/test/Transforms/SCCP/ipsccp-preserve-analysis.ll
M llvm/test/Transforms/SCCP/range-and-ip.ll
M llvm/test/Transforms/SCCP/solve-after-each-resolving-undefs-for-function.ll
M llvm/test/Transforms/SCCP/struct-arg-resolve-undefs.ll
Log Message:
-----------
[SCCP] Generate test checks (NFC)
Some of these are just old, while others previously did not use
UTC due to missing features that have since been implemented
(such as signature matching).
Commit: 16bb8c16aab32e2ee623a2b64d976548be247180
https://github.com/llvm/llvm-project/commit/16bb8c16aab32e2ee623a2b64d976548be247180
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/IPO/SCCP.cpp
M llvm/test/Transforms/FunctionSpecialization/track-return.ll
M llvm/test/Transforms/SCCP/2009-01-14-IPSCCP-Invoke.ll
M llvm/test/Transforms/SCCP/apint-ipsccp2.ll
M llvm/test/Transforms/SCCP/arg-count-mismatch.ll
M llvm/test/Transforms/SCCP/indirectbr.ll
M llvm/test/Transforms/SCCP/ip-constant-ranges.ll
M llvm/test/Transforms/SCCP/ip-ranges-phis.ll
M llvm/test/Transforms/SCCP/ip-ranges-select.ll
M llvm/test/Transforms/SCCP/ipsccp-basic.ll
M llvm/test/Transforms/SCCP/ipsccp-clear-returned.ll
M llvm/test/Transforms/SCCP/ipsccp-noundef.ll
M llvm/test/Transforms/SCCP/ipsccp-preserve-analysis.ll
M llvm/test/Transforms/SCCP/issue59602-assume-like-call-users.ll
M llvm/test/Transforms/SCCP/metadata.ll
M llvm/test/Transforms/SCCP/musttail-call.ll
M llvm/test/Transforms/SCCP/range-and-ip.ll
M llvm/test/Transforms/SCCP/range-attribute.ll
M llvm/test/Transforms/SCCP/remove-call-inst.ll
M llvm/test/Transforms/SCCP/resolvedundefsin-tracked-fn.ll
M llvm/test/Transforms/SCCP/return-constant.ll
M llvm/test/Transforms/SCCP/solve-after-each-resolving-undefs-for-function.ll
M llvm/test/Transforms/SCCP/struct-arg-resolve-undefs.ll
Log Message:
-----------
[SCCP] Use poison instead of undef when zapping returns
Commit: 35eef9f97f077a7cfa2beb9df857aa1713e4faa6
https://github.com/llvm/llvm-project/commit/35eef9f97f077a7cfa2beb9df857aa1713e4faa6
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Scalar/Reassociate.cpp
Log Message:
-----------
[Reassociate] Use poison instead of undef for dummy operands (NFCI)
These will be replaced later.
Commit: 65f9601fb1479507820ca8381bb472ec4e8a2eee
https://github.com/llvm/llvm-project/commit/65f9601fb1479507820ca8381bb472ec4e8a2eee
Author: Daniil Kovalev <dkovalev at accesssoftek.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M lld/ELF/InputFiles.cpp
M lld/ELF/InputFiles.h
Log Message:
-----------
[NFC][lld][ELF] Remove unused `sec` param of `ObjFile<ELFT>::getRelocTarget` (#96500)
Commit: bfad8757159a3def7c07d4f5ef23ed76fafdf441
https://github.com/llvm/llvm-project/commit/bfad8757159a3def7c07d4f5ef23ed76fafdf441
Author: hev <wangrui at loongson.cn>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/LoongArch/LoongArchISelLowering.cpp
M llvm/lib/Target/LoongArch/LoongArchISelLowering.h
M llvm/lib/Target/LoongArch/LoongArchInstrInfo.td
M llvm/test/CodeGen/LoongArch/machinelicm-address-pseudos.ll
M llvm/test/CodeGen/LoongArch/psabi-restricted-scheduling.ll
Log Message:
-----------
[LoongArch] Ensure PseudoLA* can be hoisted (#94723)
Since we mark the pseudos as mayLoad but do not provide any MMOs,
isSafeToMove conservatively returns false, stopping MachineLICM from
hoisting the instructions. PseudoLA_TLS_{LD,GD} does not actually expand
to a load, so stop marking that as mayLoad to allow it to be hoisted,
and for the others make sure to add MMOs during lowering to indicate
they're GOT loads and thus can be freely moved.
Commit: 9952e00af6c93e9807c3dfe076e99e7f5a77f675
https://github.com/llvm/llvm-project/commit/9952e00af6c93e9807c3dfe076e99e7f5a77f675
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/unittests/Transforms/Vectorize/VPlanTest.cpp
Log Message:
-----------
[VPlanTest] Use poison instead of undef for dummy values (NFC)
Commit: 68efc506f58776fdfcfc60707f8fe52584736d8a
https://github.com/llvm/llvm-project/commit/68efc506f58776fdfcfc60707f8fe52584736d8a
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/unittests/IR/VectorBuilderTest.cpp
Log Message:
-----------
[VectorBuilderTest] Use poison instead of undef for dummy values (NFC)
Commit: 9acb533c38be833ec1d8daa06e127a9de8f0a5ef
https://github.com/llvm/llvm-project/commit/9acb533c38be833ec1d8daa06e127a9de8f0a5ef
Author: Alex Voicu <alexandru.voicu at amd.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/include/clang/Basic/Cuda.h
M clang/lib/Basic/Cuda.cpp
M clang/lib/Basic/Targets/NVPTX.cpp
M clang/lib/CodeGen/CGOpenMPRuntimeGPU.cpp
M clang/lib/CodeGen/CodeGenModule.cpp
M clang/lib/Driver/Driver.cpp
M clang/lib/Driver/ToolChains/Clang.cpp
M clang/lib/Driver/ToolChains/HIPAMD.cpp
M clang/lib/Driver/ToolChains/HIPAMD.h
M clang/test/Driver/cuda-arch-translation.cu
M clang/test/Frontend/embed-bitcode.ll
M clang/test/Misc/target-invalid-cpu-note.c
M llvm/lib/Bitcode/Writer/BitcodeWriter.cpp
Log Message:
-----------
[clang][Driver] Add HIPAMD Driver support for AMDGCN flavoured SPIR-V (#95061)
This patch augments the HIPAMD driver to allow it to target AMDGCN
flavoured SPIR-V compilation. It's mostly straightforward, as we re-use
some of the existing SPIRV infra, however there are a few notable
additions:
- we introduce an `amdgcnspirv` offload arch, rather than relying on
using `generic` (this is already fairly overloaded) or simply using
`spirv` or `spirv64` (we'll want to use these to denote unflavoured
SPIRV, once we bring up that capability)
- initially it is won't be possible to mix-in SPIR-V and concrete AMDGPU
targets, as it would require some relatively intrusive surgery in the
HIPAMD Toolchain and the Driver to deal with two triples
(`spirv64-amd-amdhsa` and `amdgcn-amd-amdhsa`, respectively)
- in order to retain user provided compiler flags and have them
available at JIT time, we rely on embedding the command line via
`-fembed-bitcode=marker`, which the bitcode writer had previously not
implemented for SPIRV; we only allow it conditionally for AMDGCN
flavoured SPIRV, and it is handled correctly by the Translator (it ends
up as a string literal)
Once the SPIRV BE is no longer experimental we'll switch to using that
rather than the translator. There's some additional work that'll come
via a separate PR around correctly piping through AMDGCN's
implementation of `printf`, for now we merely handle its flags
correctly.
Commit: a46a2c2b7d8daf68c793b6b96f1c5bc009aea62b
https://github.com/llvm/llvm-project/commit/a46a2c2b7d8daf68c793b6b96f1c5bc009aea62b
Author: Simon Pilgrim <llvm-dev at redking.me.uk>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/X86/X86ISelLowering.cpp
M llvm/lib/Target/X86/X86TargetTransformInfo.cpp
M llvm/test/Analysis/CostModel/X86/arith-int-codesize.ll
M llvm/test/Analysis/CostModel/X86/arith-int-latency.ll
M llvm/test/Analysis/CostModel/X86/arith-int-sizelatency.ll
M llvm/test/Analysis/CostModel/X86/arith-int.ll
M llvm/test/Analysis/CostModel/X86/rem-codesize.ll
M llvm/test/Analysis/CostModel/X86/rem-latency.ll
M llvm/test/Analysis/CostModel/X86/rem-sizelatency.ll
M llvm/test/Analysis/CostModel/X86/rem.ll
M llvm/test/Analysis/CostModel/X86/slm-arith-costs.ll
M llvm/test/CodeGen/X86/avx2-arith.ll
M llvm/test/CodeGen/X86/midpoint-int-vec-128.ll
M llvm/test/CodeGen/X86/midpoint-int-vec-256.ll
M llvm/test/CodeGen/X86/min-legal-vector-width.ll
M llvm/test/CodeGen/X86/pmul.ll
M llvm/test/CodeGen/X86/prefer-avx256-wide-mul.ll
M llvm/test/CodeGen/X86/vector-shuffle-combining-sse41.ll
Log Message:
-----------
[X86] Lower vXi8 multiplies using PMADDUBSW on SSSE3+ targets (#95690)
Extends https://github.com/llvm/llvm-project/pull/95403 to handle non-constant cases - we can avoid unpacks/extensions from vXi8 to vXi16 by using PMADDUBSW instead and truncating the vXi16 results back together.
Most targets benefit from performing this for non-constant cases - its just Intel Core/SandyBridge era CPUs that might experience additional Port0/15 contention (but lower instruction count).
Fixes https://github.com/llvm/llvm-project/issues/90748
Commit: 5ed5d723db58f7556782427444642d0571cdf649
https://github.com/llvm/llvm-project/commit/5ed5d723db58f7556782427444642d0571cdf649
Author: Benjamin Maxwell <benjamin.maxwell at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M mlir/lib/Dialect/ArmSME/Transforms/VectorLegalization.cpp
M mlir/test/Dialect/ArmSME/vector-legalization.mlir
M mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/multi-tile-matmul.mlir
Log Message:
-----------
[mlir][ArmSME] Lower multi-tile stores to a single loop (#96187)
This adds a new pattern that can legalize a multi-tile transfer_write as
a single store loop. This is done as part of type decomposition as at
this level we know each tile write is disjoint, but that information is
lost after decomposition (without analysis to reconstruct it).
Example (pseudo-MLIR):
```
vector.transfer_write %vector, %dest[%y, %x], %mask
: vector<[16]x[8]xi16>, memref<?x?xi16>
```
Is rewritten to:
```
scf.for %slice_idx = %c0 to %c8_vscale step %c1 {
%upper_slice_mask = vector.extract %mask[%slice_idx] ─┐
: vector<[8]xi1> from vector<[16]x[8]xi1> |
%upper_slice = vector.extract %upper_tile[%slice_idx] |- Store upper tile
: vector<[8]xi16> from vector<[8]x[8]xi16> |
vector.transfer_write %upper_slice, |
%dest[%slice_idx + %y, %x], %upper_slice_mask |
: vector<[8]xi16>, memref<?x?xi16> ┘
%lower_slice_idx = %slice_idx + %c8_vscale ─┐
%lower_slice_mask = vector.extract %mask[%lower_slice_idx] |
: vector<[8]xi1> from vector<[16]x[8]xi1> |
%lower_slice = vector.extract %lower_tile[%slice_idx] |- Store lower
: vector<[8]xi16> from vector<[8]x[8]xi16> | tile
vector.transfer_write %lower_slice, |
%dest[%lower_slice_idx + %y, %x], %lower_slice_mask |
: vector<[8]xi16>, memref<?x?xi16> ┘
}
```
Commit: 6c9916d0d8a40034c5e0dfc157f146855e42520e
https://github.com/llvm/llvm-project/commit/6c9916d0d8a40034c5e0dfc157f146855e42520e
Author: Joseph Huber <huberjn at outlook.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libc/cmake/modules/prepare_libc_gpu_build.cmake
M llvm/runtimes/CMakeLists.txt
Log Message:
-----------
[libc] Configure CMAKE_REQUIRED_FLAGS so the GPU can use flag checks (#95424)
Summary:
This patch adds `CMAKE_REQUIRED_FLAGS` for the GPU build so checks like
`check_cxx_compiler_flags` work as expected. This is required because we
need to hack around the potential lack of `nvlink` and `ptxas` for NVPTX
targets and the fact that the AMDGPU target needs `-nogpulib` to avoid
errors on lack of ROCm. This makes a few of the checks pass and also
allows us to just check `-mcpu=native` for architecture detection
instead of finding the tools manually.
Commit: 0f111ba790a28056d42a0ae376371d7b694454ae
https://github.com/llvm/llvm-project/commit/0f111ba790a28056d42a0ae376371d7b694454ae
Author: Ramkumar Ramachandra <ramkumar.ramachandra at codasip.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/Analysis/LoopInfo.h
M llvm/lib/Analysis/LoopAccessAnalysis.cpp
M llvm/lib/Analysis/LoopInfo.cpp
M llvm/lib/Transforms/Scalar/LoopDistribute.cpp
M llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
A llvm/test/Analysis/LoopAccessAnalysis/debug-loc.ll
M llvm/test/Analysis/LoopAccessAnalysis/print-order.ll
M llvm/test/Analysis/LoopAccessAnalysis/scalable-vector-regression-tests.ll
A llvm/test/Transforms/LoopDistribute/debug-loc.ll
A llvm/test/Transforms/LoopDistribute/debug-print.ll
M llvm/test/Transforms/LoopVectorize/ARM/mve-hoist-runtime-checks.ll
M llvm/test/Transforms/LoopVectorize/runtime-checks-hoist.ll
Log Message:
-----------
LoopInfo: introduce Loop::getLocStr; unify debug output (#93051)
Introduce a Loop::getLocStr stolen from LoopVectorize's static function
getDebugLocString in order to have uniform debug output headers across
LoopVectorize, LoopAccessAnalysis, and LoopDistribute. The motivation
for this change is to have UpdateTestChecks recognize the headers and
automatically generate CHECK lines for debug output, with minimal
special-casing.
Commit: 8263bec5331863113c6163afdc3f93e063f20a07
https://github.com/llvm/llvm-project/commit/8263bec5331863113c6163afdc3f93e063f20a07
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
Log Message:
-----------
[SLP] Use poison instead of undef in reorderScalars() (#96619)
-1 mask elements are specified to return poison rather than undef
nowadays , so update the reorderScalars() implementation to match.
Commit: 952bdaaf79c1e5d7364160b21de0cd1295cdfbd8
https://github.com/llvm/llvm-project/commit/952bdaaf79c1e5d7364160b21de0cd1295cdfbd8
Author: Leandro Lupori <leandro.lupori at linaro.org>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M flang/include/flang/Lower/AbstractConverter.h
M flang/lib/Lower/Bridge.cpp
M flang/lib/Lower/OpenMP/ClauseProcessor.cpp
A flang/test/Lower/OpenMP/copyprivate2.f90
Log Message:
-----------
[flang][OpenMP] Fix copyprivate allocatable/pointer lowering (#95975)
The lowering of copyprivate clauses with allocatable or pointer
variables was incorrect. This happened because the values passed to
copyVar() are always wrapped in SymbolBox::Intrinsic, which
resulted in allocatable/pointer variables being handled as regular
ones.
This is fixed by providing to copyVar() the attributes of the
variables being copied, to make it possible to detect and handle
allocatable/pointer variables correctly.
Fixes #95801
Commit: c43664931378c589091fb169a6d3840474f5c324
https://github.com/llvm/llvm-project/commit/c43664931378c589091fb169a6d3840474f5c324
Author: Sander de Smalen <sander.desmalen at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AArch64/AArch64ISelDAGToDAG.cpp
M llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
M llvm/lib/Target/AArch64/AArch64InstrInfo.cpp
M llvm/lib/Target/AArch64/AArch64Subtarget.h
M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp
Log Message:
-----------
[AArch64] Remove all instances of the 'hasSVEorSME' interfaces. (#96543)
I've not added any new tests for these, because the original conditions
were wrong (they did not consider streaming mode) and we have tests for
the positive cases.
Commit: c5aa983f91f0543b43926a32d2f70bb53f9945a2
https://github.com/llvm/llvm-project/commit/c5aa983f91f0543b43926a32d2f70bb53f9945a2
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Analysis/InstructionSimplify.cpp
M llvm/test/Transforms/InstCombine/shift.ll
M llvm/test/Transforms/InstSimplify/phi.ll
M llvm/test/Transforms/LoopDeletion/update-scev.ll
Log Message:
-----------
[InstSimplify] Fold all poison phi to poison instead of undef
Commit: 4780dc3d7fd63016ebb871544287fedc83b1006a
https://github.com/llvm/llvm-project/commit/4780dc3d7fd63016ebb871544287fedc83b1006a
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/Transforms/InstCombine/phi.ll
Log Message:
-----------
[InstCombine] Add poison variant to phi test (NFC)
And rename an argument to avoid an upper/lowercase clash.
Commit: abc8c4be3bb70755328051c99e1b196663140c66
https://github.com/llvm/llvm-project/commit/abc8c4be3bb70755328051c99e1b196663140c66
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/Transforms/LoopVectorize/uniform-blend.ll
Log Message:
-----------
[LoopVectorize] Generate test checks (NFC)
Commit: edbe698eadaefca31564e3207074c8ed4b10bc12
https://github.com/llvm/llvm-project/commit/edbe698eadaefca31564e3207074c8ed4b10bc12
Author: OverMighty <its.overmighty at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libc/config/linux/aarch64/entrypoints.txt
M libc/config/linux/x86_64/entrypoints.txt
M libc/docs/math/index.rst
M libc/spec/stdc.td
M libc/src/__support/FPUtil/CMakeLists.txt
M libc/src/__support/FPUtil/dyadic_float.h
M libc/src/__support/FPUtil/generic/CMakeLists.txt
A libc/src/__support/FPUtil/generic/div.h
M libc/src/math/CMakeLists.txt
A libc/src/math/f16divf.h
M libc/src/math/generic/CMakeLists.txt
A libc/src/math/generic/f16divf.cpp
M libc/test/src/math/CMakeLists.txt
A libc/test/src/math/DivTest.h
A libc/test/src/math/f16divf_test.cpp
M libc/test/src/math/smoke/CMakeLists.txt
A libc/test/src/math/smoke/DivTest.h
A libc/test/src/math/smoke/f16divf_test.cpp
M libc/utils/MPFRWrapper/MPFRUtils.cpp
M libc/utils/MPFRWrapper/MPFRUtils.h
Log Message:
-----------
[libc][math][c23] Add f16divf C23 math function (#96131)
Part of #93566.
Commit: eabe091cdba859890fd169bd51924f34fd754bbf
https://github.com/llvm/llvm-project/commit/eabe091cdba859890fd169bd51924f34fd754bbf
Author: Benjamin Kramer <benny.kra at googlemail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M utils/bazel/llvm-project-overlay/mlir/BUILD.bazel
Log Message:
-----------
[bazel] Add missing dependency for 5ed5d723db58f7556782427444642d0571cdf649
Commit: 70cfece24d6cbb57e35dd961cc97eb2a6bf1e387
https://github.com/llvm/llvm-project/commit/70cfece24d6cbb57e35dd961cc97eb2a6bf1e387
Author: sstwcw <su3e8a96kzlver at posteo.net>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/docs/ClangFormatStyleOptions.rst
M clang/docs/ReleaseNotes.rst
M clang/include/clang/Format/Format.h
M clang/lib/Format/Format.cpp
M clang/lib/Format/UnwrappedLineFormatter.cpp
M clang/unittests/Format/ConfigParseTest.cpp
M clang/unittests/Format/FormatTest.cpp
Log Message:
-----------
Revert "[clang-format] Add option to remove leading blank lines (#91221)"
This reverts commit 9267f8f19a2e502ef5a216c0d52b352b3699d399.
I changed a formatter option. I forgot to update other components that
depend on the formatter when the option name changed.
Commit: c69ea04fb9738db283263eb350669e00b77ee4fd
https://github.com/llvm/llvm-project/commit/c69ea04fb9738db283263eb350669e00b77ee4fd
Author: Jie Fu <jiefu at tencent.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang-tools-extra/clangd/Format.cpp
Log Message:
-----------
Revert "[clangd] Fix the build broken (NFC)"
This reverts commit 4c91b49bab0728d4bc136aa33c4aeb4e8ea37d01 after 70cfece24d6cbb57e35dd961cc97eb2a6bf1e387.
Commit: fd62906ddb252298f6ed63fe85e146d477acdaed
https://github.com/llvm/llvm-project/commit/fd62906ddb252298f6ed63fe85e146d477acdaed
Author: Louis Dionne <ldionne.2 at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/include/__numeric/gcd_lcm.h
M libcxx/test/std/numerics/numeric.ops/numeric.ops.lcm/lcm.pass.cpp
Log Message:
-----------
[libc++] Fix incorrect overflow checking in std::lcm (#96310)
We should have been using __builtin_mul_overflow from the start instead
of adding a manual (and error-prone) check for overflow.
Fixes #96196
Commit: 44c9a26756192228fd9dcd33c238ec8858d40329
https://github.com/llvm/llvm-project/commit/44c9a26756192228fd9dcd33c238ec8858d40329
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/Transforms/Mem2Reg/preserve-nonnull-load-metadata.ll
Log Message:
-----------
[Mem2Reg] Add tests with !noundef (NFC)
Commit: c9f083a9940d1d62f77c39f05bb0fc186cc4832c
https://github.com/llvm/llvm-project/commit/c9f083a9940d1d62f77c39f05bb0fc186cc4832c
Author: Shilei Tian <i at tianshilei.me>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/include/clang/Basic/BuiltinsAMDGPU.def
M clang/lib/CodeGen/CGBuiltin.cpp
A clang/test/CodeGenOpenCL/builtins-amdgcn-raw-buffer-store.cl
A clang/test/SemaOpenCL/builtins-amdgcn-raw-buffer-store-error.cl
Log Message:
-----------
[Clang][AMDGPU] Add builtins for instrinsic `llvm.amdgcn.raw.ptr.buffer.store` (#94576)
Depends on https://github.com/llvm/llvm-project/pull/96313.
Commit: 1462605ab0a121fc2f652f178c7c13dc0179f10e
https://github.com/llvm/llvm-project/commit/1462605ab0a121fc2f652f178c7c13dc0179f10e
Author: Kazu Hirata <kazu at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/Analysis/CFGPrinter.h
M llvm/lib/Analysis/CallGraph.cpp
M llvm/lib/Analysis/ConstraintSystem.cpp
M llvm/lib/Analysis/LoopAccessAnalysis.cpp
M llvm/lib/Analysis/PHITransAddr.cpp
M llvm/lib/Analysis/ScalarEvolution.cpp
M llvm/lib/Analysis/TargetTransformInfo.cpp
Log Message:
-----------
[Analysis] Use range-based for loops (NFC) (#96587)
Commit: 2dd4167a09cfe1a1e09852e99b4984ddb3e89cf7
https://github.com/llvm/llvm-project/commit/2dd4167a09cfe1a1e09852e99b4984ddb3e89cf7
Author: David Sherwood <david.sherwood at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp
M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.h
M llvm/test/Analysis/CostModel/AArch64/masked_ldst.ll
M llvm/test/Analysis/CostModel/AArch64/sve-gather.ll
M llvm/test/Analysis/CostModel/AArch64/sve-ldst.ll
M llvm/test/Analysis/CostModel/AArch64/sve-scatter.ll
M llvm/test/Transforms/LoopVectorize/AArch64/scalable-reductions.ll
M llvm/test/Transforms/LoopVectorize/AArch64/sve-illegal-type.ll
A llvm/test/Transforms/LoopVectorize/AArch64/sve-invalid-costs.ll
Log Message:
-----------
[LoopVectorize][AArch64] Add limited support for scalable vectorisation of i1 types (#95920)
Previously isElementTypeLegalForScalableVector returned false for i1
types, which also prevented vectorisation of loops with i1 reductions.
This is overkill - we only need to disable vectorisation for loads
and/or stores of i1 types. I've added i1 as a legal type, but changed
the cost model to return an invalid cost for loads and stores.
Commit: 54b61adc0cbefb7f923ef43c407704ba9f9d8b69
https://github.com/llvm/llvm-project/commit/54b61adc0cbefb7f923ef43c407704ba9f9d8b69
Author: Mike Rice <michael.p.rice at intel.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/lib/Lex/LiteralSupport.cpp
Log Message:
-----------
[NFC][clang] Replace unreachable code in literal processing with assert (#96579)
Address static verifier concerns about dead code in DoubleUnderscore
check. Replace it with an assert.
Commit: eb12d9b5cb6d15ce151a63ad32b4e0e5823ddb87
https://github.com/llvm/llvm-project/commit/eb12d9b5cb6d15ce151a63ad32b4e0e5823ddb87
Author: Louis Dionne <ldionne.2 at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/docs/ReleaseNotes/19.rst
M libcxx/include/__memory/allocator.h
M libcxx/include/__memory/shared_ptr.h
R libcxx/test/libcxx/containers/sequences/vector/const_T.compile.pass.cpp
R libcxx/test/libcxx/containers/sequences/vector/const_value_type.pass.cpp
M libcxx/test/libcxx/diagnostics/memory.nodiscard.verify.cpp
M libcxx/test/libcxx/memory/allocator_void.trivial.compile.pass.cpp
M libcxx/test/libcxx/memory/allocator_volatile.verify.cpp
M libcxx/test/std/concepts/concepts.lang/concept.default.init/default_initializable.compile.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.ctor.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.dtor.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.members/allocate.constexpr.size.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.members/allocate.size.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.deprecated_in_cxx17.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.deprecated_in_cxx23.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.removed_in_cxx20.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.removed_in_cxx26.verify.cpp
M libcxx/test/std/utilities/memory/specialized.algorithms/specialized.construct/construct_at.pass.cpp
Log Message:
-----------
[libc++] Remove the allocator<const T> extension (#96319)
This effort has quite a history:
- This was first attempted in 2022 via bed3240bf7d1, which broke
std::shared_ptr<T const> and caused the change to be reverted in
9138666f5464.
- We then re-attempted landing the change in 276ca87382b8 after fixing
std::shared_ptr, but reports were made that this broke code en masse
within Google. This led to the patch being reverted again in
a54d028895c9 with the goal to land this again with a migration path for
vendors.
This patch re-lands the removal while providing a migration path for
vendors by providing the `_LIBCPP_ENABLE_REMOVED_ALLOCATOR_CONST` macro.
This macro will be honored for the LLVM 19 release and will be removed
after that, at which point allocator<const T> will be removed
unconditionally.
Fixes #73665
Commit: d6c74102626ead8c4e2f7e638fdfea1fcd3d0bc7
https://github.com/llvm/llvm-project/commit/d6c74102626ead8c4e2f7e638fdfea1fcd3d0bc7
Author: Nicolai Hähnle <nicolai.haehnle at amd.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.cpp
Log Message:
-----------
AMDGPU: Remove an outdated TODO (#96446)
We have a fixed calling convention for stack pointer and frame pointer,
we shouldn't try to shift anything around.
Commit: 7e9b49f6b86c8616e6211ec02dbccc3ebb615e79
https://github.com/llvm/llvm-project/commit/7e9b49f6b86c8616e6211ec02dbccc3ebb615e79
Author: Nicolai Hähnle <nicolai.haehnle at amd.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPUArgumentUsageInfo.cpp
M llvm/lib/Target/AMDGPU/AMDGPUArgumentUsageInfo.h
M llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp
M llvm/lib/Target/AMDGPU/AMDGPUSubtarget.cpp
M llvm/lib/Target/AMDGPU/GCNSubtarget.h
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.cpp
M llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.h
Log Message:
-----------
AMDGPU: Add plumbing for private segment size argument (#96445)
The actual size of scratch/private is determined at dispatch time, so
add more plumbing to request it. Will be used in subsequent change.
Commit: 13ed349c44ca44f470e2e6181888f14b6b22a44a
https://github.com/llvm/llvm-project/commit/13ed349c44ca44f470e2e6181888f14b6b22a44a
Author: Ivan Kosarev <ivan.kosarev at amd.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp
M llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUMCExpr.cpp
M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUMCExpr.h
Log Message:
-----------
[AMDGPU][NFC] Rename AMDGPUVariadicMCExpr to AMDGPUMCExpr. (#96618)
Some of our custom expressions are not variadic and there seems to be
little benefit in mentioning the variadic nature of expression nodes in
the name anyway.
Commit: fb463e154e7fe1200b9d91f82fafa532331ce764
https://github.com/llvm/llvm-project/commit/fb463e154e7fe1200b9d91f82fafa532331ce764
Author: Aaron Ballman <aaron at aaronballman.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/www/c_status.html
Log Message:
-----------
[C23] Remove WG14 N2379 from the C status page
This paper was adopted at the Oct 2019 meeting in Ithaca, but the
changes from the paper were later removed (editorially) by the C
Floating Point study group due to being incorrect and unnecessary.
Commit: c6973ade4c46e28b1be9e1f5d873ca9a59197418
https://github.com/llvm/llvm-project/commit/c6973ade4c46e28b1be9e1f5d873ca9a59197418
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Utils/PromoteMemoryToRegister.cpp
M llvm/test/Transforms/Mem2Reg/preserve-nonnull-load-metadata.ll
Log Message:
-----------
[Mem2Reg] Generate non-terminator unreachable for !noundef undef (#96639)
When performing a load from uninitialized memory using !noundef, insert
a non-terminator unreachable instruction, which will be converted to a
proper unreachable by SimplifyCFG. This way we retain the fact that UB
occurred on this code path.
Commit: de528ffb17ebce96e0bc4dde1749146c41ca1d0d
https://github.com/llvm/llvm-project/commit/de528ffb17ebce96e0bc4dde1749146c41ca1d0d
Author: Mats Petersson <mats.petersson at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M flang/include/flang/Optimizer/Transforms/Passes.h
M flang/include/flang/Optimizer/Transforms/Passes.td
M flang/include/flang/Tools/CLOptions.inc
M flang/lib/Optimizer/Transforms/CMakeLists.txt
A flang/lib/Optimizer/Transforms/ConstantArgumentGlobalisation.cpp
M flang/test/Fir/boxproc.fir
M flang/test/Lower/character-local-variables.f90
A flang/test/Transforms/constant-argument-globalisation-2.fir
A flang/test/Transforms/constant-argument-globalisation.fir
Log Message:
-----------
[Flang] Extracting internal constants from scalar literals (#73829)
Constants actual arguments in function/subroutine calls are currently
lowered as allocas + store. This can sometimes inhibit LTO and the
constant will not be propagated to the called function. Particularly in
cases where the function/subroutine call happens inside a condition.
This patch changes the lowering of these constant actual arguments to a
global constant + fir.address_of_op. This lowering makes it easier for
LTO to propagate the constant.
The optimization must be enabled explicitly to run. Use -mmlir
--enable-constant-argument-globalisation to enable.
---------
Co-authored-by: Dmitriy Smirnov <dmitriy.smirnov at arm.com>
Commit: 2cf197506f59b6151c542faec594a7e1e97d9c64
https://github.com/llvm/llvm-project/commit/2cf197506f59b6151c542faec594a7e1e97d9c64
Author: harishch4 <harishcse44 at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M mlir/include/mlir/Dialect/OpenMP/OpenMPClauseOperands.h
M mlir/include/mlir/Dialect/OpenMP/OpenMPEnums.td
M mlir/include/mlir/Dialect/OpenMP/OpenMPOps.td
M mlir/lib/Dialect/OpenMP/IR/OpenMPDialect.cpp
M mlir/test/Dialect/OpenMP/invalid.mlir
M mlir/test/Dialect/OpenMP/ops.mlir
Log Message:
-----------
[MLIR][OpenMP]Add order-modifier support to Order clause (#93805)
This adds order-modifier (reproducible|unconstrained) support to Order
clause.
Commit: a4045299d3024a69746c4fd9517e27e5cef4785e
https://github.com/llvm/llvm-project/commit/a4045299d3024a69746c4fd9517e27e5cef4785e
Author: bwlodarcz <bertrand.wlodarczyk at intel.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/SPIRV/MCTargetDesc/SPIRVBaseInfo.cpp
M llvm/lib/Target/SPIRV/MCTargetDesc/SPIRVBaseInfo.h
M llvm/lib/Target/SPIRV/SPIRVBuiltins.td
Log Message:
-----------
[SPIRV] Add definitions for NonSemantic debug info (#95530)
This commit adds basic types and definitions for
NonSemantic.Shader.DebugInfo.100 standard for SPIRV.
Full implementation of the standard will allow SPIRV backend to emit
files with debug info included.
Link to standard:
https://github.com/KhronosGroup/SPIRV-Registry/blob/main/nonsemantic/NonSemantic.Shader.DebugInfo.100.html
Commit: ec9ce89a0823165e245be8fc095da8a88fb4cf79
https://github.com/llvm/llvm-project/commit/ec9ce89a0823165e245be8fc095da8a88fb4cf79
Author: David Sherwood <david.sherwood at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/Transforms/LoopVectorize/AArch64/sve-invalid-costs.ll
Log Message:
-----------
[LoopVectorize] Fix build issue caused by #95920 (#96647)
Commit: 2274c66e6faaaf29ad693b1ae3e5a7b0228a1950
https://github.com/llvm/llvm-project/commit/2274c66e6faaaf29ad693b1ae3e5a7b0228a1950
Author: Nikolas Klauser <nikolasklauser at berlin.de>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/include/__type_traits/conditional.h
Log Message:
-----------
[libc++] Use _If for conditional_t (#96193)
This avoids different instantiations when the if and else types are
different, resulting in reduced memory use by the compiler.
Commit: ed1273d4ddee160f847cbee28714c79353022a3c
https://github.com/llvm/llvm-project/commit/ed1273d4ddee160f847cbee28714c79353022a3c
Author: RichardLuo <CommAdama at outlook.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/modules/CMakeLists.txt.in
Log Message:
-----------
[libc++] change the visibility of libc++ header to public in libcxx module (#91240)
This PR addresses a problem that headers may not be able to be found if
`#include` is used with std modules.
Consider the following file:
#include <boost/json.hpp>
import std;
int main(int, const char **) { }
Boost will include something from libc++, but we are using -nostdinc++
at [1] so the compiler can not find any default std header. Therefore
the locally built header needs to be public.
[1]: https://github.com/RichardLuo0/llvm-project/blob/15fdd47c4b110b64dc61f636e42e0484bf8bdbe0/libcxx/modules/CMakeLists.txt.in#L52
Commit: bb5ab1ffe719f5e801ef08ac08be975546aa3266
https://github.com/llvm/llvm-project/commit/bb5ab1ffe719f5e801ef08ac08be975546aa3266
Author: shawbyoung <shawbyoung at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M bolt/docs/CommandLineArgumentReference.md
M bolt/lib/Profile/YAMLProfileReader.cpp
M bolt/lib/Rewrite/RewriteInstance.cpp
M bolt/lib/Utils/CommandLineOpts.cpp
A bolt/test/X86/hashing-based-function-matching.test
Log Message:
-----------
[𝘀𝗽𝗿] initial version
Created using spr 1.3.4
Commit: 020f69cd10a2ff1233cc28088989319e5a58b116
https://github.com/llvm/llvm-project/commit/020f69cd10a2ff1233cc28088989319e5a58b116
Author: shawbyoung <shawbyoung at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M bolt/lib/Rewrite/RewriteInstance.cpp
Log Message:
-----------
Added opts::Lite to RewriteInstance
Commit: c097e643efc2a2ee5170fbcdb5e8c644a7207452
https://github.com/llvm/llvm-project/commit/c097e643efc2a2ee5170fbcdb5e8c644a7207452
Author: shawbyoung <shawbyoung at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M bolt/lib/Rewrite/RewriteInstance.cpp
Log Message:
-----------
Revert "Added opts::Lite to RewriteInstance"
This reverts commit 020f69cd10a2ff1233cc28088989319e5a58b116.
Commit: 0e11a7e7179a2eb066e5f63f25b9277aa63f38eb
https://github.com/llvm/llvm-project/commit/0e11a7e7179a2eb066e5f63f25b9277aa63f38eb
Author: Nikita Popov <npopov at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/Transforms/EarlyCSE/flags.ll
Log Message:
-----------
[EarlyCSE] Add test with noundef load of undef (NFC)
Commit: 79e8a5952366eacd92201a8d6472726fc14e00fd
https://github.com/llvm/llvm-project/commit/79e8a5952366eacd92201a8d6472726fc14e00fd
Author: Hui <hui.xie1990 at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/include/__memory/allocator_traits.h
M libcxx/include/deque
M libcxx/include/forward_list
M libcxx/include/list
M libcxx/include/map
M libcxx/include/set
M libcxx/include/string
M libcxx/include/unordered_map
M libcxx/include/unordered_set
M libcxx/include/vector
Log Message:
-----------
[libc++] Move allocator assertion into allocator_traits (#94750)
There is code duplication in all containers that static_assert the
allocator matches the allocator requirements in the spec. This check can
be moved into a more centralised place.
Commit: 7f10ed637e53c68ce62b756a3be8546a3dccf751
https://github.com/llvm/llvm-project/commit/7f10ed637e53c68ce62b756a3be8546a3dccf751
Author: Vitaly Buka <vitalybuka at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M compiler-rt/lib/tsan/rtl/tsan_interceptors_posix.cpp
Log Message:
-----------
[tsan] Fix dead lock when starting StackDepot thread (#96456)
Sometime tsan runtimes calls, like
`__tsan_mutex_create ()`, need to store a stack
in the StackDepot, and the Depot may need to start
and maintenance thread.
Example:
```
__sanitizer::FutexWait ()
__sanitizer::Semaphore::Wait ()
__sanitizer::Mutex::Lock ()
__tsan::SlotLock ()
__tsan::SlotLocker::SlotLocker ()
__tsan::Acquire ()
__tsan::CallUserSignalHandler ()
__tsan::ProcessPendingSignalsImpl ()
__tsan::ProcessPendingSignals ()
__tsan::ScopedInterceptor::~ScopedInterceptor ()
___interceptor_mmap ()
pthread_create ()
__sanitizer::internal_start_thread ()
__sanitizer::(anonymous namespace)::CompressThread::NewWorkNotify ()
__sanitizer::StackDepotNode::store ()
__sanitizer::StackDepotBase<__sanitizer::StackDepotNode, 1, 20>::Put ()
__tsan::CurrentStackId ()
__tsan::MutexCreate ()
__tsan_mutex_create ()
```
pthread_create() implementation may hit other
interceptors recursively, which may invoke
ProcessPendingSignals, which deadlocks.
Alternative solution could be block interceptors
closer to TSAN runtime API function, like
`__tsan_mutex_create`, or just before
`StackDepotPut``, but it's not needed for most
calls, only when new thread is created using
`real_pthread_create`.
I don't see a reasonable way to create a
regression test.
Commit: dddef9d1c9b7b4e8a96bd68d44cd3ed6dc85f758
https://github.com/llvm/llvm-project/commit/dddef9d1c9b7b4e8a96bd68d44cd3ed6dc85f758
Author: Craig Topper <craig.topper at sifive.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/RISCV/GISel/RISCVLegalizerInfo.cpp
M llvm/lib/Target/RISCV/GISel/RISCVRegisterBankInfo.cpp
A llvm/test/CodeGen/RISCV/GlobalISel/instruction-select/fp-arith-f16.mir
A llvm/test/CodeGen/RISCV/GlobalISel/legalizer/legalize-fp-arith-f16.mir
A llvm/test/CodeGen/RISCV/GlobalISel/regbankselect/fp-arith-f16.mir
Log Message:
-----------
[RISCV] Add FPR16 regbank and start legalizing f16 operations for Zfh. (#96582)
Commit: 8a46bbbc22a51db57f05beb0026772b899a785b9
https://github.com/llvm/llvm-project/commit/8a46bbbc22a51db57f05beb0026772b899a785b9
Author: Lukacma <Marian.Lukac at arm.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/include/clang/Basic/DiagnosticSemaKinds.td
M clang/lib/Sema/SemaType.cpp
M clang/test/Sema/arm-vector-types-support.c
R clang/test/SemaCUDA/neon-attrs.cu
M clang/utils/TableGen/NeonEmitter.cpp
Log Message:
-----------
[Clang] Remove preprocessor guards and global feature checks for NEON (#95224)
To enable function multi-versioning (FMV), current checks which rely on
cmd line options or global macros to see if target feature is present
need to be removed. This patch removes those for NEON and also
implements changes to NEON header file as proposed in
[ACLE](https://github.com/ARM-software/acle/pull/321).
Commit: 8c11d3788c26cd85f102959068109d3e498e8498
https://github.com/llvm/llvm-project/commit/8c11d3788c26cd85f102959068109d3e498e8498
Author: Xiaoyang Liu <siujoeng.lau at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/include/__mdspan/extents.h
M libcxx/test/std/containers/views/mdspan/extents/ctad.pass.cpp
Log Message:
-----------
[libc++] P3029R1: Better `mdspan`'s CTAD - `std::extents` (#89015)
This patch implements an improvement introduced in P3029R1 that was
missed in #87873. It adds a deduction of static extents if
integral_constant-like constants are passed to `std::extents`.
Commit: 902952ae04afc2dfe28805b949a1e2218affe65e
https://github.com/llvm/llvm-project/commit/902952ae04afc2dfe28805b949a1e2218affe65e
Author: shawbyoung <shawbyoung at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M bolt/docs/CommandLineArgumentReference.md
M bolt/lib/Profile/YAMLProfileReader.cpp
M bolt/lib/Rewrite/RewriteInstance.cpp
M bolt/lib/Utils/CommandLineOpts.cpp
R bolt/test/X86/hashing-based-function-matching.test
Log Message:
-----------
Revert "[𝘀𝗽𝗿] initial version"
This reverts commit bb5ab1ffe719f5e801ef08ac08be975546aa3266.
Commit: 731db06a878f5c8cb29b36d526a54493677ea89f
https://github.com/llvm/llvm-project/commit/731db06a878f5c8cb29b36d526a54493677ea89f
Author: Nikolas Klauser <nikolasklauser at berlin.de>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libcxx/include/__atomic/atomic_init.h
M libcxx/src/barrier.cpp
M libcxx/src/filesystem/operations.cpp
M libcxx/src/filesystem/path.cpp
M libcxx/src/filesystem/path_parser.h
M libcxx/src/locale.cpp
M runtimes/cmake/Modules/WarningFlags.cmake
Log Message:
-----------
[libc++] Get the GCC build mostly clean of warnings (#96604)
The GCC build has gotten to the point where it's often hard to find the
actual error in the build log. We should look into enabling these
warnings again in the future, but it looks like a lot of them are
bogous.
Commit: 2604830aacdd563715da030d0396b565e912436f
https://github.com/llvm/llvm-project/commit/2604830aacdd563715da030d0396b565e912436f
Author: Akira Hatanaka <ahatanak at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/docs/LanguageExtensions.rst
M clang/include/clang/AST/Expr.h
M clang/include/clang/Basic/Builtins.td
M clang/include/clang/Basic/DiagnosticSemaKinds.td
M clang/include/clang/CodeGen/ModuleBuilder.h
M clang/lib/AST/ExprConstant.cpp
M clang/lib/CodeGen/CGBuiltin.cpp
M clang/lib/CodeGen/CGDebugInfo.cpp
M clang/lib/CodeGen/CGDebugInfo.h
M clang/lib/Sema/SemaChecking.cpp
A clang/test/CodeGenCXX/debug-info-verbose-trap.cpp
A clang/test/SemaCXX/verbose-trap.cpp
Log Message:
-----------
Add support for __builtin_verbose_trap (#79230)
The builtin causes the program to stop its execution abnormally and
shows a human-readable description of the reason for the termination
when a debugger is attached or in a symbolicated crash log.
The motivation for the builtin is explained in the following RFC:
https://discourse.llvm.org/t/rfc-adding-builtin-verbose-trap-string-literal/75845
clang's CodeGen lowers the builtin to `llvm.trap` and emits debugging
information that represents an artificial inline frame whose name
encodes the category and reason strings passed to the builtin.
Commit: bb075eeb892374a209953ad20e02c1324e272679
https://github.com/llvm/llvm-project/commit/bb075eeb892374a209953ad20e02c1324e272679
Author: Jakub Mazurkiewicz <mazkuba3 at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
A libcxx/test/std/containers/sequences/array/array.overview/nttp.equivalence.compile.pass.cpp
A libcxx/test/std/containers/sequences/array/array.overview/nttp.verify.cpp
A libcxx/test/std/utilities/utility/pairs/pairs.pair/nttp.equivalence.compile.pass.cpp
A libcxx/test/std/utilities/utility/pairs/pairs.pair/nttp.verify.cpp
Log Message:
-----------
[libc++] LWG3382: NTTP for `pair` and `array` (#85811)
Mark LWG3382 as "Nothing To Do" and add tests.
Commit: 889f3c5741e78ce90b4dcc643bb15d992225a67c
https://github.com/llvm/llvm-project/commit/889f3c5741e78ce90b4dcc643bb15d992225a67c
Author: Matt Arsenault <Matthew.Arsenault at amd.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPU.td
M llvm/lib/Target/AMDGPU/BUFInstructions.td
M llvm/lib/Target/AMDGPU/GCNSubtarget.h
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/test/CodeGen/AMDGPU/buffer-fat-pointer-atomicrmw-fadd.ll
Log Message:
-----------
AMDGPU: Handle legal v2bf16 atomicrmw fadd for gfx12 (#95930)
Annoyingly gfx90a/940 support this for global/flat but not buffer.
Commit: b7768c5485844306cd3c8afeef609ddf9d79696a
https://github.com/llvm/llvm-project/commit/b7768c5485844306cd3c8afeef609ddf9d79696a
Author: Timm Bäder <tbaeder at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/lib/AST/Interp/ByteCodeExprGen.cpp
Log Message:
-----------
[clang][Interp][NFC] Use delegate() to delegate to only initlist item
Commit: e951bd0f51f8b077296f09d9c60ddf150048042f
https://github.com/llvm/llvm-project/commit/e951bd0f51f8b077296f09d9c60ddf150048042f
Author: Vy Nguyen <vyng at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M lldb/include/lldb/API/SBDebugger.h
M lldb/include/lldb/Symbol/TypeSystem.h
M lldb/source/API/SBDebugger.cpp
M lldb/source/Symbol/TypeSystem.cpp
M lldb/tools/lldb-dap/DAP.cpp
M lldb/tools/lldb-dap/DAP.h
M lldb/tools/lldb-dap/lldb-dap.cpp
Log Message:
-----------
Reapply PR/87550 (again) (#95571)
New fixes:
- properly init the `std::optional<std::vector>` to an empty vector as
opposed to `{}` (which was effectively `std::nullopt`).
---------
Co-authored-by: Vy Nguyen <oontvoo at users.noreply.github.com>
Commit: dca49d739de07b1755ad65aa26dacd2e2c22af20
https://github.com/llvm/llvm-project/commit/dca49d739de07b1755ad65aa26dacd2e2c22af20
Author: Nick Desaulniers (paternity leave) <nickdesaulniers at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libc/config/linux/app.h
M libc/src/__support/threads/thread.h
M libc/startup/linux/do_start.cpp
Log Message:
-----------
[libc][arm32] define argc type and stack alignment (#96367)
https://github.com/ARM-software/abi-aa/blob/main/aapcs32/aapcs32.rst#6212stack-constraints-at-a-public-interface
mentions that the stack on ARM32 is double word aligned.
Remove confused comments around ArgcType. argc is always an int, passed on the
stack, so we need to store a pointer to it (regardless of ILP32 or LP64).
Commit: fb07afedbebb0c9f2647d02c6f254245cdb84085
https://github.com/llvm/llvm-project/commit/fb07afedbebb0c9f2647d02c6f254245cdb84085
Author: yonghong-song <yhs at fb.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/BPF/BPFRegisterInfo.cpp
Log Message:
-----------
[BPF] Avoid potential long compilation time without -g (#96575)
Alastair Robertson reported a huge compilation time increase without -g
for bpf target when comparing to x86 ([1]). In my setup, with '-O0', for
x86, a large basic block compilation takes 0.19s while bpf target takes
2.46s. The top function which contributes to the compile time is
eliminateFrameIndex().
Such long compilation time without -g is caused by commit
05de2e481811 ("[bpf] error when BPF stack size exceeds 512 bytes")
The compiler tries to get some debug loc by iterating all insns in the
basic block which will be used when compiler warns larger-than-512 stack
size. Even without -g, such iterating also happens which cause
unnecessary compile time increase.
To fix the issue, let us move the related code when the compiler is
about to warn stack limit violation. This fixed the compile time
regression, and on my system, the compile time is reduced from 2.46s to
0.35s.
[1] https://github.com/bpftrace/bpftrace/issues/3257
Co-authored-by: Yonghong Song <yonghong.song at linux.dev>
Commit: ac1e22f3053f761e4e2ef832b92de15876e68335
https://github.com/llvm/llvm-project/commit/ac1e22f3053f761e4e2ef832b92de15876e68335
Author: Stanley Winata <68087699+raikonenfnu at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp
M mlir/test/Dialect/Vector/fold-arith-extf-into-vector-contract.mlir
Log Message:
-----------
[mlir][vector] Generalize folding of ext-contractionOp to other types. (#96593)
Many state of the art models and quantization operations are now
directly working on vector.contract on integers.
This commit enables generalizes ext-contraction folding S.T we can emit
more performant vector.contracts on codegen pipelines.
Signed-off-by: Stanley Winata <stanley.winata at amd.com>
Commit: 580343d96f18e3c2fa8d7a8e25a175aaa2e20522
https://github.com/llvm/llvm-project/commit/580343d96f18e3c2fa8d7a8e25a175aaa2e20522
Author: Timm Bäder <tbaeder at redhat.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/lib/AST/Interp/Descriptor.cpp
Log Message:
-----------
[clang][Interp][NFC] Destroy InitMap when moving contents to DeadBlock
Commit: c9529f76017f0b517dca1e89eecdf6bbd97c3e84
https://github.com/llvm/llvm-project/commit/c9529f76017f0b517dca1e89eecdf6bbd97c3e84
Author: Max191 <44243577+Max191 at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M mlir/lib/Dialect/Tensor/IR/TensorOps.cpp
M mlir/test/Dialect/Tensor/fold-tensor-subset-ops-into-vector-transfers.mlir
M mlir/test/Dialect/Tensor/fold-tensor-subset-ops.mlir
Log Message:
-----------
[mlir] Drop outermost dims in slice rank reduction inference (#95020)
The `getDroppedDims` utility function does not follow the convention of
dropping outermost unit dimensions first when inferring a rank reduction
mask for a slice. This PR updates the implementation to match this
convention.
Commit: 05ca20744159a33349271a96b90959a0a3807133
https://github.com/llvm/llvm-project/commit/05ca20744159a33349271a96b90959a0a3807133
Author: Aaron Ballman <aaron at aaronballman.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/www/c_status.html
Log Message:
-----------
[C23] Update status page regarding FLT_MAX_EXP
N2843 was subsumed by N2882; we could probably consider removing
subsumed entries, but I've been leaving them to help folks looking at
the editor's report from various working drafts and wondering about the
changes.
Commit: d7dd778cde84110e38521a6b55dfeb4e1c649ec2
https://github.com/llvm/llvm-project/commit/d7dd778cde84110e38521a6b55dfeb4e1c649ec2
Author: PeterChou1 <peter.chou at mail.utoronto.ca>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang-tools-extra/clang-doc/tool/CMakeLists.txt
M clang-tools-extra/clang-doc/tool/ClangDocMain.cpp
Log Message:
-----------
[clang-doc] update install path to share/clang-doc instead of share/clang (#96555)
Updates the install path for clang-doc to share/clang-doc instead
share/clang to avoid confusion
Commit: dbd5c7805bae510c79b51c2c7700a590f2df446d
https://github.com/llvm/llvm-project/commit/dbd5c7805bae510c79b51c2c7700a590f2df446d
Author: PeterChou1 <peter.chou at mail.utoronto.ca>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang-tools-extra/test/clang-doc/Inputs/basic-project/src/Calculator.cpp
M clang-tools-extra/test/clang-doc/basic-project.test
Log Message:
-----------
[clang-doc] Remove stdexecpt from clang-doc test (#96552)
Removes stdexecpt from clang-doc test introduced in
https://github.com/llvm/llvm-project/pull/93928
since it violates the rule that test must be freestanding
Commit: f0f774ebf09b1f1ae8129074801342eeadf5495b
https://github.com/llvm/llvm-project/commit/f0f774ebf09b1f1ae8129074801342eeadf5495b
Author: Vitaly Buka <vitalybuka at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M compiler-rt/lib/asan/asan_interceptors.cpp
M compiler-rt/lib/hwasan/hwasan_interceptors.cpp
M compiler-rt/lib/lsan/lsan_interceptors.cpp
M compiler-rt/lib/memprof/memprof_interceptors.cpp
M compiler-rt/lib/msan/msan_interceptors.cpp
M compiler-rt/lib/sanitizer_common/sanitizer_common_libcdep.cpp
M compiler-rt/lib/sanitizer_common/sanitizer_linux.cpp
M compiler-rt/lib/sanitizer_common/sanitizer_posix.h
M compiler-rt/lib/tsan/rtl/tsan_interceptors_posix.cpp
Log Message:
-----------
[sanitizer] Rename DEFINE_REAL_PTHREAD_FUNCTIONS (#96527)
We use REAL() calls in interceptors, but
DEFINE_REAL_PTHREAD_FUNCTIONS has nothing to do
with them and only used for internal maintenance
threads.
This is done to avoid confusion like in #96456.
Commit: de7c1396f29b9bf7011912e7cfea9edad1efb492
https://github.com/llvm/llvm-project/commit/de7c1396f29b9bf7011912e7cfea9edad1efb492
Author: Han-Kuan Chen <hankuan.chen at sifive.com>
Date: 2024-06-26 (Wed, 26 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
Log Message:
-----------
[SLP] NFC. Refactor and add getAltInstrMask help function. (#94709)
Co-authored-by: Alexey Bataev <a.bataev at gmx.com>
Commit: aaf50bf34f3a2007221c644384d238666cfc2bc3
https://github.com/llvm/llvm-project/commit/aaf50bf34f3a2007221c644384d238666cfc2bc3
Author: Jay Foad <jay.foad at amd.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-smem.ll
Log Message:
-----------
[AMDGPU] Disallow negative s_load offsets in isLegalAddressingMode (#91327)
Commit: 5e2beed9a1643cd6358fb8b43feb893543d90bf3
https://github.com/llvm/llvm-project/commit/5e2beed9a1643cd6358fb8b43feb893543d90bf3
Author: Aaron Ballman <aaron at aaronballman.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/www/c_status.html
Log Message:
-----------
[C23] Move WG14 N2931 to the TS18661 section
This paper only matters for TS18661-3 integration.
Commit: 0b049ce64653a8cbdeedaa2bfe6cc79fc6af5d40
https://github.com/llvm/llvm-project/commit/0b049ce64653a8cbdeedaa2bfe6cc79fc6af5d40
Author: Vitaly Buka <vitalybuka at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M compiler-rt/lib/tsan/rtl/tsan_rtl.cpp
A compiler-rt/test/tsan/Darwin/fork_deadlock.cpp
R compiler-rt/test/tsan/Linux/fork_deadlock.cpp
Log Message:
-----------
[tsan] Test `__tsan_test_only_on_fork` only on Mac (#96597)
According to https://reviews.llvm.org/D114250
this was to handle Mac specific issue, however
the test is Linux only.
The test effectively prevents to lock main allocator
on fork, but we do that on Linux for other
sanitizers for years, and need to do the same
for TSAN to avoid deadlocks.
Commit: 4c87212d63c3b45e9371434239553ef007216106
https://github.com/llvm/llvm-project/commit/4c87212d63c3b45e9371434239553ef007216106
Author: Nick Desaulniers (paternity leave) <nickdesaulniers at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libc/src/__support/OSUtil/linux/arm/syscall.h
Log Message:
-----------
[libc][thumb] support syscalls from thumb mode (#96558)
r7 is reserved in thumb2 (typically for the frame pointer, as opposed to r11 in
ARM mode), so assigning to a variable with explicit register storage in r7 will
produce an error.
But r7 is where the Linux kernel expects the syscall number to be placed. We
can use a temporary to get the register allocator to pick a temporary, which we
save+restore the previous value of r7 in.
Fixes: #93738
Commit: cd2bac81a9ae961f8299198750a8910dc389491e
https://github.com/llvm/llvm-project/commit/cd2bac81a9ae961f8299198750a8910dc389491e
Author: Vitaly Buka <vitalybuka at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M compiler-rt/lib/tsan/rtl/tsan_mman.cpp
M compiler-rt/lib/tsan/rtl/tsan_mman.h
M compiler-rt/lib/tsan/rtl/tsan_rtl.cpp
Log Message:
-----------
[nfc][tsan] Better name for locking functions (#96598)
These functions used only for `fork`.
Unused parameter `child` will be used in followup patches.
Commit: 0258a60cd97c5d44b150a6c954f992b39a6c4f2f
https://github.com/llvm/llvm-project/commit/0258a60cd97c5d44b150a6c954f992b39a6c4f2f
Author: Vitaly Buka <vitalybuka at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M compiler-rt/lib/tsan/rtl/tsan_mman.cpp
Log Message:
-----------
[nfc][tsan] Clang format includes (#96599)
Commit: 5c9513ac752fe33976a0a16a9e6d188b8200d3ae
https://github.com/llvm/llvm-project/commit/5c9513ac752fe33976a0a16a9e6d188b8200d3ae
Author: Alex MacLean <amaclean at nvidia.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp
A llvm/test/CodeGen/NVPTX/max-align.ll
Log Message:
-----------
[NVPTX] cap param alignment at 128 (max supported by ptx) (#96117)
Cap the alignment to 128 bytes as that is the maximum alignment
supported by PTX. The restriction is mentioned in the parameter passing
section (Note D) of the [PTX Writer's Guide to Interoperability]
(https://docs.nvidia.com/cuda/ptx-writers-guide-to-interoperability/index.html#parameter-passing)
> D. The alignment must be 1, 2, 4, 8, 16, 32, 64, or 128 bytes.
Commit: 70fb1e379b52bfab061fd608acb8c7d070d31ecd
https://github.com/llvm/llvm-project/commit/70fb1e379b52bfab061fd608acb8c7d070d31ecd
Author: Fabian Mora <fmora.dev at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M mlir/include/mlir/Target/LLVM/ROCDL/Utils.h
M mlir/lib/Target/LLVM/CMakeLists.txt
M mlir/lib/Target/LLVM/ROCDL/Target.cpp
Log Message:
-----------
Reland [mlir][Target] Improve ROCDL gpu serialization API (#96198)
Reland: https://github.com/llvm/llvm-project/pull/95456
This patch improves the ROCDL gpu serialization API by:
- Introducing the enum `AMDGCNLibraries` for specifying the AMD GCN
device code libraries to use during linking.
- Removing `getCommonBitcodeLibs` in favor of `AMDGCNLibraries`.
Previously `getCommonBitcodeLibs` would try to load all AMD GCN bitcode
librariesm now it will only load the requested libraries.
- Exposing the `compileToBinary` method and making it virtual, allowing
downstream users to re-use this method.
- Exposing `moduleToObjectImpl`, this method provides a prototype flow
for compiling to binary, allowing downstream users to re-use this
method.
- It also avoids constructing the control variables if no device
libraries are being used.
- Changes the style of the error messages to be composable, ie no full
stops.
- Adds an error message for when the ROCm toolkit can't be found but it
was required.
Commit: c0dc134de56f0fa25b942c9db1b1852b1da9ee51
https://github.com/llvm/llvm-project/commit/c0dc134de56f0fa25b942c9db1b1852b1da9ee51
Author: Vitaly Buka <vitalybuka at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M compiler-rt/lib/tsan/rtl/tsan_mman.cpp
M compiler-rt/test/sanitizer_common/TestCases/Posix/fork_threaded.c
Log Message:
-----------
[tsan] Lock/Unlock allocator and stacks on fork (#96600)
We do that for other Sanitizers, and we
should do the same for TSAN.
There are know deadlocks reports here.
Commit: 39a0aa5876c20ef245fc4065b9f512008fac91a6
https://github.com/llvm/llvm-project/commit/39a0aa5876c20ef245fc4065b9f512008fac91a6
Author: Eli Friedman <efriedma at quicinc.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
M llvm/lib/Target/ARM/ARMISelLowering.cpp
M llvm/lib/Target/X86/X86ISelLowering.cpp
M llvm/test/CodeGen/AArch64/ldexp.ll
A llvm/test/CodeGen/ARM/ldexp.ll
M llvm/test/CodeGen/X86/ldexp.ll
M llvm/test/CodeGen/X86/llvm.frexp.ll
Log Message:
-----------
[SelectionDAG] Lower llvm.ldexp.f32 to ldexp() on Windows. (#95301)
This reduces codesize. As discussed in #92707.
Commit: 928b7808406b893b24edb8d8462491dc82f9ce43
https://github.com/llvm/llvm-project/commit/928b7808406b893b24edb8d8462491dc82f9ce43
Author: Brendan Dahl <brendan.dahl at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/WebAssembly/WebAssemblyInstrSIMD.td
M llvm/test/CodeGen/WebAssembly/half-precision.ll
M llvm/test/MC/WebAssembly/simd-encodings.s
Log Message:
-----------
[WebAssembly] Implement trunc_sat and convert instructions for f16x8. (#95180)
These instructions can be generated using regular LL intrinsics.
Specified at:
https://github.com/WebAssembly/half-precision/blob/29a9b9462c9285d4ccc1a5dc39214ddfd1892658/proposals/half-precision/Overview.md
Commit: a790279bf2a8be2f9c42bf80f55a63933e398d0e
https://github.com/llvm/llvm-project/commit/a790279bf2a8be2f9c42bf80f55a63933e398d0e
Author: Alexis Perry-Holby <AlexisPerry at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/include/clang/Driver/Options.td
M clang/lib/Driver/ToolChains/Flang.cpp
M flang/include/flang/Frontend/TargetOptions.h
M flang/include/flang/Lower/Bridge.h
M flang/include/flang/Optimizer/CodeGen/CGPasses.td
M flang/include/flang/Optimizer/CodeGen/Target.h
M flang/include/flang/Optimizer/Dialect/Support/FIRContext.h
M flang/include/flang/Optimizer/Transforms/Passes.td
M flang/lib/Frontend/CompilerInvocation.cpp
M flang/lib/Frontend/FrontendActions.cpp
M flang/lib/Lower/Bridge.cpp
M flang/lib/Optimizer/CodeGen/CodeGen.cpp
M flang/lib/Optimizer/CodeGen/Target.cpp
M flang/lib/Optimizer/CodeGen/TargetRewrite.cpp
M flang/lib/Optimizer/CodeGen/TypeConverter.cpp
M flang/lib/Optimizer/Dialect/Support/FIRContext.cpp
A flang/test/Driver/tune-cpu-fir.f90
A flang/test/Lower/tune-cpu-llvm.f90
M flang/tools/bbc/bbc.cpp
M flang/tools/tco/tco.cpp
M flang/unittests/Optimizer/FIRContextTest.cpp
M mlir/include/mlir/Dialect/LLVMIR/LLVMOps.td
M mlir/lib/Target/LLVMIR/ModuleImport.cpp
M mlir/lib/Target/LLVMIR/ModuleTranslation.cpp
A mlir/test/Target/LLVMIR/Import/tune-cpu.ll
A mlir/test/Target/LLVMIR/tune-cpu.mlir
Log Message:
-----------
[flang] Add basic -mtune support (#95043)
This PR adds -mtune as a valid flang flag and passes the information
through to LLVM IR as an attribute on all functions. No specific
architecture optimizations are added at this time.
Commit: a0869331ec0c9ae42f696e87d72416ea5ecb926f
https://github.com/llvm/llvm-project/commit/a0869331ec0c9ae42f696e87d72416ea5ecb926f
Author: Aaron Ballman <aaron at aaronballman.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/www/c_status.html
Log Message:
-----------
[C11] Remove WG14 N1537 from the status page
This paper was a rewording of WG14 N1485, correcting terminology and
bringing the C11 feature slightly closer in line with the C++11
feature. There is nothing additional to be done or test to conform to
what was specified by WG14 N1537, so we'll remove the entry and lean on
N1485 to track status for atomics.
Commit: 21ab32e1c144b42458b7b3181e84bfb45aadcc54
https://github.com/llvm/llvm-project/commit/21ab32e1c144b42458b7b3181e84bfb45aadcc54
Author: Michael Buch <michaelbuch12 at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M lldb/source/Plugins/Language/CPlusPlus/LibCxx.cpp
Log Message:
-----------
[lldb][LibCxx] Move incorrect nullptr check (#96635)
Found while skimming this code. Don't have a reproducible test case for
this but the nullptr check should clearly occur before we try to
dereference `location_sp`.
Commit: b9353f7f3ec88ecc5c4cdfc109665c89599b7f2f
https://github.com/llvm/llvm-project/commit/b9353f7f3ec88ecc5c4cdfc109665c89599b7f2f
Author: Joseph Huber <huberjn at outlook.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/tools/clang-linker-wrapper/ClangLinkerWrapper.cpp
Log Message:
-----------
[LinkerWrapper][NFC] Simplify StringErrors (#96650)
Summary:
The StringError class has a specialized method that creates the
inconvertible error code for you. It's much easier to read this way.
Commit: fef144cebb378f550ef098d370316554d647f625
https://github.com/llvm/llvm-project/commit/fef144cebb378f550ef098d370316554d647f625
Author: Kazu Hirata <kazu at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/CodeGen/AssignmentTrackingAnalysis.cpp
M llvm/lib/CodeGen/MLRegAllocEvictAdvisor.cpp
M llvm/lib/DWARFLinker/Parallel/ArrayList.h
M llvm/lib/ExecutionEngine/Orc/Core.cpp
M llvm/lib/ExecutionEngine/Orc/Debugging/DebugInfoSupport.cpp
M llvm/lib/ProfileData/InstrProfReader.cpp
M llvm/lib/Target/AMDGPU/AMDGPUIGroupLP.cpp
M llvm/lib/Target/ARM/ARMBaseInstrInfo.cpp
M llvm/lib/Target/NVPTX/NVVMReflect.cpp
M llvm/lib/Target/PowerPC/PPCMergeStringPool.cpp
M llvm/lib/Transforms/IPO/MemProfContextDisambiguation.cpp
M llvm/lib/Transforms/Utils/CodeLayout.cpp
M llvm/tools/llvm-jitlink/llvm-jitlink.cpp
M llvm/utils/TableGen/ARMTargetDefEmitter.cpp
M llvm/utils/TableGen/ExegesisEmitter.cpp
Log Message:
-----------
Revert "[llvm] Use llvm::sort (NFC) (#96434)"
This reverts commit 05d167fc201b4f2e96108be0d682f6800a70c23d.
Reverting the patch fixes the following under EXPENSIVE_CHECKS:
LLVM :: CodeGen/AMDGPU/sched-group-barrier-pipeline-solver.mir
LLVM :: CodeGen/AMDGPU/sched-group-barrier-pre-RA.mir
LLVM :: CodeGen/PowerPC/aix-xcoff-used-with-stringpool.ll
LLVM :: CodeGen/PowerPC/merge-string-used-by-metadata.mir
LLVM :: CodeGen/PowerPC/mergeable-string-pool-large.ll
LLVM :: CodeGen/PowerPC/mergeable-string-pool-pass-only.mir
LLVM :: CodeGen/PowerPC/mergeable-string-pool.ll
Commit: aec735cf476c3975b026aa79fa40dda06a27fac3
https://github.com/llvm/llvm-project/commit/aec735cf476c3975b026aa79fa40dda06a27fac3
Author: agozillon <Andrew.Gozillon at amd.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M flang/lib/Lower/OpenMP/OpenMP.cpp
M flang/test/Fir/convert-to-llvm-openmp-and-fir.fir
M flang/test/Integration/OpenMP/map-types-and-sizes.f90
A flang/test/Lower/OpenMP/common-block-map.f90
M llvm/lib/Frontend/OpenMP/OMPIRBuilder.cpp
M mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
A mlir/test/Target/LLVMIR/omptarget-fortran-common-block-host.mlir
A offload/test/offloading/fortran/target-map-all-common-block-members.f90
A offload/test/offloading/fortran/target-map-common-block.f90
A offload/test/offloading/fortran/target-map-declare-target-link-common-block.f90
A offload/test/offloading/fortran/target-map-first-common-block-member.f90
A offload/test/offloading/fortran/target-map-mix-imp-exp-common-block-members.f90
A offload/test/offloading/fortran/target-map-second-common-block-member.f90
Log Message:
-----------
[Flang][OpenMP][MLIR] Fix common block mapping for regular and declare target link (#91829)
This PR attempts to fix common block mapping for regular mapping of
these types as well as when they have been marked as "declare target
link". This PR should allow correct mapping of both the members of a
common block and the full common block via its block symbol.
The main changes were some adjustments to the Fortran OpenMP lowering to
HLFIR/FIR, the lowering of the LLVM+OpenMP dialect to LLVM-IR and
adjustments to the way the we handle target kernel map argument
rebinding inside of the OMPIRBuilder.
For the Fortran OpenMP lowering were two changes, one to prevent the
implicit capture of common block members when the common block symbol
itself has been marked and the other creates intermediate member access
inside of the target region to be used in-place of those external to the
target region, this prevents external usages breaking the
IsolatedFromAbove pact.
In the latter case, there was an adjustment to the size calculation for
types to better handle cases where we pass an array as the type of a map
(as opposed to the bounds and the type of the element), which occurs in
the case of common blocks. There is also some adjustment to how
handleDeclareTargetMapVar handles renaming of declare target symbols in
the module to the reference pointer, now it will only apply to those
within the kernel that is currently being generated and we also perform
a modification to replace constants with instructions as necessary as we
cannot replace these with our reference pointer (non-constant and
constants do not mix nicely).
In the case of the OpenMPIRBuilder some changes were made to defer
global symbol rebinding to kernel arguments until all other arguments
have been rebound. This makes sure we do not replace uses that may refer
to the global (e.g. a GEP) but are themselves actually a separate
argument that needs bound.
Currently "declare target to" still needs some work, but this may be the
case for all types in conjunction with "declare target to" at the
moment.
Commit: b003c60904a78eb62702c613f9e61155bad56798
https://github.com/llvm/llvm-project/commit/b003c60904a78eb62702c613f9e61155bad56798
Author: Felix Schneider <fx.schn at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M mlir/include/mlir/Dialect/Arith/IR/ArithOps.td
M mlir/lib/Dialect/Arith/IR/ArithOps.cpp
M mlir/test/Dialect/Arith/canonicalize.mlir
Log Message:
-----------
[mlir][arith] Match folding of `arith.remf` to `llvm.frem` semantics (#96537)
There are multiple ways to define a remainder operation. Depending on
the definition, the result could be either always positive or have the
sign of the dividend.
The pattern lowering `arith.remf` to LLVM assumes that the semantics
match `llvm.frem`, which seems to be reasonable. The folder, however, is
implemented via `APFloat::remainder()` which has different semantics.
This patch matches the folding behaviour to lowering behavior by using
`APFloat::mod()`, which matches the behavior of `llvm.frem` and libm's
`fmod()`. It also updates the documentation of `arith.remf` to explain
this behavior: The sign of the result of the remainder operation always
matches the sign of the dividend (LHS operand).
frem documentation: https://llvm.org/docs/LangRef.html#frem-instruction
Fix https://github.com/llvm/llvm-project/issues/94431
---------
Co-authored-by: Jakub Kuderski <kubakuderski at gmail.com>
Commit: a255ece56f73f3ef4bd5add83f9120a0b17914a1
https://github.com/llvm/llvm-project/commit/a255ece56f73f3ef4bd5add83f9120a0b17914a1
Author: Nivetha Kuruparan <167944429+nivethakuruparan at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/DebugInfo/attr-btf_type_tag.ll
Log Message:
-----------
XFAIL llvm/test/DebugInfo/attr-btf_type_tag.ll on AIX (#96677)
This PR XFAILS `llvm/test/DebugInfo/attr-btf_type_tag.ll` on AIX since
we we don’t have `.debug_addr` section.
Co-authored-by: Nivetha Kuruparan <nivetha at comp810.rtp.raleigh.ibm.com>
Commit: ec2fb59e6c12bf89aae823c4c47b25488d00daa0
https://github.com/llvm/llvm-project/commit/ec2fb59e6c12bf89aae823c4c47b25488d00daa0
Author: Maksim Panchenko <maks at fb.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
A bolt/docs/OptimizingLinux.md
Log Message:
-----------
[BOLT][docs] Add Linux kernel optimization guide (#96669)
Describe steps for optimizing the Linux kernel with BOLT.
Commit: 8dd9494056d6797144dfabbbfb6d478c95375019
https://github.com/llvm/llvm-project/commit/8dd9494056d6797144dfabbbfb6d478c95375019
Author: Tarun Prabhu <tarun at lanl.gov>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/include/clang/Driver/Options.td
M clang/lib/Driver/ToolChains/Flang.cpp
M flang/include/flang/Frontend/TargetOptions.h
M flang/include/flang/Lower/Bridge.h
M flang/include/flang/Optimizer/CodeGen/CGPasses.td
M flang/include/flang/Optimizer/CodeGen/Target.h
M flang/include/flang/Optimizer/Dialect/Support/FIRContext.h
M flang/include/flang/Optimizer/Transforms/Passes.td
M flang/lib/Frontend/CompilerInvocation.cpp
M flang/lib/Frontend/FrontendActions.cpp
M flang/lib/Lower/Bridge.cpp
M flang/lib/Optimizer/CodeGen/CodeGen.cpp
M flang/lib/Optimizer/CodeGen/Target.cpp
M flang/lib/Optimizer/CodeGen/TargetRewrite.cpp
M flang/lib/Optimizer/CodeGen/TypeConverter.cpp
M flang/lib/Optimizer/Dialect/Support/FIRContext.cpp
R flang/test/Driver/tune-cpu-fir.f90
R flang/test/Lower/tune-cpu-llvm.f90
M flang/tools/bbc/bbc.cpp
M flang/tools/tco/tco.cpp
M flang/unittests/Optimizer/FIRContextTest.cpp
M mlir/include/mlir/Dialect/LLVMIR/LLVMOps.td
M mlir/lib/Target/LLVMIR/ModuleImport.cpp
M mlir/lib/Target/LLVMIR/ModuleTranslation.cpp
R mlir/test/Target/LLVMIR/Import/tune-cpu.ll
R mlir/test/Target/LLVMIR/tune-cpu.mlir
Log Message:
-----------
Revert "[flang] Add basic -mtune support" (#96678)
Reverts llvm/llvm-project#95043
Commit: 7e77353db6984b2bf07f54fd0ef5f56bf1aa8244
https://github.com/llvm/llvm-project/commit/7e77353db6984b2bf07f54fd0ef5f56bf1aa8244
Author: Karl-Johan Karlsson <karl-johan.karlsson at ericsson.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M clang/test/Preprocessor/embed_weird.cpp
Log Message:
-----------
[clang][test] Avoid writing to a potentially write-protected dir (#96457)
The test clang/test/Preprocessor/embed_weird.cpp creates a file directly
in the Inputs dir in the llvm-project repo instead of the temporary
directory. The llvm-project repo may be write protected e.g. in a
sandboxed environment.
This patch creates a separate temporary directory where the
null_byte.bin file is created along with the rest of copies of the other
embed-files needed for the testcase.
Commit: a55dc1d3ca2ed948bdae887ea1f4ab8041ed1b6d
https://github.com/llvm/llvm-project/commit/a55dc1d3ca2ed948bdae887ea1f4ab8041ed1b6d
Author: Alexey Bataev <a.bataev at outlook.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/Transforms/SLPVectorizer/RISCV/shuffled-gather-casted.ll
Log Message:
-----------
[SLP][NFC]Add a test with the incorrect casting of the sext/zext alternate node, NFC.
Commit: 600ff287722a15106e9a02c470b9865dda05980e
https://github.com/llvm/llvm-project/commit/600ff287722a15106e9a02c470b9865dda05980e
Author: Mircea Trofin <mtrofin at google.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/Analysis/InlineModelFeatureMaps.h
M llvm/lib/Analysis/MLInlineAdvisor.cpp
M llvm/lib/Analysis/models/gen-inline-oz-test-model.py
A llvm/test/Transforms/Inline/ML/avail-external.ll
Log Message:
-----------
[mlgo] add 2 new features whether caller/callee is `available_externally` (#96585)
AvailableExternally linkage is interesting because, in ThinLTO cases, it
means the function may get elided if it survives inlining - see
`elim-avail-extern` pass.
Commit: bd5b77529f07d21c27cf14009845e319e454867d
https://github.com/llvm/llvm-project/commit/bd5b77529f07d21c27cf14009845e319e454867d
Author: Fangrui Song <i at maskray.me>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M compiler-rt/lib/asan/asan_globals.cpp
Log Message:
-----------
__asan_register_elf_globals: properly check the "no instrumented global variable" case
On ELF platforms, the instrumentation registers global variables using
`__asan_register_elf_globals` for the default `UseGlobalsGC` case. If
all instrumented global variables in a module are discarded by linker
GC, we will have `start == stop`.
Normally `start == 0`, but `start != 0` is possible with a linker script
retaining `asan_globals`. The called `__asan_register_globals` would
access out-of-bounds `globals[n-1]`, though there is likely no runtime
failure.
Pull Request: https://github.com/llvm/llvm-project/pull/96529
Commit: 80114a7b325becbfe4ff4298c4ab5e17631e24bb
https://github.com/llvm/llvm-project/commit/80114a7b325becbfe4ff4298c4ab5e17631e24bb
Author: Björn Pettersson <bjorn.a.pettersson at ericsson.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/CodeGen/ScheduleDAGInstrs.h
Log Message:
-----------
[ScheduleDAG] Remove obsolete VReg2SUnitMap. NFC (#96559)
The VReg2SUnitMap type is not used anywhere (it has probably been
obsolete since commit 97d0ffbe0647c4ec49ed4 back in 2015). This commit
simply removes the type definition.
Commit: 699cd9ac1dc305b2f0ee6e70939a3dddccc2add4
https://github.com/llvm/llvm-project/commit/699cd9ac1dc305b2f0ee6e70939a3dddccc2add4
Author: Morten larsen <mortenlarsens at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/DWP/DWP.cpp
M llvm/test/tools/llvm-dwp/X86/merge.test
A llvm/test/tools/llvm-dwp/X86/merge_v5.test
Log Message:
-----------
[llvm-dwp] Fix merging of debug_str_offsets with multiple contributions (#90461)
This pull request will change the merging of ``debug_str_offset`` to
merge per contribution and correctly copy over each contribution header
to the merged section. I have added some test data which is in dwarf5
format as this is where the section contribution header was introduced,
as far as i can tell.
Commit: 228c2e147390ca996a22e460e0bb804431469d25
https://github.com/llvm/llvm-project/commit/228c2e147390ca996a22e460e0bb804431469d25
Author: Alexey Bataev <a.bataev at outlook.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
M llvm/test/Transforms/SLPVectorizer/RISCV/shuffled-gather-casted.ll
Log Message:
-----------
[SLP]Fix incorrect promotion of nodes before shuffling.
If the base node is signed, but some values are unsigned, still the
whole node should be considered signed. Also, an extra bitwidth analysis
should be performed, when estimating the minimal bitwidth.
Commit: 3c24eb39fb7e9c8b61992c86cb44d8cacc5853d2
https://github.com/llvm/llvm-project/commit/3c24eb39fb7e9c8b61992c86cb44d8cacc5853d2
Author: Stephen Chou <stephenchouca at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/CodeGen/MIRParser/MILexer.cpp
M llvm/lib/CodeGen/MIRParser/MILexer.h
M llvm/lib/CodeGen/MIRParser/MIParser.cpp
A llvm/test/CodeGen/MIR/Generic/bfloat-immediates.mir
Log Message:
-----------
[LLVM][MIR] Support parsing bfloat immediates in MIR parser (#96010)
Adds support in MIR parser for parsing bfloat immediates, and adds a
test for this.
Commit: ef05b0322307318eb806c0bd4cdcdae9e31d477b
https://github.com/llvm/llvm-project/commit/ef05b0322307318eb806c0bd4cdcdae9e31d477b
Author: OverMighty <its.overmighty at gmail.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libc/test/src/math/exhaustive/CMakeLists.txt
M libc/test/src/math/exhaustive/exhaustive_test.h
A libc/test/src/math/exhaustive/fmodf16_test.cpp
M libc/utils/MPFRWrapper/MPFRUtils.cpp
Log Message:
-----------
[libc][math][c23] Add MPFR exhaustive test for fmodf16 (#94656)
Commit: 88f80aeb0c626708017ea2d845e541e0e20f4d58
https://github.com/llvm/llvm-project/commit/88f80aeb0c626708017ea2d845e541e0e20f4d58
Author: lntue <35648136+lntue at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M libc/config/darwin/arm/entrypoints.txt
M libc/config/linux/aarch64/entrypoints.txt
M libc/config/linux/arm/entrypoints.txt
M libc/config/linux/riscv/entrypoints.txt
M libc/docs/math/index.rst
M libc/src/math/generic/CMakeLists.txt
A libc/src/math/generic/cos.cpp
M libc/src/math/generic/range_reduction_double_common.h
M libc/src/math/generic/sin.cpp
M libc/src/math/generic/sincos_eval.h
M libc/src/math/x86_64/CMakeLists.txt
R libc/src/math/x86_64/cos.cpp
M libc/test/src/math/cos_test.cpp
M libc/test/src/math/smoke/CMakeLists.txt
A libc/test/src/math/smoke/cos_test.cpp
Log Message:
-----------
[libc][math] Implement double precision cos correctly rounded to all rounding modes. (#96591)
Sharing the same algorithm as double precision sin:
https://github.com/llvm/llvm-project/pull/95736
Commit: 04c27852e47093f7efa18609dbf57b3ce58a3ffa
https://github.com/llvm/llvm-project/commit/04c27852e47093f7efa18609dbf57b3ce58a3ffa
Author: Fangrui Song <i at maskray.me>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/include/llvm/MC/MCContext.h
M llvm/include/llvm/MC/MCWinCOFFStreamer.h
M llvm/lib/MC/MCContext.cpp
M llvm/lib/MC/MCObjectFileInfo.cpp
M llvm/lib/MC/MCWinCOFFStreamer.cpp
M llvm/lib/MC/WinCOFFObjectWriter.cpp
M llvm/test/DebugInfo/X86/InlinedFnLocalVar.ll
M llvm/test/DebugInfo/X86/ref_addr_relocation.ll
M llvm/test/ExecutionEngine/RuntimeDyld/X86/COFF_x86_64.s
M llvm/test/MC/COFF/section-comdat-conflict.s
M llvm/test/MC/COFF/section-comdat.s
A llvm/test/MC/COFF/section-sym-err.s
Log Message:
-----------
[MC,COFF] Change how we handle section symbols
13a79bbfe583e1d8cc85d241b580907260065eb8 (2017) unified `BeginSymbol` and
section symbol for ELF. This patch does the same for COFF.
* In getCOFFSection, all sections now have a `BeginSymbol` (section
symbol). We do not need a dummy symbol name when `getBeginSymbol` is
needed (used by AsmParser::Run and DWARF generation).
* Section symbols are in the global symbol table. `call .text` will
reference the section symbol instead of an undefined symbol. This
matches GNU assembler. Unlike GNU, redefining the section symbol will
cause a "symbol 'foo0' is already defined" error (see
`section-sym-err.s`).
Pull Request: https://github.com/llvm/llvm-project/pull/96459
Commit: d26451411f9cab2200a749f5b7a2f4f93c154445
https://github.com/llvm/llvm-project/commit/d26451411f9cab2200a749f5b7a2f4f93c154445
Author: Akshay Deodhar <adeodhar at nvidia.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/test/CodeGen/NVPTX/f16-instructions.ll
Log Message:
-----------
Enforce parameter order in f16 call, flipped call unit tests (#96258)
Commit: 0280f97b36c83a7129e5dbce53c603b7ec5d82fe
https://github.com/llvm/llvm-project/commit/0280f97b36c83a7129e5dbce53c603b7ec5d82fe
Author: Alexey Bataev <a.bataev at outlook.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
A llvm/test/Transforms/SLPVectorizer/X86/insertelement-uses-vectorized-index.ll
Log Message:
-----------
[SLP]Fix PR95925: extract vectorized index of the potential buildvector sequence.
If the vectorized scalar is not the insert value in the buildvector
sequence but the index, it should be always extracted.
Commit: dff6871cdbce8569a141d541023061294860fb95
https://github.com/llvm/llvm-project/commit/dff6871cdbce8569a141d541023061294860fb95
Author: Craig Topper <craig.topper at sifive.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Target/RISCV/GISel/RISCVInstructionSelector.cpp
M llvm/lib/Target/RISCV/GISel/RISCVLegalizerInfo.cpp
A llvm/test/CodeGen/RISCV/GlobalISel/instruction-select/fp-constant-f16.mir
Log Message:
-----------
[RISCV][GISel] Support G_FCONSTANT for Zfh.
Commit: 7e59b20034aa77d69e5218ff44e3cba8a500f76a
https://github.com/llvm/llvm-project/commit/7e59b20034aa77d69e5218ff44e3cba8a500f76a
Author: vaibhav <73255802+mrdaybird at users.noreply.github.com>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
M llvm/lib/Analysis/ScalarEvolution.cpp
A llvm/test/Analysis/ScalarEvolution/pr92560.ll
Log Message:
-----------
[SCEV] Support addrec in right hand side in howManyLessThans (#92560)
Fixes #92554 (std::reverse will auto-vectorize now)
When calculating number of times a exit condition containing a
comparison is executed, we mostly assume that RHS of comparison should
be loop invariant, but it may be another add-recurrence.
~In that case, we can try the computation with `LHS = LHS - RHS` and
`RHS = 0`.~ (It is not valid unless proven that it doesn't wrap)
**Edit:**
We can calculate back edge count for loop structure like:
```cpp
left = left_start
right = right_start
while(left < right){
// ...do something...
left += s1; // the stride of left is s1 (> 0)
right -= s2; // the stride of right is -s2 (s2 > 0)
}
// left and right converge somewhere in the middle of their start values
```
We can calculate the backedge-count as ceil((End - left_start) /u (s1-
(-s2)) where, End = max(left_start, right_start).
**Alive2**: https://alive2.llvm.org/ce/z/ggxx58
Commit: 223fd48362c4387a4145d8dd6532b1fd3f96864a
https://github.com/llvm/llvm-project/commit/223fd48362c4387a4145d8dd6532b1fd3f96864a
Author: Fangrui Song <i at maskray.me>
Date: 2024-06-25 (Tue, 25 Jun 2024)
Changed paths:
A bolt/docs/OptimizingLinux.md
M clang-tools-extra/clang-doc/tool/CMakeLists.txt
M clang-tools-extra/clang-doc/tool/ClangDocMain.cpp
M clang-tools-extra/clang-tidy/utils/DeclRefExprUtils.cpp
M clang-tools-extra/docs/ReleaseNotes.rst
M clang-tools-extra/test/clang-doc/Inputs/basic-project/src/Calculator.cpp
M clang-tools-extra/test/clang-doc/basic-project.test
M clang-tools-extra/test/clang-tidy/checkers/performance/unnecessary-copy-initialization.cpp
M clang/docs/ClangFormatStyleOptions.rst
M clang/docs/LanguageExtensions.rst
M clang/docs/ReleaseNotes.rst
M clang/include/clang/AST/Expr.h
M clang/include/clang/Basic/Builtins.td
M clang/include/clang/Basic/BuiltinsAMDGPU.def
M clang/include/clang/Basic/Cuda.h
M clang/include/clang/Basic/DiagnosticSemaKinds.td
M clang/include/clang/CodeGen/ModuleBuilder.h
M clang/include/clang/Format/Format.h
M clang/lib/AST/ExprConstant.cpp
M clang/lib/AST/Interp/ByteCodeExprGen.cpp
M clang/lib/AST/Interp/Descriptor.cpp
M clang/lib/AST/Interp/Pointer.cpp
M clang/lib/Basic/Cuda.cpp
M clang/lib/Basic/Targets/NVPTX.cpp
M clang/lib/CodeGen/CGBuiltin.cpp
M clang/lib/CodeGen/CGDebugInfo.cpp
M clang/lib/CodeGen/CGDebugInfo.h
M clang/lib/CodeGen/CGOpenMPRuntimeGPU.cpp
M clang/lib/CodeGen/CodeGenModule.cpp
M clang/lib/Driver/Driver.cpp
M clang/lib/Driver/ToolChains/Clang.cpp
M clang/lib/Driver/ToolChains/HIPAMD.cpp
M clang/lib/Driver/ToolChains/HIPAMD.h
M clang/lib/Format/Format.cpp
M clang/lib/Format/UnwrappedLineFormatter.cpp
M clang/lib/Lex/LiteralSupport.cpp
M clang/lib/Sema/SemaChecking.cpp
M clang/lib/Sema/SemaTemplate.cpp
M clang/lib/Sema/SemaType.cpp
M clang/lib/Serialization/ASTWriter.cpp
M clang/test/AST/Interp/literals.cpp
A clang/test/CodeGenCXX/debug-info-verbose-trap.cpp
A clang/test/CodeGenOpenCL/builtins-amdgcn-raw-buffer-store.cl
M clang/test/CodeGenOpenCL/builtins-amdgcn.cl
M clang/test/Driver/cuda-arch-translation.cu
M clang/test/Frontend/embed-bitcode.ll
M clang/test/Misc/target-invalid-cpu-note.c
A clang/test/Modules/no-external-identifier-id.cppm
A clang/test/Modules/no-external-type-id.cppm
M clang/test/Preprocessor/embed_weird.cpp
M clang/test/Sema/arm-vector-types-support.c
R clang/test/SemaCUDA/neon-attrs.cu
A clang/test/SemaCXX/verbose-trap.cpp
A clang/test/SemaOpenCL/builtins-amdgcn-raw-buffer-store-error.cl
M clang/tools/clang-linker-wrapper/ClangLinkerWrapper.cpp
M clang/unittests/Format/ConfigParseTest.cpp
M clang/unittests/Format/FormatTest.cpp
M clang/utils/TableGen/NeonEmitter.cpp
M clang/www/c_status.html
M compiler-rt/lib/asan/asan_globals.cpp
M compiler-rt/lib/asan/asan_interceptors.cpp
M compiler-rt/lib/gwp_asan/guarded_pool_allocator.cpp
M compiler-rt/lib/gwp_asan/platform_specific/guarded_pool_allocator_fuchsia.cpp
M compiler-rt/lib/gwp_asan/platform_specific/guarded_pool_allocator_posix.cpp
M compiler-rt/lib/gwp_asan/utilities.h
M compiler-rt/lib/hwasan/hwasan_interceptors.cpp
M compiler-rt/lib/lsan/lsan_interceptors.cpp
M compiler-rt/lib/memprof/memprof_interceptors.cpp
M compiler-rt/lib/msan/msan_interceptors.cpp
M compiler-rt/lib/sanitizer_common/sanitizer_common_libcdep.cpp
M compiler-rt/lib/sanitizer_common/sanitizer_linux.cpp
M compiler-rt/lib/sanitizer_common/sanitizer_posix.h
M compiler-rt/lib/tsan/rtl/tsan_interceptors_posix.cpp
M compiler-rt/lib/tsan/rtl/tsan_mman.cpp
M compiler-rt/lib/tsan/rtl/tsan_mman.h
M compiler-rt/lib/tsan/rtl/tsan_rtl.cpp
M compiler-rt/test/sanitizer_common/TestCases/Posix/fork_threaded.c
A compiler-rt/test/tsan/Darwin/fork_deadlock.cpp
R compiler-rt/test/tsan/Linux/fork_deadlock.cpp
M flang/include/flang/Lower/AbstractConverter.h
M flang/include/flang/Optimizer/Transforms/Passes.h
M flang/include/flang/Optimizer/Transforms/Passes.td
M flang/include/flang/Tools/CLOptions.inc
M flang/lib/Lower/Bridge.cpp
M flang/lib/Lower/OpenMP/ClauseProcessor.cpp
M flang/lib/Lower/OpenMP/OpenMP.cpp
M flang/lib/Optimizer/Transforms/CMakeLists.txt
A flang/lib/Optimizer/Transforms/ConstantArgumentGlobalisation.cpp
M flang/lib/Optimizer/Transforms/DebugTypeGenerator.cpp
M flang/test/Fir/boxproc.fir
M flang/test/Fir/convert-to-llvm-openmp-and-fir.fir
M flang/test/Integration/OpenMP/map-types-and-sizes.f90
A flang/test/Integration/debug-ptr-type.f90
A flang/test/Lower/OpenMP/common-block-map.f90
A flang/test/Lower/OpenMP/copyprivate2.f90
M flang/test/Lower/character-local-variables.f90
A flang/test/Transforms/constant-argument-globalisation-2.fir
A flang/test/Transforms/constant-argument-globalisation.fir
A flang/test/Transforms/debug-ptr-type.fir
M libc/cmake/modules/prepare_libc_gpu_build.cmake
M libc/config/darwin/arm/entrypoints.txt
M libc/config/linux/aarch64/entrypoints.txt
M libc/config/linux/app.h
M libc/config/linux/arm/entrypoints.txt
M libc/config/linux/riscv/entrypoints.txt
M libc/config/linux/x86_64/entrypoints.txt
M libc/docs/math/index.rst
M libc/spec/stdc.td
M libc/src/__support/FPUtil/CMakeLists.txt
M libc/src/__support/FPUtil/dyadic_float.h
M libc/src/__support/FPUtil/generic/CMakeLists.txt
A libc/src/__support/FPUtil/generic/div.h
M libc/src/__support/OSUtil/linux/arm/syscall.h
M libc/src/__support/threads/thread.h
M libc/src/math/CMakeLists.txt
A libc/src/math/f16divf.h
M libc/src/math/generic/CMakeLists.txt
A libc/src/math/generic/cos.cpp
A libc/src/math/generic/f16divf.cpp
M libc/src/math/generic/range_reduction_double_common.h
M libc/src/math/generic/sin.cpp
M libc/src/math/generic/sincos_eval.h
M libc/src/math/x86_64/CMakeLists.txt
R libc/src/math/x86_64/cos.cpp
M libc/startup/linux/do_start.cpp
M libc/test/src/math/CMakeLists.txt
A libc/test/src/math/DivTest.h
M libc/test/src/math/cos_test.cpp
M libc/test/src/math/exhaustive/CMakeLists.txt
M libc/test/src/math/exhaustive/exhaustive_test.h
A libc/test/src/math/exhaustive/fmodf16_test.cpp
A libc/test/src/math/f16divf_test.cpp
M libc/test/src/math/smoke/CMakeLists.txt
A libc/test/src/math/smoke/DivTest.h
A libc/test/src/math/smoke/cos_test.cpp
A libc/test/src/math/smoke/f16divf_test.cpp
M libc/utils/MPFRWrapper/MPFRUtils.cpp
M libc/utils/MPFRWrapper/MPFRUtils.h
M libcxx/docs/ReleaseNotes/19.rst
M libcxx/include/__atomic/atomic_init.h
M libcxx/include/__mdspan/extents.h
M libcxx/include/__memory/allocator.h
M libcxx/include/__memory/allocator_traits.h
M libcxx/include/__memory/shared_ptr.h
M libcxx/include/__numeric/gcd_lcm.h
M libcxx/include/__type_traits/conditional.h
M libcxx/include/__type_traits/is_nothrow_destructible.h
M libcxx/include/deque
M libcxx/include/forward_list
M libcxx/include/list
M libcxx/include/map
M libcxx/include/set
M libcxx/include/string
M libcxx/include/unordered_map
M libcxx/include/unordered_set
M libcxx/include/vector
M libcxx/modules/CMakeLists.txt.in
M libcxx/src/barrier.cpp
M libcxx/src/filesystem/operations.cpp
M libcxx/src/filesystem/path.cpp
M libcxx/src/filesystem/path_parser.h
M libcxx/src/locale.cpp
R libcxx/test/libcxx/containers/sequences/vector/const_T.compile.pass.cpp
R libcxx/test/libcxx/containers/sequences/vector/const_value_type.pass.cpp
M libcxx/test/libcxx/diagnostics/memory.nodiscard.verify.cpp
M libcxx/test/libcxx/memory/allocator_void.trivial.compile.pass.cpp
M libcxx/test/libcxx/memory/allocator_volatile.verify.cpp
M libcxx/test/std/concepts/concepts.lang/concept.default.init/default_initializable.compile.pass.cpp
A libcxx/test/std/containers/sequences/array/array.overview/nttp.equivalence.compile.pass.cpp
A libcxx/test/std/containers/sequences/array/array.overview/nttp.verify.cpp
M libcxx/test/std/containers/views/mdspan/extents/ctad.pass.cpp
M libcxx/test/std/numerics/numeric.ops/numeric.ops.lcm/lcm.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.ctor.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.dtor.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.members/allocate.constexpr.size.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator.members/allocate.size.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.deprecated_in_cxx17.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.deprecated_in_cxx23.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.pass.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.removed_in_cxx20.verify.cpp
M libcxx/test/std/utilities/memory/default.allocator/allocator_types.removed_in_cxx26.verify.cpp
M libcxx/test/std/utilities/memory/specialized.algorithms/specialized.construct/construct_at.pass.cpp
M libcxx/test/std/utilities/meta/meta.unary/meta.unary.prop/is_nothrow_destructible.pass.cpp
A libcxx/test/std/utilities/utility/pairs/pairs.pair/nttp.equivalence.compile.pass.cpp
A libcxx/test/std/utilities/utility/pairs/pairs.pair/nttp.verify.cpp
M lld/ELF/InputFiles.cpp
M lld/ELF/InputFiles.h
M lld/ELF/InputSection.cpp
M lld/ELF/Relocations.cpp
M lldb/include/lldb/API/SBDebugger.h
M lldb/include/lldb/Symbol/TypeSystem.h
M lldb/source/API/SBDebugger.cpp
M lldb/source/Plugins/Language/CPlusPlus/LibCxx.cpp
M lldb/source/Plugins/SymbolFile/DWARF/DWARFASTParserClang.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARF.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARF.h
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDebugMap.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDebugMap.h
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDwo.cpp
M lldb/source/Plugins/SymbolFile/DWARF/SymbolFileDWARFDwo.h
M lldb/source/Symbol/TypeSystem.cpp
M lldb/tools/lldb-dap/DAP.cpp
M lldb/tools/lldb-dap/DAP.h
M lldb/tools/lldb-dap/lldb-dap.cpp
M llvm/docs/AMDGPUUsage.rst
M llvm/include/llvm/ADT/SetOperations.h
M llvm/include/llvm/ADT/SmallPtrSet.h
M llvm/include/llvm/Analysis/CFGPrinter.h
M llvm/include/llvm/Analysis/InlineModelFeatureMaps.h
M llvm/include/llvm/Analysis/LoopInfo.h
M llvm/include/llvm/CodeGen/GlobalISel/MachineIRBuilder.h
M llvm/include/llvm/CodeGen/MachinePostDominators.h
M llvm/include/llvm/CodeGen/ScheduleDAGInstrs.h
M llvm/include/llvm/IR/Analysis.h
M llvm/include/llvm/IR/IntrinsicsAMDGPU.td
M llvm/include/llvm/MC/MCContext.h
M llvm/include/llvm/MC/MCWinCOFFStreamer.h
M llvm/include/llvm/Passes/MachinePassRegistry.def
M llvm/include/llvm/Support/GenericDomTreeConstruction.h
M llvm/lib/Analysis/CallGraph.cpp
M llvm/lib/Analysis/ConstraintSystem.cpp
M llvm/lib/Analysis/InstructionSimplify.cpp
M llvm/lib/Analysis/LoopAccessAnalysis.cpp
M llvm/lib/Analysis/LoopInfo.cpp
M llvm/lib/Analysis/MLInlineAdvisor.cpp
M llvm/lib/Analysis/PHITransAddr.cpp
M llvm/lib/Analysis/ScalarEvolution.cpp
M llvm/lib/Analysis/TargetTransformInfo.cpp
M llvm/lib/Analysis/models/gen-inline-oz-test-model.py
M llvm/lib/Bitcode/Writer/BitcodeWriter.cpp
M llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp
M llvm/lib/CodeGen/AssignmentTrackingAnalysis.cpp
M llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp
M llvm/lib/CodeGen/MIRParser/MILexer.cpp
M llvm/lib/CodeGen/MIRParser/MILexer.h
M llvm/lib/CodeGen/MIRParser/MIParser.cpp
M llvm/lib/CodeGen/MLRegAllocEvictAdvisor.cpp
M llvm/lib/CodeGen/MachinePostDominators.cpp
M llvm/lib/CodeGen/MachineSSAUpdater.cpp
M llvm/lib/DWARFLinker/Parallel/ArrayList.h
M llvm/lib/DWP/DWP.cpp
M llvm/lib/ExecutionEngine/Orc/Core.cpp
M llvm/lib/ExecutionEngine/Orc/Debugging/DebugInfoSupport.cpp
M llvm/lib/Frontend/OpenMP/OMPIRBuilder.cpp
M llvm/lib/MC/MCContext.cpp
M llvm/lib/MC/MCDwarf.cpp
M llvm/lib/MC/MCObjectFileInfo.cpp
M llvm/lib/MC/MCWinCOFFStreamer.cpp
M llvm/lib/MC/WinCOFFObjectWriter.cpp
M llvm/lib/Passes/PassBuilder.cpp
M llvm/lib/ProfileData/InstrProfReader.cpp
M llvm/lib/Target/AArch64/AArch64ISelDAGToDAG.cpp
M llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
M llvm/lib/Target/AArch64/AArch64InstrInfo.cpp
M llvm/lib/Target/AArch64/AArch64Subtarget.h
M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp
M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.h
M llvm/lib/Target/AMDGPU/AMDGPU.td
M llvm/lib/Target/AMDGPU/AMDGPUArgumentUsageInfo.cpp
M llvm/lib/Target/AMDGPU/AMDGPUArgumentUsageInfo.h
M llvm/lib/Target/AMDGPU/AMDGPUAsmPrinter.cpp
M llvm/lib/Target/AMDGPU/AMDGPUAtomicOptimizer.cpp
M llvm/lib/Target/AMDGPU/AMDGPUIGroupLP.cpp
M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp
M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.h
M llvm/lib/Target/AMDGPU/AMDGPUSubtarget.cpp
M llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
M llvm/lib/Target/AMDGPU/BUFInstructions.td
M llvm/lib/Target/AMDGPU/GCNSubtarget.h
M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUMCExpr.cpp
M llvm/lib/Target/AMDGPU/MCTargetDesc/AMDGPUMCExpr.h
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/lib/Target/AMDGPU/SIInstructions.td
M llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.cpp
M llvm/lib/Target/AMDGPU/SIMachineFunctionInfo.h
M llvm/lib/Target/AMDGPU/VOP1Instructions.td
M llvm/lib/Target/AMDGPU/VOP2Instructions.td
M llvm/lib/Target/ARM/ARMBaseInstrInfo.cpp
M llvm/lib/Target/ARM/ARMISelLowering.cpp
M llvm/lib/Target/BPF/BPFRegisterInfo.cpp
M llvm/lib/Target/LoongArch/LoongArchISelLowering.cpp
M llvm/lib/Target/LoongArch/LoongArchISelLowering.h
M llvm/lib/Target/LoongArch/LoongArchInstrInfo.td
M llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp
M llvm/lib/Target/NVPTX/NVVMReflect.cpp
M llvm/lib/Target/PowerPC/PPCMergeStringPool.cpp
M llvm/lib/Target/RISCV/GISel/RISCVInstructionSelector.cpp
M llvm/lib/Target/RISCV/GISel/RISCVLegalizerInfo.cpp
M llvm/lib/Target/RISCV/GISel/RISCVRegisterBankInfo.cpp
M llvm/lib/Target/RISCV/RISCV.td
M llvm/lib/Target/RISCV/RISCVProcessors.td
A llvm/lib/Target/RISCV/RISCVSchedSyntacoreSCR3.td
M llvm/lib/Target/SPIRV/MCTargetDesc/SPIRVBaseInfo.cpp
M llvm/lib/Target/SPIRV/MCTargetDesc/SPIRVBaseInfo.h
M llvm/lib/Target/SPIRV/SPIRVBuiltins.cpp
M llvm/lib/Target/SPIRV/SPIRVBuiltins.td
M llvm/lib/Target/SPIRV/SPIRVGlobalRegistry.cpp
M llvm/lib/Target/SPIRV/SPIRVGlobalRegistry.h
M llvm/lib/Target/SPIRV/SPIRVInstructionSelector.cpp
M llvm/lib/Target/SPIRV/SPIRVUtils.cpp
M llvm/lib/Target/WebAssembly/WebAssemblyInstrSIMD.td
M llvm/lib/Target/X86/X86ISelLowering.cpp
M llvm/lib/Target/X86/X86TargetTransformInfo.cpp
M llvm/lib/Target/Xtensa/XtensaAsmPrinter.cpp
M llvm/lib/Target/Xtensa/XtensaAsmPrinter.h
M llvm/lib/Target/Xtensa/XtensaISelLowering.cpp
M llvm/lib/Target/Xtensa/XtensaISelLowering.h
M llvm/lib/Target/Xtensa/XtensaInstrInfo.td
M llvm/lib/Target/Xtensa/XtensaOperators.td
M llvm/lib/TargetParser/RISCVISAInfo.cpp
M llvm/lib/Transforms/IPO/MemProfContextDisambiguation.cpp
M llvm/lib/Transforms/IPO/SCCP.cpp
M llvm/lib/Transforms/Scalar/LoopDistribute.cpp
M llvm/lib/Transforms/Scalar/Reassociate.cpp
M llvm/lib/Transforms/Utils/CodeLayout.cpp
M llvm/lib/Transforms/Utils/LoopUnrollRuntime.cpp
M llvm/lib/Transforms/Utils/PromoteMemoryToRegister.cpp
M llvm/lib/Transforms/Vectorize/LoopVectorizationPlanner.h
M llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
M llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
M llvm/lib/Transforms/Vectorize/VPlan.h
M llvm/lib/Transforms/Vectorize/VectorCombine.cpp
M llvm/runtimes/CMakeLists.txt
M llvm/test/Analysis/CostModel/AArch64/masked_ldst.ll
M llvm/test/Analysis/CostModel/AArch64/sve-gather.ll
M llvm/test/Analysis/CostModel/AArch64/sve-ldst.ll
M llvm/test/Analysis/CostModel/AArch64/sve-scatter.ll
M llvm/test/Analysis/CostModel/X86/arith-int-codesize.ll
M llvm/test/Analysis/CostModel/X86/arith-int-latency.ll
M llvm/test/Analysis/CostModel/X86/arith-int-sizelatency.ll
M llvm/test/Analysis/CostModel/X86/arith-int.ll
M llvm/test/Analysis/CostModel/X86/rem-codesize.ll
M llvm/test/Analysis/CostModel/X86/rem-latency.ll
M llvm/test/Analysis/CostModel/X86/rem-sizelatency.ll
M llvm/test/Analysis/CostModel/X86/rem.ll
M llvm/test/Analysis/CostModel/X86/slm-arith-costs.ll
A llvm/test/Analysis/LoopAccessAnalysis/debug-loc.ll
M llvm/test/Analysis/LoopAccessAnalysis/print-order.ll
M llvm/test/Analysis/LoopAccessAnalysis/scalable-vector-regression-tests.ll
A llvm/test/Analysis/ScalarEvolution/pr92560.ll
M llvm/test/Analysis/UniformityAnalysis/AMDGPU/intrinsics.ll
M llvm/test/CodeGen/AArch64/ldexp.ll
A llvm/test/CodeGen/AArch64/taildup-ssa-update-pr62712.mir
M llvm/test/CodeGen/AMDGPU/GlobalISel/atomic_optimizations_mul_one.ll
M llvm/test/CodeGen/AMDGPU/atomic_optimization_split_dt_update.ll
M llvm/test/CodeGen/AMDGPU/buffer-fat-pointer-atomicrmw-fadd.ll
M llvm/test/CodeGen/AMDGPU/cgp-addressing-modes-smem.ll
A llvm/test/CodeGen/AMDGPU/convergence-laneops.ll
M llvm/test/CodeGen/AMDGPU/global-atomic-scan.ll
M llvm/test/CodeGen/AMDGPU/global_atomic_optimizer_fp_rtn.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_iterative_scan.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_iterative_scan_fp.ll
M llvm/test/CodeGen/AMDGPU/global_atomics_optimizer_fp_no_rtn.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readfirstlane.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readfirstlane.ptr.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readlane.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.readlane.ptr.ll
M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.writelane.ll
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.writelane.ptr.ll
A llvm/test/CodeGen/ARM/ldexp.ll
M llvm/test/CodeGen/LoongArch/machinelicm-address-pseudos.ll
M llvm/test/CodeGen/LoongArch/psabi-restricted-scheduling.ll
A llvm/test/CodeGen/MIR/Generic/bfloat-immediates.mir
M llvm/test/CodeGen/NVPTX/f16-instructions.ll
A llvm/test/CodeGen/NVPTX/max-align.ll
A llvm/test/CodeGen/RISCV/GlobalISel/instruction-select/fp-arith-f16.mir
M llvm/test/CodeGen/RISCV/GlobalISel/instruction-select/fp-arith.mir
A llvm/test/CodeGen/RISCV/GlobalISel/instruction-select/fp-constant-f16.mir
A llvm/test/CodeGen/RISCV/GlobalISel/legalizer/legalize-fp-arith-f16.mir
A llvm/test/CodeGen/RISCV/GlobalISel/regbankselect/fp-arith-f16.mir
A llvm/test/CodeGen/SPIRV/var-uniform-const.ll
M llvm/test/CodeGen/WebAssembly/half-precision.ll
M llvm/test/CodeGen/X86/avx2-arith.ll
M llvm/test/CodeGen/X86/ldexp.ll
M llvm/test/CodeGen/X86/llvm.frexp.ll
M llvm/test/CodeGen/X86/midpoint-int-vec-128.ll
M llvm/test/CodeGen/X86/midpoint-int-vec-256.ll
M llvm/test/CodeGen/X86/min-legal-vector-width.ll
M llvm/test/CodeGen/X86/pmul.ll
M llvm/test/CodeGen/X86/prefer-avx256-wide-mul.ll
M llvm/test/CodeGen/X86/vector-shuffle-combining-sse41.ll
A llvm/test/CodeGen/Xtensa/blockaddress.ll
A llvm/test/CodeGen/Xtensa/brcc.ll
A llvm/test/CodeGen/Xtensa/indirectbr.ll
A llvm/test/CodeGen/Xtensa/jumpt.ll
M llvm/test/DebugInfo/X86/InlinedFnLocalVar.ll
M llvm/test/DebugInfo/X86/ref_addr_relocation.ll
M llvm/test/DebugInfo/attr-btf_type_tag.ll
M llvm/test/DebugInfo/unrolled-loop-remainder.ll
M llvm/test/ExecutionEngine/RuntimeDyld/X86/COFF_x86_64.s
M llvm/test/MC/COFF/section-comdat-conflict.s
M llvm/test/MC/COFF/section-comdat.s
A llvm/test/MC/COFF/section-sym-err.s
M llvm/test/MC/WebAssembly/simd-encodings.s
M llvm/test/Transforms/EarlyCSE/flags.ll
M llvm/test/Transforms/FunctionSpecialization/track-return.ll
A llvm/test/Transforms/Inline/ML/avail-external.ll
A llvm/test/Transforms/InstCombine/AArch64/sve-intrinsic-comb-no-active-lanes-loads.ll
M llvm/test/Transforms/InstCombine/AMDGPU/amdgcn-intrinsics.ll
M llvm/test/Transforms/InstCombine/phi.ll
M llvm/test/Transforms/InstCombine/shift.ll
M llvm/test/Transforms/InstSimplify/phi.ll
M llvm/test/Transforms/LoopDeletion/update-scev.ll
A llvm/test/Transforms/LoopDistribute/debug-loc.ll
A llvm/test/Transforms/LoopDistribute/debug-print.ll
M llvm/test/Transforms/LoopUnroll/ARM/multi-blocks.ll
M llvm/test/Transforms/LoopUnroll/runtime-exit-phi-scev-invalidation.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop-at-most-two-exits.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop-multiple-exits.ll
M llvm/test/Transforms/LoopUnroll/runtime-loop5.ll
M llvm/test/Transforms/LoopUnroll/runtime-multiexit-heuristic.ll
M llvm/test/Transforms/LoopUnroll/runtime-unroll-remainder.ll
M llvm/test/Transforms/LoopUnroll/tripcount-overflow.ll
M llvm/test/Transforms/LoopUnrollAndJam/unroll-and-jam.ll
M llvm/test/Transforms/LoopVectorize/AArch64/scalable-reductions.ll
M llvm/test/Transforms/LoopVectorize/AArch64/sve-illegal-type.ll
A llvm/test/Transforms/LoopVectorize/AArch64/sve-invalid-costs.ll
M llvm/test/Transforms/LoopVectorize/ARM/mve-hoist-runtime-checks.ll
M llvm/test/Transforms/LoopVectorize/X86/float-induction-x86.ll
M llvm/test/Transforms/LoopVectorize/create-induction-resume.ll
M llvm/test/Transforms/LoopVectorize/runtime-checks-hoist.ll
M llvm/test/Transforms/LoopVectorize/uniform-blend.ll
M llvm/test/Transforms/Mem2Reg/preserve-nonnull-load-metadata.ll
M llvm/test/Transforms/PhaseOrdering/AArch64/interleavevectorization.ll
M llvm/test/Transforms/SCCP/2009-01-14-IPSCCP-Invoke.ll
M llvm/test/Transforms/SCCP/apint-ipsccp2.ll
M llvm/test/Transforms/SCCP/arg-count-mismatch.ll
M llvm/test/Transforms/SCCP/indirectbr.ll
M llvm/test/Transforms/SCCP/ip-constant-ranges.ll
M llvm/test/Transforms/SCCP/ip-ranges-phis.ll
M llvm/test/Transforms/SCCP/ip-ranges-select.ll
M llvm/test/Transforms/SCCP/ipsccp-basic.ll
M llvm/test/Transforms/SCCP/ipsccp-clear-returned.ll
M llvm/test/Transforms/SCCP/ipsccp-noundef.ll
M llvm/test/Transforms/SCCP/ipsccp-preserve-analysis.ll
M llvm/test/Transforms/SCCP/issue59602-assume-like-call-users.ll
M llvm/test/Transforms/SCCP/metadata.ll
M llvm/test/Transforms/SCCP/musttail-call.ll
M llvm/test/Transforms/SCCP/range-and-ip.ll
M llvm/test/Transforms/SCCP/range-attribute.ll
M llvm/test/Transforms/SCCP/remove-call-inst.ll
M llvm/test/Transforms/SCCP/resolvedundefsin-tracked-fn.ll
M llvm/test/Transforms/SCCP/return-constant.ll
M llvm/test/Transforms/SCCP/solve-after-each-resolving-undefs-for-function.ll
M llvm/test/Transforms/SCCP/struct-arg-resolve-undefs.ll
M llvm/test/Transforms/SLPVectorizer/RISCV/shuffled-gather-casted.ll
A llvm/test/Transforms/SLPVectorizer/X86/insertelement-uses-vectorized-index.ll
M llvm/test/Transforms/VectorCombine/AArch64/shuffletoidentity-concat.ll
M llvm/test/Transforms/VectorCombine/AArch64/shuffletoidentity.ll
M llvm/test/tools/llvm-dwp/X86/merge.test
A llvm/test/tools/llvm-dwp/X86/merge_v5.test
A llvm/test/tools/llvm-mca/RISCV/SyntacoreSCR/SCR3-ALU.s
A llvm/test/tools/llvm-mca/RISCV/SyntacoreSCR/SCR3-LSU.s
M llvm/tools/llvm-jitlink/llvm-jitlink.cpp
M llvm/unittests/ADT/SmallPtrSetTest.cpp
M llvm/unittests/CodeGen/GlobalISel/MachineIRBuilderTest.cpp
M llvm/unittests/IR/VectorBuilderTest.cpp
M llvm/unittests/Transforms/Vectorize/VPlanTest.cpp
M llvm/utils/TableGen/ARMTargetDefEmitter.cpp
M llvm/utils/TableGen/ExegesisEmitter.cpp
M mlir/docs/DialectConversion.md
M mlir/include/mlir/Dialect/Arith/IR/ArithOps.td
M mlir/include/mlir/Dialect/OpenMP/OpenMPClauseOperands.h
M mlir/include/mlir/Dialect/OpenMP/OpenMPEnums.td
M mlir/include/mlir/Dialect/OpenMP/OpenMPOps.td
M mlir/include/mlir/Target/LLVM/ROCDL/Utils.h
M mlir/include/mlir/Transforms/DialectConversion.h
M mlir/lib/Dialect/Arith/IR/ArithOps.cpp
M mlir/lib/Dialect/ArmSME/Transforms/VectorLegalization.cpp
M mlir/lib/Dialect/OpenMP/IR/OpenMPDialect.cpp
M mlir/lib/Dialect/Tensor/IR/TensorOps.cpp
M mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp
M mlir/lib/Target/LLVM/CMakeLists.txt
M mlir/lib/Target/LLVM/ROCDL/Target.cpp
M mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
M mlir/lib/Transforms/Utils/DialectConversion.cpp
M mlir/test/Dialect/Arith/canonicalize.mlir
M mlir/test/Dialect/ArmSME/vector-legalization.mlir
M mlir/test/Dialect/OpenMP/invalid.mlir
M mlir/test/Dialect/OpenMP/ops.mlir
M mlir/test/Dialect/Tensor/fold-tensor-subset-ops-into-vector-transfers.mlir
M mlir/test/Dialect/Tensor/fold-tensor-subset-ops.mlir
M mlir/test/Dialect/Vector/fold-arith-extf-into-vector-contract.mlir
M mlir/test/Integration/Dialect/Linalg/CPU/ArmSME/multi-tile-matmul.mlir
A mlir/test/Target/LLVMIR/omptarget-fortran-common-block-host.mlir
M mlir/test/Transforms/test-legalize-type-conversion.mlir
A offload/test/offloading/fortran/target-map-all-common-block-members.f90
A offload/test/offloading/fortran/target-map-common-block.f90
A offload/test/offloading/fortran/target-map-declare-target-link-common-block.f90
A offload/test/offloading/fortran/target-map-first-common-block-member.f90
A offload/test/offloading/fortran/target-map-mix-imp-exp-common-block-members.f90
A offload/test/offloading/fortran/target-map-second-common-block-member.f90
M runtimes/cmake/Modules/WarningFlags.cmake
M utils/bazel/llvm-project-overlay/mlir/BUILD.bazel
Log Message:
-----------
xxx
Created using spr 1.3.5-bogner
Compare: https://github.com/llvm/llvm-project/compare/bc7cadd9ab2c...223fd48362c4
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list