[llvm] [AMDGPU] InstCombine llvm.amdgcn.ds.bpermute with uniform arguments (PR #129895)
LLVM Continuous Integration via llvm-commits
llvm-commits at lists.llvm.org
Thu Mar 6 06:36:33 PST 2025
llvm-ci wrote:
LLVM Buildbot has detected a new failure on builder `amdgpu-offload-ubuntu-22-cmake-build-only` running on `rocm-docker-ubu-22` while building `llvm` at step 4 "annotate".
Full details are available at: https://lab.llvm.org/buildbot/#/builders/203/builds/3742
<details>
<summary>Here is the relevant piece of the build log for the reference</summary>
```
Step 4 (annotate) failure: '../llvm-zorg/zorg/buildbot/builders/annotated/amdgpu-offload-cmake.py --jobs=32' (failure)
...
[46/60] Building LLVM bitcode Parallelism.cpp-nvptx.bc
[47/60] Building LLVM bitcode Parallelism.cpp-amdgpu.bc
[48/60] Building LLVM bitcode Workshare.cpp-amdgpu.bc
[49/60] Building LLVM bitcode Workshare.cpp-nvptx.bc
[50/60] Linking LLVM bitcode libomptarget-amdgpu.bc
[51/60] Linking LLVM bitcode libomptarget-nvptx.bc
[52/60] Internalizing LLVM bitcode libomptarget-amdgpu.bc
[53/60] Internalizing LLVM bitcode libomptarget-nvptx.bc
/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/opt: WARNING: failed to create target machine for 'nvptx64-nvidia-cuda': unable to get target for 'nvptx64-nvidia-cuda', see --version and --triple.
[54/60] Optimizing LLVM bitcode libomptarget-amdgpu.bc
FAILED: offload/DeviceRTL/libomptarget-amdgpu.bc /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL/libomptarget-amdgpu.bc
cd /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL && /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/opt -O3 -openmp-opt-disable -attributor-enable=module -vectorize-slp=false /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL/internalized_libomptarget-amdgpu.bc -o /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL/libomptarget-amdgpu.bc
opt: /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/llvm-project/llvm/lib/IR/Intrinsics.cpp:157: std::string getIntrinsicNameImpl(llvm::Intrinsic::ID, llvm::ArrayRef<llvm::Type*>, llvm::Module*, llvm::FunctionType*, bool): Assertion `(Tys.empty() || Intrinsic::isOverloaded(Id)) && "This version of getName is for overloaded intrinsics only"' failed.
PLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace.
Stack dump:
0. Program arguments: /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/opt -O3 -openmp-opt-disable -attributor-enable=module -vectorize-slp=false /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL/internalized_libomptarget-amdgpu.bc -o /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL/libomptarget-amdgpu.bc
1. Running pass "require<globals-aa>,function(invalidate<aa>),require<profile-summary>,cgscc(devirt<4>(inline,function-attrs<skip-non-recursive-function-attrs>,argpromotion,openmp-opt-cgscc,function(amdgpu-promote-kernel-arguments,infer-address-spaces,amdgpu-lower-kernel-attributes,amdgpu-promote-alloca-to-vector),function<eager-inv;no-rerun>(sroa<modify-cfg>,early-cse<memssa>,speculative-execution<only-if-divergent-target>,jump-threading,correlated-propagation,simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;no-sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,instcombine<max-iterations=1;no-verify-fixpoint>,aggressive-instcombine,libcalls-shrinkwrap,amdgpu-usenative,amdgpu-simplifylib,tailcallelim,simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;no-sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,reassociate,constraint-elimination,loop-mssa(loop-instsimplify,loop-simplifycfg,licm<no-allowspeculation>,loop-rotate<header-duplication;no-prepare-for-lto>,licm<allowspeculation>,simple-loop-unswitch<nontrivial;trivial>),simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;no-sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,instcombine<max-iterations=1;no-verify-fixpoint>,loop(loop-idiom,indvars,extra-simple-loop-unswitch-passes,loop-deletion,loop-unroll-full),sroa<modify-cfg>,vector-combine,mldst-motion<no-split-footer-bb>,gvn<>,sccp,bdce,instcombine<max-iterations=1;no-verify-fixpoint>,amdgpu-usenative,amdgpu-simplifylib,jump-threading,correlated-propagation,adce,memcpyopt,dse,move-auto-init,loop-mssa(licm<allowspeculation>),coro-elide,simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,instcombine<max-iterations=1;no-verify-fixpoint>,amdgpu-usenative,amdgpu-simplifylib),function-attrs,function(require<should-not-run-function-passes>),coro-split,coro-annotation-elide)),function(invalidate<should-not-run-function-passes>),cgscc(devirt<4>())" on module "/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL/internalized_libomptarget-amdgpu.bc"
2. Running pass "cgscc(devirt<4>(inline,function-attrs<skip-non-recursive-function-attrs>,argpromotion,openmp-opt-cgscc,function(amdgpu-promote-kernel-arguments,infer-address-spaces,amdgpu-lower-kernel-attributes,amdgpu-promote-alloca-to-vector),function<eager-inv;no-rerun>(sroa<modify-cfg>,early-cse<memssa>,speculative-execution<only-if-divergent-target>,jump-threading,correlated-propagation,simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;no-sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,instcombine<max-iterations=1;no-verify-fixpoint>,aggressive-instcombine,libcalls-shrinkwrap,amdgpu-usenative,amdgpu-simplifylib,tailcallelim,simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;no-sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,reassociate,constraint-elimination,loop-mssa(loop-instsimplify,loop-simplifycfg,licm<no-allowspeculation>,loop-rotate<header-duplication;no-prepare-for-lto>,licm<allowspeculation>,simple-loop-unswitch<nontrivial;trivial>),simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;no-sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,instcombine<max-iterations=1;no-verify-fixpoint>,loop(loop-idiom,indvars,extra-simple-loop-unswitch-passes,loop-deletion,loop-unroll-full),sroa<modify-cfg>,vector-combine,mldst-motion<no-split-footer-bb>,gvn<>,sccp,bdce,instcombine<max-iterations=1;no-verify-fixpoint>,amdgpu-usenative,amdgpu-simplifylib,jump-threading,correlated-propagation,adce,memcpyopt,dse,move-auto-init,loop-mssa(licm<allowspeculation>),coro-elide,simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,instcombine<max-iterations=1;no-verify-fixpoint>,amdgpu-usenative,amdgpu-simplifylib),function-attrs,function(require<should-not-run-function-passes>),coro-split,coro-annotation-elide))" on module "/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL/internalized_libomptarget-amdgpu.bc"
3. Running pass "instcombine<max-iterations=1;no-verify-fixpoint>" on function "ompx_shfl_down_sync_f"
#0 0x000076d2859e2910 llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/libLLVMSupport.so.21.0git+0x1e2910)
#1 0x000076d2859dfd0f llvm::sys::RunSignalHandlers() (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/libLLVMSupport.so.21.0git+0x1dfd0f)
#2 0x000076d2859dfe5a SignalHandler(int, siginfo_t*, void*) Signals.cpp:0:0
#3 0x000076d285619520 (/lib/x86_64-linux-gnu/libc.so.6+0x42520)
#4 0x000076d28566d9fc pthread_kill (/lib/x86_64-linux-gnu/libc.so.6+0x969fc)
#5 0x000076d285619476 gsignal (/lib/x86_64-linux-gnu/libc.so.6+0x42476)
#6 0x000076d2855ff7f3 abort (/lib/x86_64-linux-gnu/libc.so.6+0x287f3)
#7 0x000076d2855ff71b (/lib/x86_64-linux-gnu/libc.so.6+0x2871b)
#8 0x000076d285610e96 (/lib/x86_64-linux-gnu/libc.so.6+0x39e96)
#9 0x000076d27f83e402 getIntrinsicNameImpl(unsigned int, llvm::ArrayRef<llvm::Type*>, llvm::Module*, llvm::FunctionType*, bool) Intrinsics.cpp:0:0
#10 0x000076d27f83e6a1 llvm::Intrinsic::getOrInsertDeclaration(llvm::Module*, unsigned int, llvm::ArrayRef<llvm::Type*>) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMCore.so.21.0git+0x23e6a1)
#11 0x000076d2843a7aaa llvm::GCNTTIImpl::instCombineIntrinsic(llvm::InstCombiner&, llvm::IntrinsicInst&) const (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMAMDGPUCodeGen.so.21.0git+0x1a7aaa)
#12 0x000076d2826d8c0b llvm::InstCombinerImpl::visitCallInst(llvm::CallInst&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMInstCombine.so.21.0git+0xb4c0b)
#13 0x000076d28267bc57 llvm::InstCombinerImpl::run() (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMInstCombine.so.21.0git+0x57c57)
#14 0x000076d28267d691 combineInstructionsOverFunction(llvm::Function&, llvm::InstructionWorklist&, llvm::AAResults*, llvm::AssumptionCache&, llvm::TargetLibraryInfo&, llvm::TargetTransformInfo&, llvm::DominatorTree&, llvm::OptimizationRemarkEmitter&, llvm::BlockFrequencyInfo*, llvm::BranchProbabilityInfo*, llvm::ProfileSummaryInfo*, llvm::InstCombineOptions const&) InstructionCombining.cpp:0:0
#15 0x000076d28267ead8 llvm::InstCombinePass::run(llvm::Function&, llvm::AnalysisManager<llvm::Function>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMInstCombine.so.21.0git+0x5aad8)
#16 0x000076d282487aa6 llvm::detail::PassModel<llvm::Function, llvm::InstCombinePass, llvm::AnalysisManager<llvm::Function>>::run(llvm::Function&, llvm::AnalysisManager<llvm::Function>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMPasses.so.21.0git+0x87aa6)
#17 0x000076d27f8f998f llvm::PassManager<llvm::Function, llvm::AnalysisManager<llvm::Function>>::run(llvm::Function&, llvm::AnalysisManager<llvm::Function>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMCore.so.21.0git+0x2f998f)
#18 0x000076d2845bbd26 llvm::detail::PassModel<llvm::Function, llvm::PassManager<llvm::Function, llvm::AnalysisManager<llvm::Function>>, llvm::AnalysisManager<llvm::Function>>::run(llvm::Function&, llvm::AnalysisManager<llvm::Function>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMAMDGPUCodeGen.so.21.0git+0x3bbd26)
#19 0x000076d27fd62944 llvm::CGSCCToFunctionPassAdaptor::run(llvm::LazyCallGraph::SCC&, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>&, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMAnalysis.so.21.0git+0x162944)
#20 0x000076d2845bc876 llvm::detail::PassModel<llvm::LazyCallGraph::SCC, llvm::CGSCCToFunctionPassAdaptor, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&>::run(llvm::LazyCallGraph::SCC&, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>&, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMAMDGPUCodeGen.so.21.0git+0x3bc876)
#21 0x000076d27fd5b317 llvm::PassManager<llvm::LazyCallGraph::SCC, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&>::run(llvm::LazyCallGraph::SCC&, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>&, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMAnalysis.so.21.0git+0x15b317)
#22 0x000076d282486546 llvm::detail::PassModel<llvm::LazyCallGraph::SCC, llvm::PassManager<llvm::LazyCallGraph::SCC, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&>, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&>::run(llvm::LazyCallGraph::SCC&, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>&, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMPasses.so.21.0git+0x86546)
#23 0x000076d27fd63bfd llvm::DevirtSCCRepeatedPass::run(llvm::LazyCallGraph::SCC&, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>&, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMAnalysis.so.21.0git+0x163bfd)
#24 0x000076d282486596 llvm::detail::PassModel<llvm::LazyCallGraph::SCC, llvm::DevirtSCCRepeatedPass, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&>::run(llvm::LazyCallGraph::SCC&, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>&, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMPasses.so.21.0git+0x86596)
#25 0x000076d27fd5e3ce llvm::ModuleToPostOrderCGSCCPassAdaptor::run(llvm::Module&, llvm::AnalysisManager<llvm::Module>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMAnalysis.so.21.0git+0x15e3ce)
#26 0x000076d2824864f6 llvm::detail::PassModel<llvm::Module, llvm::ModuleToPostOrderCGSCCPassAdaptor, llvm::AnalysisManager<llvm::Module>>::run(llvm::Module&, llvm::AnalysisManager<llvm::Module>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMPasses.so.21.0git+0x864f6)
#27 0x000076d27f8f88ed llvm::PassManager<llvm::Module, llvm::AnalysisManager<llvm::Module>>::run(llvm::Module&, llvm::AnalysisManager<llvm::Module>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMCore.so.21.0git+0x2f88ed)
#28 0x000076d28201505a llvm::ModuleInlinerWrapperPass::run(llvm::Module&, llvm::AnalysisManager<llvm::Module>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMipo.so.21.0git+0x21505a)
#29 0x000076d282485906 llvm::detail::PassModel<llvm::Module, llvm::ModuleInlinerWrapperPass, llvm::AnalysisManager<llvm::Module>>::run(llvm::Module&, llvm::AnalysisManager<llvm::Module>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMPasses.so.21.0git+0x85906)
Step 7 (build cmake config) failure: build cmake config (failure)
...
[46/60] Building LLVM bitcode Parallelism.cpp-nvptx.bc
[47/60] Building LLVM bitcode Parallelism.cpp-amdgpu.bc
[48/60] Building LLVM bitcode Workshare.cpp-amdgpu.bc
[49/60] Building LLVM bitcode Workshare.cpp-nvptx.bc
[50/60] Linking LLVM bitcode libomptarget-amdgpu.bc
[51/60] Linking LLVM bitcode libomptarget-nvptx.bc
[52/60] Internalizing LLVM bitcode libomptarget-amdgpu.bc
[53/60] Internalizing LLVM bitcode libomptarget-nvptx.bc
/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/opt: WARNING: failed to create target machine for 'nvptx64-nvidia-cuda': unable to get target for 'nvptx64-nvidia-cuda', see --version and --triple.
[54/60] Optimizing LLVM bitcode libomptarget-amdgpu.bc
FAILED: offload/DeviceRTL/libomptarget-amdgpu.bc /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL/libomptarget-amdgpu.bc
cd /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL && /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/opt -O3 -openmp-opt-disable -attributor-enable=module -vectorize-slp=false /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL/internalized_libomptarget-amdgpu.bc -o /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL/libomptarget-amdgpu.bc
opt: /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/llvm-project/llvm/lib/IR/Intrinsics.cpp:157: std::string getIntrinsicNameImpl(llvm::Intrinsic::ID, llvm::ArrayRef<llvm::Type*>, llvm::Module*, llvm::FunctionType*, bool): Assertion `(Tys.empty() || Intrinsic::isOverloaded(Id)) && "This version of getName is for overloaded intrinsics only"' failed.
PLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace.
Stack dump:
0. Program arguments: /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/opt -O3 -openmp-opt-disable -attributor-enable=module -vectorize-slp=false /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL/internalized_libomptarget-amdgpu.bc -o /home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL/libomptarget-amdgpu.bc
1. Running pass "require<globals-aa>,function(invalidate<aa>),require<profile-summary>,cgscc(devirt<4>(inline,function-attrs<skip-non-recursive-function-attrs>,argpromotion,openmp-opt-cgscc,function(amdgpu-promote-kernel-arguments,infer-address-spaces,amdgpu-lower-kernel-attributes,amdgpu-promote-alloca-to-vector),function<eager-inv;no-rerun>(sroa<modify-cfg>,early-cse<memssa>,speculative-execution<only-if-divergent-target>,jump-threading,correlated-propagation,simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;no-sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,instcombine<max-iterations=1;no-verify-fixpoint>,aggressive-instcombine,libcalls-shrinkwrap,amdgpu-usenative,amdgpu-simplifylib,tailcallelim,simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;no-sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,reassociate,constraint-elimination,loop-mssa(loop-instsimplify,loop-simplifycfg,licm<no-allowspeculation>,loop-rotate<header-duplication;no-prepare-for-lto>,licm<allowspeculation>,simple-loop-unswitch<nontrivial;trivial>),simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;no-sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,instcombine<max-iterations=1;no-verify-fixpoint>,loop(loop-idiom,indvars,extra-simple-loop-unswitch-passes,loop-deletion,loop-unroll-full),sroa<modify-cfg>,vector-combine,mldst-motion<no-split-footer-bb>,gvn<>,sccp,bdce,instcombine<max-iterations=1;no-verify-fixpoint>,amdgpu-usenative,amdgpu-simplifylib,jump-threading,correlated-propagation,adce,memcpyopt,dse,move-auto-init,loop-mssa(licm<allowspeculation>),coro-elide,simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,instcombine<max-iterations=1;no-verify-fixpoint>,amdgpu-usenative,amdgpu-simplifylib),function-attrs,function(require<should-not-run-function-passes>),coro-split,coro-annotation-elide)),function(invalidate<should-not-run-function-passes>),cgscc(devirt<4>())" on module "/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL/internalized_libomptarget-amdgpu.bc"
2. Running pass "cgscc(devirt<4>(inline,function-attrs<skip-non-recursive-function-attrs>,argpromotion,openmp-opt-cgscc,function(amdgpu-promote-kernel-arguments,infer-address-spaces,amdgpu-lower-kernel-attributes,amdgpu-promote-alloca-to-vector),function<eager-inv;no-rerun>(sroa<modify-cfg>,early-cse<memssa>,speculative-execution<only-if-divergent-target>,jump-threading,correlated-propagation,simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;no-sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,instcombine<max-iterations=1;no-verify-fixpoint>,aggressive-instcombine,libcalls-shrinkwrap,amdgpu-usenative,amdgpu-simplifylib,tailcallelim,simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;no-sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,reassociate,constraint-elimination,loop-mssa(loop-instsimplify,loop-simplifycfg,licm<no-allowspeculation>,loop-rotate<header-duplication;no-prepare-for-lto>,licm<allowspeculation>,simple-loop-unswitch<nontrivial;trivial>),simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;no-sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,instcombine<max-iterations=1;no-verify-fixpoint>,loop(loop-idiom,indvars,extra-simple-loop-unswitch-passes,loop-deletion,loop-unroll-full),sroa<modify-cfg>,vector-combine,mldst-motion<no-split-footer-bb>,gvn<>,sccp,bdce,instcombine<max-iterations=1;no-verify-fixpoint>,amdgpu-usenative,amdgpu-simplifylib,jump-threading,correlated-propagation,adce,memcpyopt,dse,move-auto-init,loop-mssa(licm<allowspeculation>),coro-elide,simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;switch-range-to-icmp;no-switch-to-lookup;keep-loops;hoist-common-insts;no-hoist-loads-stores-with-cond-faulting;sink-common-insts;speculate-blocks;simplify-cond-branch;no-speculate-unpredictables>,instcombine<max-iterations=1;no-verify-fixpoint>,amdgpu-usenative,amdgpu-simplifylib),function-attrs,function(require<should-not-run-function-passes>),coro-split,coro-annotation-elide))" on module "/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/runtimes/runtimes-bins/offload/DeviceRTL/internalized_libomptarget-amdgpu.bc"
3. Running pass "instcombine<max-iterations=1;no-verify-fixpoint>" on function "ompx_shfl_down_sync_f"
#0 0x000076d2859e2910 llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/libLLVMSupport.so.21.0git+0x1e2910)
#1 0x000076d2859dfd0f llvm::sys::RunSignalHandlers() (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/libLLVMSupport.so.21.0git+0x1dfd0f)
#2 0x000076d2859dfe5a SignalHandler(int, siginfo_t*, void*) Signals.cpp:0:0
#3 0x000076d285619520 (/lib/x86_64-linux-gnu/libc.so.6+0x42520)
#4 0x000076d28566d9fc pthread_kill (/lib/x86_64-linux-gnu/libc.so.6+0x969fc)
#5 0x000076d285619476 gsignal (/lib/x86_64-linux-gnu/libc.so.6+0x42476)
#6 0x000076d2855ff7f3 abort (/lib/x86_64-linux-gnu/libc.so.6+0x287f3)
#7 0x000076d2855ff71b (/lib/x86_64-linux-gnu/libc.so.6+0x2871b)
#8 0x000076d285610e96 (/lib/x86_64-linux-gnu/libc.so.6+0x39e96)
#9 0x000076d27f83e402 getIntrinsicNameImpl(unsigned int, llvm::ArrayRef<llvm::Type*>, llvm::Module*, llvm::FunctionType*, bool) Intrinsics.cpp:0:0
#10 0x000076d27f83e6a1 llvm::Intrinsic::getOrInsertDeclaration(llvm::Module*, unsigned int, llvm::ArrayRef<llvm::Type*>) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMCore.so.21.0git+0x23e6a1)
#11 0x000076d2843a7aaa llvm::GCNTTIImpl::instCombineIntrinsic(llvm::InstCombiner&, llvm::IntrinsicInst&) const (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMAMDGPUCodeGen.so.21.0git+0x1a7aaa)
#12 0x000076d2826d8c0b llvm::InstCombinerImpl::visitCallInst(llvm::CallInst&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMInstCombine.so.21.0git+0xb4c0b)
#13 0x000076d28267bc57 llvm::InstCombinerImpl::run() (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMInstCombine.so.21.0git+0x57c57)
#14 0x000076d28267d691 combineInstructionsOverFunction(llvm::Function&, llvm::InstructionWorklist&, llvm::AAResults*, llvm::AssumptionCache&, llvm::TargetLibraryInfo&, llvm::TargetTransformInfo&, llvm::DominatorTree&, llvm::OptimizationRemarkEmitter&, llvm::BlockFrequencyInfo*, llvm::BranchProbabilityInfo*, llvm::ProfileSummaryInfo*, llvm::InstCombineOptions const&) InstructionCombining.cpp:0:0
#15 0x000076d28267ead8 llvm::InstCombinePass::run(llvm::Function&, llvm::AnalysisManager<llvm::Function>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMInstCombine.so.21.0git+0x5aad8)
#16 0x000076d282487aa6 llvm::detail::PassModel<llvm::Function, llvm::InstCombinePass, llvm::AnalysisManager<llvm::Function>>::run(llvm::Function&, llvm::AnalysisManager<llvm::Function>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMPasses.so.21.0git+0x87aa6)
#17 0x000076d27f8f998f llvm::PassManager<llvm::Function, llvm::AnalysisManager<llvm::Function>>::run(llvm::Function&, llvm::AnalysisManager<llvm::Function>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMCore.so.21.0git+0x2f998f)
#18 0x000076d2845bbd26 llvm::detail::PassModel<llvm::Function, llvm::PassManager<llvm::Function, llvm::AnalysisManager<llvm::Function>>, llvm::AnalysisManager<llvm::Function>>::run(llvm::Function&, llvm::AnalysisManager<llvm::Function>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMAMDGPUCodeGen.so.21.0git+0x3bbd26)
#19 0x000076d27fd62944 llvm::CGSCCToFunctionPassAdaptor::run(llvm::LazyCallGraph::SCC&, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>&, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMAnalysis.so.21.0git+0x162944)
#20 0x000076d2845bc876 llvm::detail::PassModel<llvm::LazyCallGraph::SCC, llvm::CGSCCToFunctionPassAdaptor, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&>::run(llvm::LazyCallGraph::SCC&, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>&, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMAMDGPUCodeGen.so.21.0git+0x3bc876)
#21 0x000076d27fd5b317 llvm::PassManager<llvm::LazyCallGraph::SCC, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&>::run(llvm::LazyCallGraph::SCC&, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>&, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMAnalysis.so.21.0git+0x15b317)
#22 0x000076d282486546 llvm::detail::PassModel<llvm::LazyCallGraph::SCC, llvm::PassManager<llvm::LazyCallGraph::SCC, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&>, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&>::run(llvm::LazyCallGraph::SCC&, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>&, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMPasses.so.21.0git+0x86546)
#23 0x000076d27fd63bfd llvm::DevirtSCCRepeatedPass::run(llvm::LazyCallGraph::SCC&, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>&, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMAnalysis.so.21.0git+0x163bfd)
#24 0x000076d282486596 llvm::detail::PassModel<llvm::LazyCallGraph::SCC, llvm::DevirtSCCRepeatedPass, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&>::run(llvm::LazyCallGraph::SCC&, llvm::AnalysisManager<llvm::LazyCallGraph::SCC, llvm::LazyCallGraph&>&, llvm::LazyCallGraph&, llvm::CGSCCUpdateResult&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMPasses.so.21.0git+0x86596)
#25 0x000076d27fd5e3ce llvm::ModuleToPostOrderCGSCCPassAdaptor::run(llvm::Module&, llvm::AnalysisManager<llvm::Module>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMAnalysis.so.21.0git+0x15e3ce)
#26 0x000076d2824864f6 llvm::detail::PassModel<llvm::Module, llvm::ModuleToPostOrderCGSCCPassAdaptor, llvm::AnalysisManager<llvm::Module>>::run(llvm::Module&, llvm::AnalysisManager<llvm::Module>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMPasses.so.21.0git+0x864f6)
#27 0x000076d27f8f88ed llvm::PassManager<llvm::Module, llvm::AnalysisManager<llvm::Module>>::run(llvm::Module&, llvm::AnalysisManager<llvm::Module>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMCore.so.21.0git+0x2f88ed)
#28 0x000076d28201505a llvm::ModuleInlinerWrapperPass::run(llvm::Module&, llvm::AnalysisManager<llvm::Module>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMipo.so.21.0git+0x21505a)
#29 0x000076d282485906 llvm::detail::PassModel<llvm::Module, llvm::ModuleInlinerWrapperPass, llvm::AnalysisManager<llvm::Module>>::run(llvm::Module&, llvm::AnalysisManager<llvm::Module>&) (/home/botworker/bbot/amdgpu-offload-ubuntu-22-cmake-build-only/build/bin/../lib/../lib/libLLVMPasses.so.21.0git+0x85906)
```
</details>
https://github.com/llvm/llvm-project/pull/129895
More information about the llvm-commits
mailing list