[llvm] [offload] Remove bad assert in StaticLoopChunker::Distribute (PR #132705)
via llvm-commits
llvm-commits at lists.llvm.org
Mon Mar 24 03:17:05 PDT 2025
https://github.com/macurtis-amd created https://github.com/llvm/llvm-project/pull/132705
When building with asserts enabled, this can actually cause strange miscompilations because an incorrect llvm.assume is generated at the point of the assertion.
>From 269ee210218631fcae1ad1b56b70c648c3cdfa74 Mon Sep 17 00:00:00 2001
From: Matthew Curtis <macurtis at amd.com>
Date: Fri, 21 Mar 2025 09:04:52 -0500
Subject: [PATCH] [offload] Remove bad assert in StaticLoopChunker::Distribute
When building with asserts enabled, this can actually cause strange
miscompilations because an incorrect llvm.assume is generated at the point of
the assertion.
---
offload/DeviceRTL/src/Workshare.cpp | 1 -
.../fortran/target-teams-dist-nest-par.f90 | 26 +++++++++++++++++++
2 files changed, 26 insertions(+), 1 deletion(-)
create mode 100644 offload/test/offloading/fortran/target-teams-dist-nest-par.f90
diff --git a/offload/DeviceRTL/src/Workshare.cpp b/offload/DeviceRTL/src/Workshare.cpp
index de4ed2e2102a6..861b9ca371ccd 100644
--- a/offload/DeviceRTL/src/Workshare.cpp
+++ b/offload/DeviceRTL/src/Workshare.cpp
@@ -820,7 +820,6 @@ template <typename Ty> class StaticLoopChunker {
Ty ThreadChunk = 0;
Ty NumThreads = 1;
Ty TId = 0;
- ASSERT(TId == mapping::getThreadIdInBlock(), "Bad thread id");
// All teams need to participate.
Ty NumBlocks = mapping::getNumberOfBlocksInKernel();
diff --git a/offload/test/offloading/fortran/target-teams-dist-nest-par.f90 b/offload/test/offloading/fortran/target-teams-dist-nest-par.f90
new file mode 100644
index 0000000000000..dfde1b98f3c86
--- /dev/null
+++ b/offload/test/offloading/fortran/target-teams-dist-nest-par.f90
@@ -0,0 +1,26 @@
+! REQUIRES: flang, amdgpu
+
+! RUN: %libomptarget-compile-fortran-generic
+! RUN: %libomptarget-run-generic 2>&1 | %fcheck-generic
+program main
+ integer :: array(10) = 0
+ integer :: x, y, z
+ !$omp target
+ !$omp teams distribute private(x, y)
+ OuterLoopOne: do x=1,1
+ array(2) = 42
+ OuterLoopTwo: do y=1,1
+ !$omp parallel do private(z)
+ InnerLoopOne: do z=1,10
+ array(z) = 20
+ enddo InnerLoopOne
+ !$omp end parallel do
+ enddo OuterLoopTwo
+ enddo OuterLoopOne
+ !$omp end teams distribute
+ !$omp end target
+ ! Expected to print all 20's
+ print *, array
+end program main
+
+! CHECK: 20 20 20 20 20 20 20 20 20 20
More information about the llvm-commits
mailing list