[PATCH] D133463: [OpenMP] Device Time Profile
Johannes Doerfert via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Dec 7 15:04:01 PST 2022
jdoerfert added a comment.
I left some initial comments, I think this is not in a "working" state, it looks like parts are missing, beyond the LLVM changes.
================
Comment at: openmp/libomptarget/DeviceRTL/src/Kernel.cpp:25
+// CLOCKINIT();
+extern DeviceTimeProfileTy omptarget_device_time_profile;
----------------
Leftover
================
Comment at: openmp/libomptarget/DeviceRTL/src/Kernel.cpp:45
+ return OMP_ARRAY_SIZE;
+ }
+}
----------------
Turn this around.
```
if (Idx >= OMP_ARRAY_SIZE)
return OMP_ARRAY_SIZE
...
```
Where does OMP_ARRAY_SIZE and OMP_THREADS come from? Shouldn't those be read from the struct the runtime initializes?
================
Comment at: openmp/libomptarget/DeviceRTL/src/Kernel.cpp:50
+ if (Idx < OMP_ARRAY_SIZE) {
+#pragma omp parallel
+ {
----------------
Don't use parallel here. This is the device runtime.
Depending on where this is called, all threads will execute it anyway.
Just use `mapping::getThreadIdInBlock()` to get the thread id and `mapping::getBlockId()` to get the block id.
Same above.
================
Comment at: openmp/libomptarget/DeviceRTL/src/Kernel.cpp:120
+ CLOCKIN();
if (IsSPMD) {
----------------
What is this?
================
Comment at: openmp/libomptarget/DeviceRTL/src/Kernel.cpp:170
FunctionTracingRAII();
+ CLOCKOUT();
const bool IsSPMD = Mode & OMP_TGT_EXEC_MODE_SPMD;
----------------
And this?
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D133463/new/
https://reviews.llvm.org/D133463
More information about the llvm-commits
mailing list