[PATCH] D52434: [OpenMP] Make default distribute schedule for NVPTX target regions in SPMD mode achieve coalescing
Jonas Hahnfeld via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Fri Sep 28 08:33:50 PDT 2018
Hahnfeld added a comment.
In https://reviews.llvm.org/D52434#1249102, @gtbercea wrote:
> You report a slow down which I am not able to reproduce actually. Do you use any additional clauses not present in your previous post?
No, only `dist_schedule(static)` which is faster. Tested on a `Tesla P100` with today's trunk version:
| `#pragma omp target teams distribute parallel for` (new defaults) | 190 - 250 GB/s |
| adding clauses for old defaults: `schedule(static) dist_schedule(static)` | 30 - 50 GB/s |
| same directive with only `dist_schedule(static)` added (fewer registers) | 320 - 400 GB/s |
Repository:
rC Clang
https://reviews.llvm.org/D52434
More information about the cfe-commits
mailing list