[PATCH] D52434: [OpenMP] Make default distribute schedule for NVPTX target regions in SPMD mode achieve coalescing
Jonas Hahnfeld via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Fri Sep 28 05:34:04 PDT 2018
Hahnfeld added a comment.
In https://reviews.llvm.org/D52434#1248974, @gtbercea wrote:
> One big problem your code has is that the trip count is incredibly small, especially for STREAM and especially on GPUs. You need a much larger loop size otherwise the timings will be dominated by OpenMP setups costs.
Sure, I'm not that dump. The real code has larger loops, this was just for demonstration purposes. I don't expect the register count to change based on loop size - is that too optimistic?
Repository:
rC Clang
https://reviews.llvm.org/D52434
More information about the cfe-commits
mailing list