[PATCH] D52434: [OpenMP] Make default distribute schedule for NVPTX target regions in SPMD mode achieve coalescing

Jonas Hahnfeld via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Fri Sep 28 05:34:04 PDT 2018


Hahnfeld added a comment.

In https://reviews.llvm.org/D52434#1248974, @gtbercea wrote:

> One big problem your code has is that the trip count is incredibly small, especially for STREAM and especially on GPUs. You need a much larger loop size otherwise the timings will be dominated by OpenMP setups costs.


Sure, I'm not that dump. The real code has larger loops, this was just for demonstration purposes. I don't expect the register count to change based on loop size - is that too optimistic?


Repository:
  rC Clang

https://reviews.llvm.org/D52434





More information about the cfe-commits mailing list