[PATCH] D52434: [OpenMP] Make default distribute schedule for NVPTX target regions in SPMD mode achieve coalescing

Gheorghe-Teodor Bercea via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Fri Sep 28 05:40:30 PDT 2018


gtbercea added a comment.

In https://reviews.llvm.org/D52434#1248975, @Hahnfeld wrote:

> In https://reviews.llvm.org/D52434#1248974, @gtbercea wrote:
>
> > One big problem your code has is that the trip count is incredibly small, especially for STREAM and especially on GPUs. You need a much larger loop size otherwise the timings will be dominated by OpenMP setups costs.
>
>
> Sure, I'm not that dump. The real code has larger loops, this was just for demonstration purposes. I don't expect the register count to change based on loop size - is that too optimistic?


The register count will of course not change with loop size.


Repository:
  rC Clang

https://reviews.llvm.org/D52434





More information about the cfe-commits mailing list