[PATCH] D84767: [OPENMP]Fix PR46824: Global declare target pointer cannot be accessed in target region.
Alexey Bataev via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Wed Jul 29 05:41:22 PDT 2020
ABataev added a comment.
In D84767#2180433 <https://reviews.llvm.org/D84767#2180433>, @ye-luo wrote:
> In D84767#2180280 <https://reviews.llvm.org/D84767#2180280>, @ye-luo wrote:
>
>> This patch
>> GPU activities: 96.99% 350.05ms 10 35.005ms 1.5680us 350.00ms [CUDA memcpy HtoD]
>> before the July21 change
>> GPU activities: 95.33% 20.317ms 4 5.0793ms 1.6000us 20.305ms [CUDA memcpy HtoD]
>> Still more transfer than it should.
>
> @ABataev could you have a look? My July 21 compiler was built before "[OPENMP]Fix PR46012: declare target pointer cannot be accessed in target region." gets in.
Are you talking about the `number of calls` value? The total number of calls will increase after the patch anyway, PTR_AND_OBJ adds 1 extra mem transfer for transferring translated pointer address.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D84767/new/
https://reviews.llvm.org/D84767
More information about the cfe-commits
mailing list