[PATCH] D16941: [NVPTX] Mark nvvm synchronizing intrinsics as convergent.

Justin Lebar via llvm-commits llvm-commits at lists.llvm.org
Fri Feb 5 18:22:46 PST 2016


jlebar added a comment.

Thank you for the quick review, jingyue.  I don't have an example of code that is miscompiled without this; I just went looking for optimizations that check convergent and not noduplicate.  I found LoopUnswitch, Sink, and InstructionCombining.  I think InstructionCombining is not relevant, because it appears that the only calls it's interested in are free()s.  LoopUnswitch seems OK because it checks Metrics.notDuplicatable, which checks for NoDuplicate calls.  But I don't immediately see how Sink is safe.  (This was the example majnemer	came up with off the top of his head when we talked.)  I also see something similar in MachineSink, although I'm not as sure that's relevant.

In http://reviews.llvm.org/D16941#345516, @jingyue wrote:

> According to PTX ISA, `bar.sync` should only be executed uniformly, so inlining won't introduce new divergence.


I don't know what this means; can you elaborate?


http://reviews.llvm.org/D16941





More information about the llvm-commits mailing list