[PATCH] D49761: [AMDGPU] Use AssumptionCacheTracker in the divrem32 expansion

Stanislav Mekhanoshin via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Jul 24 15:00:48 PDT 2018


rampitec created this revision.
rampitec added reviewers: artem.tamazov, arsenm.
Herald added subscribers: t-tye, tpr, dstuttard, yaxunl, nhaehnle, wdng, kzhuravl.

This allows to use a short div/rem expansion given a kernel with __builtin_assume() call:

  kernel void divrem24_uint(global int *a, uint x) {
    uint id = get_local_id(0);
    __builtin_assume(x < 42);
    a[id / x] = 0;
  }


https://reviews.llvm.org/D49761

Files:
  lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
  test/CodeGen/AMDGPU/divrem24-assume.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D49761.157137.patch
Type: text/x-patch
Size: 7485 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20180724/34800bd6/attachment.bin>


More information about the llvm-commits mailing list