[PATCH] D20340: AMDGPU: Other sizes of popcnt are fast

Matt Arsenault via llvm-commits llvm-commits at lists.llvm.org
Tue May 17 15:10:53 PDT 2016


arsenm created this revision.
arsenm added a reviewer: tstellarAMD.
arsenm added a subscriber: llvm-commits.
Herald added subscribers: kzhuravl, arsenm.

We can chain bcnt instructions together, so any width popcnt is pretty fast.

http://reviews.llvm.org/D20340

Files:
  lib/Target/AMDGPU/AMDGPUTargetTransformInfo.h
  test/CodeGen/AMDGPU/ctpop64.ll
  test/Transforms/LoopIdiom/AMDGPU/popcnt.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D20340.57529.patch
Type: text/x-patch
Size: 4577 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20160517/d892396e/attachment.bin>


More information about the llvm-commits mailing list