[PATCH] D45173: [InstCombine] Recognize idioms for ctpop and ctlz

Craig Topper via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Apr 3 14:25:11 PDT 2018


craig.topper added a comment.

I think we need to evaluate what popcount sequences we want to handle. The code you're handling isn't the most optimal version

For example compiler-rt uses this

  su_int x = (su_int)a;
  x = x - ((x >> 1) & 0x55555555);
  /* Every 2 bits holds the sum of every pair of bits */
  x = ((x >> 2) & 0x33333333) + (x & 0x33333333);
  /* Every 4 bits holds the sum of every 4-set of bits (3 significant bits) */
  x = (x + (x >> 4)) & 0x0F0F0F0F;
  /* Every 8 bits holds the sum of every 8-set of bits (4 significant bits) */
  x = (x + (x >> 16));
  /* The lower 16 bits hold two 8 bit sums (5 significant bits).*/
  /*    Upper 16 bits are garbage */
  return (x + (x >> 8)) & 0x0000003F;  /* (6 significant bits) */

Then there is another form here that uses a multiply in the last step.

https://graphics.stanford.edu/~seander/bithacks.html#CountBitsSetParallel


Repository:
  rL LLVM

https://reviews.llvm.org/D45173





More information about the llvm-commits mailing list