[PATCH] D45173: [InstCombine] Recognize idioms for ctpop and ctlz
Craig Topper via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Apr 3 14:25:11 PDT 2018
craig.topper added a comment.
I think we need to evaluate what popcount sequences we want to handle. The code you're handling isn't the most optimal version
For example compiler-rt uses this
su_int x = (su_int)a;
x = x - ((x >> 1) & 0x55555555);
/* Every 2 bits holds the sum of every pair of bits */
x = ((x >> 2) & 0x33333333) + (x & 0x33333333);
/* Every 4 bits holds the sum of every 4-set of bits (3 significant bits) */
x = (x + (x >> 4)) & 0x0F0F0F0F;
/* Every 8 bits holds the sum of every 8-set of bits (4 significant bits) */
x = (x + (x >> 16));
/* The lower 16 bits hold two 8 bit sums (5 significant bits).*/
/* Upper 16 bits are garbage */
return (x + (x >> 8)) & 0x0000003F; /* (6 significant bits) */
Then there is another form here that uses a multiply in the last step.
https://graphics.stanford.edu/~seander/bithacks.html#CountBitsSetParallel
Repository:
rL LLVM
https://reviews.llvm.org/D45173
More information about the llvm-commits
mailing list