[llvm-commits] PATCH: Enable direct selection of bsf and bsr instructions for cttz and ctlz with zero-undef behavior
Erik Olofsson
Erik.Olofsson at hansoft.se
Tue Dec 13 09:56:49 PST 2011
Just a note, BSR/BSF might have some performance implications depending on which CPU is running the code. See http://www.realworldtech.com/beta/forums/index.cfm?action=detail&id=82507&threadid=82344&roomid=2
It would probably be safest to just use lzcnt/tzcnt when available unless you know which CPU you are optimizing for.
Regards,
Erik
On 2011-12-13, at 13:31 , Chandler Carruth wrote:
This just enables the necessary patterns for x86. I really know nothing about the tablegen-backed patterns, so please review and be gentle. =]
There is still quite a bit of work to be done on the zero-undef front. We need to teach the entire middle end about this behavior and have it transform both to and from the zero-undef variants based on conditions and known bits of the operand. I've updated the test to demonstrate that no more cmov instructions are generated for the common case, but we still generate them when required by the instruction semantics unless there are trivially bits set in the operand.
This also doesn't teach the x86 backend to prefer bsf and bsr when they are sufficient even though lzcnt and tzcnt are available. That behavior would be beneficial as bsf and bsr encode more compactly. If someone can shed light on how to do this, that would really help. I'm completely out of my depth in the actual target pattern magic of TableGen. =]
_______________________________________________
llvm-commits mailing list
llvm-commits at cs.uiuc.edu<mailto:llvm-commits at cs.uiuc.edu>
http://lists.cs.uiuc.edu/mailman/listinfo/llvm-commits
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20111213/656747ce/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: fix-cttz-x86-patterns.patch
Type: text/x-patch
Size: 4336 bytes
Desc: fix-cttz-x86-patterns.patch
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20111213/656747ce/attachment.bin>
More information about the llvm-commits
mailing list