[llvm-commits] [llvm] r160117 - in /llvm/trunk: include/llvm/IntrinsicsX86.td lib/Target/X86/X86ISelLowering.cpp lib/Target/X86/X86ISelLowering.h lib/Target/X86/X86InstrInfo.td test/CodeGen/X86/rdrand.ll
Benjamin Kramer
benny.kra at gmail.com
Thu Jul 12 11:17:27 PDT 2012
On 12.07.2012, at 19:35, Jakob Stoklund Olesen <stoklund at 2pi.dk> wrote:
>
> On Jul 12, 2012, at 10:26 AM, Benjamin Kramer <benny.kra at gmail.com> wrote:
>
>>
>> On 12.07.2012, at 18:53, Jakob Stoklund Olesen <stoklund at 2pi.dk> wrote:
>>
>>>
>>> On Jul 12, 2012, at 2:31 AM, Benjamin Kramer <benny.kra at googlemail.com> wrote:
>>>
>>>> Author: d0k
>>>> Date: Thu Jul 12 04:31:43 2012
>>>> New Revision: 160117
>>>>
>>>> URL: http://llvm.org/viewvc/llvm-project?rev=160117&view=rev
>>>> Log:
>>>> Add intrinsics for Ivy Bridge's rdrand instruction.
>>>>
>>>> The rdrand/cmov sequence is the same that is emitted by both
>>>> GCC and ICC.
>>>
>>> Hi Ben,
>>>
>>> You should make sure we don't CSE these instructions:
>>>
>>> define i32 @f() nounwind {
>>> entry:
>>> %0 = tail call { i32, i32 } @llvm.x86.rdrand.32() nounwind
>>> %1 = extractvalue { i32, i32 } %0, 0
>>> %2 = tail call { i32, i32 } @llvm.x86.rdrand.32() nounwind
>>> %3 = extractvalue { i32, i32 } %2, 0
>>> %add = add i32 %3, %1
>>> ret i32 %add
>>> }
>>>
>>> $ llc -mattr=+rdrand < rdrand.ll
>>> f: # @f
>>> # BB#0: # %entry
>>> rdrandl %eax
>>> addl %eax, %eax
>>> ret
>>
>> Hmm, I thought I hit all knobs to make sure all CSE passes know to avoid rdrand. What is the right way to teach MachineCSE the same thing? SDNPSideEffect? Should the RDRAND node have a chain?
>
> The MachineLICM and MachineCSE passes will leave instructions alone if MCID::UnmodeledSideEffects is set. Set hasSideEffects = 1 in the instruction description.
>
> But I think you should give the RDRAND node a chain as well. We really don't want to be clever about reordering these intrinsics since their primary purpose is security software.
The instruction now has side effects and a chain with r160137.
The amusing part is that gcc 4.7 has the same bug so I carefully tested it, but apparently didn't have the right test case to trigger MachineLICM.
- Ben
More information about the llvm-commits
mailing list