[LLVMdev] [cfe-dev] [Proposal] function attribute to reduce emission of vzeroupper instructions
Aaron Ballman
aaron at aaronballman.com
Tue Dec 24 07:02:13 PST 2013
On Tue, Dec 24, 2013 at 7:50 AM, Rafael EspĂndola
<rafael.espindola at gmail.com> wrote:
>> In general, I'm not too keen on adding more calling conventions unless
>> there's a really powerful need for one from an ABI perspective. This
>> sounds more like an optimization than an ABI need.
>
> I think that is the case.
>
>> What's more, I
>> worry (a little bit) about confusion that could be caused with the
>> __vectorcall calling convention (which we do not currently support,
>> but will need to at some point for MSVC compatibility).
>
> What does the __vectorcall does?
http://msdn.microsoft.com/en-us/library/dn375768.aspx
It's different than the proposed attribute, but still relates to SIMD
instruction optimizations.
>
>> What should happen with this code?
>>
>> int foo() __attribute__((avx));
>>
>> void bar(int (*fp)()) {
>> int i = fp();
>> }
>>
>> void baz(void) {
>> bar(foo);
>> }
>>
>> Based on your description, this code is valid, but not as performant
>> as it could be. The vzeroupper would be inserted before fp() is
>> called, but there's no incompatibility happening. So I guess this
>> feels more like a regular function attribute than a calling
>> convention.
>
> It is not a calling convention. The issue is more if it is a type or a
> decl attribute. Given that putting the attributes on the function
> decls is the simplest and should cover most of the cases, I think we
> can probably start with that and revisit if we still see too many
> vzeroupper being inserted. What do you think?
That seems reasonable to me.
>
>>>
>>> Function definitions in a translation unit compiled with -mavx architecture
>>> will
>>>
>>> implicitly have this attribute.
>>
>> Can you safely do that? What about code that does uses inline assembly
>> to use legacy SSE instructions in a TU compiled with -mavx, for
>> instance?
>
> I think it would take a performance penalty, but I don't expect that
> to be common.
Hmm, I was worried about the situation where:
extern int foo(); // compiled without -mavx
void bar() { // compiled in a TU with -mavx
...
// no vzeroupper is inserted before the call instruction because it is
// implicit due to -mavx
foo();
...
}
I'm not certain whether this sort of pattern could cause problems or
not. If there's no way for it to be problematic, then implicitly
attaching the attribute is reasonable enough. It does mean we're
straying farther from the as-written attributes for the function, but
that's just an unfortunate situation we're already in today and
wouldn't block this feature.
~Aaron
More information about the llvm-dev
mailing list