erikjv added a comment. GCC doesn't define this, but icc does. Just like e.g. __AVX512CD__, it can be used to conditionally enable code that uses the instruction as a fast implementation for an algorithm. http://reviews.llvm.org/D11752