[PATCH] D20965: [X86][SSE] Add general lowering of nontemporal vector loads
Simon Pilgrim via llvm-commits
llvm-commits at lists.llvm.org
Fri Jun 3 08:29:39 PDT 2016
RKSimon added a comment.
In http://reviews.llvm.org/D20965#448279, @craig.topper wrote:
> let me rephrase that question. If we replace the intrinsic with generic IR, would we end up combining those loads into instructions during isel?
At present yes, if we replace the intrinsics then cases that currently don't fold would then start folding - that's why I added those tests to nontemporal-loads.ll to make sure we fix that before moving to generic ir.
We have very few cases of folding stores but the ones I've tested (_mm_extract_epi* and __mm256_cvtps_ph) don't seem to fold non-temporal stores so its certainly possible (whether its intended or not is another question.......).
Repository:
rL LLVM
http://reviews.llvm.org/D20965
More information about the llvm-commits
mailing list