<html>
<head>
<base href="https://bugs.llvm.org/">
</head>
<body><span class="vcard"><a class="email" href="mailto:llvm-dev@redking.me.uk" title="Simon Pilgrim <llvm-dev@redking.me.uk>"> <span class="fn">Simon Pilgrim</span></a>
</span> changed
<a class="bz_bug_link
bz_status_RESOLVED bz_closed"
title="RESOLVED FIXED - [AVX512F] Suboptimal code generated for a _mm_mask_store_sd expansion"
href="https://bugs.llvm.org/show_bug.cgi?id=30563">bug 30563</a>
<br>
<table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>What</th>
<th>Removed</th>
<th>Added</th>
</tr>
<tr>
<td style="text-align:right;">Resolution</td>
<td>---
</td>
<td>FIXED
</td>
</tr>
<tr>
<td style="text-align:right;">Status</td>
<td>NEW
</td>
<td>RESOLVED
</td>
</tr>
<tr>
<td style="text-align:right;">Fixed By Commit(s)</td>
<td>
</td>
<td>286229, 331958
</td>
</tr></table>
<p>
<div>
<b><a class="bz_bug_link
bz_status_RESOLVED bz_closed"
title="RESOLVED FIXED - [AVX512F] Suboptimal code generated for a _mm_mask_store_sd expansion"
href="https://bugs.llvm.org/show_bug.cgi?id=30563#c2">Comment # 2</a>
on <a class="bz_bug_link
bz_status_RESOLVED bz_closed"
title="RESOLVED FIXED - [AVX512F] Suboptimal code generated for a _mm_mask_store_sd expansion"
href="https://bugs.llvm.org/show_bug.cgi?id=30563">bug 30563</a>
from <span class="vcard"><a class="email" href="mailto:llvm-dev@redking.me.uk" title="Simon Pilgrim <llvm-dev@redking.me.uk>"> <span class="fn">Simon Pilgrim</span></a>
</span></b>
<pre>Resolving, actual support for these intrinsics was added to the headers at
rL286229, and tweaked at rL331958 to give the optimal codegen requested in the
bug.
The IR we use is:
define @test_mm_mask_store_sd(double*, i8 zeroext, <2 x double>) {
%4 = bitcast double* %0 to <2 x double>*
%5 = and i8 %1, 1
%6 = bitcast i8 %5 to <8 x i1>
%7 = shufflevector <8 x i1> %6, <8 x i1> undef, <2 x i32> <i32 0, i32 1>
tail call void @llvm.masked.store.v2f64.p0v2f64(<2 x double> %2, <2 x
double>* %4, i32 1, <2 x i1> %7)
ret void
}
declare void @llvm.masked.store.v2f64.p0v2f64(<2 x double>, <2 x double>*, i32,
<2 x i1>)
@Zvi - please reopen this if you think we have a need to optimise your
alternative IR any further than [<a href="show_bug.cgi?id=30563#c1">Comment #1</a>].</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are on the CC list for the bug.</li>
</ul>
</body>
</html>