<br><br>On Friday, February 1, 2019, Simon Moll <<a href="mailto:moll@cs.uni-saarland.de">moll@cs.uni-saarland.de</a>> wrote:<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div text="#000000" bgcolor="#FFFFFF"><blockquote type="cite"><div dir="auto">

      </div>

    </blockquote>

    <p>We could untie the mask length from the data length:</p>

    <p>  %result = call <scalable 4 x float>

      @llvm.evl.fsub.v4f32(<scalable 4 x float> %x, <scalable 4

      x float> %y, <scalable 1 x i1> %M, i32 %L)</p>

    <p>would then indicate the mask %M applies to groups of "4 / 1"

      float elements.</p></div></blockquote><div><br></div><div>That would provide the greatest flexibility, as a 1:1 ratio could mean 1 bit per element, covering the normal case.</div><div><br></div><div>Question: are there any circumstances under which it is desirable to underspecify or overspecify the number of bits in the predicate?</div><div><br></div><div>ie to deliberately have a FP vector of length 11 and a mask of length 9 or 13?</div><div><br></div><div>Or, is that just a runtime error.</div><div><br></div><div>L.</div><br><br>-- <br>---<br>crowd-funded eco-conscious hardware: <a href="https://www.crowdsupply.com/eoma68" target="_blank">https://www.crowdsupply.com/eoma68</a><br><br>