<html><head><meta http-equiv="Content-Type" content="text/html charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><br class=""><div><blockquote type="cite" class=""><div class="">On Jul 31, 2015, at 10:49 AM, Steve King <<a href="mailto:steve@metrokings.com" class="">steve@metrokings.com</a>> wrote:</div><br class="Apple-interchange-newline"><div class="">Hello LLVM,<br class="">If there are any, can someone please explain rules of thumb for when<br class="">to do a PerformDAGCombine operation in ISelLowering vs. when to do a<br class="">DAG to DAG transformation?<br class=""><br class="">I'm specifically thinking of an AND + SRL merge into a bit field<br class="">extract type instruction. I see that the ARM target does this in<br class="">DAG-to-DAG, but this is literally a combine of two instructions,so why<br class="">not DAGCombine?<br class=""><br class="">Thanks,<br class="">-steve<br class="">_______________________________________________<br class="">LLVM Developers mailing list<br class=""><a href="mailto:LLVMdev@cs.uiuc.edu" class="">LLVMdev@cs.uiuc.edu</a> <a href="http://llvm.cs.uiuc.edu" class="">http://llvm.cs.uiuc.edu</a><br class=""><a href="http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev" class="">http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev</a><br class=""></div></blockquote><br class=""></div><div><br class=""></div><div class="">It’s better to do something as a DAG combine if doing so is more likely to expose other DAG combine opportunities. For example, in AMDGPU we have to do a unit conversion to the hardware sin/cos instructions. We could implement this as an instruction pattern for the sin node with the fmul by a constant conversions there. However if the unit conversion is done during custom lowering and you have another fmul input to the sin node, with fast math the 2 fmuls by a constant will be folded into one.</div><div class=""><br class=""></div><div class="">In the case of bitextract instructions it’s kind of harder to analyze them after they are created, so might seem less worth it to have those done in the DAG. However, we currently have at least 3 in-tree targets all trying to match some variant of pretty much the same instruction, so I would like it if we had a generic bit extract node and moved all of the pattern matching to DAG combiner.</div><div class=""><br class=""></div><div class="">-Matt</div></body></html>