<html><head><meta http-equiv="Content-Type" content="text/html charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><br class=""><div><blockquote type="cite" class=""><div class="">On Dec 23, 2014, at 3:52 PM, Chandler Carruth <<a href="mailto:chandlerc@google.com" class="">chandlerc@google.com</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><div class="gmail_extra" style="font-family: Helvetica; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px;">So, what I was trying to point out is that it isn't clear we need to have the ability to teach the middle end optimizer about the second set above. For example, the only place where I can find us dealing with intrinsics from Hexagon, r600/AMDGPU, or NVPTX in the middle end is for AMDGPU_rcp which has an instcombine. While r600/AMDGPU doesn't really have a lot of intrinsics anyways, NVPTX seems to have many of the kinds of intrinsics that would be directly relevant to GPU shaders... But maybe there is something about how people are using NVPTX that makes this a bad example?</div><div class="gmail_extra" style="font-family: Helvetica; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px;"><br class=""></div><div class="gmail_extra" style="font-family: Helvetica; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px;">The largest contribution of target-specific intrinsics in the middle end optimizer is actually x86, and I'm pretty confident that we can and probably should remove most of that code. The operations we optimize there don't actually seem special at all, I suspect this is more a consequence of the historical needs than anything else. (I mean, I'm pretty sure I added some of those combines for x86!)</div></div></blockquote><br class=""></div><div>I’m not sure I agree that the ability to optimize these intrinsics is unnecessary, but I definitely believe there’s a lot of room to improve the separation of concerns and a lot of code that knows about target-specific intrinsics into the targets themselves. That said, this is still a lot of work and may well have compile time performance implications/</div><div><br class=""></div><div>—Owen</div></body></html>