[PATCH] D30209: AMDGPU: Fold omod into instructions

Marek Olšák via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Feb 24 01:42:13 PST 2017


mareko added inline comments.


================
Comment at: lib/Target/AMDGPU/AMDGPUCallingConv.td:38
   // 32*4 + 4 is the minimum for a fetch shader consumer with 32 inputs.
-  CCIfNotInReg<CCIfType<[f32, i32] , CCAssignToReg<[
+  CCIfNotInReg<CCIfType<[f32, i32, f16] , CCAssignToReg<[
     VGPR0, VGPR1, VGPR2, VGPR3, VGPR4, VGPR5, VGPR6, VGPR7,
----------------
arsenm wrote:
> mareko wrote:
> > Why are the calling conventions being changed?
> I needed a way to get an f16 input into a graphics shader. This would just assert on unhandled value type before. I can commit this separately
OK. I guess you can keep it here.


https://reviews.llvm.org/D30209





More information about the llvm-commits mailing list