[PATCH] D47782: AMDGPU: Custom lower v2f16 fneg/fabs with illegal f16
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Jun 5 10:27:58 PDT 2018
arsenm added inline comments.
================
Comment at: test/CodeGen/AMDGPU/fabs.f16.ll:107
; CI: v_cvt_f32_f16_e32
-; CI: v_mul_f32_e64 v{{[0-9]+}}, |v{{[0-9]+}}|, v{{[0-9]+}}
+; CI: v_mul_f32_e32 v{{[0-9]+}}, v{{[0-9]+}}, v{{[0-9]+}}
; CI: v_cvt_f16_f32
----------------
rampitec wrote:
> Are there a lot of regressions like this?
> Any plans to address these?
I don't think so. I only really noticed this one because of the kind of perverse case of using the original and negated value resulting in doubling the number of conversions. We probably have to repeat the performFNeg/Fabs combines for the bit patterns used for lowered f16
https://reviews.llvm.org/D47782
More information about the llvm-commits
mailing list