[all-commits] [llvm/llvm-project] ab01f4: AMDGPU/GlobalISel: Do not fcanonicalize const spla...
petar-avramovic via All-commits
all-commits at lists.llvm.org
Fri Dec 3 04:02:52 PST 2021
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: ab01f4d26400f13f71a57b15f3f6c336c24606e8
https://github.com/llvm/llvm-project/commit/ab01f4d26400f13f71a57b15f3f6c336c24606e8
Author: Petar Avramovic <Petar.Avramovic at amd.com>
Date: 2021-12-03 (Fri, 03 Dec 2021)
Changed paths:
M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
M llvm/test/CodeGen/AMDGPU/GlobalISel/combine-fcanonicalize.mir
Log Message:
-----------
AMDGPU/GlobalISel: Do not fcanonicalize const splat padded with undef
Recognize constant splat padded with undef in isCanonicalized.
Fcanonicalize will be removed by RemoveFcanonicalize in post-legalizer
combiner. We will treat undef as value that will result in a splat
in clamp combine after regbankselect.
Differential Revision: https://reviews.llvm.org/D104408
Commit: ec54867d750bd6c095697d089d9ad5a7440974a8
https://github.com/llvm/llvm-project/commit/ec54867d750bd6c095697d089d9ad5a7440974a8
Author: Petar Avramovic <Petar.Avramovic at amd.com>
Date: 2021-12-03 (Fri, 03 Dec 2021)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPUCombine.td
M llvm/lib/Target/AMDGPU/AMDGPUGISel.td
M llvm/lib/Target/AMDGPU/AMDGPURegBankCombiner.cpp
M llvm/lib/Target/AMDGPU/SIInstructions.td
A llvm/test/CodeGen/AMDGPU/GlobalISel/fmed3-min-max-const-combine.ll
A llvm/test/CodeGen/AMDGPU/GlobalISel/regbankcombiner-fmed3-minmax-const.mir
Log Message:
-----------
AMDGPU/GlobalISel: Add floating point med3 combine
Add floating point version of med3 combine.
Source is fminnum(fmaxnum(Val, K0), K1) or fmaxnum(fminnum(Val, K1), K0)
where K0 and K1 are constants and K0 <= K1.
Differential Revision: https://reviews.llvm.org/D90051
Commit: 0b34ffe4a61eab708b594f23a7781c90ec22423d
https://github.com/llvm/llvm-project/commit/0b34ffe4a61eab708b594f23a7781c90ec22423d
Author: Petar Avramovic <Petar.Avramovic at amd.com>
Date: 2021-12-03 (Fri, 03 Dec 2021)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPUCombine.td
M llvm/lib/Target/AMDGPU/AMDGPUGISel.td
M llvm/lib/Target/AMDGPU/AMDGPURegBankCombiner.cpp
M llvm/lib/Target/AMDGPU/SIInstructions.td
A llvm/test/CodeGen/AMDGPU/GlobalISel/clamp-fmed3-const-combine.ll
A llvm/test/CodeGen/AMDGPU/GlobalISel/clamp-minmax-const-combine.ll
A llvm/test/CodeGen/AMDGPU/GlobalISel/regbankcombiner-clamp-fmed3-const.mir
A llvm/test/CodeGen/AMDGPU/GlobalISel/regbankcombiner-clamp-minmax-const.mir
Log Message:
-----------
AMDGPU/GlobalISel: Add clamp combine
Add clamp combine. Source is fminnum(fmaxnum(Val, 0.0), 1.0) or
fmaxnum(fminnum(Val, 1.0), 0.0) or fmed3 intrinsic with 0.0 and
1.0 as two out of three operands.
Differential Revision: https://reviews.llvm.org/D90052
Compare: https://github.com/llvm/llvm-project/compare/9dd1f8dfdd1a...0b34ffe4a61e
More information about the All-commits
mailing list