[all-commits] [llvm/llvm-project] fced87: [AMDGPU] Fix regression with vectorization limiting
Stanislav Mekhanoshin via All-commits
all-commits at lists.llvm.org
Fri Apr 8 17:47:17 PDT 2022
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: fced87d457d39d94cf7b396d8200dcb4e171003d
https://github.com/llvm/llvm-project/commit/fced87d457d39d94cf7b396d8200dcb4e171003d
Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin at amd.com>
Date: 2022-04-08 (Fri, 08 Apr 2022)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp
M llvm/lib/Target/AMDGPU/AMDGPUTargetTransformInfo.h
A llvm/test/Transforms/LoopVectorize/AMDGPU/packed-fp32.ll
M llvm/test/Transforms/LoopVectorize/AMDGPU/packed-math.ll
Log Message:
-----------
[AMDGPU] Fix regression with vectorization limiting
D67148 has removed TTI::getNumberOfRegisters(bool Vector) and
started to call TTI::getNumberOfRegisters(unsigned ClassID) from
the LoopVectorize. This has resulted in an unrestricted vectorization
on AMDGPU blowing up register pressure.
Differential Revision: https://reviews.llvm.org/D122850
More information about the All-commits
mailing list