[PATCH] D30442: [AMDGPU] Add second pass of the scheduler
Konstantin Zhuravlyov via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Feb 28 10:18:19 PST 2017
kzhuravl added inline comments.
================
Comment at: lib/Target/AMDGPU/GCNSchedStrategy.cpp:62
->getNumAllocatableRegs(&AMDGPU::VGPR_32RegClass) - ErrorMargin;
- SGPRCriticalLimit = SRI->getRegPressureSetLimit(DAG->MF,
- SRI->getSGPRPressureSet()) - ErrorMargin;
- VGPRCriticalLimit = SRI->getRegPressureSetLimit(DAG->MF,
- SRI->getVGPRPressureSet()) - ErrorMargin;
+ if (TargetOccupancy) {
+ SGPRCriticalLimit = ST.getMaxNumSGPRs(TargetOccupancy, true);
----------------
I think this should also respect the "amdgpu-waves-per-eu" attribute (https://clang.llvm.org/docs/AttributeReference.html#amdgpu-waves-per-eu)?
Repository:
rL LLVM
https://reviews.llvm.org/D30442
More information about the llvm-commits
mailing list