[llvm] [TableGen] Speed up inferMatchingSuperRegClass. NFC. (PR #133060)
via llvm-commits
llvm-commits at lists.llvm.org
Wed Mar 26 02:48:04 PDT 2025
llvmbot wrote:
<!--LLVM PR SUMMARY COMMENT-->
@llvm/pr-subscribers-tablegen
Author: Jay Foad (jayfoad)
<details>
<summary>Changes</summary>
SubToSuperRegs was a DenseMap of std::vectors, where the vectors
typically had size 1. Switching to a vector of pairs avoids the overhead
of allocating tiny vectors.
I measured a 1.14x speed-up building AMDGPUGenRegisterInfo.inc with this
patch.
---
Full diff: https://github.com/llvm/llvm-project/pull/133060.diff
1 Files Affected:
- (modified) llvm/utils/TableGen/Common/CodeGenRegisters.cpp (+15-10)
``````````diff
diff --git a/llvm/utils/TableGen/Common/CodeGenRegisters.cpp b/llvm/utils/TableGen/Common/CodeGenRegisters.cpp
index 43b9c04eab20a..530f7cae86e5d 100644
--- a/llvm/utils/TableGen/Common/CodeGenRegisters.cpp
+++ b/llvm/utils/TableGen/Common/CodeGenRegisters.cpp
@@ -2341,7 +2341,7 @@ void CodeGenRegBank::inferSubClassWithSubReg(CodeGenRegisterClass *RC) {
void CodeGenRegBank::inferMatchingSuperRegClass(
CodeGenRegisterClass *RC,
std::list<CodeGenRegisterClass>::iterator FirstSubRegRC) {
- DenseMap<const CodeGenRegister *, std::vector<const CodeGenRegister *>>
+ std::vector<std::pair<const CodeGenRegister *, const CodeGenRegister *>>
SubToSuperRegs;
BitVector TopoSigs(getNumTopoSigs());
@@ -2353,15 +2353,17 @@ void CodeGenRegBank::inferMatchingSuperRegClass(
if (RC->getSubClassWithSubReg(&SubIdx) != RC)
continue;
- // Build list of (Super, Sub) pairs for this SubIdx.
+ // Build list of (Sub, Super) pairs for this SubIdx, sorted by Sub. Note
+ // that the list may contain entries with the same Sub but different Supers.
SubToSuperRegs.clear();
TopoSigs.reset();
for (const auto Super : RC->getMembers()) {
const CodeGenRegister *Sub = Super->getSubRegs().find(&SubIdx)->second;
assert(Sub && "Missing sub-register");
- SubToSuperRegs[Sub].push_back(Super);
+ SubToSuperRegs.emplace_back(Sub, Super);
TopoSigs.set(Sub->getTopoSig());
}
+ sort(SubToSuperRegs, on_first<deref<std::less<>>>());
// Iterate over sub-register class candidates. Ignore classes created by
// this loop. They will never be useful.
@@ -2376,14 +2378,17 @@ void CodeGenRegBank::inferMatchingSuperRegClass(
// Topological shortcut: SubRC members have the wrong shape.
if (!TopoSigs.anyCommon(SubRC.getTopoSigs()))
continue;
- // Compute the subset of RC that maps into SubRC.
+ // Compute the subset of RC that maps into SubRC with a single linear scan
+ // through SubToSuperRegs and the members of SubRC.
CodeGenRegister::Vec SubSetVec;
- for (const CodeGenRegister *R : SubRC.getMembers()) {
- auto It = SubToSuperRegs.find(R);
- if (It != SubToSuperRegs.end()) {
- const std::vector<const CodeGenRegister *> &SuperRegs = It->second;
- SubSetVec.insert(SubSetVec.end(), SuperRegs.begin(), SuperRegs.end());
- }
+ auto SubI = SubRC.getMembers().begin(), SubE = SubRC.getMembers().end();
+ for (auto &[Sub, Super] : SubToSuperRegs) {
+ while (SubI != SubE && **SubI < *Sub)
+ ++SubI;
+ if (SubI == SubE)
+ break;
+ if (**SubI == *Sub)
+ SubSetVec.push_back(Super);
}
if (SubSetVec.empty())
``````````
</details>
https://github.com/llvm/llvm-project/pull/133060
More information about the llvm-commits
mailing list