[llvm] [CodeGen][StaticDataPartitioning]Place local-linkage global variables in hot or unlikely prefixed sections based on profile information (PR #125756)
Mingming Liu via llvm-commits
llvm-commits at lists.llvm.org
Fri Mar 28 13:23:04 PDT 2025
================
@@ -0,0 +1,103 @@
+//===- StaticDataAnnotator - Annotate static data's section prefix --------===//
+//
+// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
+// See https://llvm.org/LICENSE.txt for license information.
+// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
+//
+//===----------------------------------------------------------------------===//
+//
+// To reason about module-wide data hotness in a module granularity, this file
+// implements a module pass StaticDataAnnotator to work coordinately with the
+// StaticDataSplitter pass.
+//
+// The StaticDataSplitter pass is a machine function pass. It analyzes data
+// hotness based on code and adds counters in the StaticDataProfileInfo.
+// The StaticDataAnnotator pass is a module pass. It iterates global variables
+// in the module, looks up counters from StaticDataProfileInfo and sets the
+// section prefix based on profiles.
+//
+// The three-pass structure is implemented for practical reasons, to work around
+// the limitation that a module pass based on legacy pass manager cannot make
+// use of MachineBlockFrequencyInfo analysis. In the future, we can consider
+// porting the StaticDataSplitter pass to a module-pass using the new pass
+// manager framework. That way, analysis are lazily computed as opposed to
+// eagerly scheduled, and a module pass can use MachineBlockFrequencyInfo.
+//===----------------------------------------------------------------------===//
+
+#include "llvm/Analysis/ProfileSummaryInfo.h"
+#include "llvm/Analysis/StaticDataProfileInfo.h"
+#include "llvm/CodeGen/Passes.h"
+#include "llvm/IR/Analysis.h"
+#include "llvm/IR/Module.h"
+#include "llvm/IR/PassManager.h"
+#include "llvm/InitializePasses.h"
+#include "llvm/Pass.h"
+#include "llvm/Support/raw_ostream.h"
+
+#define DEBUG_TYPE "static-data-annotator"
+
+using namespace llvm;
+
+class StaticDataAnnotator : public ModulePass {
+public:
+ static char ID;
+
+ StaticDataProfileInfo *SDPI = nullptr;
+ const ProfileSummaryInfo *PSI = nullptr;
+
+ StaticDataAnnotator() : ModulePass(ID) {
+ initializeStaticDataAnnotatorPass(*PassRegistry::getPassRegistry());
+ }
+
+ void getAnalysisUsage(AnalysisUsage &AU) const override {
+ AU.addRequired<StaticDataProfileInfoWrapperPass>();
+ AU.addRequired<ProfileSummaryInfoWrapperPass>();
+ AU.setPreservesAll();
+ ModulePass::getAnalysisUsage(AU);
+ }
+
+ StringRef getPassName() const override { return "Static Data Annotator"; }
+
+ bool runOnModule(Module &M) override;
+};
+
+// Returns true if the global variable already has a section prefix that is the
+// same as `Prefix`.
+static bool alreadyHasSectionPrefix(const GlobalVariable &GV,
+ StringRef Prefix) {
+ std::optional<StringRef> SectionPrefix = GV.getSectionPrefix();
+ return SectionPrefix && (*SectionPrefix == Prefix);
+}
+
+bool StaticDataAnnotator::runOnModule(Module &M) {
+ SDPI = &getAnalysis<StaticDataProfileInfoWrapperPass>()
+ .getStaticDataProfileInfo();
+ PSI = &getAnalysis<ProfileSummaryInfoWrapperPass>().getPSI();
+
+ if (!PSI->hasProfileSummary())
+ return false;
+
+ bool Changed = false;
+ for (auto &GV : M.globals()) {
+ if (GV.isDeclarationForLinker())
+ continue;
+
+ StringRef SectionPrefix = SDPI->getConstantSectionPrefix(&GV, PSI);
+ if (SectionPrefix.empty() || alreadyHasSectionPrefix(GV, SectionPrefix))
----------------
mingmingl-llvm wrote:
> In what situation can the constant already have a section prefix?
Good question. A search of `GlobalValue::setSectionPrefix` in the LLVM repo shows that it's only used for functions but not global variables, so the `alreadyHasSectionPrefix` should return always false currently. To be a little bit future proof, updated the patch to report error if a global variable already has a section prefix, and commented that the subsequent code doesn't handle 'update' properly.
> clang provides a section attribute which can be attached to globals. This is commonly used in embedded applications as far as I know.
Thanks for pointing this out.
For test coverage purposes, the test case was updated with `@cold_data_custom_foo_section` and `@hot_data_custom_bar_section`; the former uses [clang's section attribute](https://clang.llvm.org/docs/AttributeReference.html#section-declspec-allocate), and the latter uses [#pragma clang section ](https://clang.llvm.org/docs/LanguageExtensions.html#specifying-section-names-for-global-objects-pragma-clang-section).
Taking a look at [1], such global variables [1] will have explicit sections, and this work (hot-cold partitioning) won't kick in. Per offline discussion, it's desired for compiler to keep source code annotations without partitioning.
[1] https://github.com/llvm/llvm-project/blob/91d2ecf0d563b03d75380375e8ac26a291bed9d7/llvm/lib/Target/TargetLoweringObjectFile.cpp#L316-L331
https://github.com/llvm/llvm-project/pull/125756
More information about the llvm-commits
mailing list