[llvm] [AMDGPU] Add option to prevent insns straddling half cache-line boundaries (PR #150239)
Matt Arsenault via llvm-commits
llvm-commits at lists.llvm.org
Wed Jul 23 08:49:09 PDT 2025
================
@@ -525,6 +525,12 @@ class SIMachineFunctionInfo final : public AMDGPUMachineFunction,
void MRI_NoteCloneVirtualRegister(Register NewReg, Register SrcReg) override;
public:
+ // Current known instruction alignment and offset in bytes.
+ // Used to prevent instructions from straddling half cache-line boundaries
+ // for performance.
+ unsigned Alignment = 1;
+ unsigned Offset = 0;
----------------
arsenm wrote:
This doesn't belong in MachineFunctionInfo. No per-pass state should ever be here
https://github.com/llvm/llvm-project/pull/150239
More information about the llvm-commits
mailing list