[llvm] [AMDGPU] Add option to prevent insns straddling half cache-line boundaries (PR #150239)

Matt Arsenault via llvm-commits llvm-commits at lists.llvm.org
Wed Jul 23 08:49:09 PDT 2025


================
@@ -525,6 +525,12 @@ class SIMachineFunctionInfo final : public AMDGPUMachineFunction,
   void MRI_NoteCloneVirtualRegister(Register NewReg, Register SrcReg) override;
 
 public:
+  // Current known instruction alignment and offset in bytes.
+  // Used to prevent instructions from straddling half cache-line boundaries
+  // for performance.
+  unsigned Alignment = 1;
+  unsigned Offset = 0;
----------------
arsenm wrote:

This doesn't belong in MachineFunctionInfo. No per-pass state should ever be here

https://github.com/llvm/llvm-project/pull/150239


More information about the llvm-commits mailing list