[llvm] [AMDGPU] Work around s_getpc_b64 zero extending on GFX12 (PR #78186)

Matt Arsenault via llvm-commits llvm-commits at lists.llvm.org
Mon Jan 15 09:04:36 PST 2024


================
@@ -1810,6 +1810,8 @@ def int_amdgcn_s_setreg :
 // not cross a 4Gb address boundary. Use for any other purpose may not
 // produce the desired results as optimizations may cause code movement,
 // especially as we explicitly use IntrNoMem to allow optimizations.
+// This intrinsic always returns PC sign-extended from 48 bits even if the
+// s_getpc_b64 instruction returns a zero-extended value.
----------------
arsenm wrote:

Maybe add this to AMDGPUUsage's intrinsic documentation?

https://github.com/llvm/llvm-project/pull/78186


More information about the llvm-commits mailing list