[clang] [llvm] AMDGPU: Move enqueued block handling into clang (PR #128519)

Matt Arsenault via cfe-commits cfe-commits at lists.llvm.org
Thu Mar 6 16:29:56 PST 2025


================
@@ -614,6 +614,20 @@ void AMDGPUTargetCodeGenInfo::setCUDAKernelCallingConvention(
       FT, FT->getExtInfo().withCallingConv(CC_OpenCLKernel));
 }
 
+/// Return IR struct type for rtinfo struct in rocm-device-libs used for device
+/// enqueue.
+///
+/// ptr addrspace(1) kernel_object, i32 private_segment_size,
----------------
arsenm wrote:

No, this isn't in the implicit arguments. It could be constant, but I think I had difficulty getting externally_initialized + constant to work as expected.

I also think in principle the special pass isn't necessary anymore. I think something about visibility was how it ended up this way, and might require a runtime change to fully delete it 

https://github.com/llvm/llvm-project/pull/128519


More information about the cfe-commits mailing list