[clang] Pass -offload-lto instead of -lto for cuda/hip kernels (PR #125243)

Omar Ahmed via cfe-commits cfe-commits at lists.llvm.org
Tue Feb 4 07:57:08 PST 2025


================
@@ -498,12 +498,16 @@ Expected<StringRef> clang(ArrayRef<StringRef> InputFiles, const ArgList &Args) {
   };
 
   // Forward all of the `--offload-opt` and similar options to the device.
-  CmdArgs.push_back("-flto");
   for (auto &Arg : Args.filtered(OPT_offload_opt_eq_minus, OPT_mllvm))
     CmdArgs.append(
         {"-Xlinker",
          Args.MakeArgString("--plugin-opt=" + StringRef(Arg->getValue()))});
 
+  if (Triple.isNVPTX() || Triple.isAMDGPU())
+    CmdArgs.push_back("-foffload-lto");
+  else
+    CmdArgs.push_back("-flto");
----------------
omarahmed1111 wrote:

I managed to get similar behaviour from this small example: https://godbolt.org/z/xaTfGrajd

https://github.com/llvm/llvm-project/pull/125243


More information about the cfe-commits mailing list