<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<style type="text/css" style="display:none;"> P {margin-top:0;margin-bottom:0;} </style>
</head>
<body dir="ltr">
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
The conversions are in the following locations. Only 8 conversions of functions are there now.<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
-> mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp<br>
patterns.insert<OpToFuncCallLowering<CosOp>>(converter, "__nv_cosf", "__nv_cos");</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
-> mlir/lib/Conversion/GPUToROCDL/LowerGpuOpsToROCDLOps.cpp<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
patterns.insert<OpToFuncCallLowering<CosOp>>(converter, "__ocml_cos_f32", "__ocml_cos_f64");<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
--Kiran</div>
<div id="appendonsend"></div>
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" color="#000000" style="font-size:11pt"><b>From:</b> Doerfert, Johannes <jdoerfert@anl.gov><br>
<b>Sent:</b> 30 July 2020 15:55<br>
<b>To:</b> Kiran Chandramohan <Kiran.Chandramohan@arm.com>; Jean Perier <jperier@nvidia.com>; Romero, Nichols A. <naromero@anl.gov>; flang-dev@lists.llvm.org <flang-dev@lists.llvm.org><br>
<b>Cc:</b> Clement, Valentin <clementv@ornl.gov><br>
<b>Subject:</b> Re: [flang-dev] OpenMP target regions and intrinsic Fortran math functions</font>
<div> </div>
</div>
<div class="BodyFragment"><font size="2"><span style="font-size:11pt">
<div class="PlainText">On 7/30/20 2:59 AM, Kiran Chandramohan wrote:<br>
> Hi,<br>
><br>
> Another possibility (like with OpenACC dialect) is to use the GPU dialect in mlir to model target regions.<br>
><br>
> The gpu dialect can be converted to other vendor dialects like nvvm, rocdl. During these conversions, calls to math library functions are converted to calls to device library functions<br>
><br>
> For e.g: the following call to the cos math function is converted to either of the calls to __nv_exp or __ocml_cos_f64 below depending on the conversion chosen.<br>
> %result64 = std.cos %arg_f64 : f64<br>
><br>
> %1 = llvm.call @__nv_cos(%arg1) : (!llvm.double) -> !llvm.double<br>
> %1 = llvm.call @__ocml_cos_f64(%arg1) : (!llvm.double) -> !llvm.double<br>
><br>
> <a href="https://github.com/llvm/llvm-project/blob/647e9a54c758a6fdd85a569f019f00a653b2bc40/mlir/test/Conversion/GPUToNVVM/gpu-to-nvvm.mlir#L183">
https://github.com/llvm/llvm-project/blob/647e9a54c758a6fdd85a569f019f00a653b2bc40/mlir/test/Conversion/GPUToNVVM/gpu-to-nvvm.mlir#L183</a><br>
> <a href="https://github.com/llvm/llvm-project/blob/73c12bd8ff1a9cd8375a357ea06f171e127ec1b8/mlir/test/Conversion/GPUToROCDL/gpu-to-rocdl.mlir#L125">
https://github.com/llvm/llvm-project/blob/73c12bd8ff1a9cd8375a357ea06f171e127ec1b8/mlir/test/Conversion/GPUToROCDL/gpu-to-rocdl.mlir#L125</a><br>
<br>
Cool. Where are those math functions and their conversion defined? I <br>
grepped the mlir code but didn't (immediately) find them.<br>
<br>
<br>
> --Kiran<br>
><br>
> ________________________________<br>
> From: flang-dev <flang-dev-bounces@lists.llvm.org> on behalf of Kiran Chandramohan via flang-dev <flang-dev@lists.llvm.org><br>
> Sent: 30 July 2020 00:20<br>
> To: Jean Perier <jperier@nvidia.com>; Romero, Nichols A. <naromero@anl.gov>; flang-dev@lists.llvm.org <flang-dev@lists.llvm.org><br>
> Cc: Doerfert, Johannes <jdoerfert@anl.gov><br>
> Subject: Re: [flang-dev] OpenMP target regions and intrinsic Fortran math functions<br>
><br>
> Hi Nick, Jean,<br>
><br>
> Thanks for bringing this topic up. I must confess that I am not an expert in target and device handling in OpenMP and we have not yet finalized the approach for handling target regions.<br>
><br>
> But here is what I can share, the GPU folks from Nvidia/AMD and Johannes (who implemented this in Clang) can correct me here.<br>
><br>
> Vendors provide device libraries (<a href="https://docs.nvidia.com/cuda/libdevice-users-guide/__nv_sin.html">https://docs.nvidia.com/cuda/libdevice-users-guide/__nv_sin.html</a>) with math function support. The compiler can/should convert calls to the math
library functions in a target region with calls to the device library functions. OpenMP provides the declare variant directive with which specialized variants of functions and the context in which these functions should be used can be specified. This mechanism
can be used in a header file and each vendor can declare variants (with calls to their device library) for each math function. If the frontend supports OpenMP declare variant handling then the calls to math library functions are automatically converted to
calls to device library functions.<br>
><br>
> For e.g: Clang has the following,<br>
><br>
> 1) clang/lib/Headers/openmp_wrappers/math.h<br>
> #pragma omp begin declare variant match( \<br>
> device = {arch(nvptx, nvptx64)}, implementation = {extension(match_any)})<br>
><br>
> #define __CUDA__<br>
> #define __OPENMP_NVPTX__<br>
> #include <__clang_cuda_math.h><br>
> #undef __OPENMP_NVPTX__<br>
> #undef __CUDA__<br>
><br>
> #pragma omp end declare variant<br>
><br>
> 2) clang/lib/Headers/__clang_cuda_math.h<br>
> __DEVICE__ double sin(double __a) { return __nv_sin(__a); }<br>
><br>
> Thanks,<br>
> --Kiran<br>
> ________________________________<br>
> From: Jean Perier <jperier@nvidia.com><br>
> Sent: 27 July 2020 10:59<br>
> To: Romero, Nichols A. <naromero@anl.gov>; flang-dev@lists.llvm.org <flang-dev@lists.llvm.org><br>
> Cc: Kiran Chandramohan <Kiran.Chandramohan@arm.com><br>
> Subject: RE: [flang-dev] OpenMP target regions and intrinsic Fortran math functions<br>
><br>
><br>
> Hi,<br>
><br>
><br>
><br>
> Kiran Chandramohan is working on OpenMP lowering and might have a plan here. Math intrinsics are currently lowered in mlir to a mix of inlined code, calls to llvm intrinsics, and calls to runtime when the first two options are not possible. Attributes are
added to these runtime calls so that they can easily be identified as intrinsic calls and later rewrote if needed. These attributes could be adapted based on what the team working on OpenMP lowering needs here.<br>
><br>
><br>
><br>
> Jean<br>
><br>
><br>
><br>
> From: flang-dev <flang-dev-bounces@lists.llvm.org> On Behalf Of Romero, Nichols A. via flang-dev<br>
> Sent: Wednesday, July 22, 2020 8:26 PM<br>
> To: flang-dev@lists.llvm.org<br>
> Subject: [flang-dev] OpenMP target regions and intrinsic Fortran math functions<br>
><br>
><br>
><br>
> External email: Use caution opening links or attachments<br>
><br>
><br>
><br>
> Hi,<br>
><br>
><br>
><br>
> I am bringing this up because it seems that OpenMP development is really ramping up now and I want to bring up a common use case that does not seem to be supported with other vendor compilers.<br>
><br>
><br>
><br>
> It is a common use case to call the Fortran intrinsic Math functions in an openmp target region. I am not sure how this in implemented in the Fortran + OpenMP compilers for the vendors who do support it. I suspect this is done by inlining, but it appears
that other vendors have their Fortran math functions in a backend runtime library which somehow prevents these functions from being called in an OpenMP target region.<br>
><br>
><br>
><br>
><br>
><br>
><br>
><br>
> --<br>
><br>
> Nichols A. Romero, Ph.D.<br>
><br>
> Computational Science Division<br>
><br>
> Argonne Leadership Computing Facility<br>
> Argonne National Laboratory<br>
> Building 240 Room 2-127<br>
> 9700 South Cass Avenue<br>
> Lemont, IL 60439<br>
> (630) 252-3441<br>
><br>
><br>
><br>
<br>
</div>
</span></font></div>
</body>
</html>