<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=Windows-1252">
<style type="text/css" style="display:none;"> P {margin-top:0;margin-bottom:0;} </style>
</head>
<body dir="ltr">
<div style="font-family: Meiryo, メイリオ, "Hiragino Sans", sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Hello,</div>
<div style="font-family: Meiryo, メイリオ, "Hiragino Sans", sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="font-family: Meiryo, メイリオ, "Hiragino Sans", sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Git hub address is <a href="https://github.com/sx-aurora-dev/llvm" id="LPNoLP577781">https://github.com/sx-aurora-dev/llvm</a> ?</div>
<br>
<div style="font-family: Meiryo, メイリオ, "Hiragino Sans", sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div id="signature">---<br>
Hideyuki Kamei<br>
<div>
<div id="appendonsend"></div>
<div style="font-family:Meiryo,メイリオ,"Hiragino Sans",sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" color="#000000" style="font-size:11pt"><b>From:</b> llvm-dev <llvm-dev-bounces@lists.llvm.org> on behalf of Erich Focht via llvm-dev <llvm-dev@lists.llvm.org><br>
<b>Sent:</b> Thursday, April 4, 2019 15:02<br>
<b>To:</b> llvm-dev@lists.llvm.org<br>
<b>Subject:</b> [llvm-dev] [RFC] NEC SX-Aurora VE backend</font>
<div> </div>
</div>
<div class="BodyFragment"><font size="2"><span style="font-size:11pt">
<div class="PlainText">Hello,<br>
<br>
<br>
we’d like to propose the integration of a new backend into LLVM: NEC<br>
SX-Aurora TSUBASA Vector Engine (VE). We hope to get some feedback here<br>
and at EuroLLVM about the path and proper procedure of merging.<br>
<br>
<br>
The SX-Aurora VE is a Vector CPU on an PCI-E Accelerator card. It has<br>
48GB memory from six HBM2 stacks, accessible with 1.2TB/s bandwidth, 8<br>
cores with vector and scalar units, each. The cores share a last level<br>
cache (LLC) and have 64 scalar registers and a normal scalar unit with<br>
two levels of caches as well as 64 long vector registers (256 x 64<br>
bits), 16 vector mask registers and a vector length register. The VE can<br>
run HPC and AI loads with high efficiency provided that the code is well<br>
vectorized.<br>
<br>
<br>
The VE was released officially in March 2018 and is currently available<br>
integrated into appropriate servers containing 1, 2, 4 or 8 VEs<br>
<a href="https://www.nec.com/en/global/solutions/hpc/sx/">https://www.nec.com/en/global/solutions/hpc/sx/</a>. An SDK is provided,<br>
containing optimized mathematical libraries, proprietary C, C++ and<br>
Fortran compilers as well as a proprietary MPI implementation capable of<br>
communicating over PCIe and in PeerDirect manner over Infiniband. The<br>
main programming model involves running code natively on the VEs, OpenMP<br>
or MPI parallel, with systemcall execution offloaded to the host. Hybrid<br>
programming is possible through offloading kernels from the main program<br>
running on the host CPU to the VEs (accelerator model), or offloading<br>
execution from the VE programs to the host (reverse offloading).<br>
<br>
<br>
In addition to the proprietary compilers an effort has been started to<br>
create a VE backend on LLVM <a href="https://github.com/as-aurora-dev/llvm">https://github.com/as-aurora-dev/llvm</a>.<br>
Initially aimed at compiler and vectorization research, it has reached a<br>
level which we would like to merge into the upstream LLVM repositories.<br>
The state of this development and a sketch of the development plan is<br>
summarized below:<br>
<br>
<br>
1. Assembler, VE CodeGen infrastructure (registers, instruction<br>
encodings): We plan to use the assembler and linker from the<br>
freely-available binutils-ve package. With them LLVM becomes the first<br>
freely-available compiler for VE, and it enables users to use VEs<br>
without any proprietary software. As next step, we will implement<br>
assembler and linker (lld) in llvm like in other backends.<br>
<br>
2. Scalar code backend that produces correct code (capable of handling<br>
vector intrinsics): A complete scalar code backend for C/C++ is already<br>
implemented. We passed all regression tests, check-llvm and check-clang,<br>
and almost all of the test-suite. We are now working to pass the full<br>
test-suite. In this backend we are providing vector intrinsic functions<br>
that cover almost fully the SX-Aurora vector ISA including masked vector<br>
instructions. It helps experienced programmers to write highly optimized<br>
code.<br>
<br>
3. Merging LLVM-VP developments (<a href="https://reviews.llvm.org/D57504">https://reviews.llvm.org/D57504</a>) for<br>
proper vectorization: We plan to use LLVM-VP as the core infrastructure<br>
for vectorization. (Saarland University is working on the LLVM-VP<br>
backend for VE).<br>
<br>
4. Vectorization improvements: We welcome everybody who is interested in<br>
explicit vector length ISA and would like to develop and work on an off<br>
the shelf machine. Detailed information about ISA and Architecture is<br>
available at <a href="https://www.hpc.nec/documents/">https://www.hpc.nec/documents/</a>. We would like to support<br>
researchers who will develop new vectorization technology for true<br>
vector processor.<br>
<br>
Finally: In order to facilitate the integration into the LLVM tree we<br>
intend to provide access to the LLVM community to an Aurora build server<br>
for nightly builds and regression testing.<br>
<br>
<br>
<br>
Erich Focht (NEC), Kazuhisa Ishizaka (NEC), Kazushi Marukawa (NEC),<br>
Simon Moll (CDL Saarland University)<br>
<br>
_______________________________________________<br>
LLVM Developers mailing list<br>
llvm-dev@lists.llvm.org<br>
<a href="https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev">https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev</a><br>
</div>
</span></font></div>
</div>
</div>
</body>
</html>