[LLVMdev] LoopVectorizer in OpenCL C work group autovectorization

Renato Golin renato.golin at linaro.org
Fri Jan 25 00:29:55 PST 2013


On 25 January 2013 07:56, Nadav Rotem <nrotem at apple.com> wrote:

> You need to implement something like Whole Function Vectorization (
> http://dl.acm.org/citation.cfm?id=2190061). The loop vectorizer can't
> help you here. Ralf Karrenberg open sourced his implementation on github.
> You should take a look.
>

It'd be great to have this in LLVM, though some care must be taken to
continue relevant (unlike the C back-end, for example). There are lots of
secrets around GPUs and OpenCL concrete implementation, which could make
very hard to predict or model costs for each different GPU.

cheers,
--renato
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20130125/b5086326/attachment.html>


More information about the llvm-dev mailing list