[LLVMdev] LoopVectorizer in OpenCL C work group autovectorization
renato.golin at linaro.org
Fri Jan 25 00:29:55 PST 2013
On 25 January 2013 07:56, Nadav Rotem <nrotem at apple.com> wrote:
> You need to implement something like Whole Function Vectorization (
> http://dl.acm.org/citation.cfm?id=2190061). The loop vectorizer can't
> help you here. Ralf Karrenberg open sourced his implementation on github.
> You should take a look.
It'd be great to have this in LLVM, though some care must be taken to
continue relevant (unlike the C back-end, for example). There are lots of
secrets around GPUs and OpenCL concrete implementation, which could make
very hard to predict or model costs for each different GPU.
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the llvm-dev