[LLVMdev] avoid live range overlap of "vector" registers
Morten Ofstad
morten at hue.no
Wed May 11 00:40:32 PDT 2005
Chris Lattner wrote:
> None, that documentation is out of date and doesn't make a ton of sense
> for your application. I would suggest that you implement it in the
> context of the SelectionDAG framework that all of the code generators
> either currently use or are moving to. I updated the documentation
> here: http://llvm.cs.uiuc.edu/ChrisLLVM/docs/ExtendingLLVM.html#intrinsic
>
> This will allow you to do something like this:
>
> %i32v4 = type <4 x uint>
>
> %f32v4 = type <4 x float>
>
> declare %f32v4 %swizzle(%f32v4 %In, %i32v4 %Form)
>
> %G = external global %f32v4
>
> void %test() {
> %A = load %f32v4* %G
> %B = call %f32v4 %swizzle(%f32v4 %A, %i32v4 <uint 1, uint 1,
> uint 1, uint 1>) ;; splat XYZW -> YYYY
> store %f32v4 %B, %f32v4* %G
> ret void
> }
>
> ... Except using llvm.swizzle instead of 'swizzle'.
I much prefer the name chosen in the SSE instruction set: 'shuffle'
> Unfortunately the code generator currently does not support packed
> types, so this will require some work. However, this certainly is the
> closest match for your model.
This work needs to be done for SSE code generation, which I think would be of interest to several people (including me)
-- Our front-end generates code that uses packed datatypes a lot and I'm not entirely happy with the current situation
using the LowerPacked pass... If SSE code generation was working, we would use LLVM for a lot more, at the moment we
have a small runtime library with SSE optimized functions for things like trilinear interpolation, but the LLVM
optimizer can't do very much with these functions since they are just external calls.
m.
More information about the llvm-dev
mailing list