RKSimon added a comment. Code using the llvm.x86.sse*.pmin* / llvm.x86.sse*.pmax* intrinsics has been the main source of them so far, but I wanted to create a general implementation if I could. Repository: rL LLVM http://reviews.llvm.org/D12118