stellaraccident wrote: I wouldn't object at some point to someone working out how to split the implementation C++ file as we add more variants. These big op libraries bring a fair amount of single threaded compile time. https://github.com/llvm/llvm-project/pull/90236