Implement a vectorized algorithm for <16 x i8> << <16 x i8>
This is about 4x faster and smaller than the existing scalarization. llvm-svn: 109566
Loading
Please register or sign in to comment
This is about 4x faster and smaller than the existing scalarization. llvm-svn: 109566