Use __builtin_clz to find leading 1 in generic sqrt (where possible)
__builtin_clz requires just a single instruction on x86 and arm, so this is a performance improvement. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D120579
Loading
Please register or sign in to comment