On Sandybridge loading unaligned 256bits using two XMM loads (vmovups and...
On Sandybridge loading unaligned 256bits using two XMM loads (vmovups and vinsertf128) is faster than using a single vmovups instruction. llvm-svn: 172868
Loading
Please register or sign in to comment