- Jun 24, 2012
-
-
Craig Topper authored
llvm-svn: 159105
-
Craig Topper authored
llvm-svn: 159090
-
- Jun 23, 2012
-
-
Craig Topper authored
Make CVTDQ2PS instruction use SSE2 predicate instead of SSE1. No functional change because there are no patterns in the instructions. Also fix a typo in a comment. llvm-svn: 159087
-
Craig Topper authored
Move CVTPD2DQ to use SSE2 predicate instead of SSE3. Move DQ2PD and PD2DQ to the SSE2 section of the file. llvm-svn: 159086
-
Craig Topper authored
llvm-svn: 159075
-
Craig Topper authored
Remove intrinsic specific instructions for 128-bit (V)CVTDQ2PD. Replace with intrinsic patterns. Mem forms omitted because the load size is only 64-bits. llvm-svn: 159070
-
- Jun 20, 2012
-
-
Craig Topper authored
llvm-svn: 158797
-
Craig Topper authored
llvm-svn: 158795
-
- Jun 16, 2012
-
-
Kay Tiong Khoo authored
llvm-svn: 158603
-
- Jun 06, 2012
-
-
Craig Topper authored
llvm-svn: 158049
-
- May 31, 2012
-
-
Benjamin Kramer authored
It was renamed in gcc/gas a while ago and causes all kinds of confusion because it was named differently in llvm and clang. llvm-svn: 157745
-
Craig Topper authored
llvm-svn: 157731
-
- May 29, 2012
-
-
Benjamin Kramer authored
Add intrinsics, code gen, assembler and disassembler support for the SSE4a extrq and insertq instructions. This required light surgery on the assembler and disassembler because the instructions use an uncommon encoding. They are the only two instructions in x86 that use register operands and two immediates. llvm-svn: 157634
-
- May 08, 2012
-
-
Craig Topper authored
llvm-svn: 156375
-
- May 07, 2012
-
-
Craig Topper authored
llvm-svn: 156287
-
Craig Topper authored
llvm-svn: 156281
-
- Apr 24, 2012
-
-
Nadav Rotem authored
Remove the v2f64 patterns because it does not match any vbroadcast instruction. llvm-svn: 155461
-
Nadav Rotem authored
using the pattern (vbroadcast (i32load src)). In some cases, after we generate this pattern new users are added to the load node, which prevent the selection of the blend pattern. This commit provides fallback patterns which perform in-vector broadcast (using in-vector vbroadcast in AVX2 and pshufd on AVX1). llvm-svn: 155437
-
- Apr 22, 2012
-
-
Elena Demikhovsky authored
llvm-svn: 155309
-
- Apr 16, 2012
-
-
Craig Topper authored
llvm-svn: 154801
-
Craig Topper authored
Flip the arguments when converting vpermd/vpermps intrinsics into instructions. The intrinsic has the mask as the last operand, but the instruction has it as the second. llvm-svn: 154797
-
Craig Topper authored
llvm-svn: 154782
-
Craig Topper authored
llvm-svn: 154778
-
- Apr 15, 2012
-
-
Nadav Rotem authored
Use non-vex instructions for SSE4. llvm-svn: 154770
-
Elena Demikhovsky authored
llvm-svn: 154761
-
- Apr 12, 2012
-
-
Craig Topper authored
llvm-svn: 154580
-
- Apr 11, 2012
-
-
Nadav Rotem authored
Original message: Modify the code that lowers shuffles to blends from using blendvXX to vblendXX. blendV uses a register for the selection while Vblend uses an immediate. On sandybridge they still have the same latency and execute on the same execution ports. llvm-svn: 154483
-
- Apr 10, 2012
-
-
Eric Christopher authored
llvm-svn: 154425
-
Nadav Rotem authored
blendv uses a register for the selection while vblend uses an immediate. On sandybridge they still have the same latency and execute on the same execution ports. llvm-svn: 154396
-
- Apr 08, 2012
-
-
Craig Topper authored
Turn avx2 vinserti128 intrinsic calls into INSERT_SUBVECTOR DAG nodes and remove patterns for selecting the intrinsic. Similar was already done for avx1. llvm-svn: 154272
-
- Apr 07, 2012
-
-
Craig Topper authored
Move vinsertf128 patterns near the instruction definitions. Add AddedComplexity to AVX2 vextracti128 patterns to give them priority over the integer versions of vextractf128 patterns. llvm-svn: 154268
-
- Apr 03, 2012
-
-
Craig Topper authored
llvm-svn: 153935
-
- Mar 20, 2012
-
-
Chad Rosier authored
vextractf128 with 128-bit mem dest. Combines vextractf128 $0, %ymm0, %xmm0 vmovaps %xmm0, (%rdi) to vextractf128 $0, %ymm0, (%rdi) rdar://11082570 llvm-svn: 153139
-
Chad Rosier authored
precedence over the VINSERTF128 avx1 patterns. llvm-svn: 153114
-
Chad Rosier authored
llvm-svn: 153105
-
Chad Rosier authored
whitespace from test case. No functional change intended. llvm-svn: 153103
-
Chad Rosier authored
This results in things such as vmovups 16(%rdi), %xmm0 vinsertf128 $1, %xmm0, %ymm0, %ymm0 to be combined to vinsertf128 $1, 16(%rdi), %ymm0, %ymm0 rdar://11076953 llvm-svn: 153092
-
- Mar 15, 2012
-
-
Chad Rosier authored
This results in things such as vmovaps -96(%rbx), %xmm1 vinsertf128 $1, %xmm1, %ymm0, %ymm0 to be combined to vinsertf128 $1, -96(%rbx), %ymm0, %ymm0 rdar://10643481 llvm-svn: 152762
-
- Mar 10, 2012
-
-
Kay Tiong Khoo authored
llvm-svn: 152507
-
- Mar 09, 2012
-
-
Chad Rosier authored
Original commit message from r147481: DAGCombine for transforming 128->256 casts into a vmovaps, rather then a vxorps + vinsertf128 pair if the original vector came from a load. Fix: Unaligned loads need to generate a vmovups. rdar://10974078 llvm-svn: 152366
-