Commits · a54893c6620a006b744435d2acfbb601a4d8fd4a · Roger Ferrer / llvm-epi-0.8

Jun 09, 2012
- Use XOP vpcom intrinsics in patterns instead of a target specific SDNode type.... · a54893c6
  Craig Topper authored Jun 09, 2012
```
Use XOP vpcom intrinsics in patterns instead of a target specific SDNode type. Remove the custom lowering code that selected the SDNode type.

llvm-svn: 158279
```
  a54893c6
- Replace XOP vpcom intrinsics with fewer intrinsics that take the immediate as an argument. · 3352ba55
  Craig Topper authored Jun 09, 2012
```
llvm-svn: 158278
```
  3352ba55
Jun 08, 2012

Enable optimization for integer ABS on X86 if Subtarget has CMOV. · 6bc2d270
Manman Ren authored Jun 08, 2012
```
llvm-svn: 158220
```
6bc2d270

X86: optimize generated code for integer ABS · 2cdc8afc

Manman Ren authored Jun 07, 2012

This patch will generate the following for integer ABS:
      movl    %edi, %eax
      negl    %eax
      cmovll  %edi, %eax
INSTEAD OF
      movl    %edi, %ecx
      sarl    $31, %ecx
      leal    (%rdi,%rcx), %eax
      xorl    %ecx, %eax

There exists a target-independent DAG combine for integer ABS, which converts
integer ABS to sar+add+xor. For X86, we match this pattern back to neg+cmov. 
This is implemented in PerformXorCombine.

rdar://10695237

llvm-svn: 158175

2cdc8afc

Jun 07, 2012

Do not optimize the used bits of the x86 vselect condition operand, when the... · bbd40f67

Nadav Rotem authored Jun 07, 2012

Do not optimize the used bits of the x86 vselect condition operand, when the condition operand is a vector of 1-bit predicates.
This may happen on MIC devices.

llvm-svn: 158168

bbd40f67

PR13046: we can't replace usage of SUB with CMP in the lowering phase. · 746e4859
Manman Ren authored Jun 07, 2012
```
It will cause assertion failure later on.

llvm-svn: 158160
```
746e4859
Use a base register instead of an index register with the local dynamic model. · 55d1145b
Rafael Espindola authored Jun 07, 2012
```
Fixes pr13048.

llvm-svn: 158158
```
55d1145b

X86: replace SUB with CMP if possible · ae02c5a9

Manman Ren authored Jun 07, 2012

This patch will optimize the following
    movq    %rdi, %rax
    subq    %rsi, %rax
    cmovsq  %rsi, %rdi
    movq    %rdi, %rax
to
    cmpq    %rsi, %rdi
    cmovsq  %rsi, %rdi
    movq    %rdi, %rax

Perform this optimization if the actual result of SUB is not used.

rdar: 11540023
llvm-svn: 158126

ae02c5a9

Revert r157755. · 9c964181

Manman Ren authored Jun 06, 2012

The commit is intended to fix rdar://11540023.
It is implemented as part of peephole optimization. We can actually implement
this in the SelectionDAG lowering phase.

llvm-svn: 158122

9c964181

Jun 06, 2012
- Remove unused private fields found by clang's new -Wunused-private-field. · 628a39fa
  Benjamin Kramer authored Jun 06, 2012
```
There are some that I didn't remove this round because they looked like
obvious stubs. There are dead variables in gtest too, they should be
fixed upstream.

llvm-svn: 158090
```
  628a39fa
- Add support for dynamic stack realignment in the presence of dynamic allocas on · 5d6f01ad
  Chad Rosier authored Jun 06, 2012
```
X86.
rdar://11496434

llvm-svn: 158087
```
  5d6f01ad
- Mark several instructions SSE2 instead of SSE3 as they should be. · bf2409e8
  Craig Topper authored Jun 06, 2012
```
llvm-svn: 158049
```
  bf2409e8
Jun 05, 2012
- X86 itinerary properties. · 39a99140
  Andrew Trick authored Jun 05, 2012
```
llvm-svn: 157981
```
  39a99140
- whitespace · 515f1317
  Andrew Trick authored Jun 05, 2012
```
llvm-svn: 157976
```
  515f1317
Jun 04, 2012
- Better comments for TLS-related X86 MachineOperand flags. · 09610f3e
  Hans Wennborg authored Jun 04, 2012
```
llvm-svn: 157920
```
  09610f3e
- Add intrinsic forms for FMA instructions to opcode folding tables. · c6ac4cef
  Craig Topper authored Jun 04, 2012
```
llvm-svn: 157917
```
  c6ac4cef
- Add VFMADDSUB and VFMSUBADD FMA instructions to folding tables. Also add 213... · 3cb14301
  Craig Topper authored Jun 04, 2012
```
Add VFMADDSUB and VFMSUBADD FMA instructions to folding tables. Also add 213 forms of scalar FMA instructions.

llvm-svn: 157914
```
  3cb14301
Jun 03, 2012
- Rename FMA3 feature flag to just FMA to match gcc so it can be added to clang. · 79dbb0c6
  Craig Topper authored Jun 03, 2012
```
llvm-svn: 157903
```
  79dbb0c6
- Rename fma4 intrinsics to just fma since they are now used for both FMA4 and... · fd53b802
  Craig Topper authored Jun 03, 2012
```
Rename fma4 intrinsics to just fma since they are now used for both FMA4 and FMA3. Autoupgrade support coming in a separate commit.

llvm-svn: 157898
```
  fd53b802
- Revert r157831 · 5097e4f3
  Manman Ren authored Jun 03, 2012
```
llvm-svn: 157896
```
  5097e4f3
- Use sse_load_f32/64 for scalar FMA3 intrinsic patterns instead of 128-bit... · 29eafea2
  Craig Topper authored Jun 03, 2012
```
Use sse_load_f32/64 for scalar FMA3 intrinsic patterns instead of 128-bit loads to match instruction behavior.

llvm-svn: 157895
```
  29eafea2
- Add neverHasSideEffects and mayLoad to FMA3 instructions. · badd755a
  Craig Topper authored Jun 03, 2012
```
llvm-svn: 157894
```
  badd755a
Jun 02, 2012

Fix typos found by http://github.com/lyda/misspell-check · bde91766
Benjamin Kramer authored Jun 02, 2012
```
llvm-svn: 157885
```
bde91766

Switch all register list clients to the new MC*Iterator interface. · 54038d79

Jakob Stoklund Olesen authored Jun 01, 2012

No functional change intended.

Sorry for the churn. The iterator classes are supposed to help avoid
giant commits like this one in the future. The TableGen-produced
register lists are getting quite large, and it may be necessary to
change the table representation.

This makes it possible to do so without changing all clients (again).

llvm-svn: 157854

54038d79

Jun 01, 2012
- X86: peephole optimization to remove cmp instruction · 879ca9d4
  Manman Ren authored Jun 01, 2012
```
This patch will optimize the following:
  sub r1, r3
  cmp r3, r1 or cmp r1, r3
  bge L1
TO
  sub r1, r3
  bge L1 or ble L1

If the branch instruction can use flag from "sub", then we can eliminate
the "cmp" instruction.

llvm-svn: 157831
```
  879ca9d4
- Implement the local-dynamic TLS model for x86 (PR3985) · 789acfb6
  Hans Wennborg authored Jun 01, 2012
```
This implements codegen support for accesses to thread-local variables
using the local-dynamic model, and adds a clean-up pass so that the base
address for the TLS block can be re-used between local-dynamic access on
an execution path.

llvm-svn: 157818
```
  789acfb6
- Enable automatic detection of FMA3 support to allow intrinsics to be used. · 1d4d62d7
  Craig Topper authored Jun 01, 2012
```
llvm-svn: 157805
```
  1d4d62d7
- Remove fadd(fmul) patterns for FMA3. This needs to be implemented by paying... · 00649d51
  Craig Topper authored Jun 01, 2012
```
Remove fadd(fmul) patterns for FMA3. This needs to be implemented by paying attention to FP_CONTRACT and matching @llvm.fma which is not available yet. This will allow us to enablle intrinsic use at least though.

llvm-svn: 157804
```
  00649d51
- Add VFNSUB* instructions to folding table. · 2e127b52
  Craig Topper authored Jun 01, 2012
```
llvm-svn: 157802
```
  2e127b52
- Remove a trailing space and fix a comment. · 9eadcfdf
  Craig Topper authored Jun 01, 2012
```
llvm-svn: 157801
```
  9eadcfdf
- Tidy up. Remove trailing spaces and fix the worst of the 80 column violations. · df09da83
  Craig Topper authored Jun 01, 2012
```
llvm-svn: 157799
```
  df09da83
- Put the shiny new MCSubRegIterator to work. · 526772de
  Chad Rosier authored Jun 01, 2012
```
llvm-svn: 157783
```
  526772de
May 31, 2012

Add support for return value promotion in X86 calling conventions. · 4f203ea3
Jakob Stoklund Olesen authored May 31, 2012
```
Patch by Yiannis Tsiouris!

llvm-svn: 157757
```
4f203ea3

X86: replace SUB with CMP if possible · 9bccb64e

Manman Ren authored May 31, 2012

This patch will optimize the following
        movq    %rdi, %rax
        subq    %rsi, %rax
        cmovsq  %rsi, %rdi
        movq    %rdi, %rax
to
        cmpq    %rsi, %rdi
        cmovsq  %rsi, %rdi
        movq    %rdi, %rax

Perform this optimization if the actual result of SUB is not used.

rdar: 11540023
llvm-svn: 157755

9bccb64e

X86: Rename the CLMUL target feature to PCLMUL. · a0396e45

Benjamin Kramer authored May 31, 2012

It was renamed in gcc/gas a while ago and causes all kinds of
confusion because it was named differently in llvm and clang.

llvm-svn: 157745

a0396e45

Added FMA3 Intel instructions. · 602f3a26

Elena Demikhovsky authored May 31, 2012

I disabled FMA3 autodetection, since the result may differ from expected for some benchmarks.
I added tests for GodeGen and intrinsics.
I did not change llvm.fma.f32/64 - it may be done later.

llvm-svn: 157737

602f3a26

Add intrinsic for pclmulqdq instruction. · c1ac05da
Craig Topper authored May 31, 2012
```
llvm-svn: 157731
```
c1ac05da

May 30, 2012

it's pointed out that R11 can be used for magic things, and doing things just... · 1622a99e

Chris Lattner authored May 30, 2012

it's pointed out that R11 can be used for magic things, and doing things just for 64-bit registers is silly.  Just optimize 3 more.

llvm-svn: 157699

1622a99e

Extend the (abi-irrelevant) return convention to be able to return more than two values in · 04d722a6

Chris Lattner authored May 30, 2012

integer registers.  This is already supported by the fastcc convention, but it doesn't
hurt to support it in the standard conventions as well.

In cases where we can cheat at the calling convention, this allows us to avoid returning
things through memory in more cases.

llvm-svn: 157698

04d722a6

Port support for SSE4a extrq/insertq to the old jit code emitter. · f1e0b6cd
Benjamin Kramer authored May 30, 2012
```
llvm-svn: 157685
```
f1e0b6cd