Commits · d5e3fd6dc84d0c703875efd541a61a6636985c4a · Roger Ferrer / llvm-epi-0.8

Mar 06, 2010

Lower dynamic stack allocation on mingw32 to separate instruction. · d5e3fd6d

Anton Korobeynikov authored Mar 06, 2010

We cannot use a normal call here since it has extra unmodelled side
effects (it changes stack pointer). This should fix PR5292.

llvm-svn: 97884

d5e3fd6d

Mar 05, 2010

Fix typo. · 27494232
Evan Cheng authored Mar 05, 2010
```
llvm-svn: 97818
```
27494232

Fix an oops in x86 sibcall optimization. If the ByVal callee argument is... · 654ec2a6

Evan Cheng authored Mar 05, 2010

Fix an oops in x86 sibcall optimization. If the ByVal callee argument is itself passed as a pointer, then it's obviously not safe to do a tail call.

llvm-svn: 97797

654ec2a6

Rever 96389 and 96990. They are causing some miscompilation that I do not fully understand. · cf67ffa5
Evan Cheng authored Mar 05, 2010
```
llvm-svn: 97782
```
cf67ffa5
Revert r97766. It's deleting a tag. · 543ce1f6
Bill Wendling authored Mar 05, 2010
```
llvm-svn: 97768
```
543ce1f6

Micro-optimization: · 6517f88f

Bill Wendling authored Mar 05, 2010

This code:

float floatingPointComparison(float x, float y) {
    double product = (double)x * y;
    if (product == 0.0)
        return product;
    return product - 1.0;
}

produces this:

_floatingPointComparison:
0000000000000000        cvtss2sd        %xmm1,%xmm1
0000000000000004        cvtss2sd        %xmm0,%xmm0
0000000000000008        mulsd           %xmm1,%xmm0
000000000000000c        pxor            %xmm1,%xmm1
0000000000000010        ucomisd         %xmm1,%xmm0
0000000000000014        jne             0x00000004
0000000000000016        jp              0x00000002
0000000000000018        jmp             0x00000008
000000000000001a        addsd           0x00000006(%rip),%xmm0
0000000000000022        cvtsd2ss        %xmm0,%xmm0
0000000000000026        ret

The "jne/jp/jmp" sequence can be reduced to this instead:

_floatingPointComparison:
0000000000000000        cvtss2sd        %xmm1,%xmm1
0000000000000004        cvtss2sd        %xmm0,%xmm0
0000000000000008        mulsd           %xmm1,%xmm0
000000000000000c        pxor            %xmm1,%xmm1
0000000000000010        ucomisd         %xmm1,%xmm0
0000000000000014        jp              0x00000002
0000000000000016        je              0x00000008
0000000000000018        addsd           0x00000006(%rip),%xmm0
0000000000000020        cvtsd2ss        %xmm0,%xmm0
0000000000000024        ret

for a savings of 2 bytes.

This xform can happen when we recognize that jne and jp jump to the same "true"
MBB, the unconditional jump would jump to the "false" MBB, and the "true" branch
is the fall-through MBB.

llvm-svn: 97766

6517f88f

Mar 04, 2010
- Fix the remaining MUL8 and DIV8 to define AX instead of AL,AH. · af6ca232
  Jakob Stoklund Olesen authored Mar 04, 2010
```
These instructions technically define AL,AH, but a trick in X86ISelDAGToDAG
reads AX in order to avoid reading AH with a REX instruction.

Fix PR6489.

llvm-svn: 97742
```
  af6ca232
- Fix recognition of 16-bit bswap for C front-ends which emit the · b8ebd408
  Dan Gohman authored Mar 04, 2010
```
clobber registers in a different order.

llvm-svn: 97741
```
  b8ebd408
- not committing what you test = bad. · 795667b4
  Chris Lattner authored Mar 04, 2010
```
llvm-svn: 97740
```
  795667b4
- make gep matching in fastisel match the base of the gep as a · 6ce8e24b
  Chris Lattner authored Mar 04, 2010
```
register if it isn't possible to match the indexes *and* the base.
This fixes some fast isel rejects of load instructions on oggenc.

llvm-svn: 97739
```
  6ce8e24b
- add a comment. · 82cc5338
  Chris Lattner authored Mar 04, 2010
```
llvm-svn: 97709
```
  82cc5338
Mar 03, 2010
- remove nvload and two patterns that use it which are · db42f3ef
  Chris Lattner authored Mar 03, 2010
```
better done by dag combine.

llvm-svn: 97633
```
  db42f3ef
- factor the 'in the default address space' check out to a single · 46897d35
  Chris Lattner authored Mar 03, 2010
```
'dsload' pattern.  tblgen doesn't check patterns to see if they're
textually identical.  This allows better factoring.

llvm-svn: 97630
```
  46897d35
- factor the 'sign extended from 8 bit' patterns better so · 3fcbbd86
  Chris Lattner authored Mar 03, 2010
```
that they are not destination type specific.  This allows
tblgen to factor them and the type check is redundant with
what the isel does anyway.

llvm-svn: 97629
```
  3fcbbd86
- merge two loops over all nodes in the graph into one. · 8d637040
  Chris Lattner authored Mar 02, 2010
```
llvm-svn: 97606
```
  8d637040
Mar 02, 2010

eliminate PreprocessForRMW now that isel handles it. · 1eb6eb05
Chris Lattner authored Mar 02, 2010
```
We still preprocess calls and fp return stuff.

llvm-svn: 97598
```
1eb6eb05

Fix some issues in WalkChainUsers dealing with · dd030701

Chris Lattner authored Mar 02, 2010

CopyToReg/CopyFromReg/INLINEASM.  These are annoying because
they have the same opcode before an after isel.  Fix this by
setting their NodeID to -1 to indicate that they are selected,
just like what automatically happens when selecting things that
end up being machine nodes.

With that done, give IsLegalToFold a new flag that causes it to
ignore chains.  This lets the HandleMergeInputChains routine be
the one place that validates chains after a match is successful,
enabling the new hotness in chain processing.  This smarter
chain processing eliminates the need for "PreprocessRMW" in the
X86 and MSP430 backends and enables MSP to start matching it's
multiple mem operand instructions more aggressively.

I currently #if out the dead code in the X86 backend and MSP 
backend, I'll remove it for real in a follow-on patch.

The testcase changes are:
  test/CodeGen/X86/sse3.ll: we generate better code
  test/CodeGen/X86/store_op_load_fold2.ll: PreprocessRMW was 
      miscompiling this before, we now generate correct code
      Convert it to filecheck while I'm at it.
  test/CodeGen/MSP430/Inst16mm.ll: Add a testcase for mem/mem
      folding to make anton happy. :)

llvm-svn: 97596

dd030701

Sink InstructionSelect() out of each target into SDISel, and rename it · f98f124a

Chris Lattner authored Mar 02, 2010

DoInstructionSelection.  Inline "SelectRoot" into it from DAGISelHeader.
Sink some other stuff out of DAGISelHeader into SDISel.

Eliminate the various 'Indent' stuff from various targets, which dates
to when isel was recursive.

 17 files changed, 114 insertions(+), 430 deletions(-)

llvm-svn: 97555

f98f124a

Remove dead parameter passing. · 78c5b7a7
Bill Wendling authored Mar 02, 2010
```
llvm-svn: 97536
```
78c5b7a7
Floating-point add, sub, and mul are now spelled fadd, fsub, and fmul, · 6f34abd0
Dan Gohman authored Mar 02, 2010
```
respectively.

llvm-svn: 97531
```
6f34abd0

Mar 01, 2010
- remove a little hack I did for the old isel, not needed · bd6e193f
  Chris Lattner authored Mar 01, 2010
```
now that it is gone.

llvm-svn: 97516
```
  bd6e193f
- Remove the optimize for code size limitation on r67917. Optimize 64-bit imul... · 87d50aa1
  Evan Cheng authored Mar 01, 2010
```
Remove the optimize for code size limitation on r67917. Optimize 64-bit imul by constants into leas + shl regardless if optimizing for code size. The size saving from using imulq isn't worth it. Also, the lea and shl instructions may expose further optimization.

llvm-svn: 97507
```
  87d50aa1
- remove a terrible hack that disabled assertions from this file because of build time · 55ef1ebe
  Chris Lattner authored Mar 01, 2010
```
problems.  rdar://7697850.

llvm-svn: 97500
```
  55ef1ebe
- This is now done. · 312d604e
  Dan Gohman authored Mar 01, 2010
```
llvm-svn: 97450
```
  312d604e
Feb 28, 2010

80-col violations/trailing whitespace. · abd56bde
Mikhail Glushenkov authored Feb 28, 2010
```
llvm-svn: 97427
```
abd56bde

Implement XMM subregs. · bdd6405f

Dan Gohman authored Feb 28, 2010

Extracting the low element of a vector is now done with EXTRACT_SUBREG,
and the zero-extension performed by load movss is now modeled with
SUBREG_TO_REG, and so on.

Register-to-register movss and movsd are no longer considered copies;
they are two-address instructions which insert a scalar into a vector.

llvm-svn: 97354

bdd6405f

The mayHaveSideEffects flag is no longer used. · 8c5d683a
Dan Gohman authored Feb 27, 2010
```
llvm-svn: 97348
```
8c5d683a

Feb 27, 2010
- fix an incorrect (overly conservative) predicate. · a2075d44
  Chris Lattner authored Feb 27, 2010
```
llvm-svn: 97316
```
  a2075d44
- Re-apply 97040 with fix. This survives a ppc self-host llvm-gcc bootstrap. · 228c31f0
  Evan Cheng authored Feb 27, 2010
```
llvm-svn: 97310
```
  228c31f0
Feb 26, 2010
- Move dbg_value generation to target-independent FastISel, · dd331042
  Dale Johannesen authored Feb 26, 2010
```
as X86 is currently the only FastISel target.  Per review.

llvm-svn: 97255
```
  dd331042
- movl is a cheaper way to materialize 0 without clobbering EFLAGS than movabsq. · 952f6f98
  Dan Gohman authored Feb 26, 2010
```
llvm-svn: 97227
```
  952f6f98
- Delete a bunch of redundant predicates. · 9300486d
  Dan Gohman authored Feb 26, 2010
```
llvm-svn: 97201
```
  9300486d
Feb 25, 2010
- Fix TextAlignFillValue in a few places · 68e22cb5
  Daniel Dunbar authored Feb 25, 2010
```
llvm-svn: 97151
```
  68e22cb5
- Truncate from i64 to i32 is "free" on x86-32, because it involves · ec4e1b67
  Dan Gohman authored Feb 25, 2010
```
just discarding one of the registers.

llvm-svn: 97100
```
  ec4e1b67
Feb 24, 2010
- Speculatively revert r97011, "Re-apply 96540 and 96556 with fixes.", again in · 4811d004
  Daniel Dunbar authored Feb 24, 2010
```
the hopes of fixing PPC bootstrap.

llvm-svn: 97040
```
  4811d004
- When forming SSE min and max nodes for UGE and ULE comparisons, it's · 38605214
  Dan Gohman authored Feb 24, 2010
```
necessary to swap the operands to handle NaN and negative zero properly.

Also, reintroduce logic for checking for NaN conditions when forming
SSE min and max instructions, fixed to take into consideration NaNs and
negative zeros. This allows forming min and max instructions in more
cases.

llvm-svn: 97025
```
  38605214
- Re-apply 96540 and 96556 with fixes. · 328a6074
  Evan Cheng authored Feb 24, 2010
```
llvm-svn: 97011
```
  328a6074
- DIV8r must define %AX since X86DAGToDAGISel::Select() sometimes uses it · a2d8c97b
  Jakob Stoklund Olesen authored Feb 24, 2010
```
instead of %AL/%AH.

llvm-svn: 97006
```
  a2d8c97b
Feb 23, 2010
- Fix rev 96389 by restricting the xform to mask that's either signbit or max signed value. · da52f449
  Evan Cheng authored Feb 23, 2010
```
llvm-svn: 96990
```
  da52f449
- no need to override IsLegalToFold, the base implementation · 8d7b4393
  Chris Lattner authored Feb 23, 2010
```
disables load folding at -O0.

llvm-svn: 96973
```
  8d7b4393