Commits · dc166eb8f7e23fecc2e0d6c67646f496c788943c · Roger Ferrer / llvm-epi-0.8

Feb 23, 2005
- Silence some uninit variable warnings. · 80c5b970
  Chris Lattner authored Feb 23, 2005
```
llvm-svn: 20284
```
  80c5b970
Jan 25, 2005
- We can fold promoted and non-promoted loads into divs also! · 1b206151
  Chris Lattner authored Jan 25, 2005
```
llvm-svn: 19835
```
  1b206151
- Fold promoted loads into binary ops for FP, allowing us to generate m32 forms · 30607ec6
  Chris Lattner authored Jan 25, 2005
```
of FP ops.

llvm-svn: 19834
```
  30607ec6
Jan 24, 2005
- Silence a warning. · 0e1de101
  Chris Lattner authored Jan 23, 2005
```
llvm-svn: 19798
```
  0e1de101
- Allow the FP stackifier to completely ignore functions that do not use FP at · debae1e3
  Chris Lattner authored Jan 23, 2005
```
all.  This should speed up the X86 backend fairly significantly on integer
codes.  Now if only we didn't have to compute livevar still... ;-)

llvm-svn: 19796
```
  debae1e3
Jan 23, 2005

Support Cygwin assembly generation. The cygwin version of Gnu ASsembler · 30226da5

Reid Spencer authored Jan 23, 2005

doesn't support certain directives and symbols on cygwin are prefixed with
an underscore. This patch makes the necessary adjustments to the output.

llvm-svn: 19775

30226da5

Jan 21, 2005
- Speed up folding operations into loads. · e70eb9da
  Chris Lattner authored Jan 21, 2005
```
llvm-svn: 19733
```
  e70eb9da
- The ever-important vanity pass name :) · e1e844c4
  Chris Lattner authored Jan 21, 2005
```
llvm-svn: 19731
```
  e1e844c4
- Fix a FIXME: realize that argument stores are all independent (don't alias) · c78776d2
  Chris Lattner authored Jan 21, 2005
```
llvm-svn: 19728
```
  c78776d2
Jan 20, 2005
- Implement ADD_PARTS/SUB_PARTS so that 64-bit integer add/sub work. This · 2a631fa4
  Chris Lattner authored Jan 20, 2005
```
fixes most of the remaining llc-beta failures.

llvm-svn: 19716
```
  2a631fa4
- Fix a crash compiling 134.perl. · 5b04f334
  Chris Lattner authored Jan 20, 2005
```
llvm-svn: 19711
```
  5b04f334
Jan 19, 2005

Fix a problem where were were literally selecting for INCREASED register · 474aac4d

Chris Lattner authored Jan 19, 2005

pressure, not decreases register pressure.  Fix problem where we accidentally
swapped the operands of SHLD, which caused fourinarow to fail.  This fixes
fourinarow.

llvm-svn: 19697

474aac4d

When commuting these instructions, make sure to actually swap the operands too. · 25be208e
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19694
```
25be208e

Implement Regression/CodeGen/X86/rotate.ll: emit rotate instructions (which · de87d146

Chris Lattner authored Jan 19, 2005

typically cost 1 cycle) instead of shld/shrd instruction (which are typically
6 or more cycles).  This also saves code space.

For example, instead of emitting:

rotr:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %CL, BYTE PTR [%ESP + 8]
        shrd %EAX, %EAX, %CL
        ret
rotli:
        mov %EAX, DWORD PTR [%ESP + 4]
        shrd %EAX, %EAX, 27
        ret

Emit:

rotr32:
        mov %CL, BYTE PTR [%ESP + 8]
        mov %EAX, DWORD PTR [%ESP + 4]
        ror %EAX, %CL
        ret
rotli32:
        mov %EAX, DWORD PTR [%ESP + 4]
        ror %EAX, 27
        ret

We also emit byte rotate instructions which do not have a sh[lr]d counterpart
at all.

llvm-svn: 19692

de87d146

Add rotate instructions. · 0edf9535
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19690
```
0edf9535
Match 16-bit shld/shrd instructions as well, implementing shift-double.llx:test5 · 29f58191
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19689
```
29f58191
Improve coverage of the X86 instruction set by adding 16-bit shift doubles. · d54845f5
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19687
```
d54845f5

Teach the code generator that shrd/shld is commutable if it has an immediate. · 29478017

Chris Lattner authored Jan 19, 2005

This allows us to generate this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        shld %EDX, %EDX, 2
        shl %EAX, 2
        ret

instead of this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, DWORD PTR [%ESP + 8]
        mov %EDX, %EAX
        shrd %EDX, %ECX, 30
        shl %EAX, 2
        ret

Note the magically transmogrifying immediate.

llvm-svn: 19686

29478017

Codegen long >> 2 to this: · 41fe201b

Chris Lattner authored Jan 19, 2005

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        shrd %EAX, %EDX, 2
        sar %EDX, 2
        ret

instead of this:

test1:
        mov %ECX, DWORD PTR [%ESP + 4]
        shr %ECX, 2
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %EAX, %EDX
        shl %EAX, 30
        or %EAX, %ECX
        sar %EDX, 2
        ret

and long << 2 to this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, DWORD PTR [%ESP + 8]
***     mov %EDX, %EAX
        shrd %EDX, %ECX, 30
        shl %EAX, 2
        ret

instead of this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, %EAX
        shr %ECX, 30
        mov %EDX, DWORD PTR [%ESP + 8]
        shl %EDX, 2
        or %EDX, %ECX
        shl %EAX, 2
        ret

The extra copy (marked ***) can be eliminated when I teach the code generator
that shrd32rri8 is really commutative.

llvm-svn: 19681

41fe201b

X86 shifts mask the amount. · d8d30660
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19678
```
d8d30660

Jan 18, 2005

Code to handle FP_EXTEND is dead now. X86 doesn't support any data types to · 14947c34
Chris Lattner authored Jan 18, 2005
```
FP_EXTEND from!

llvm-svn: 19674
```
14947c34
Remove more dead code. · c6e928cb
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19673
```
c6e928cb
The selection dag code handles the promotions from F32 to F64 for us, so we · 0616fa6b
Chris Lattner authored Jan 18, 2005
```
don't need to even think about F32 in the X86 code anymore.

llvm-svn: 19672
```
0616fa6b
Fix 124.m88ksim. · 479c7118
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19667
```
479c7118
Do not emit loads multiple times, potentially in the wrong places. · ed246ec0
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19661
```
ed246ec0
Eliminate bad assertions. · 28a205e0
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19659
```
28a205e0

* Eliminate the TokenSet and just use the ExprMap for both tokens and values. · 78d30283

Chris Lattner authored Jan 18, 2005

* Insert some really pedantic assertions that will notice when we emit the
  same loads more than one time, exposing bugs.  This turns a miscompilation in
  bzip2 into a compile-fail.  yaay.

llvm-svn: 19658

78d30283

Rely on the code in MatchAddress to do this work. Otherwise we fail to · d7f93950

Chris Lattner authored Jan 18, 2005

match (X+Y)+(Z << 1), because we match the X+Y first, consuming the index
register, then there is no place to put the Z.

llvm-svn: 19652

d7f93950

Fix a problem where probing for addressing modes caused expressions to be · a7acdda0

Chris Lattner authored Jan 18, 2005

emitted too early.  In particular, this fixes
Regression/CodeGen/X86/regpressure.ll:regpressure3.

This also improves the 2nd basic block in 164.gzip:flush_block, which went from

.LBBflush_block_1:      # loopentry.1.i
        movzx %EAX, WORD PTR [dyn_ltree + 20]
        movzx %ECX, WORD PTR [dyn_ltree + 16]
        mov DWORD PTR [%ESP + 32], %ECX
        movzx %ECX, WORD PTR [dyn_ltree + 12]
        movzx %EDX, WORD PTR [dyn_ltree + 8]
        movzx %EBX, WORD PTR [dyn_ltree + 4]
        mov DWORD PTR [%ESP + 36], %EBX
        movzx %EBX, WORD PTR [dyn_ltree]
        add DWORD PTR [%ESP + 36], %EBX
        add %EDX, DWORD PTR [%ESP + 36]
        add %ECX, %EDX
        add DWORD PTR [%ESP + 32], %ECX
        add %EAX, DWORD PTR [%ESP + 32]
        movzx %ECX, WORD PTR [dyn_ltree + 24]
        add %EAX, %ECX
        mov %ECX, 0
        mov %EDX, %ECX

to

.LBBflush_block_1:      # loopentry.1.i
        movzx %EAX, WORD PTR [dyn_ltree]
        movzx %ECX, WORD PTR [dyn_ltree + 4]
        add %ECX, %EAX
        movzx %EAX, WORD PTR [dyn_ltree + 8]
        add %EAX, %ECX
        movzx %ECX, WORD PTR [dyn_ltree + 12]
        add %ECX, %EAX
        movzx %EAX, WORD PTR [dyn_ltree + 16]
        add %EAX, %ECX
        movzx %ECX, WORD PTR [dyn_ltree + 20]
        add %ECX, %EAX
        movzx %EAX, WORD PTR [dyn_ltree + 24]
        add %ECX, %EAX
        mov %EAX, 0
        mov %EDX, %EAX

... which results in less spilling in the function.

This change alone speeds up 164.gzip from 37.23s to 36.24s on apoc.  The
default isel takes 37.31s.

llvm-svn: 19650

a7acdda0

Fix indentation. · b93409f3
Chris Lattner authored Jan 17, 2005
```
llvm-svn: 19649
```
b93409f3
Don't bother using max here. · a5d137f4
Chris Lattner authored Jan 17, 2005
```
llvm-svn: 19647
```
a5d137f4

Jan 17, 2005

Do not give token factor nodes outrageous weights · ca318edb
Chris Lattner authored Jan 17, 2005
```
llvm-svn: 19645
```
ca318edb

Two changes: · e86c933d

Chris Lattner authored Jan 17, 2005

 1. Fold  [mem] += (1|-1) into inc [mem]/dec [mem] to save some icache space.
 2. Do not let token factor nodes prevent forming '[mem] op= val' folds.

llvm-svn: 19643

e86c933d

Refactor load/op/store folding into it's own method, no functionality changes. · 96113fd0
Chris Lattner authored Jan 17, 2005
```
llvm-svn: 19641
```
96113fd0
Fix a major regression last night that prevented us from producing [mem] op= reg · 90988794
Chris Lattner authored Jan 17, 2005
```
operations.

The body of the if is less indented but unmodified in this patch.

llvm-svn: 19638
```
90988794

Codegen this: · b72ea1b7

Chris Lattner authored Jan 17, 2005

int %foo(int %X) {
        %T = add int %X, 13
        %S = mul int %T, 3
        ret int %S
}

as this:

        mov %ECX, DWORD PTR [%ESP + 4]
        lea %EAX, DWORD PTR [%ECX + 2*%ECX + 39]
        ret

instead of this:

        mov %ECX, DWORD PTR [%ESP + 4]
        mov %EAX, %ECX
        add %EAX, 13
        imul %EAX, %EAX, 3
        ret

llvm-svn: 19633

b72ea1b7

Fix test/Regression/CodeGen/X86/2005-01-17-CycleInDAG.ll and 132.ijpeg. · a56d29d5
Chris Lattner authored Jan 17, 2005
```
Do not fold a load into an operation if it will induce a cycle in the DAG.

Repeat after me: dAg.

llvm-svn: 19631
```
a56d29d5
Do not fold a load into a comparison that is used by more than one place. · 3be6cd57
Chris Lattner authored Jan 17, 2005
```
The comparison will probably be folded, so this is not ok to do.
This fixed 197.parser.

llvm-svn: 19624
```
3be6cd57
Do not codegen 'xor bool, true' as 'not reg'. not reg inverts the upper bits · 0cd6b9ae
Chris Lattner authored Jan 17, 2005
```
of the bytereg.  This fixes yacr2, 300.twolf and probably others.

llvm-svn: 19622
```
0cd6b9ae

Set up the shift and setcc types. · c1f386c7

Chris Lattner authored Jan 17, 2005

If we emit a load because we followed a token chain to get to it, try to
fold it into its single user if possible.

llvm-svn: 19620

c1f386c7