Commits · a31d4c7548e0f43e07ea59b6cd90e28809496664 · Roger Ferrer / llvm-epi-0.8

Apr 02, 2005
- Add support for 64-bit shifts. · a31d4c75
  Chris Lattner authored Apr 02, 2005
```
llvm-svn: 21005
```
  a31d4c75
- Add support for ISD::UNDEF to the X86 be · f4b985d1
  Chris Lattner authored Apr 01, 2005
```
llvm-svn: 20990
```
  f4b985d1
Mar 30, 2005
- don't depend on the cfg being set up yet · 472a265e
  Chris Lattner authored Mar 30, 2005
```
llvm-svn: 20936
```
  472a265e
Mar 26, 2005
- Change interface to LowerCallTo to take a boolean isVarArg argument. · f656525c
  Nate Begeman authored Mar 26, 2005
```
llvm-svn: 20842
```
  f656525c
Mar 24, 2005
- eliminate dead variables, patch contributed by Gabor Greif! · b15317b7
  Chris Lattner authored Mar 24, 2005
```
llvm-svn: 20812
```
  b15317b7
- Remove comments that are now meaningless from the pattern ISels, at Chris's · 95210522
  Nate Begeman authored Mar 24, 2005
```
request.

llvm-svn: 20804
```
  95210522
Mar 17, 2005
- Don't emit two comparisons when comparing a FP value against zero! · 43832b04
  Chris Lattner authored Mar 17, 2005
```
llvm-svn: 20651
```
  43832b04
- Fix the missing symbols problem Bill was hitting. Patch contributed by · 7b9020a0
  Chris Lattner authored Mar 17, 2005
```
Bill Wendling!!

llvm-svn: 20649
```
  7b9020a0
Mar 15, 2005

This mega patch converts us from using Function::a{iterator|begin|end} to · 531f9e92

Chris Lattner authored Mar 15, 2005

using Function::arg_{iterator|begin|end}.  Likewise Module::g* -> Module::global_*.

This patch is contributed by Gabor Greif, thanks!

llvm-svn: 20597

531f9e92

Mar 08, 2005
- Patch to make assembly output compatible with mingw compilation (identical · 00658b80
  Reid Spencer authored Mar 08, 2005
```
to cygwin)

llvm-svn: 20520
```
  00658b80
Feb 27, 2005
- Fix spelling, patch contributed by Gabor Greif! · 0ce80cd5
  Chris Lattner authored Feb 27, 2005
```
llvm-svn: 20343
```
  0ce80cd5
Feb 23, 2005
- Silence some uninit variable warnings. · 80c5b970
  Chris Lattner authored Feb 23, 2005
```
llvm-svn: 20284
```
  80c5b970
Jan 25, 2005
- We can fold promoted and non-promoted loads into divs also! · 1b206151
  Chris Lattner authored Jan 25, 2005
```
llvm-svn: 19835
```
  1b206151
- Fold promoted loads into binary ops for FP, allowing us to generate m32 forms · 30607ec6
  Chris Lattner authored Jan 25, 2005
```
of FP ops.

llvm-svn: 19834
```
  30607ec6
Jan 24, 2005
- Silence a warning. · 0e1de101
  Chris Lattner authored Jan 23, 2005
```
llvm-svn: 19798
```
  0e1de101
- Allow the FP stackifier to completely ignore functions that do not use FP at · debae1e3
  Chris Lattner authored Jan 23, 2005
```
all.  This should speed up the X86 backend fairly significantly on integer
codes.  Now if only we didn't have to compute livevar still... ;-)

llvm-svn: 19796
```
  debae1e3
Jan 23, 2005

Support Cygwin assembly generation. The cygwin version of Gnu ASsembler · 30226da5

Reid Spencer authored Jan 23, 2005

doesn't support certain directives and symbols on cygwin are prefixed with
an underscore. This patch makes the necessary adjustments to the output.

llvm-svn: 19775

30226da5

Jan 21, 2005
- Speed up folding operations into loads. · e70eb9da
  Chris Lattner authored Jan 21, 2005
```
llvm-svn: 19733
```
  e70eb9da
- The ever-important vanity pass name :) · e1e844c4
  Chris Lattner authored Jan 21, 2005
```
llvm-svn: 19731
```
  e1e844c4
- Fix a FIXME: realize that argument stores are all independent (don't alias) · c78776d2
  Chris Lattner authored Jan 21, 2005
```
llvm-svn: 19728
```
  c78776d2
Jan 20, 2005
- Implement ADD_PARTS/SUB_PARTS so that 64-bit integer add/sub work. This · 2a631fa4
  Chris Lattner authored Jan 20, 2005
```
fixes most of the remaining llc-beta failures.

llvm-svn: 19716
```
  2a631fa4
- Fix a crash compiling 134.perl. · 5b04f334
  Chris Lattner authored Jan 20, 2005
```
llvm-svn: 19711
```
  5b04f334
Jan 19, 2005

Fix a problem where were were literally selecting for INCREASED register · 474aac4d

Chris Lattner authored Jan 19, 2005

pressure, not decreases register pressure.  Fix problem where we accidentally
swapped the operands of SHLD, which caused fourinarow to fail.  This fixes
fourinarow.

llvm-svn: 19697

474aac4d

When commuting these instructions, make sure to actually swap the operands too. · 25be208e
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19694
```
25be208e

Implement Regression/CodeGen/X86/rotate.ll: emit rotate instructions (which · de87d146

Chris Lattner authored Jan 19, 2005

typically cost 1 cycle) instead of shld/shrd instruction (which are typically
6 or more cycles).  This also saves code space.

For example, instead of emitting:

rotr:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %CL, BYTE PTR [%ESP + 8]
        shrd %EAX, %EAX, %CL
        ret
rotli:
        mov %EAX, DWORD PTR [%ESP + 4]
        shrd %EAX, %EAX, 27
        ret

Emit:

rotr32:
        mov %CL, BYTE PTR [%ESP + 8]
        mov %EAX, DWORD PTR [%ESP + 4]
        ror %EAX, %CL
        ret
rotli32:
        mov %EAX, DWORD PTR [%ESP + 4]
        ror %EAX, 27
        ret

We also emit byte rotate instructions which do not have a sh[lr]d counterpart
at all.

llvm-svn: 19692

de87d146

Add rotate instructions. · 0edf9535
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19690
```
0edf9535
Match 16-bit shld/shrd instructions as well, implementing shift-double.llx:test5 · 29f58191
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19689
```
29f58191
Improve coverage of the X86 instruction set by adding 16-bit shift doubles. · d54845f5
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19687
```
d54845f5

Teach the code generator that shrd/shld is commutable if it has an immediate. · 29478017

Chris Lattner authored Jan 19, 2005

This allows us to generate this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        shld %EDX, %EDX, 2
        shl %EAX, 2
        ret

instead of this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, DWORD PTR [%ESP + 8]
        mov %EDX, %EAX
        shrd %EDX, %ECX, 30
        shl %EAX, 2
        ret

Note the magically transmogrifying immediate.

llvm-svn: 19686

29478017

Codegen long >> 2 to this: · 41fe201b

Chris Lattner authored Jan 19, 2005

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        shrd %EAX, %EDX, 2
        sar %EDX, 2
        ret

instead of this:

test1:
        mov %ECX, DWORD PTR [%ESP + 4]
        shr %ECX, 2
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %EAX, %EDX
        shl %EAX, 30
        or %EAX, %ECX
        sar %EDX, 2
        ret

and long << 2 to this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, DWORD PTR [%ESP + 8]
***     mov %EDX, %EAX
        shrd %EDX, %ECX, 30
        shl %EAX, 2
        ret

instead of this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, %EAX
        shr %ECX, 30
        mov %EDX, DWORD PTR [%ESP + 8]
        shl %EDX, 2
        or %EDX, %ECX
        shl %EAX, 2
        ret

The extra copy (marked ***) can be eliminated when I teach the code generator
that shrd32rri8 is really commutative.

llvm-svn: 19681

41fe201b

X86 shifts mask the amount. · d8d30660
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19678
```
d8d30660

Jan 18, 2005

Code to handle FP_EXTEND is dead now. X86 doesn't support any data types to · 14947c34
Chris Lattner authored Jan 18, 2005
```
FP_EXTEND from!

llvm-svn: 19674
```
14947c34
Remove more dead code. · c6e928cb
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19673
```
c6e928cb
The selection dag code handles the promotions from F32 to F64 for us, so we · 0616fa6b
Chris Lattner authored Jan 18, 2005
```
don't need to even think about F32 in the X86 code anymore.

llvm-svn: 19672
```
0616fa6b
Fix 124.m88ksim. · 479c7118
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19667
```
479c7118
Do not emit loads multiple times, potentially in the wrong places. · ed246ec0
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19661
```
ed246ec0
Eliminate bad assertions. · 28a205e0
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19659
```
28a205e0

* Eliminate the TokenSet and just use the ExprMap for both tokens and values. · 78d30283

Chris Lattner authored Jan 18, 2005

* Insert some really pedantic assertions that will notice when we emit the
  same loads more than one time, exposing bugs.  This turns a miscompilation in
  bzip2 into a compile-fail.  yaay.

llvm-svn: 19658

78d30283

Rely on the code in MatchAddress to do this work. Otherwise we fail to · d7f93950

Chris Lattner authored Jan 18, 2005

match (X+Y)+(Z << 1), because we match the X+Y first, consuming the index
register, then there is no place to put the Z.

llvm-svn: 19652

d7f93950

Fix a problem where probing for addressing modes caused expressions to be · a7acdda0

Chris Lattner authored Jan 18, 2005

emitted too early.  In particular, this fixes
Regression/CodeGen/X86/regpressure.ll:regpressure3.

This also improves the 2nd basic block in 164.gzip:flush_block, which went from

.LBBflush_block_1:      # loopentry.1.i
        movzx %EAX, WORD PTR [dyn_ltree + 20]
        movzx %ECX, WORD PTR [dyn_ltree + 16]
        mov DWORD PTR [%ESP + 32], %ECX
        movzx %ECX, WORD PTR [dyn_ltree + 12]
        movzx %EDX, WORD PTR [dyn_ltree + 8]
        movzx %EBX, WORD PTR [dyn_ltree + 4]
        mov DWORD PTR [%ESP + 36], %EBX
        movzx %EBX, WORD PTR [dyn_ltree]
        add DWORD PTR [%ESP + 36], %EBX
        add %EDX, DWORD PTR [%ESP + 36]
        add %ECX, %EDX
        add DWORD PTR [%ESP + 32], %ECX
        add %EAX, DWORD PTR [%ESP + 32]
        movzx %ECX, WORD PTR [dyn_ltree + 24]
        add %EAX, %ECX
        mov %ECX, 0
        mov %EDX, %ECX

to

.LBBflush_block_1:      # loopentry.1.i
        movzx %EAX, WORD PTR [dyn_ltree]
        movzx %ECX, WORD PTR [dyn_ltree + 4]
        add %ECX, %EAX
        movzx %EAX, WORD PTR [dyn_ltree + 8]
        add %EAX, %ECX
        movzx %ECX, WORD PTR [dyn_ltree + 12]
        add %ECX, %EAX
        movzx %EAX, WORD PTR [dyn_ltree + 16]
        add %EAX, %ECX
        movzx %ECX, WORD PTR [dyn_ltree + 20]
        add %ECX, %EAX
        movzx %EAX, WORD PTR [dyn_ltree + 24]
        add %ECX, %EAX
        mov %EAX, 0
        mov %EDX, %EAX

... which results in less spilling in the function.

This change alone speeds up 164.gzip from 37.23s to 36.24s on apoc.  The
default isel takes 37.31s.

llvm-svn: 19650

a7acdda0