Commits · 0edf9535b9a425b19e1176b346cd112974b34875 · Roger Ferrer / llvm-epi-0.8

Jan 19, 2005

Add rotate instructions. · 0edf9535
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19690
```
0edf9535
Match 16-bit shld/shrd instructions as well, implementing shift-double.llx:test5 · 29f58191
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19689
```
29f58191
Improve coverage of the X86 instruction set by adding 16-bit shift doubles. · d54845f5
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19687
```
d54845f5

Teach the code generator that shrd/shld is commutable if it has an immediate. · 29478017

Chris Lattner authored Jan 19, 2005

This allows us to generate this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        shld %EDX, %EDX, 2
        shl %EAX, 2
        ret

instead of this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, DWORD PTR [%ESP + 8]
        mov %EDX, %EAX
        shrd %EDX, %ECX, 30
        shl %EAX, 2
        ret

Note the magically transmogrifying immediate.

llvm-svn: 19686

29478017

Codegen long >> 2 to this: · 41fe201b

Chris Lattner authored Jan 19, 2005

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        shrd %EAX, %EDX, 2
        sar %EDX, 2
        ret

instead of this:

test1:
        mov %ECX, DWORD PTR [%ESP + 4]
        shr %ECX, 2
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %EAX, %EDX
        shl %EAX, 30
        or %EAX, %ECX
        sar %EDX, 2
        ret

and long << 2 to this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, DWORD PTR [%ESP + 8]
***     mov %EDX, %EAX
        shrd %EDX, %ECX, 30
        shl %EAX, 2
        ret

instead of this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, %EAX
        shr %ECX, 30
        mov %EDX, DWORD PTR [%ESP + 8]
        shl %EDX, 2
        or %EDX, %ECX
        shl %EAX, 2
        ret

The extra copy (marked ***) can be eliminated when I teach the code generator
that shrd32rri8 is really commutative.

llvm-svn: 19681

41fe201b

X86 shifts mask the amount. · d8d30660
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19678
```
d8d30660

Jan 18, 2005

Code to handle FP_EXTEND is dead now. X86 doesn't support any data types to · 14947c34
Chris Lattner authored Jan 18, 2005
```
FP_EXTEND from!

llvm-svn: 19674
```
14947c34
Remove more dead code. · c6e928cb
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19673
```
c6e928cb
The selection dag code handles the promotions from F32 to F64 for us, so we · 0616fa6b
Chris Lattner authored Jan 18, 2005
```
don't need to even think about F32 in the X86 code anymore.

llvm-svn: 19672
```
0616fa6b
Fix 124.m88ksim. · 479c7118
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19667
```
479c7118
Do not emit loads multiple times, potentially in the wrong places. · ed246ec0
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19661
```
ed246ec0
Eliminate bad assertions. · 28a205e0
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19659
```
28a205e0

* Eliminate the TokenSet and just use the ExprMap for both tokens and values. · 78d30283

Chris Lattner authored Jan 18, 2005

* Insert some really pedantic assertions that will notice when we emit the
  same loads more than one time, exposing bugs.  This turns a miscompilation in
  bzip2 into a compile-fail.  yaay.

llvm-svn: 19658

78d30283

Rely on the code in MatchAddress to do this work. Otherwise we fail to · d7f93950

Chris Lattner authored Jan 18, 2005

match (X+Y)+(Z << 1), because we match the X+Y first, consuming the index
register, then there is no place to put the Z.

llvm-svn: 19652

d7f93950

Fix a problem where probing for addressing modes caused expressions to be · a7acdda0

Chris Lattner authored Jan 18, 2005

emitted too early.  In particular, this fixes
Regression/CodeGen/X86/regpressure.ll:regpressure3.

This also improves the 2nd basic block in 164.gzip:flush_block, which went from

.LBBflush_block_1:      # loopentry.1.i
        movzx %EAX, WORD PTR [dyn_ltree + 20]
        movzx %ECX, WORD PTR [dyn_ltree + 16]
        mov DWORD PTR [%ESP + 32], %ECX
        movzx %ECX, WORD PTR [dyn_ltree + 12]
        movzx %EDX, WORD PTR [dyn_ltree + 8]
        movzx %EBX, WORD PTR [dyn_ltree + 4]
        mov DWORD PTR [%ESP + 36], %EBX
        movzx %EBX, WORD PTR [dyn_ltree]
        add DWORD PTR [%ESP + 36], %EBX
        add %EDX, DWORD PTR [%ESP + 36]
        add %ECX, %EDX
        add DWORD PTR [%ESP + 32], %ECX
        add %EAX, DWORD PTR [%ESP + 32]
        movzx %ECX, WORD PTR [dyn_ltree + 24]
        add %EAX, %ECX
        mov %ECX, 0
        mov %EDX, %ECX

to

.LBBflush_block_1:      # loopentry.1.i
        movzx %EAX, WORD PTR [dyn_ltree]
        movzx %ECX, WORD PTR [dyn_ltree + 4]
        add %ECX, %EAX
        movzx %EAX, WORD PTR [dyn_ltree + 8]
        add %EAX, %ECX
        movzx %ECX, WORD PTR [dyn_ltree + 12]
        add %ECX, %EAX
        movzx %EAX, WORD PTR [dyn_ltree + 16]
        add %EAX, %ECX
        movzx %ECX, WORD PTR [dyn_ltree + 20]
        add %ECX, %EAX
        movzx %EAX, WORD PTR [dyn_ltree + 24]
        add %ECX, %EAX
        mov %EAX, 0
        mov %EDX, %EAX

... which results in less spilling in the function.

This change alone speeds up 164.gzip from 37.23s to 36.24s on apoc.  The
default isel takes 37.31s.

llvm-svn: 19650

a7acdda0

Fix indentation. · b93409f3
Chris Lattner authored Jan 17, 2005
```
llvm-svn: 19649
```
b93409f3
Don't bother using max here. · a5d137f4
Chris Lattner authored Jan 17, 2005
```
llvm-svn: 19647
```
a5d137f4

Jan 17, 2005

Do not give token factor nodes outrageous weights · ca318edb
Chris Lattner authored Jan 17, 2005
```
llvm-svn: 19645
```
ca318edb

Two changes: · e86c933d

Chris Lattner authored Jan 17, 2005

 1. Fold  [mem] += (1|-1) into inc [mem]/dec [mem] to save some icache space.
 2. Do not let token factor nodes prevent forming '[mem] op= val' folds.

llvm-svn: 19643

e86c933d

Refactor load/op/store folding into it's own method, no functionality changes. · 96113fd0
Chris Lattner authored Jan 17, 2005
```
llvm-svn: 19641
```
96113fd0
Fix a major regression last night that prevented us from producing [mem] op= reg · 90988794
Chris Lattner authored Jan 17, 2005
```
operations.

The body of the if is less indented but unmodified in this patch.

llvm-svn: 19638
```
90988794

Codegen this: · b72ea1b7

Chris Lattner authored Jan 17, 2005

int %foo(int %X) {
        %T = add int %X, 13
        %S = mul int %T, 3
        ret int %S
}

as this:

        mov %ECX, DWORD PTR [%ESP + 4]
        lea %EAX, DWORD PTR [%ECX + 2*%ECX + 39]
        ret

instead of this:

        mov %ECX, DWORD PTR [%ESP + 4]
        mov %EAX, %ECX
        add %EAX, 13
        imul %EAX, %EAX, 3
        ret

llvm-svn: 19633

b72ea1b7

Fix test/Regression/CodeGen/X86/2005-01-17-CycleInDAG.ll and 132.ijpeg. · a56d29d5
Chris Lattner authored Jan 17, 2005
```
Do not fold a load into an operation if it will induce a cycle in the DAG.

Repeat after me: dAg.

llvm-svn: 19631
```
a56d29d5
Do not fold a load into a comparison that is used by more than one place. · 3be6cd57
Chris Lattner authored Jan 17, 2005
```
The comparison will probably be folded, so this is not ok to do.
This fixed 197.parser.

llvm-svn: 19624
```
3be6cd57
Do not codegen 'xor bool, true' as 'not reg'. not reg inverts the upper bits · 0cd6b9ae
Chris Lattner authored Jan 17, 2005
```
of the bytereg.  This fixes yacr2, 300.twolf and probably others.

llvm-svn: 19622
```
0cd6b9ae

Set up the shift and setcc types. · c1f386c7

Chris Lattner authored Jan 17, 2005

If we emit a load because we followed a token chain to get to it, try to
fold it into its single user if possible.

llvm-svn: 19620

c1f386c7

Jan 16, 2005

* Adjust to changes in TargetLowering interfaces. · b14a63aa

Chris Lattner authored Jan 16, 2005

* Remove custom promotion for bool and byte select ops.  Legalize now
  promotes them for us.
* Allow folding ConstantPoolIndexes into EXTLOAD's, useful for float immediates.
* Declare which operations are not supported better.
* Add some hacky code for TRUNCSTORE to pretend that we have truncstore
  for i16 types.  This is useful for testing promotion code because I can
  just remove 16-bit registers all together and verify that programs work.

llvm-svn: 19614

b14a63aa

Jan 15, 2005
- Add support for truncstore and *extload. · e18a4c4c
  Chris Lattner authored Jan 15, 2005
```
llvm-svn: 19566
```
  e18a4c4c
Jan 14, 2005
- Adjust to CopyFromREg changes. · 720a62e8
  Chris Lattner authored Jan 14, 2005
```
llvm-svn: 19561
```
  720a62e8
Jan 13, 2005
- Add new ImplicitDef node, rename CopyRegSDNode class to RegSDNode. · e727af06
  Chris Lattner authored Jan 13, 2005
```
llvm-svn: 19535
```
  e727af06
- Codegen factor nodes more intelligently according to perceived register pressure. · 15bd19dd
  Chris Lattner authored Jan 13, 2005
```
llvm-svn: 19532
```
  15bd19dd
- Initial trivial (but stupid) codegen for this node. · c251fb64
  Chris Lattner authored Jan 13, 2005
```
llvm-svn: 19529
```
  c251fb64
- Add some really pedantic assertions to the load folding code. Fix a bunch · 3676cd6f
  Chris Lattner authored Jan 13, 2005
```
of cases where we accidentally emitted a load folded once and unfolded
elsewhere.

llvm-svn: 19522
```
  3676cd6f
Jan 12, 2005
- We can only fold a load into an op if there is exactly one use of the value. · 7b1dae80
  Chris Lattner authored Jan 12, 2005
```
Checking to see if the load has two uses is not equivalent, as the chain
value may have zero uses.

llvm-svn: 19518
```
  7b1dae80
- Try both ways to fold an add together. This allows us to generate this code · 1755360d
  Chris Lattner authored Jan 12, 2005
```
        imul %EAX, %EAX, 400
        add %ECX, %EAX
        add %ESI, DWORD PTR [%ECX + 4*%EDX]
        inc %EDX
        cmp %EDX, 100

instead of this:

        imul %EAX, %EAX, 400
        add %ECX, %EAX
        mov %EAX, %EDX
        shl %EAX, 2
        add %ECX, %EAX
        add %ESI, DWORD PTR [%ECX]
        inc %EDX
        cmp %EDX, 100

llvm-svn: 19513
```
  1755360d
- Fix a major miscompilation where we were overwriting the scale reg. · 42e7e989
  Chris Lattner authored Jan 12, 2005
```
llvm-svn: 19511
```
  42e7e989
- Do not use the type of the RHS constant to determine the type of the operation. · db4b67c8
  Chris Lattner authored Jan 12, 2005
```
This fails for shifts because the constant is always 8 bits.

llvm-svn: 19508
```
  db4b67c8
- Do not lose the offset from teh global when peephole optimizing instructions. · bdb2e9da
  Chris Lattner authored Jan 12, 2005
```
This fixes FreeBench/pcompress

llvm-svn: 19507
```
  bdb2e9da
- Fix C++ more compilatiom errors · 407aa019
  Jeff Cohen authored Jan 12, 2005
```
llvm-svn: 19504
```
  407aa019
- Fix a compile error with VC++, which things that static const arrays need · efe90209
  Chris Lattner authored Jan 12, 2005
```
to be dynamically initialized. :(

llvm-svn: 19503
```
  efe90209