Commits · 0edf9535b9a425b19e1176b346cd112974b34875 · Roger Ferrer / llvm-epi-0.8

Jan 19, 2005

Add rotate instructions. · 0edf9535
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19690
```
0edf9535
Match 16-bit shld/shrd instructions as well, implementing shift-double.llx:test5 · 29f58191
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19689
```
29f58191
Improve coverage of the X86 instruction set by adding 16-bit shift doubles. · d54845f5
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19687
```
d54845f5

Teach the code generator that shrd/shld is commutable if it has an immediate. · 29478017

Chris Lattner authored Jan 19, 2005

This allows us to generate this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        shld %EDX, %EDX, 2
        shl %EAX, 2
        ret

instead of this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, DWORD PTR [%ESP + 8]
        mov %EDX, %EAX
        shrd %EDX, %ECX, 30
        shl %EAX, 2
        ret

Note the magically transmogrifying immediate.

llvm-svn: 19686

29478017

Finegrainify namespacification · f6932b70

Chris Lattner authored Jan 19, 2005

Add default impl of commuteInstruction
Add notes about ugly V9 code.

llvm-svn: 19684

f6932b70

Codegen long >> 2 to this: · 41fe201b

Chris Lattner authored Jan 19, 2005

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        shrd %EAX, %EDX, 2
        sar %EDX, 2
        ret

instead of this:

test1:
        mov %ECX, DWORD PTR [%ESP + 4]
        shr %ECX, 2
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %EAX, %EDX
        shl %EAX, 30
        or %EAX, %ECX
        sar %EDX, 2
        ret

and long << 2 to this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, DWORD PTR [%ESP + 8]
***     mov %EDX, %EAX
        shrd %EDX, %ECX, 30
        shl %EAX, 2
        ret

instead of this:

foo:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, %EAX
        shr %ECX, 30
        mov %EDX, DWORD PTR [%ESP + 8]
        shl %EDX, 2
        or %EDX, %ECX
        shl %EAX, 2
        ret

The extra copy (marked ***) can be eliminated when I teach the code generator
that shrd32rri8 is really commutative.

llvm-svn: 19681

41fe201b

X86 shifts mask the amount. · d8d30660
Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19678
```
d8d30660

Add a hook to find out how the target handles shift amounts that are out of · a05cd83d

Chris Lattner authored Jan 19, 2005

range.  Either they are undefined (the default), they mask the shift amount
to the size of the register (X86, Alpha, etc), or they extend the shift (PPC).

This defaults to undefined, which is conservatively correct.

llvm-svn: 19677

a05cd83d

Jan 18, 2005

Code to handle FP_EXTEND is dead now. X86 doesn't support any data types to · 14947c34
Chris Lattner authored Jan 18, 2005
```
FP_EXTEND from!

llvm-svn: 19674
```
14947c34
Remove more dead code. · c6e928cb
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19673
```
c6e928cb
The selection dag code handles the promotions from F32 to F64 for us, so we · 0616fa6b
Chris Lattner authored Jan 18, 2005
```
don't need to even think about F32 in the X86 code anymore.

llvm-svn: 19672
```
0616fa6b
Fix 124.m88ksim. · 479c7118
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19667
```
479c7118
Do not emit loads multiple times, potentially in the wrong places. · ed246ec0
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19661
```
ed246ec0
Minor changes. · c227ad26
Tanya Lattner authored Jan 18, 2005
```
llvm-svn: 19660
```
c227ad26
Eliminate bad assertions. · 28a205e0
Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19659
```
28a205e0

* Eliminate the TokenSet and just use the ExprMap for both tokens and values. · 78d30283

Chris Lattner authored Jan 18, 2005

* Insert some really pedantic assertions that will notice when we emit the
  same loads more than one time, exposing bugs.  This turns a miscompilation in
  bzip2 into a compile-fail.  yaay.

llvm-svn: 19658

78d30283

Rely on the code in MatchAddress to do this work. Otherwise we fail to · d7f93950

Chris Lattner authored Jan 18, 2005

match (X+Y)+(Z << 1), because we match the X+Y first, consuming the index
register, then there is no place to put the Z.

llvm-svn: 19652

d7f93950

Fix a problem where probing for addressing modes caused expressions to be · a7acdda0

Chris Lattner authored Jan 18, 2005

emitted too early.  In particular, this fixes
Regression/CodeGen/X86/regpressure.ll:regpressure3.

This also improves the 2nd basic block in 164.gzip:flush_block, which went from

.LBBflush_block_1:      # loopentry.1.i
        movzx %EAX, WORD PTR [dyn_ltree + 20]
        movzx %ECX, WORD PTR [dyn_ltree + 16]
        mov DWORD PTR [%ESP + 32], %ECX
        movzx %ECX, WORD PTR [dyn_ltree + 12]
        movzx %EDX, WORD PTR [dyn_ltree + 8]
        movzx %EBX, WORD PTR [dyn_ltree + 4]
        mov DWORD PTR [%ESP + 36], %EBX
        movzx %EBX, WORD PTR [dyn_ltree]
        add DWORD PTR [%ESP + 36], %EBX
        add %EDX, DWORD PTR [%ESP + 36]
        add %ECX, %EDX
        add DWORD PTR [%ESP + 32], %ECX
        add %EAX, DWORD PTR [%ESP + 32]
        movzx %ECX, WORD PTR [dyn_ltree + 24]
        add %EAX, %ECX
        mov %ECX, 0
        mov %EDX, %ECX

to

.LBBflush_block_1:      # loopentry.1.i
        movzx %EAX, WORD PTR [dyn_ltree]
        movzx %ECX, WORD PTR [dyn_ltree + 4]
        add %ECX, %EAX
        movzx %EAX, WORD PTR [dyn_ltree + 8]
        add %EAX, %ECX
        movzx %ECX, WORD PTR [dyn_ltree + 12]
        add %ECX, %EAX
        movzx %EAX, WORD PTR [dyn_ltree + 16]
        add %EAX, %ECX
        movzx %ECX, WORD PTR [dyn_ltree + 20]
        add %ECX, %EAX
        movzx %EAX, WORD PTR [dyn_ltree + 24]
        add %ECX, %EAX
        mov %EAX, 0
        mov %EDX, %EAX

... which results in less spilling in the function.

This change alone speeds up 164.gzip from 37.23s to 36.24s on apoc.  The
default isel takes 37.31s.

llvm-svn: 19650

a7acdda0

Fix indentation. · b93409f3
Chris Lattner authored Jan 17, 2005
```
llvm-svn: 19649
```
b93409f3
Don't bother using max here. · a5d137f4
Chris Lattner authored Jan 17, 2005
```
llvm-svn: 19647
```
a5d137f4

Jan 17, 2005
- Do not give token factor nodes outrageous weights · ca318edb
  Chris Lattner authored Jan 17, 2005
```
llvm-svn: 19645
```
  ca318edb
- Two changes: · e86c933d
  Chris Lattner authored Jan 17, 2005
```
 1. Fold  [mem] += (1|-1) into inc [mem]/dec [mem] to save some icache space.
 2. Do not let token factor nodes prevent forming '[mem] op= val' folds.

llvm-svn: 19643
```
  e86c933d
- Refactor load/op/store folding into it's own method, no functionality changes. · 96113fd0
  Chris Lattner authored Jan 17, 2005
```
llvm-svn: 19641
```
  96113fd0
- Fix a major regression last night that prevented us from producing [mem] op= reg · 90988794
  Chris Lattner authored Jan 17, 2005
```
operations.

The body of the if is less indented but unmodified in this patch.

llvm-svn: 19638
```
  90988794
- Codegen this: · b72ea1b7
  Chris Lattner authored Jan 17, 2005
```
int %foo(int %X) {
        %T = add int %X, 13
        %S = mul int %T, 3
        ret int %S
}

as this:

        mov %ECX, DWORD PTR [%ESP + 4]
        lea %EAX, DWORD PTR [%ECX + 2*%ECX + 39]
        ret

instead of this:

        mov %ECX, DWORD PTR [%ESP + 4]
        mov %EAX, %ECX
        add %EAX, 13
        imul %EAX, %EAX, 3
        ret

llvm-svn: 19633
```
  b72ea1b7
- Added tmp instructions to preserve ssa. · a8b2929f
  Tanya Lattner authored Jan 17, 2005
```
llvm-svn: 19632
```
  a8b2929f
- Fix test/Regression/CodeGen/X86/2005-01-17-CycleInDAG.ll and 132.ijpeg. · a56d29d5
  Chris Lattner authored Jan 17, 2005
```
Do not fold a load into an operation if it will induce a cycle in the DAG.

Repeat after me: dAg.

llvm-svn: 19631
```
  a56d29d5
- Do not fold a load into a comparison that is used by more than one place. · 3be6cd57
  Chris Lattner authored Jan 17, 2005
```
The comparison will probably be folded, so this is not ok to do.
This fixed 197.parser.

llvm-svn: 19624
```
  3be6cd57
- Do not codegen 'xor bool, true' as 'not reg'. not reg inverts the upper bits · 0cd6b9ae
  Chris Lattner authored Jan 17, 2005
```
of the bytereg.  This fixes yacr2, 300.twolf and probably others.

llvm-svn: 19622
```
  0cd6b9ae
- Set up the shift and setcc types. · c1f386c7
  Chris Lattner authored Jan 17, 2005
```
If we emit a load because we followed a token chain to get to it, try to
fold it into its single user if possible.

llvm-svn: 19620
```
  c1f386c7
- Shift and setcc types default to the pointer type. · 5f180e46
  Chris Lattner authored Jan 16, 2005
```
llvm-svn: 19619
```
  5f180e46
Jan 16, 2005
- Added paramters to a few functions in order to allow me to change the functions to preserve SSA · e4872342
  Tanya Lattner authored Jan 16, 2005
```
llvm-svn: 19615
```
  e4872342
- * Adjust to changes in TargetLowering interfaces. · b14a63aa
  Chris Lattner authored Jan 16, 2005
```
* Remove custom promotion for bool and byte select ops.  Legalize now
  promotes them for us.
* Allow folding ConstantPoolIndexes into EXTLOAD's, useful for float immediates.
* Declare which operations are not supported better.
* Add some hacky code for TRUNCSTORE to pretend that we have truncstore
  for i16 types.  This is useful for testing promotion code because I can
  just remove 16-bit registers all together and verify that programs work.

llvm-svn: 19614
```
  b14a63aa
- Use enums, move virtual dtor out of line. · 6f809795
  Chris Lattner authored Jan 16, 2005
```
llvm-svn: 19610
```
  6f809795
- cycles_t -> CycleCount_t · d0a65013
  Chris Lattner authored Jan 16, 2005
```
llvm-svn: 19604
```
  d0a65013
- Rename BUILD_* to PROJ_* · 0e48bf8a
  Reid Spencer authored Jan 16, 2005
```
llvm-svn: 19592
```
  0e48bf8a
- Fixed a couple of instructions that broke SSA. · 7462d192
  Tanya Lattner authored Jan 16, 2005
```
llvm-svn: 19587
```
  7462d192
- Improve compatiblity with HPUX on Itanium, patch by Duraid Madina · cebb964f
  Chris Lattner authored Jan 16, 2005
```
llvm-svn: 19586
```
  cebb964f
- Set up identity transforms. · 8ec1dc5f
  Chris Lattner authored Jan 16, 2005
```
llvm-svn: 19584
```
  8ec1dc5f
- Move some information out of LegalizeDAG into the generic Target interface. · 1bc93bac
  Chris Lattner authored Jan 16, 2005
```
llvm-svn: 19581
```
  1bc93bac