Commits · 304053c6ec12405ae5da0f257c05369bac4dfbb0 · Roger Ferrer / llvm-epi-0.8

Jan 23, 2005

Add support for the PhysRegsUsed array. · 304053c6
Chris Lattner authored Jan 23, 2005
```
llvm-svn: 19789
```
304053c6
Speed this up a bit by making ModifiedRegs a vector<char> not vector<bool> · ef2de322
Chris Lattner authored Jan 23, 2005
```
llvm-svn: 19787
```
ef2de322

Adjust to changes in SelectionDAG interfaces · 4add7e35

Chris Lattner authored Jan 23, 2005

The first half of correct chain insertion for libcalls. This is not enough
to fix Fhourstones yet though.

llvm-svn: 19781

4add7e35

Remove the 3 HACK HACK HACKs I put in before, fixing them properly with · 90b7c13f

Chris Lattner authored Jan 23, 2005

the new TLI that is available.

Implement support for handling out of range shifts.  This allows us to
compile this code (a 64-bit rotate):

unsigned long long f3(unsigned long long x) {
  return (x << 32) | (x >> (64-32));
}

into this:

f3:
        mov %EDX, DWORD PTR [%ESP + 4]
        mov %EAX, DWORD PTR [%ESP + 8]
        ret

GCC produces this:

$ gcc t.c -masm=intel -O3 -S -o - -fomit-frame-pointer
..
f3:
        push    %ebx
        mov     %ebx, DWORD PTR [%esp+12]
        mov     %ecx, DWORD PTR [%esp+8]
        mov     %eax, %ebx
        mov     %edx, %ecx
        pop     %ebx
        ret

The Simple ISEL produces (eww gross):

f3:
        sub %ESP, 4
        mov DWORD PTR [%ESP], %ESI
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %ECX, DWORD PTR [%ESP + 12]
        mov %EAX, 0
        mov %ESI, 0
        or %EAX, %ECX
        or %EDX, %ESI
        mov %ESI, DWORD PTR [%ESP]
        add %ESP, 4
        ret

llvm-svn: 19780

90b7c13f

Adjust to changes in SelectionDAG interface. · ffcb0ae3
Chris Lattner authored Jan 23, 2005
```
llvm-svn: 19779
```
ffcb0ae3
Get this to work for 64-bit systems. · eccb73d5
Chris Lattner authored Jan 22, 2005
```
llvm-svn: 19763
```
eccb73d5

Jan 22, 2005
- Implicitly defined registers can clobber callee saved registers too! · 52c97fbe
  Chris Lattner authored Jan 22, 2005
```
This fixes the return-address-not-being-saved problem in the Alpha backend.

llvm-svn: 19741
```
  52c97fbe
- More bugfixes for IA64 shifts. · 3bc78b2e
  Chris Lattner authored Jan 22, 2005
```
llvm-svn: 19739
```
  3bc78b2e
- Fix problems with non-x86 targets. · ec218371
  Chris Lattner authored Jan 22, 2005
```
llvm-svn: 19738
```
  ec218371
- Add a nasty hack to fix Alpha/IA64 multiplies by a power of two. · d637c96f
  Chris Lattner authored Jan 22, 2005
```
llvm-svn: 19737
```
  d637c96f
- Remove unneeded line. · d53e763f
  Chris Lattner authored Jan 21, 2005
```
llvm-svn: 19736
```
  d53e763f
- test commit · 4f987bf1
  Chris Lattner authored Jan 21, 2005
```
llvm-svn: 19735
```
  4f987bf1
Jan 21, 2005
- Unary token factor nodes are unneeded. · 96e809c4
  Chris Lattner authored Jan 21, 2005
```
llvm-svn: 19727
```
  96e809c4
- Refactor libcall code a bit. Initial implementation of expanding int -> FP · aac464e6
  Chris Lattner authored Jan 21, 2005
```
operations for 64-bit integers.

llvm-svn: 19724
```
  aac464e6
Jan 20, 2005
- Simplify the shift-expansion code. · 4d25c04f
  Chris Lattner authored Jan 20, 2005
```
llvm-svn: 19721
```
  4d25c04f
- Expand add/sub into ADD_PARTS/SUB_PARTS instead of a non-existant libcall. · b3f83b28
  Chris Lattner authored Jan 20, 2005
```
llvm-svn: 19715
```
  b3f83b28
- implement add_parts/sub_parts. · 1fe9b409
  Chris Lattner authored Jan 20, 2005
```
llvm-svn: 19714
```
  1fe9b409
- Add missing entry. · 28d15860
  Chris Lattner authored Jan 20, 2005
```
llvm-svn: 19712
```
  28d15860
Jan 19, 2005
- Support targets that do not use i8 shift amounts. · 96c26751
  Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19707
```
  96c26751
- Add an assertion that would have made more sense to duraid · f8402892
  Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19704
```
  f8402892
- Add support for targets that pass args in registers to calls. · 3d95c14d
  Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19703
```
  3d95c14d
- Fold single use token factor nodes into other token factor nodes. · 55562fa9
  Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19701
```
  55562fa9
- Realize the individual pieces of an expanded copytoreg/store/load are · 0d03eb45
  Chris Lattner authored Jan 19, 2005
```
independent of each other.

llvm-svn: 19700
```
  0d03eb45
- Know some identities about tokenfactor nodes. · 9b75e148
  Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19699
```
  9b75e148
- Know some simple identities. This improves codegen for (1LL << N). · 32a5f025
  Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19698
```
  32a5f025
- Just in case, handle something that is both a use and a def. · 1cffa73f
  Chris Lattner authored Jan 19, 2005
```
llvm-svn: 19696
```
  1cffa73f
- When an instruction moves, make sure to update the VarInfo::Kills list as · 00c43682
  Chris Lattner authored Jan 19, 2005
```
well as all of teh other stuff in livevar. This fixes the compiler crash
on fourinarow last night.

llvm-svn: 19695
```
  00c43682
- Use the TargetInstrInfo::commuteInstruction method to commute instructions · ea42c15d
  Chris Lattner authored Jan 19, 2005
```
instead of doing it manually.

llvm-svn: 19685
```
  ea42c15d
- Implement a way of expanding shifts. This applies to targets that offer · 2a7f8a94
  Chris Lattner authored Jan 19, 2005
```
select operations or to shifts that are by a constant.  This automatically
implements (with no special code) all of the special cases for shift by 32,
shift by < 32 and shift by > 32.

llvm-svn: 19679
```
  2a7f8a94
Jan 18, 2005
- Zero is cheaper than sign extend. · 42993e45
  Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19675
```
  42993e45
- Fix some fixmes (promoting bools for select and brcond), fix promotion · d65c3f31
  Chris Lattner authored Jan 18, 2005
```
of zero and sign extends.

llvm-svn: 19671
```
  d65c3f31
- Keep track of the retval type as well. · a9d53f9f
  Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19670
```
  a9d53f9f
- Teach legalize to promote copy(from|to)reg, instead of making the isel pass · 9f2c4a52
  Chris Lattner authored Jan 18, 2005
```
do it.  This results in better code on X86 for floats (because if strict
precision is not required, we can elide some more expensive double -> float
conversions like the old isel did), and allows other targets to emit
CopyFromRegs that are not legal for arguments.

llvm-svn: 19668
```
  9f2c4a52
- Teach legalize to promote SetCC results. · 2cb338d7
  Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19657
```
  2cb338d7
- Allow setcc operations to have nonbool types. · b07e2d20
  Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19656
```
  b07e2d20
- Fix the completely broken FP constant folds for setcc's. · 2b4b7958
  Chris Lattner authored Jan 18, 2005
```
llvm-svn: 19651
```
  2b4b7958
Jan 17, 2005

Non-volatile loads can be freely reordered against each other. This fixes · 4d9651c7

Chris Lattner authored Jan 17, 2005

X86/reg-pressure.ll again, and allows us to do nice things in other cases.
For example, we now codegen this sort of thing:

int %loadload(int *%X, int* %Y) {
  %Z = load int* %Y
  %Y = load int* %X      ;; load between %Z and store
  %Q = add int %Z, 1
  store int %Q, int* %Y
  ret int %Y
}

Into this:

loadload:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EAX, DWORD PTR [%EAX]
        mov %ECX, DWORD PTR [%ESP + 8]
        inc DWORD PTR [%ECX]
        ret

where we weren't able to form the 'inc [mem]' before.  This also lets the
instruction selector emit loads in any order it wants to, which can be good
for register pressure as well.

llvm-svn: 19644

4d9651c7

Don't call SelectionDAG.getRoot() directly, go through a forwarding method. · 4108bb01
Chris Lattner authored Jan 17, 2005
```
llvm-svn: 19642
```
4108bb01

Implement a target independent optimization to codegen arguments only into · e3c2cf48

Chris Lattner authored Jan 17, 2005

the basic block that uses them if possible.  This is a big win on X86, as it
lets us fold the argument loads into instructions and reduce register pressure
(by not loading all of the arguments in the entry block).

For this (contrived to show the optimization) testcase:

int %argtest(int %A, int %B) {
        %X = sub int 12345, %A
        br label %L
L:
        %Y = add int %X, %B
        ret int %Y
}

we used to produce:

argtest:
        mov %ECX, DWORD PTR [%ESP + 4]
        mov %EAX, 12345
        sub %EAX, %ECX
        mov %EDX, DWORD PTR [%ESP + 8]
.LBBargtest_1:  # L
        add %EAX, %EDX
        ret


now we produce:

argtest:
        mov %EAX, 12345
        sub %EAX, DWORD PTR [%ESP + 4]
.LBBargtest_1:  # L
        add %EAX, DWORD PTR [%ESP + 8]
        ret

This also fixes the FIXME in the code.

BTW, this occurs in real code.  164.gzip shrinks from 8623 to 8608 lines of
.s file.  The stack frame in huft_build shrinks from 1644->1628 bytes,
inflate_codes shrinks from 116->108 bytes, and inflate_block from 2620->2612,
due to fewer spills.

Take that alkis. :-)

llvm-svn: 19639

e3c2cf48

Refactor code into a new method. · 16f64df9
Chris Lattner authored Jan 17, 2005
```
llvm-svn: 19635
```
16f64df9