Commits · 6ad040a6bc8a674e4b290c8479000395372864eb · Roger Ferrer / llvm-epi-0.8

May 11, 2006

If the live interval legnth is essentially zero, i.e. in every live range · 6ad040a6

Evan Cheng authored May 10, 2006

the use follows def immediately, it doesn't make sense to spill it and
hope it will be easier to allocate for this LI.

llvm-svn: 28217

6ad040a6

May 10, 2006
- Templatify RegReductionPriorityQueue · 9665ba05
  Evan Cheng authored May 10, 2006
```
llvm-svn: 28212
```
  9665ba05
May 09, 2006
- Fix PR773 · 1a225d23
  Nate Begeman authored May 09, 2006
```
llvm-svn: 28207
```
  1a225d23
- Fix a regression in my patch from last night that broke the llvmgcc4 build on · f801792e
  Chris Lattner authored May 09, 2006
```
ppc

llvm-svn: 28205
```
  f801792e
- Add pseudo dependency to force a def&use operand to be scheduled last (unless · 7d693898
  Evan Cheng authored May 09, 2006
```
the distance between the def and another use is much longer). This is under
option control for now "-sched-lower-defnuse".

llvm-svn: 28201
```
  7d693898
- Debugging info · 2c74848a
  Evan Cheng authored May 09, 2006
```
llvm-svn: 28200
```
  2c74848a
- PR 770 - permit coallescing of registers in subset register classes. · ae450207
  Evan Cheng authored May 09, 2006
```
llvm-svn: 28197
```
  ae450207
- Implement MASM sections correctly, without a "has masm sections flag" and a... · 4ebc6a23
  Chris Lattner authored May 09, 2006
```
Implement MASM sections correctly, without a "has masm sections flag" and a bunch of special case code.

llvm-svn: 28194
```
  4ebc6a23
- Oh yeah, there are two of these now, unify both. · 8c2bfc06
  Chris Lattner authored May 09, 2006
```
llvm-svn: 28192
```
  8c2bfc06
- Setting SwitchToSectionDirective properly in the MASM backend permits a bunch · 6341df80
  Chris Lattner authored May 09, 2006
```
of code to be unified.

llvm-svn: 28191
```
  6341df80
- · d36cc2b6
  Chris Lattner authored May 09, 2006
```
Don't prefix section directives with a tab.  Doing so causes blank lines to
be emitted to the .s file.

llvm-svn: 28189
```
  d36cc2b6
- Make the masm codepath work like the normal code path. · e64f764d
  Chris Lattner authored May 09, 2006
```
llvm-svn: 28188
```
  e64f764d
- The MASM asmprinter has been fixed, these hacks are no longer needed. · c0f0dfa5
  Chris Lattner authored May 09, 2006
```
llvm-svn: 28186
```
  c0f0dfa5
- Split SwitchSection into SwitchTo{Text|Data}Section methods. · 8488ba2e
  Chris Lattner authored May 09, 2006
```
llvm-svn: 28184
```
  8488ba2e
May 08, 2006

Make the case I just checked in stronger. Now we compile this: · 446e1ef2

Chris Lattner authored May 08, 2006

short test2(short X, short x) {
  int Y = (short)(X+x);
  return Y >> 1;
}

to:

_test2:
        add r2, r3, r4
        extsh r2, r2
        srawi r3, r2, 1
        blr

instead of:

_test2:
        add r2, r3, r4
        extsh r2, r2
        srwi r2, r2, 1
        extsh r3, r2
        blr

llvm-svn: 28175

446e1ef2

Implement and_sext.ll:test3, generating: · 29062da0

Chris Lattner authored May 08, 2006

_test4:
        srawi r3, r3, 16
        blr

instead of:

_test4:
        srwi r2, r3, 16
        extsh r3, r2
        blr

for:

short test4(unsigned X) {
  return (X >> 16);
}

llvm-svn: 28174

29062da0

Compile this: · 2935d819

Chris Lattner authored May 08, 2006

short test4(unsigned X) {
  return (X >> 16);
}

to:

_test4:
        movl 4(%esp), %eax
        sarl $16, %eax
        ret

instead of:

_test4:
        movl $-65536, %eax
        andl 4(%esp), %eax
        sarl $16, %eax
        ret

llvm-svn: 28171

2935d819

Fold shifts with undef operands. · 78da6792
Chris Lattner authored May 08, 2006
```
llvm-svn: 28167
```
78da6792
Make emission of jump tables a bit less conservative; they are now required · d7a19102
Nate Begeman authored May 08, 2006
```
to be only 31.25% dense, rather than 75% dense.

llvm-svn: 28165
```
d7a19102
Fix PR772 · e5ce5bb6
Nate Begeman authored May 08, 2006
```
llvm-svn: 28161
```
e5ce5bb6

May 07, 2006
- Simplify some code, add a couple minor missed folds · 7e7bcf3a
  Chris Lattner authored May 06, 2006
```
llvm-svn: 28152
```
  7e7bcf3a
- constant fold sign_extend_inreg · 751817c5
  Chris Lattner authored May 06, 2006
```
llvm-svn: 28151
```
  751817c5
- remove cases handled elsewhere · 2a4d7b84
  Chris Lattner authored May 06, 2006
```
llvm-svn: 28150
```
  2a4d7b84
May 06, 2006

Fix some loose ends in MASM support. · ce9b9fe6
Jeff Cohen authored May 06, 2006
```
llvm-svn: 28148
```
ce9b9fe6

Use the new TargetLowering::ComputeNumSignBits method to eliminate · 1ecb2a2d

Chris Lattner authored May 06, 2006

sign_extend_inreg operations.  Though ComputeNumSignBits is still rudimentary,
this is enough to compile this:

short test(short X, short x) {
  int Y = X+x;
  return (Y >> 1);
}
short test2(short X, short x) {
  int Y = (short)(X+x);
  return Y >> 1;
}

into:

_test:
        add r2, r3, r4
        srawi r3, r2, 1
        blr
_test2:
        add r2, r3, r4
        extsh r2, r2
        srawi r3, r2, 1
        blr

instead of:

_test:
        add r2, r3, r4
        srawi r2, r2, 1
        extsh r3, r2
        blr
_test2:
        add r2, r3, r4
        extsh r2, r2
        srawi r2, r2, 1
        extsh r3, r2
        blr

llvm-svn: 28146

1ecb2a2d

When inserting casts, be careful of where we put them. We cannot insert · 21cd9902

Chris Lattner authored May 06, 2006

a cast immediately before a PHI node.

This fixes Regression/CodeGen/Generic/2006-05-06-GEP-Cast-Sink-Crash.ll

llvm-svn: 28143

21cd9902

Fold trunc(any_ext). This gives stuff like: · 907e392d

Chris Lattner authored May 05, 2006

27,28c27
<       movzwl %di, %edi
<       movl %edi, %ebx
---
>       movw %di, %bx

llvm-svn: 28137

907e392d

Shrink shifts when possible. · 57f8c5a3
Chris Lattner authored May 05, 2006
```
llvm-svn: 28136
```
57f8c5a3

May 05, 2006

Indent multiline asm strings more nicely · a633c313
Chris Lattner authored May 05, 2006
```
llvm-svn: 28132
```
a633c313
Fold (fpext (load x)) -> (extload x) · 3d265773
Chris Lattner authored May 05, 2006
```
llvm-svn: 28130
```
3d265773

More aggressively sink GEP offsets into loops. For example, before we · 3e3f2c63

Chris Lattner authored May 05, 2006

generated:

        movl 8(%esp), %eax
        movl %eax, %edx
        addl $4316, %edx
        cmpb $1, %cl
        ja LBB1_2       #cond_false
LBB1_1: #cond_true
        movl L_QuantizationTables720$non_lazy_ptr, %ecx
        movl %ecx, (%edx)
        movl L_QNOtoQuantTableShift720$non_lazy_ptr, %edx
        movl %edx, 4460(%eax)
        ret
...

Now we generate:

        movl 8(%esp), %eax
        cmpb $1, %cl
        ja LBB1_2       #cond_false
LBB1_1: #cond_true
        movl L_QuantizationTables720$non_lazy_ptr, %ecx
        movl %ecx, 4316(%eax)
        movl L_QNOtoQuantTableShift720$non_lazy_ptr, %ecx
        movl %ecx, 4460(%eax)
        ret

... which uses one fewer register.

llvm-svn: 28129

3e3f2c63

Fold some common code. · 25a5283a
Chris Lattner authored May 05, 2006
```
llvm-svn: 28124
```
25a5283a

Implement: · 002ee914

Chris Lattner authored May 05, 2006

  // fold (and (sext x), (sext y)) -> (sext (and x, y))
  // fold (or  (sext x), (sext y)) -> (sext (or  x, y))
  // fold (xor (sext x), (sext y)) -> (sext (xor x, y))
  // fold (and (aext x), (aext y)) -> (aext (and x, y))
  // fold (or  (aext x), (aext y)) -> (aext (or  x, y))
  // fold (xor (aext x), (aext y)) -> (aext (xor x, y))

llvm-svn: 28123

002ee914

Pull and through and/or/xor. This compiles some bitfield code to: · 5ac42936

Chris Lattner authored May 05, 2006

        mov EAX, DWORD PTR [ESP + 4]
        mov ECX, DWORD PTR [EAX]
        mov EDX, ECX
        add EDX, EDX
        or EDX, ECX
        and EDX, -2147483648
        and ECX, 2147483647
        or EDX, ECX
        mov DWORD PTR [EAX], EDX
        ret

instead of:

        sub ESP, 4
        mov DWORD PTR [ESP], ESI
        mov EAX, DWORD PTR [ESP + 8]
        mov ECX, DWORD PTR [EAX]
        mov EDX, ECX
        add EDX, EDX
        mov ESI, ECX
        and ESI, -2147483648
        and EDX, -2147483648
        or EDX, ESI
        and ECX, 2147483647
        or EDX, ECX
        mov DWORD PTR [EAX], EDX
        mov ESI, DWORD PTR [ESP]
        add ESP, 4
        ret

llvm-svn: 28122

5ac42936

Implement a variety of simplifications for ANY_EXTEND. · 812646aa
Chris Lattner authored May 05, 2006
```
llvm-svn: 28121
```
812646aa

Factor some code, add these transformations: · 8d6fc201

Chris Lattner authored May 05, 2006

  // fold (and (trunc x), (trunc y)) -> (trunc (and x, y))
  // fold (or  (trunc x), (trunc y)) -> (trunc (or  x, y))
  // fold (xor (trunc x), (trunc y)) -> (trunc (xor x, y))

llvm-svn: 28120

8d6fc201

Fix VC++ compilation error. · 78a7f0e0
Jeff Cohen authored May 05, 2006
```
llvm-svn: 28117
```
78a7f0e0

Sink noop copies into the basic block that uses them. This reduces the number · 7a3ecf79

Chris Lattner authored May 05, 2006

of cross-block live ranges, and allows the bb-at-a-time selector to always
coallesce these away, at isel time.

This reduces the load on the coallescer and register allocator. For example
on a codec on X86, we went from:

1643 asm-printer - Number of machine instrs printed
419 liveintervals - Number of loads/stores folded into instructions
1144 liveintervals - Number of identity moves eliminated after coalescing
1022 liveintervals - Number of interval joins performed
282 liveintervals - Number of intervals after coalescing
1304 liveintervals - Number of original intervals
86 regalloc - Number of times we had to backtrack
1.90232 regalloc - Ratio of intervals processed over total intervals
40 spiller - Number of values reused
182 spiller - Number of loads added
121 spiller - Number of stores added
132 spiller - Number of register spills
6 twoaddressinstruction - Number of instructions commuted to coalesce
360 twoaddressinstruction - Number of two-address instructions

to:

1636 asm-printer - Number of machine instrs printed
403 liveintervals - Number of loads/stores folded into instructions
1155 liveintervals - Number of identity moves eliminated after coalescing
1033 liveintervals - Number of interval joins performed
279 liveintervals - Number of intervals after coalescing
1312 liveintervals - Number of original intervals
76 regalloc - Number of times we had to backtrack
1.88998 regalloc - Ratio of intervals processed over total intervals
1 spiller - Number of copies elided
41 spiller - Number of values reused
191 spiller - Number of loads added
114 spiller - Number of stores added
128 spiller - Number of register spills
4 twoaddressinstruction - Number of instructions commuted to coalesce
356 twoaddressinstruction - Number of two-address instructions

On this testcase, this change provides a modest reduction in spill code,
regalloc iterations, and total instructions emitted. It increases the number
of register coallesces.

llvm-svn: 28115

7a3ecf79

May 04, 2006

Final pass of minor cleanups for MachineInstr · abdf4d56
Chris Lattner authored May 04, 2006
```
llvm-svn: 28110
```
abdf4d56

Initial support for register pressure aware scheduling. The register reduction · 9add8805

Evan Cheng authored May 04, 2006

scheduler can go into a "vertical mode" (i.e. traversing up the two-address
chain, etc.) when the register pressure is low.
This does seem to reduce the number of spills in the cases I've looked at. But
with x86, it's no guarantee the performance of the code improves.
It can be turned on with -sched-vertically option.

llvm-svn: 28108

9add8805