Commits · ba33ae5831ea8c03c26340f80be3d7dd0fb0c90f · Roger Ferrer / llvm-epi-0.8

Apr 06, 2004

Fix incorrect encoding of some ADC and SBB instuctions · ba33ae58
Chris Lattner authored Apr 06, 2004
```
llvm-svn: 12710
```
ba33ae58

Fix a minor bug in previous checking · 19c8b13e

Chris Lattner authored Apr 06, 2004

Enable folding of long seteq/setne comparisons into branches and select instructions
Implement unfolded long relational comparisons against a constants a bit more efficiently

Folding comparisons changes code that looks like this:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %ECX, %EAX
        or %ECX, %EDX
        sete %CL
        test %CL, %CL
        je .LBB2 # PC rel: F

into code that looks like this:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %ECX, %EAX
        or %ECX, %EDX
        jne .LBB2 # PC rel: F

This speeds up 186.crafty by 6% with llc-ls.

llvm-svn: 12702

19c8b13e

Improve codegen of long == and != comparisons against constants. Before, · f2ee88eb

Chris Lattner authored Apr 06, 2004

comparing a long against zero got us this:

        sub %ESP, 8
        mov DWORD PTR [%ESP + 4], %ESI
        mov DWORD PTR [%ESP], %EDI
        mov %EAX, DWORD PTR [%ESP + 12]
        mov %EDX, DWORD PTR [%ESP + 16]
        mov %ECX, 0
        mov %ESI, 0
        mov %EDI, %EAX
        xor %EDI, %ECX
        mov %ECX, %EDX
        xor %ECX, %ESI
        or %EDI, %ECX
        sete %CL
        test %CL, %CL
        je .LBB2 # PC rel: F

Now it gets us this:

        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %ECX, %EAX
        or %ECX, %EDX
        sete %CL
        test %CL, %CL
        je .LBB2 # PC rel: F

llvm-svn: 12696

f2ee88eb

Handle various other important cases of multiplying a long constant immediate. For · 6c3bf13f

Chris Lattner authored Apr 06, 2004

example, multiplying X*(1 + (1LL << 32)) now produces:

test:
        mov %ECX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %EAX, %ECX
        add %EDX, %ECX
        ret

[[[Note to Alkis: why isn't linear scan generating this code??  This might be a
 problem with your intervals being too conservative:

test:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        add %EDX, %EAX
        ret

end note]]]

Whereas GCC produces this:

T:
        sub     %esp, 12
        mov     %edx, DWORD PTR [%esp+16]
        mov     DWORD PTR [%esp+8], %edi
        mov     %ecx, DWORD PTR [%esp+20]
        xor     %edi, %edi
        mov     DWORD PTR [%esp], %ebx
        mov     %ebx, %edi
        mov     %eax, %edx
        mov     DWORD PTR [%esp+4], %esi
        add     %ebx, %edx
        mov     %edi, DWORD PTR [%esp+8]
        lea     %edx, [%ecx+%ebx]
        mov     %esi, DWORD PTR [%esp+4]
        mov     %ebx, DWORD PTR [%esp]
        add     %esp, 12
        ret

I'm not sure example what GCC is smoking here, but it looks like it has just
confused itself with a bunch of stack slots or something.  The intel compiler
is better, but still not good:

T:
        movl      4(%esp), %edx                                 #2.11
        movl      8(%esp), %eax                                 #2.11
        lea       (%eax,%edx), %ecx                             #3.12
        movl      $1, %eax                                      #3.12
        mull      %edx                                          #3.12
        addl      %ecx, %edx                                    #3.12
        ret                                                     #3.12

llvm-svn: 12693

6c3bf13f

Efficiently handle a long multiplication by a constant. For this testcase: · 1f6024cb

Chris Lattner authored Apr 06, 2004

long %test(long %X) {
        %Y = mul long %X, 123
        ret long %Y
}

we used to generate:

test:
        sub %ESP, 12
        mov DWORD PTR [%ESP + 8], %ESI
        mov DWORD PTR [%ESP + 4], %EDI
        mov DWORD PTR [%ESP], %EBX
        mov %ECX, DWORD PTR [%ESP + 16]
        mov %ESI, DWORD PTR [%ESP + 20]
        mov %EDI, 123
        mov %EBX, 0
        mov %EAX, %ECX
        mul %EDI
        imul %ESI, %EDI
        add %ESI, %EDX
        imul %ECX, %EBX
        add %ESI, %ECX
        mov %EDX, %ESI
        mov %EBX, DWORD PTR [%ESP]
        mov %EDI, DWORD PTR [%ESP + 4]
        mov %ESI, DWORD PTR [%ESP + 8]
        add %ESP, 12
        ret

Now we emit:
test:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, DWORD PTR [%ESP + 8]
        mov %EDX, 123
        mul %EDX
        imul %ECX, %ECX, 123
        add %ECX, %EDX
        mov %EDX, %ECX
        ret

Which, incidently, is substantially nicer than what GCC manages:
T:
        sub     %esp, 8
        mov     %eax, 123
        mov     DWORD PTR [%esp], %ebx
        mov     %ebx, DWORD PTR [%esp+16]
        mov     DWORD PTR [%esp+4], %esi
        mov     %esi, DWORD PTR [%esp+12]
        imul    %ecx, %ebx, 123
        mov     %ebx, DWORD PTR [%esp]
        mul     %esi
        mov     %esi, DWORD PTR [%esp+4]
        add     %esp, 8
        lea     %edx, [%ecx+%edx]
        ret

llvm-svn: 12692

1f6024cb

Improve code generation of long shifts by 32. · 2448baea

Chris Lattner authored Apr 06, 2004

On this testcase:

long %test(long %X) {
        %Y = shr long %X, ubyte 32
        ret long %Y
}

instead of:
t:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EAX, DWORD PTR [%ESP + 8]
        sar %EAX, 0
        mov %EDX, 0
        ret


we now emit:
test:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EAX, DWORD PTR [%ESP + 8]
        mov %EDX, 0
        ret

llvm-svn: 12688

2448baea

Bugfixes: inc/dec don't set the carry flag! · 7332d4c5
Chris Lattner authored Apr 06, 2004
```
llvm-svn: 12687
```
7332d4c5

Improve code for passing constant longs as arguments to function calls. · decce5bc

Chris Lattner authored Apr 06, 2004

For example, on this instruction:

        call void %test(long 1234)

Instead of this:
        mov %EAX, 1234
        mov %ECX, 0
        mov DWORD PTR [%ESP], %EAX
        mov DWORD PTR [%ESP + 4], %ECX
        call test

We now emit this:
        mov DWORD PTR [%ESP], 1234
        mov DWORD PTR [%ESP + 4], 0
        call test

llvm-svn: 12686

decce5bc

Emit more efficient 64-bit operations when the RHS is a constant, and one · 5fc6f77b

Chris Lattner authored Apr 06, 2004

of the words of the constant is zeros.  For example:
  Y = and long X, 1234

now generates:
  Yl = and Xl, 1234
  Yh = 0

instead of:
  Yl = and Xl, 1234
  Yh = and Xh, 0

llvm-svn: 12685

5fc6f77b

Fix typeo · b49608af
Chris Lattner authored Apr 06, 2004
```
llvm-svn: 12684
```
b49608af
Add support for simple immediate handling to long instruction selection. · 996e667a
Chris Lattner authored Apr 06, 2004
```
This allows us to handle code like 'add long %X, 123456789012' more efficiently.

llvm-svn: 12683
```
996e667a
The sbb instructions really ARE sbb's, not adc's · 9366f034
Chris Lattner authored Apr 06, 2004
```
llvm-svn: 12682
```
9366f034

Implement negation of longs efficiently. For this testcase: · 37ba31f7

Chris Lattner authored Apr 06, 2004

long %test(long %X) {
        %Y = sub long 0, %X
        ret long %Y
}

We used to generate:

test:
        sub %ESP, 4
        mov DWORD PTR [%ESP], %ESI
        mov %ECX, DWORD PTR [%ESP + 8]
        mov %ESI, DWORD PTR [%ESP + 12]
        mov %EAX, 0
        mov %EDX, 0
        sub %EAX, %ECX
        sbb %EDX, %ESI
        mov %ESI, DWORD PTR [%ESP]
        add %ESP, 4
        ret

Now we generate:

test:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        neg %EAX
        adc %EDX, 0
        neg %EDX
        ret

llvm-svn: 12681

37ba31f7

Minor tweak to avoid an extra reg-reg copy that the register allocator has to eliminate · bfe74f58
Chris Lattner authored Apr 06, 2004
```
llvm-svn: 12680
```
bfe74f58

Two changes: · 464e2ea5

Chris Lattner authored Apr 06, 2004

  * In promote32, if we can just promote a constant value, do so instead of
    promoting a constant dynamically.
  * In visitReturn inst, actually USE the promote32 argument that takes a
    Value*

The end result of this is that we now generate this:

test:
        mov %EAX, 0
        ret

instead of...

test:
        mov %AX, 0
        movzx %EAX, %AX
        ret

for:

ushort %test() {
        ret ushort 0
}

llvm-svn: 12679

464e2ea5

Apr 05, 2004
- lli no longer takes the -quiet option! · b0d1e9d0
  Chris Lattner authored Apr 05, 2004
```
llvm-svn: 12674
```
  b0d1e9d0
- Do not mangle intrinsics in any way! · c97b7b22
  Chris Lattner authored Apr 05, 2004
```
llvm-svn: 12673
```
  c97b7b22
- Sparc don't got not "sqrtl", bum bum bum · d4f78f27
  Chris Lattner authored Apr 05, 2004
```
llvm-svn: 12670
```
  d4f78f27
- Kill warnings during an optimized compile where assert() disappears. · 5ebc25c8
  Misha Brukman authored Apr 05, 2004
```
llvm-svn: 12669
```
  5ebc25c8
- Fix PR312 and IndVarsSimplify/2004-04-05-InvokeCastCrash.llx · 29153fc2
  Chris Lattner authored Apr 05, 2004
```
llvm-svn: 12668
```
  29153fc2
- Fix a bug in yesterdays checkins which broke siod. siod is a great testcase! :) · 4d1fcf1d
  Chris Lattner authored Apr 05, 2004
```
llvm-svn: 12659
```
  4d1fcf1d
- Fix InstCombine/2004-04-04-InstCombineReplaceAllUsesWith.ll · 8953b90a
  Chris Lattner authored Apr 05, 2004
```
llvm-svn: 12658
```
  8953b90a
- Support getelementptr instructions which use uint's to index into structure · 69193f93
  Chris Lattner authored Apr 05, 2004
```
types and can have arbitrary 32- and 64-bit integer types indexing into
sequential types.

llvm-svn: 12653
```
  69193f93
- Support getelementptr instructions which use uint's to index into structure · fd9fbe18
  Chris Lattner authored Apr 05, 2004
```
types and can have arbitrary 32- and 64-bit integer types indexing into
sequential types.

Auto-upgrade .ll files that use ubytes to index into structures to use uint's.

llvm-svn: 12652
```
  fd9fbe18
- Implement support for a new LLVM 1.3 bytecode format, which uses uint's · 15701e84
  Chris Lattner authored Apr 05, 2004
```
to index into structure types and allows arbitrary 32- and 64-bit integer
types to index into sequential types.

llvm-svn: 12651
```
  15701e84
- Add ConstantExpr::get(Sign|Zero)Extend methods · dd284746
  Chris Lattner authored Apr 04, 2004
```
llvm-svn: 12648
```
  dd284746
Apr 04, 2004
- In the perhaps not-to-distant future, we might support gep instructions that · dfcf8e34
  Chris Lattner authored Apr 04, 2004
```
have non-long indices for sequential types.  In order to avoid trying to figure
out how the v9 backend works, we'll just hack it in the preselection pass.

llvm-svn: 12647
```
  dfcf8e34
- Adjust to new interface · ca76d11a
  Chris Lattner authored Apr 04, 2004
```
llvm-svn: 12646
```
  ca76d11a
- Adjust to new gep_type_iterator prototypes. · 092d260f
  Chris Lattner authored Apr 04, 2004
```
llvm-svn: 12644
```
  092d260f
- Remove a bunch of cruft that was used to be backwards compatible with the last · bc8ba73c
  Chris Lattner authored Apr 03, 2004
```
prerelease format for LLVM bytecode files.  Now we only are compatible with
LLVM 1.0+.

llvm-svn: 12643
```
  bc8ba73c
Apr 03, 2004
- Implement test/Regression/Transforms/GCSE/undefined_load.ll · 8ed3c8aa
  Chris Lattner authored Apr 03, 2004
```
llvm-svn: 12641
```
  8ed3c8aa
- Add a break in the default case · 0defaa1c
  Chris Lattner authored Apr 03, 2004
```
llvm-svn: 12639
```
  0defaa1c
Apr 02, 2004
- Add autoconf support for isStandardOutAConsole (). · 5ade5015
  Brian Gaeke authored Apr 02, 2004
```
llvm-svn: 12638
```
  5ade5015
- Remove obsolete files · 6748cca2
  Chris Lattner authored Apr 02, 2004
```
llvm-svn: 12633
```
  6748cca2
- Add support for many of the MRegisterInfo callbacks. · d0bdad38
  Brian Gaeke authored Apr 02, 2004
```
Eliminating call-frame pseudo instrs and frame indices are still stubs.
Flesh out the emitPrologue method based on better ABI knowledge.

llvm-svn: 12632
```
  d0bdad38
- Add load, store, and NOP instructions. · d4869e41
  Brian Gaeke authored Apr 02, 2004
```
Fix up comments.

llvm-svn: 12631
```
  d4869e41
- Add support for printing pc-relative displacements of functions (as used in · b65254a3
  Brian Gaeke authored Apr 02, 2004
```
the CALL instruction).

llvm-svn: 12630
```
  b65254a3
- Add support for call instructions (0-ary only for now). · 2fd46b6e
  Brian Gaeke authored Apr 02, 2004
```
llvm-svn: 12629
```
  2fd46b6e
- Comment out debugging printouts · 09169213
  Chris Lattner authored Apr 02, 2004
```
llvm-svn: 12623
```
  09169213
- Rewrite the indvars pass to use the ScalarEvolution analysis. · e61b67d7
  Chris Lattner authored Apr 02, 2004
```
This also implements some new features for the indvars pass, including
linear function test replacement, exit value substitution, and it works with
a much more general class of induction variables and loops.

llvm-svn: 12620
```
  e61b67d7