Commits · 5201004ef90c8db352c0690ff8d62af9f2a92b73 · Roger Ferrer / llvm-epi-0.8

Apr 07, 2004
- Don't include InstrSelectionSupport.h. · 69ee7e13
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12766
```
  69ee7e13
- Move ChooseRegOrImmed() prototype here, from InstrSelectionSupport.h. · c1256649
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12765
```
  c1256649
- Don't include InstrSelectionSupport.h. · 5c801183
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12764
```
  5c801183
- Fix insertion of SelectInsts. · 8931345f
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12760
```
  8931345f
- Don't print [%reg + 0], just print [%reg] · 85521d70
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12759
```
  85521d70
- First version of code to handle loads. Stub function for handling stores. · 6d62df54
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12758
```
  6d62df54
- Support loading arguments from %I0...%I5 into virtual registers in · 989c04ab
  Brian Gaeke authored Apr 07, 2004
```
function prologues, and fix an off-by-one in visitCallInst that was
putting call args into the wrong registers.

llvm-svn: 12757
```
  989c04ab
- It's setting up the call args right now, but on the callee side, it's · 7985e56c
  Brian Gaeke authored Apr 07, 2004
```
trying to get incoming args off the stack, instead of the %i0...%i6 regs,
which is wrong.

llvm-svn: 12756
```
  7985e56c
- This is a start on handling setcc instructions. As the comment notes, we · bd58b3fb
  Chris Lattner authored Apr 07, 2004
```
have no good way of handling this until the code generator is improved.
We should probably just emit V9 instructions in the meantime.

llvm-svn: 12745
```
  bd58b3fb
- andd subcc instructions which is used to create the 'cmp' pseudo instruction · bb22d5a5
  Chris Lattner authored Apr 07, 2004
```
llvm-svn: 12744
```
  bb22d5a5
- Avoid emitting an extra copy on each 32-bit operation · f6245bc8
  Chris Lattner authored Apr 07, 2004
```
llvm-svn: 12743
```
  f6245bc8
- Make generation of stack-slot loads and copies less ugly. · 4aac8143
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12742
```
  4aac8143
- Fix bug in printing loads. · 3675c308
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12741
```
  3675c308
- Add support for shift instructions, wrap some long lines · 42ffd2e3
  Chris Lattner authored Apr 07, 2004
```
llvm-svn: 12740
```
  42ffd2e3
- Fix encoding of existing shift instructions, add rr shifts · 8406cf30
  Chris Lattner authored Apr 07, 2004
```
llvm-svn: 12739
```
  8406cf30
- Add a bunch more instructions · fcdf82a1
  Chris Lattner authored Apr 07, 2004
```
llvm-svn: 12737
```
  fcdf82a1
- Merge my changes with brians · fd8212ef
  Chris Lattner authored Apr 07, 2004
```
llvm-svn: 12736
```
  fd8212ef
- Add in some things I forgot, which Chris helpfully reminded me of... · 37f92b53
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12735
```
  37f92b53
- Add support for the "Y" register, used by MUL & DIV. · 32242318
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12734
```
  32242318
- Add UDIV, SDIV, and a few variants of WR. · 5524d54c
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12733
```
  5524d54c
- Preliminary support for getting 64-bit integer constants into registers. · cfbfb8ac
  Brian Gaeke authored Apr 07, 2004
```
Preliminary support for division. It's gross because you have to initialize
the "Y" register, which is the top 32 bits of the thing you're dividing.

llvm-svn: 12732
```
  cfbfb8ac
- Prune unnecessary #includes · 589bf05b
  Brian Gaeke authored Apr 06, 2004
```
llvm-svn: 12731
```
  589bf05b
- Simple delay slot filler pass. · b3deed9f
  Brian Gaeke authored Apr 06, 2004
```
llvm-svn: 12730
```
  b3deed9f
- Add references to delay slot filler pass. · 610c685e
  Brian Gaeke authored Apr 06, 2004
```
Fill in addPassesToJITCompile method.

llvm-svn: 12729
```
  610c685e
- First attempt at handling frame index elimination. · 4bd246ae
  Brian Gaeke authored Apr 06, 2004
```
llvm-svn: 12728
```
  4bd246ae
- First attempt at special-casing printing of [%reg + offset] for · 3915ad7c
  Brian Gaeke authored Apr 06, 2004
```
ld/st instructions - doesn't seem to work yet, but I think it's
just a typo or something somewhere.

llvm-svn: 12727
```
  3915ad7c
- Delete reference to "the Mach-O Runtime ABI". · 5e624b82
  Brian Gaeke authored Apr 06, 2004
```
llvm-svn: 12726
```
  5e624b82
- Deal with call return values. · 2e91a3d6
  Brian Gaeke authored Apr 06, 2004
```
Don't put NOPs in delay slots at all. We'll have a fix-up pass later.

llvm-svn: 12725
```
  2e91a3d6
Apr 06, 2004

· b8955205

Jakub Staszak authored Apr 06, 2004

file based off InstSelectSimple.cpp, slowly being replaced by generated code from the really simple X86 instruction selector tablegen backend

llvm-svn: 12715

b8955205

· de647007

Jakub Staszak authored Apr 06, 2004

Tablgen files for really simple instruction selector

llvm-svn: 12714

de647007

Fix PR313: [x86] JIT miscompiles unsigned short to floating point · 4b936125
Chris Lattner authored Apr 06, 2004
```
llvm-svn: 12711
```
4b936125
Fix incorrect encoding of some ADC and SBB instuctions · ba33ae58
Chris Lattner authored Apr 06, 2004
```
llvm-svn: 12710
```
ba33ae58

Fix a minor bug in previous checking · 19c8b13e

Chris Lattner authored Apr 06, 2004

Enable folding of long seteq/setne comparisons into branches and select instructions
Implement unfolded long relational comparisons against a constants a bit more efficiently

Folding comparisons changes code that looks like this:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %ECX, %EAX
        or %ECX, %EDX
        sete %CL
        test %CL, %CL
        je .LBB2 # PC rel: F

into code that looks like this:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %ECX, %EAX
        or %ECX, %EDX
        jne .LBB2 # PC rel: F

This speeds up 186.crafty by 6% with llc-ls.

llvm-svn: 12702

19c8b13e

Improve codegen of long == and != comparisons against constants. Before, · f2ee88eb

Chris Lattner authored Apr 06, 2004

comparing a long against zero got us this:

        sub %ESP, 8
        mov DWORD PTR [%ESP + 4], %ESI
        mov DWORD PTR [%ESP], %EDI
        mov %EAX, DWORD PTR [%ESP + 12]
        mov %EDX, DWORD PTR [%ESP + 16]
        mov %ECX, 0
        mov %ESI, 0
        mov %EDI, %EAX
        xor %EDI, %ECX
        mov %ECX, %EDX
        xor %ECX, %ESI
        or %EDI, %ECX
        sete %CL
        test %CL, %CL
        je .LBB2 # PC rel: F

Now it gets us this:

        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %ECX, %EAX
        or %ECX, %EDX
        sete %CL
        test %CL, %CL
        je .LBB2 # PC rel: F

llvm-svn: 12696

f2ee88eb

Handle various other important cases of multiplying a long constant immediate. For · 6c3bf13f

Chris Lattner authored Apr 06, 2004

example, multiplying X*(1 + (1LL << 32)) now produces:

test:
        mov %ECX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %EAX, %ECX
        add %EDX, %ECX
        ret

[[[Note to Alkis: why isn't linear scan generating this code??  This might be a
 problem with your intervals being too conservative:

test:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        add %EDX, %EAX
        ret

end note]]]

Whereas GCC produces this:

T:
        sub     %esp, 12
        mov     %edx, DWORD PTR [%esp+16]
        mov     DWORD PTR [%esp+8], %edi
        mov     %ecx, DWORD PTR [%esp+20]
        xor     %edi, %edi
        mov     DWORD PTR [%esp], %ebx
        mov     %ebx, %edi
        mov     %eax, %edx
        mov     DWORD PTR [%esp+4], %esi
        add     %ebx, %edx
        mov     %edi, DWORD PTR [%esp+8]
        lea     %edx, [%ecx+%ebx]
        mov     %esi, DWORD PTR [%esp+4]
        mov     %ebx, DWORD PTR [%esp]
        add     %esp, 12
        ret

I'm not sure example what GCC is smoking here, but it looks like it has just
confused itself with a bunch of stack slots or something.  The intel compiler
is better, but still not good:

T:
        movl      4(%esp), %edx                                 #2.11
        movl      8(%esp), %eax                                 #2.11
        lea       (%eax,%edx), %ecx                             #3.12
        movl      $1, %eax                                      #3.12
        mull      %edx                                          #3.12
        addl      %ecx, %edx                                    #3.12
        ret                                                     #3.12

llvm-svn: 12693

6c3bf13f

Efficiently handle a long multiplication by a constant. For this testcase: · 1f6024cb

Chris Lattner authored Apr 06, 2004

long %test(long %X) {
        %Y = mul long %X, 123
        ret long %Y
}

we used to generate:

test:
        sub %ESP, 12
        mov DWORD PTR [%ESP + 8], %ESI
        mov DWORD PTR [%ESP + 4], %EDI
        mov DWORD PTR [%ESP], %EBX
        mov %ECX, DWORD PTR [%ESP + 16]
        mov %ESI, DWORD PTR [%ESP + 20]
        mov %EDI, 123
        mov %EBX, 0
        mov %EAX, %ECX
        mul %EDI
        imul %ESI, %EDI
        add %ESI, %EDX
        imul %ECX, %EBX
        add %ESI, %ECX
        mov %EDX, %ESI
        mov %EBX, DWORD PTR [%ESP]
        mov %EDI, DWORD PTR [%ESP + 4]
        mov %ESI, DWORD PTR [%ESP + 8]
        add %ESP, 12
        ret

Now we emit:
test:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, DWORD PTR [%ESP + 8]
        mov %EDX, 123
        mul %EDX
        imul %ECX, %ECX, 123
        add %ECX, %EDX
        mov %EDX, %ECX
        ret

Which, incidently, is substantially nicer than what GCC manages:
T:
        sub     %esp, 8
        mov     %eax, 123
        mov     DWORD PTR [%esp], %ebx
        mov     %ebx, DWORD PTR [%esp+16]
        mov     DWORD PTR [%esp+4], %esi
        mov     %esi, DWORD PTR [%esp+12]
        imul    %ecx, %ebx, 123
        mov     %ebx, DWORD PTR [%esp]
        mul     %esi
        mov     %esi, DWORD PTR [%esp+4]
        add     %esp, 8
        lea     %edx, [%ecx+%edx]
        ret

llvm-svn: 12692

1f6024cb

Improve code generation of long shifts by 32. · 2448baea

Chris Lattner authored Apr 06, 2004

On this testcase:

long %test(long %X) {
        %Y = shr long %X, ubyte 32
        ret long %Y
}

instead of:
t:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EAX, DWORD PTR [%ESP + 8]
        sar %EAX, 0
        mov %EDX, 0
        ret


we now emit:
test:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EAX, DWORD PTR [%ESP + 8]
        mov %EDX, 0
        ret

llvm-svn: 12688

2448baea

Bugfixes: inc/dec don't set the carry flag! · 7332d4c5
Chris Lattner authored Apr 06, 2004
```
llvm-svn: 12687
```
7332d4c5

Improve code for passing constant longs as arguments to function calls. · decce5bc

Chris Lattner authored Apr 06, 2004

For example, on this instruction:

        call void %test(long 1234)

Instead of this:
        mov %EAX, 1234
        mov %ECX, 0
        mov DWORD PTR [%ESP], %EAX
        mov DWORD PTR [%ESP + 4], %ECX
        call test

We now emit this:
        mov DWORD PTR [%ESP], 1234
        mov DWORD PTR [%ESP + 4], 0
        call test

llvm-svn: 12686

decce5bc

Emit more efficient 64-bit operations when the RHS is a constant, and one · 5fc6f77b

Chris Lattner authored Apr 06, 2004

of the words of the constant is zeros.  For example:
  Y = and long X, 1234

now generates:
  Yl = and Xl, 1234
  Yh = 0

instead of:
  Yl = and Xl, 1234
  Yh = and Xh, 0

llvm-svn: 12685

5fc6f77b