Commits · 10db062d4121785bd2d4d0999211767a146df1a8 · Roger Ferrer / llvm-epi-0.8

Apr 08, 2004

Added the llvm.readport and llvm.writeport intrinsics for x86. These do · 10db062d

John Criswell authored Apr 08, 2004

I/O port instructions on x86.  The specific code sequence is tailored to
the parameters and return value of the intrinsic call.
Added the ability for implicit defintions to be printed in the Instruction
Printer.
Added the ability for RawFrm instruction to print implict uses and
defintions with correct comma output.  This required adjustment to some
methods so that a leading comma would or would not be printed.

llvm-svn: 12782

10db062d

Apr 07, 2004
- Don't include InstrSelectionSupport.h. · 69ee7e13
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12766
```
  69ee7e13
- Move ChooseRegOrImmed() prototype here, from InstrSelectionSupport.h. · c1256649
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12765
```
  c1256649
- Don't include InstrSelectionSupport.h. · 5c801183
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12764
```
  5c801183
- Fix insertion of SelectInsts. · 8931345f
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12760
```
  8931345f
- Don't print [%reg + 0], just print [%reg] · 85521d70
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12759
```
  85521d70
- First version of code to handle loads. Stub function for handling stores. · 6d62df54
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12758
```
  6d62df54
- Support loading arguments from %I0...%I5 into virtual registers in · 989c04ab
  Brian Gaeke authored Apr 07, 2004
```
function prologues, and fix an off-by-one in visitCallInst that was
putting call args into the wrong registers.

llvm-svn: 12757
```
  989c04ab
- It's setting up the call args right now, but on the callee side, it's · 7985e56c
  Brian Gaeke authored Apr 07, 2004
```
trying to get incoming args off the stack, instead of the %i0...%i6 regs,
which is wrong.

llvm-svn: 12756
```
  7985e56c
- This is a start on handling setcc instructions. As the comment notes, we · bd58b3fb
  Chris Lattner authored Apr 07, 2004
```
have no good way of handling this until the code generator is improved.
We should probably just emit V9 instructions in the meantime.

llvm-svn: 12745
```
  bd58b3fb
- andd subcc instructions which is used to create the 'cmp' pseudo instruction · bb22d5a5
  Chris Lattner authored Apr 07, 2004
```
llvm-svn: 12744
```
  bb22d5a5
- Avoid emitting an extra copy on each 32-bit operation · f6245bc8
  Chris Lattner authored Apr 07, 2004
```
llvm-svn: 12743
```
  f6245bc8
- Make generation of stack-slot loads and copies less ugly. · 4aac8143
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12742
```
  4aac8143
- Fix bug in printing loads. · 3675c308
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12741
```
  3675c308
- Add support for shift instructions, wrap some long lines · 42ffd2e3
  Chris Lattner authored Apr 07, 2004
```
llvm-svn: 12740
```
  42ffd2e3
- Fix encoding of existing shift instructions, add rr shifts · 8406cf30
  Chris Lattner authored Apr 07, 2004
```
llvm-svn: 12739
```
  8406cf30
- Add a bunch more instructions · fcdf82a1
  Chris Lattner authored Apr 07, 2004
```
llvm-svn: 12737
```
  fcdf82a1
- Merge my changes with brians · fd8212ef
  Chris Lattner authored Apr 07, 2004
```
llvm-svn: 12736
```
  fd8212ef
- Add in some things I forgot, which Chris helpfully reminded me of... · 37f92b53
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12735
```
  37f92b53
- Add support for the "Y" register, used by MUL & DIV. · 32242318
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12734
```
  32242318
- Add UDIV, SDIV, and a few variants of WR. · 5524d54c
  Brian Gaeke authored Apr 07, 2004
```
llvm-svn: 12733
```
  5524d54c
- Preliminary support for getting 64-bit integer constants into registers. · cfbfb8ac
  Brian Gaeke authored Apr 07, 2004
```
Preliminary support for division. It's gross because you have to initialize
the "Y" register, which is the top 32 bits of the thing you're dividing.

llvm-svn: 12732
```
  cfbfb8ac
- Prune unnecessary #includes · 589bf05b
  Brian Gaeke authored Apr 06, 2004
```
llvm-svn: 12731
```
  589bf05b
- Simple delay slot filler pass. · b3deed9f
  Brian Gaeke authored Apr 06, 2004
```
llvm-svn: 12730
```
  b3deed9f
- Add references to delay slot filler pass. · 610c685e
  Brian Gaeke authored Apr 06, 2004
```
Fill in addPassesToJITCompile method.

llvm-svn: 12729
```
  610c685e
- First attempt at handling frame index elimination. · 4bd246ae
  Brian Gaeke authored Apr 06, 2004
```
llvm-svn: 12728
```
  4bd246ae
- First attempt at special-casing printing of [%reg + offset] for · 3915ad7c
  Brian Gaeke authored Apr 06, 2004
```
ld/st instructions - doesn't seem to work yet, but I think it's
just a typo or something somewhere.

llvm-svn: 12727
```
  3915ad7c
- Delete reference to "the Mach-O Runtime ABI". · 5e624b82
  Brian Gaeke authored Apr 06, 2004
```
llvm-svn: 12726
```
  5e624b82
- Deal with call return values. · 2e91a3d6
  Brian Gaeke authored Apr 06, 2004
```
Don't put NOPs in delay slots at all. We'll have a fix-up pass later.

llvm-svn: 12725
```
  2e91a3d6
Apr 06, 2004

· b8955205

Jakub Staszak authored Apr 06, 2004

file based off InstSelectSimple.cpp, slowly being replaced by generated code from the really simple X86 instruction selector tablegen backend

llvm-svn: 12715

b8955205

· de647007

Jakub Staszak authored Apr 06, 2004

Tablgen files for really simple instruction selector

llvm-svn: 12714

de647007

Fix PR313: [x86] JIT miscompiles unsigned short to floating point · 4b936125
Chris Lattner authored Apr 06, 2004
```
llvm-svn: 12711
```
4b936125
Fix incorrect encoding of some ADC and SBB instuctions · ba33ae58
Chris Lattner authored Apr 06, 2004
```
llvm-svn: 12710
```
ba33ae58

Fix a minor bug in previous checking · 19c8b13e

Chris Lattner authored Apr 06, 2004

Enable folding of long seteq/setne comparisons into branches and select instructions
Implement unfolded long relational comparisons against a constants a bit more efficiently

Folding comparisons changes code that looks like this:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %ECX, %EAX
        or %ECX, %EDX
        sete %CL
        test %CL, %CL
        je .LBB2 # PC rel: F

into code that looks like this:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %ECX, %EAX
        or %ECX, %EDX
        jne .LBB2 # PC rel: F

This speeds up 186.crafty by 6% with llc-ls.

llvm-svn: 12702

19c8b13e

Improve codegen of long == and != comparisons against constants. Before, · f2ee88eb

Chris Lattner authored Apr 06, 2004

comparing a long against zero got us this:

        sub %ESP, 8
        mov DWORD PTR [%ESP + 4], %ESI
        mov DWORD PTR [%ESP], %EDI
        mov %EAX, DWORD PTR [%ESP + 12]
        mov %EDX, DWORD PTR [%ESP + 16]
        mov %ECX, 0
        mov %ESI, 0
        mov %EDI, %EAX
        xor %EDI, %ECX
        mov %ECX, %EDX
        xor %ECX, %ESI
        or %EDI, %ECX
        sete %CL
        test %CL, %CL
        je .LBB2 # PC rel: F

Now it gets us this:

        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %ECX, %EAX
        or %ECX, %EDX
        sete %CL
        test %CL, %CL
        je .LBB2 # PC rel: F

llvm-svn: 12696

f2ee88eb

Handle various other important cases of multiplying a long constant immediate. For · 6c3bf13f

Chris Lattner authored Apr 06, 2004

example, multiplying X*(1 + (1LL << 32)) now produces:

test:
        mov %ECX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        mov %EAX, %ECX
        add %EDX, %ECX
        ret

[[[Note to Alkis: why isn't linear scan generating this code??  This might be a
 problem with your intervals being too conservative:

test:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EDX, DWORD PTR [%ESP + 8]
        add %EDX, %EAX
        ret

end note]]]

Whereas GCC produces this:

T:
        sub     %esp, 12
        mov     %edx, DWORD PTR [%esp+16]
        mov     DWORD PTR [%esp+8], %edi
        mov     %ecx, DWORD PTR [%esp+20]
        xor     %edi, %edi
        mov     DWORD PTR [%esp], %ebx
        mov     %ebx, %edi
        mov     %eax, %edx
        mov     DWORD PTR [%esp+4], %esi
        add     %ebx, %edx
        mov     %edi, DWORD PTR [%esp+8]
        lea     %edx, [%ecx+%ebx]
        mov     %esi, DWORD PTR [%esp+4]
        mov     %ebx, DWORD PTR [%esp]
        add     %esp, 12
        ret

I'm not sure example what GCC is smoking here, but it looks like it has just
confused itself with a bunch of stack slots or something.  The intel compiler
is better, but still not good:

T:
        movl      4(%esp), %edx                                 #2.11
        movl      8(%esp), %eax                                 #2.11
        lea       (%eax,%edx), %ecx                             #3.12
        movl      $1, %eax                                      #3.12
        mull      %edx                                          #3.12
        addl      %ecx, %edx                                    #3.12
        ret                                                     #3.12

llvm-svn: 12693

6c3bf13f

Efficiently handle a long multiplication by a constant. For this testcase: · 1f6024cb

Chris Lattner authored Apr 06, 2004

long %test(long %X) {
        %Y = mul long %X, 123
        ret long %Y
}

we used to generate:

test:
        sub %ESP, 12
        mov DWORD PTR [%ESP + 8], %ESI
        mov DWORD PTR [%ESP + 4], %EDI
        mov DWORD PTR [%ESP], %EBX
        mov %ECX, DWORD PTR [%ESP + 16]
        mov %ESI, DWORD PTR [%ESP + 20]
        mov %EDI, 123
        mov %EBX, 0
        mov %EAX, %ECX
        mul %EDI
        imul %ESI, %EDI
        add %ESI, %EDX
        imul %ECX, %EBX
        add %ESI, %ECX
        mov %EDX, %ESI
        mov %EBX, DWORD PTR [%ESP]
        mov %EDI, DWORD PTR [%ESP + 4]
        mov %ESI, DWORD PTR [%ESP + 8]
        add %ESP, 12
        ret

Now we emit:
test:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, DWORD PTR [%ESP + 8]
        mov %EDX, 123
        mul %EDX
        imul %ECX, %ECX, 123
        add %ECX, %EDX
        mov %EDX, %ECX
        ret

Which, incidently, is substantially nicer than what GCC manages:
T:
        sub     %esp, 8
        mov     %eax, 123
        mov     DWORD PTR [%esp], %ebx
        mov     %ebx, DWORD PTR [%esp+16]
        mov     DWORD PTR [%esp+4], %esi
        mov     %esi, DWORD PTR [%esp+12]
        imul    %ecx, %ebx, 123
        mov     %ebx, DWORD PTR [%esp]
        mul     %esi
        mov     %esi, DWORD PTR [%esp+4]
        add     %esp, 8
        lea     %edx, [%ecx+%edx]
        ret

llvm-svn: 12692

1f6024cb

Improve code generation of long shifts by 32. · 2448baea

Chris Lattner authored Apr 06, 2004

On this testcase:

long %test(long %X) {
        %Y = shr long %X, ubyte 32
        ret long %Y
}

instead of:
t:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EAX, DWORD PTR [%ESP + 8]
        sar %EAX, 0
        mov %EDX, 0
        ret


we now emit:
test:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %EAX, DWORD PTR [%ESP + 8]
        mov %EDX, 0
        ret

llvm-svn: 12688

2448baea

Bugfixes: inc/dec don't set the carry flag! · 7332d4c5
Chris Lattner authored Apr 06, 2004
```
llvm-svn: 12687
```
7332d4c5

Improve code for passing constant longs as arguments to function calls. · decce5bc

Chris Lattner authored Apr 06, 2004

For example, on this instruction:

        call void %test(long 1234)

Instead of this:
        mov %EAX, 1234
        mov %ECX, 0
        mov DWORD PTR [%ESP], %EAX
        mov DWORD PTR [%ESP + 4], %ECX
        call test

We now emit this:
        mov DWORD PTR [%ESP], 1234
        mov DWORD PTR [%ESP + 4], 0
        call test

llvm-svn: 12686

decce5bc