Commits · 0d3773d8b1c0d9f841181222368feec88c8a70f1 · Roger Ferrer / llvm-epi-0.8

Nov 05, 2004
- Fix compilation problem; make the cast and the LHS be the same type. · ab79288e
  John Criswell authored Nov 05, 2004
```
llvm-svn: 17488
```
  ab79288e
- Quiet VC++ warnings · 429aaa58
  Chris Lattner authored Nov 05, 2004
```
llvm-svn: 17484
```
  429aaa58
Nov 02, 2004
- Fix a warning · 99d7bb33
  Chris Lattner authored Nov 02, 2004
```
llvm-svn: 17431
```
  99d7bb33
Nov 01, 2004
- Add placeholder variable to make Win32 work, applied for Morten Ofstad · 720eb217
  Chris Lattner authored Nov 01, 2004
```
llvm-svn: 17406
```
  720eb217
Oct 28, 2004
- Change Library Names Not To Conflict With Others When Installed · 57cbe39d
  Reid Spencer authored Oct 27, 2004
```
llvm-svn: 17286
```
  57cbe39d
Oct 22, 2004
- Adjust to changes in Makefile.rules · 30d8baea
  Reid Spencer authored Oct 22, 2004
```
llvm-svn: 17167
```
  30d8baea
- We won't use automake · c1c320c3
  Reid Spencer authored Oct 22, 2004
```
llvm-svn: 17155
```
  c1c320c3
Oct 19, 2004
- Initial automake generated Makefile template · 6a11a75f
  Reid Spencer authored Oct 18, 2004
```
llvm-svn: 17136
```
  6a11a75f
Oct 18, 2004
- Improve compatibility with VC++, patch contributed by Morten Ofstad! · fbc070bf
  Chris Lattner authored Oct 18, 2004
```
llvm-svn: 17126
```
  fbc070bf
Oct 17, 2004

Don't print stuff out from the code generator. This broke the JIT horribly · 06855531
Chris Lattner authored Oct 17, 2004
```
last night. :)  bork!

llvm-svn: 17093
```
06855531

Rewrite support for cast uint -> FP. In particular, we used to compile this: · 839abf57

Chris Lattner authored Oct 17, 2004

double %test(uint %X) {
        %tmp.1 = cast uint %X to double         ; <double> [#uses=1]
        ret double %tmp.1
}

into:

test:
        sub %ESP, 8
        mov %EAX, DWORD PTR [%ESP + 12]
        mov %ECX, 0
        mov DWORD PTR [%ESP], %EAX
        mov DWORD PTR [%ESP + 4], %ECX
        fild QWORD PTR [%ESP]
        add %ESP, 8
        ret

... which basically zero extends to 8 bytes, then does an fild for an
8-byte signed int.

Now we generate this:


test:
        sub %ESP, 4
        mov %EAX, DWORD PTR [%ESP + 8]
        mov DWORD PTR [%ESP], %EAX
        fild DWORD PTR [%ESP]
        shr %EAX, 31
        fadd DWORD PTR [.CPItest_0 + 4*%EAX]
        add %ESP, 4
        ret

        .section .rodata
        .align  4
.CPItest_0:
        .quad   5728578726015270912

This does a 32-bit signed integer load, then adds in an offset if the sign
bit of the integer was set.

It turns out that this is substantially faster than the preceeding sequence.
Consider this testcase:

unsigned a[2]={1,2};
volatile double G;

void main() {
    int i;
    for (i=0; i<100000000; ++i )
        G += a[i&1];
}

On zion (a P4 Xeon, 3Ghz), this patch speeds up the testcase from 2.140s
to 0.94s.

On apoc, an athlon MP 2100+, this patch speeds up the testcase from 1.72s
to 1.34s.

Note that the program takes 2.5s/1.97s on zion/apoc with GCC 3.3 -O3
-fomit-frame-pointer.

llvm-svn: 17083

839abf57

Unify handling of constant pool indexes with the other code paths, allowing · 112fd88a
Chris Lattner authored Oct 17, 2004
```
us to use index registers for CPI's

llvm-svn: 17082
```
112fd88a
Give the asmprinter the ability to print memrefs with a constant pool index, · af19d396
Chris Lattner authored Oct 17, 2004
```
index reg and scale

llvm-svn: 17081
```
af19d396

fold: · 653d8663

Chris Lattner authored Oct 17, 2004

  %X = and Y, constantint
  %Z = setcc %X, 0

instead of emitting:

        and %EAX, 3
        test %EAX, %EAX
        je .LBBfoo2_2   # UnifiedReturnBlock

We now emit:

        test %EAX, 3
        je .LBBfoo2_2   # UnifiedReturnBlock

This triggers 581 times on 176.gcc for example.

llvm-svn: 17080

653d8663

Oct 16, 2004
- Teach the X86 backend about unreachable and undef. Among other things, we · e4bea062
  Chris Lattner authored Oct 16, 2004
```
now compile:

'foo() {}' into "ret" instead of "mov EAX, 0; ret"

llvm-svn: 17049
```
  e4bea062
Oct 15, 2004
- Instruction select globals with offsets better. For example, on this test · 15914416
  Chris Lattner authored Oct 15, 2004
```
case:

int C[100];
int foo() {
  return C[4];
}

We now codegen:

foo:
        mov %EAX, DWORD PTR [C + 16]
        ret

instead of:

foo:
        mov %EAX, OFFSET C
        mov %EAX, DWORD PTR [%EAX + 16]
        ret

Other impressive features may be coming later.

This patch is contributed by Jeff Cohen!

llvm-svn: 17011
```
  15914416
- Give the X86 JIT the ability to encode global+disp constants. Patch · 3b78938b
  Chris Lattner authored Oct 15, 2004
```
contributed by Jeff Cohen!

llvm-svn: 17010
```
  3b78938b
- Give the X86 asm printer the ability to print out addressing modes that have · 19025d5a
  Chris Lattner authored Oct 15, 2004
```
constant displacements from global variables.  Patch by Jeff Cohen!

llvm-svn: 17009
```
  19025d5a
- Allow X86 addressing modes to represent globals with offsets. Patch contributed · df7b984f
  Chris Lattner authored Oct 15, 2004
```
by Jeff Cohen!

llvm-svn: 17008
```
  df7b984f
Oct 13, 2004
- Update to reflect changes in Makefile rules. · ace94df7
  Reid Spencer authored Oct 13, 2004
```
llvm-svn: 16950
```
  ace94df7
Oct 11, 2004
- Initial version of automake Makefile.am file. · 97327f05
  Reid Spencer authored Oct 10, 2004
```
llvm-svn: 16893
```
  97327f05
Oct 09, 2004

The person who was planning to add SSE support isn't anymore, so disable · 23c8d0b6

Chris Lattner authored Oct 08, 2004

the -sse* options (to avoid misleading people).

Also, the stack alignment of the target doesn't depend on whether SSE is
eventually implemented, so remove a comment.

llvm-svn: 16860

23c8d0b6

Fix a major regression from the bugfix for 2004-10-08-SelectSetCCFold.llx, · 97ea4206

Chris Lattner authored Oct 08, 2004

which prevented setcc's from being folded into branches.  It appears that
conditional branchinst's CC operand is actually operand(2), not operand(0)
as we might expect. :(

llvm-svn: 16859

97ea4206

Oct 08, 2004
- Fix bug: 2004-10-08-SelectSetCCFold.llx. Normally this is hidden by the · 0be2f504
  Chris Lattner authored Oct 08, 2004
```
instcombine xform, which is why we didn't notice it before.

llvm-svn: 16840
```
  0be2f504
Oct 06, 2004

Remove debugging code, fix encoding problem. This fixes the problems · 93867e51
Chris Lattner authored Oct 06, 2004
```
the JIT had last night.

llvm-svn: 16766
```
93867e51

Codegen signed mod by 2 or -2 more efficiently. Instead of generating: · 6835dedb

Chris Lattner authored Oct 06, 2004

t:
        mov %EDX, DWORD PTR [%ESP + 4]
        mov %ECX, 2
        mov %EAX, %EDX
        sar %EDX, 31
        idiv %ECX
        mov %EAX, %EDX
        ret

Generate:
t:
        mov %ECX, DWORD PTR [%ESP + 4]
***     mov %EAX, %ECX
        cdq
        and %ECX, 1
        xor %ECX, %EDX
        sub %ECX, %EDX
***     mov %EAX, %ECX
        ret

Note that the two marked moves are redundant, and should be eliminated by the
register allocator, but aren't.

Compare this to GCC, which generates:

t:
        mov     %eax, DWORD PTR [%esp+4]
        mov     %edx, %eax
        shr     %edx, 31
        lea     %ecx, [%edx+%eax]
        and     %ecx, -2
        sub     %eax, %ecx
        ret

or ICC 8.0, which generates:

t:
        movl      4(%esp), %ecx                                 #3.5
        movl      $-2147483647, %eax                            #3.25
        imull     %ecx                                          #3.25
        movl      %ecx, %eax                                    #3.25
        sarl      $31, %eax                                     #3.25
        addl      %ecx, %edx                                    #3.25
        subl      %edx, %eax                                    #3.25
        addl      %eax, %eax                                    #3.25
        negl      %eax                                          #3.25
        subl      %eax, %ecx                                    #3.25
        movl      %ecx, %eax                                    #3.25
        ret                                                     #3.25

We would be in great shape if not for the moves.

llvm-svn: 16763

6835dedb

Fix a scary bug with signed division by a power of two. We used to generate: · 7bd8f133

Chris Lattner authored Oct 06, 2004

s:   ;; X / 4
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, %EAX
        sar %ECX, 1
        shr %ECX, 30
        mov %EDX, %EAX
        add %EDX, %ECX
        sar %EAX, 2
        ret

When we really meant:

s:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, %EAX
        sar %ECX, 1
        shr %ECX, 30
        add %EAX, %ECX
        sar %EAX, 2
        ret

Hey, this also reduces register pressure too :)

llvm-svn: 16761

7bd8f133

Codegen signed divides by 2 and -2 more efficiently. In particular · 147edd2f

Chris Lattner authored Oct 06, 2004

instead of:

s:   ;; X / 2
        movl 4(%esp), %eax
        movl %eax, %ecx
        shrl $31, %ecx
        movl %eax, %edx
        addl %ecx, %edx
        sarl $1, %eax
        ret

t:   ;; X / -2
        movl 4(%esp), %eax
        movl %eax, %ecx
        shrl $31, %ecx
        movl %eax, %edx
        addl %ecx, %edx
        sarl $1, %eax
        negl %eax
        ret

Emit:

s:
        movl 4(%esp), %eax
        cmpl $-2147483648, %eax
        sbbl $-1, %eax
        sarl $1, %eax
        ret

t:
        movl 4(%esp), %eax
        cmpl $-2147483648, %eax
        sbbl $-1, %eax
        sarl $1, %eax
        negl %eax
        ret

llvm-svn: 16760

147edd2f

Add some new instructions. Fix the asm string for sbb32rr · e9bfa5a2
Chris Lattner authored Oct 06, 2004
```
llvm-svn: 16759
```
e9bfa5a2

Oct 04, 2004
- * Prune #includes · d1ab378b
  Chris Lattner authored Oct 04, 2004
```
* Update comments
* Rearrange code a bit
* Finally ELIMINATE the GAS workaround emitter for Intel mode.  woot!

llvm-svn: 16647
```
  d1ab378b
- Add support for emitting AT&T style .s files, and make it the default. Users · 68ab0beb
  Chris Lattner authored Oct 04, 2004
```
may now choose their output format with the -x86-asm-syntax={intel|att} flag.

llvm-svn: 16646
```
  68ab0beb
- Convert some missed patterns to support AT&T style · 8bbde2fb
  Chris Lattner authored Oct 04, 2004
```
llvm-svn: 16645
```
  8bbde2fb
- Apparently the GNU assembler has a HUGE hack to be compatible with really · 2e99778a
  Chris Lattner authored Oct 04, 2004
```
old and broken AT&T syntax assemblers.  The problem with this hack is that
*SOME* forms of the fdiv and fsub instructions have the 'r' bit inverted.
This was a real pain to figure out, but is trivially easy to support: thus
we are now bug compatible with gas and gcc.

llvm-svn: 16644
```
  2e99778a
- Fix incorrect suffix · af695033
  Chris Lattner authored Oct 04, 2004
```
llvm-svn: 16642
```
  af695033
- Fix some more missed suffixes and swapped operands · e1a2826d
  Chris Lattner authored Oct 04, 2004
```
llvm-svn: 16641
```
  e1a2826d
- Add missing suffixes to FP instructions for AT&T mode · a488f04f
  Chris Lattner authored Oct 04, 2004
```
llvm-svn: 16640
```
  a488f04f
Oct 03, 2004

Add support for the -x86-asm-syntax flag, which can be used to choose between · 56832601

Chris Lattner authored Oct 03, 2004

Intel and AT&T style assembly language.  The ultimate goal of this is to
eliminate the GasBugWorkaroundEmitter class, but for now AT&T style emission
is not fully operational.

llvm-svn: 16639

56832601

Add support to the instruction patterns for AT&T style output, which will · 4e59a149

Chris Lattner authored Oct 03, 2004

hopefully lead to the death of the 'GasBugWorkaroundEmitter'.  This also
includes changes to wrap the whole file to 80 columns! Woot! :)

Note that the AT&T style output has not been tested at all.

llvm-svn: 16638

4e59a149

Sep 21, 2004
- The real x87 floating point registers should not be allocatable. They · 89dd6373
  Alkis Evlogimenos authored Sep 21, 2004
```
are only used by the stackifier when transforming FPn register
allocations to the real stack file x87 registers.

llvm-svn: 16472
```
  89dd6373
- s/ISel/X86ISel/ to have unique class names for debugging via gdb because the C++ · 43bd39e0
  Misha Brukman authored Sep 21, 2004
```
front-end in gcc does not mangle classes in anonymous namespaces correctly.

llvm-svn: 16469
```
  43bd39e0