Commits · bcdadf376549b8157a27407fdffd33b47261b528 · Roger Ferrer / llvm-epi-0.8

Jun 20, 2004
- Move the IntrinsicLowering header into the CodeGen directory, as per PR346 · bcdadf37
  Chris Lattner authored Jun 20, 2004
```
llvm-svn: 14266
```
  bcdadf37
Jun 18, 2004
- Codegen sub C, X a little bit better for register pressure. Instead of · aa27623b
  Chris Lattner authored Jun 18, 2004
```
mov REG, C
sub REG, X

generate:

neg X
add X, C

which uses one less reg

llvm-svn: 14213
```
  aa27623b
- Fold setcc instructions into select and branches that are not in the same BB as · b30d1229
  Chris Lattner authored Jun 18, 2004
```
the setcc.

llvm-svn: 14212
```
  b30d1229
- Do not fold loads into instructions if it is used more than once. In particular · 7887da36
  Chris Lattner authored Jun 17, 2004
```
we do not want to fold the load in cases like this:

  X = load
    = add A, X
    = add B, X

llvm-svn: 14204
```
  7887da36
Jun 17, 2004
- Rename Type::PrimitiveID to TypeId and ::getPrimitiveID() to ::getTypeID() · 6b727599
  Chris Lattner authored Jun 17, 2004
```
llvm-svn: 14201
```
  6b727599
Jun 15, 2004
- Remove support for llvm.isnan. Alkis wins :) · 7011d355
  Chris Lattner authored Jun 15, 2004
```
llvm-svn: 14189
```
  7011d355
- Add basic support for the isunordered intrinsic. The isnan stuff still needs to go · 70dfc06e
  Chris Lattner authored Jun 15, 2004
```
llvm-svn: 14185
```
  70dfc06e
Jun 11, 2004

By far, one of the most common uses of isnan is to make 'isunordered' · 1c2be0e5

Chris Lattner authored Jun 11, 2004

comparisons.  In an 'isunordered' predicate, which looks like this at
the LLVM level:

        %a = call bool %llvm.isnan(double %X)
        %b = call bool %llvm.isnan(double %Y)
        %COM = or bool %a, %b

We used to generate this code:

        fxch %ST(1)
        fucomip %ST(0), %ST(0)
        setp %AL
        fucomip %ST(0), %ST(0)
        setp %AH
        or %AL, %AH

With this patch, we generate this code:

        fucomip %ST(0), %ST(1)
        fstp %ST(0)
        setp %AL

Which should make alkis happy.  Tested as X86/compare_folding.llx:test1

llvm-svn: 14148

1c2be0e5

Fix bug in previous checkin · 71186e2f
Chris Lattner authored Jun 11, 2004
```
llvm-svn: 14146
```
71186e2f
No really, these are dead now · 5ed9113e
Chris Lattner authored Jun 11, 2004
```
llvm-svn: 14145
```
5ed9113e
Now that compare instructions aren't lumped in with the other twoargfp instructions, · b35f4762
Chris Lattner authored Jun 11, 2004
```
we can get rid of the FpUCOM/FpUCOMi pseudo instructions, which makes stuff simpler
and faster.

llvm-svn: 14144
```
b35f4762
Introduce a new FP instruction type to separate the compare cases from the · 0876edf1
Chris Lattner authored Jun 11, 2004
```
twoarg cases.

llvm-svn: 14143
```
0876edf1
Add direct support for the isnan intrinsic, implementing test/Regression/CodeGen/X86/isnan.llx · 26a964f8
Chris Lattner authored Jun 11, 2004
```
testcase

llvm-svn: 14141
```
26a964f8
Add support for the setp instructions · a0cfedef
Chris Lattner authored Jun 11, 2004
```
llvm-svn: 14140
```
a0cfedef

Split compare instruction handling OUT of handleTwoArgFP into handleCompareFP. · 94ff2c32

Chris Lattner authored Jun 11, 2004

This makes the code much simpler, and the two cases really do belong apart.
Once we do it, it's pretty obvious how flawed the logic was for A != A case,
so I fixed it (fixing PR369).

This also uses freeStackSlotAfter instead of inserting an fxchg then
popStackAfter'ing in the case where there is a dead result (unlikely, but
possible), producing better code.

llvm-svn: 14139

94ff2c32

Jun 10, 2004
- Fix the fixed stack offset, patch contributed by Vladimir Prus · 6d6b3b3c
  Chris Lattner authored Jun 10, 2004
```
llvm-svn: 14110
```
  6d6b3b3c
Jun 09, 2004
- Fix for PR#366. We use getClassB() so that we can handle cast instructions · 9095c641
  John Criswell authored Jun 09, 2004
```
that cast to bool.

llvm-svn: 14096
```
  9095c641
Jun 04, 2004
- This file is obsolete · add9f29f
  Chris Lattner authored Jun 04, 2004
```
llvm-svn: 14005
```
  add9f29f
Jun 02, 2004
- Convert to the new TargetMachine interface. · 82baa9c3
  Chris Lattner authored Jun 02, 2004
```
llvm-svn: 13952
```
  82baa9c3
May 23, 2004
- Add support for accurate garbage collection to the LLVM code generators · 6e4edd65
  Chris Lattner authored May 23, 2004
```
llvm-svn: 13696
```
  6e4edd65
- Add some notes to myself, no functional changes · 3ef067ff
  Chris Lattner authored May 23, 2004
```
llvm-svn: 13695
```
  3ef067ff
- minor wording change · 66911019
  Chris Lattner authored May 23, 2004
```
llvm-svn: 13694
```
  66911019
May 14, 2004
- Don't keep track of references to LLVM BasicBlocks while emitting; use · de5ccc18
  Brian Gaeke authored May 14, 2004
```
MachineBasicBlocks instead.

llvm-svn: 13568
```
  de5ccc18
- Support MachineBasicBlock operands on RawFrm instructions. · 2b3a81cd
  Brian Gaeke authored May 14, 2004
```
Get rid of separate numbering for LLVM BasicBlocks; use the automatically
generated MachineBasicBlock numbering.

llvm-svn: 13567
```
  2b3a81cd
- Generate branch machine instructions with MachineBasicBlock operands instead of · 35e73e1c
  Brian Gaeke authored May 14, 2004
```
LLVM BasicBlock operands.

llvm-svn: 13566
```
  35e73e1c
May 13, 2004

Two more improvements for null pointer handling: storing a null pointer · 8e7aea02

Chris Lattner authored May 13, 2004

and passing a null pointer into a function.

For this testcase:

void %test(int** %X) {
  store int* null, int** %X
  call void %test(int** null)
  ret void
}

we now generate this:

test:
        sub %ESP, 12
        mov %EAX, DWORD PTR [%ESP + 16]
        mov DWORD PTR [%EAX], 0
        mov DWORD PTR [%ESP], 0
        call test
        add %ESP, 12
        ret

instead of this:

test:
        sub %ESP, 12
        mov %EAX, DWORD PTR [%ESP + 16]
        mov %ECX, 0
        mov DWORD PTR [%EAX], %ECX
        mov %EAX, 0
        mov DWORD PTR [%ESP], %EAX
        call test
        add %ESP, 12
        ret

llvm-svn: 13558

8e7aea02

Second half of my fixed-sized-alloca patch. This folds the LEA to compute · 593d22d6

Chris Lattner authored May 13, 2004

the alloca address into common operations like loads/stores.

In a simple testcase like this (which is just designed to excersize the
alloca A, nothing more):

int %test(int %X, bool %C) {
        %A = alloca int
        store int %X, int* %A
        store int* %A, int** %G
        br bool %C, label %T, label %F
T:
        call int %test(int 1, bool false)
        %V = load int* %A
        ret int %V
F:
        call int %test(int 123, bool true)
        %V2 = load int* %A
        ret int %V2
}

We now generate:

test:
        sub %ESP, 12
        mov %EAX, DWORD PTR [%ESP + 16]
        mov %CL, BYTE PTR [%ESP + 20]
***     mov DWORD PTR [%ESP + 8], %EAX
        mov %EAX, OFFSET G
        lea %EDX, DWORD PTR [%ESP + 8]
        mov DWORD PTR [%EAX], %EDX
        test %CL, %CL
        je .LBB2 # PC rel: F
.LBB1:  # T
        mov DWORD PTR [%ESP], 1
        mov DWORD PTR [%ESP + 4], 0
        call test
***     mov %EAX, DWORD PTR [%ESP + 8]
        add %ESP, 12
        ret
.LBB2:  # F
        mov DWORD PTR [%ESP], 123
        mov DWORD PTR [%ESP + 4], 1
        call test
***     mov %EAX, DWORD PTR [%ESP + 8]
        add %ESP, 12
        ret

Instead of:

test:
        sub %ESP, 20
        mov %EAX, DWORD PTR [%ESP + 24]
        mov %CL, BYTE PTR [%ESP + 28]
***     lea %EDX, DWORD PTR [%ESP + 16]
***     mov DWORD PTR [%EDX], %EAX
        mov %EAX, OFFSET G
        mov DWORD PTR [%EAX], %EDX
        test %CL, %CL
***     mov DWORD PTR [%ESP + 12], %EDX
        je .LBB2 # PC rel: F
.LBB1:  # T
        mov DWORD PTR [%ESP], 1
        mov %EAX, 0
        mov DWORD PTR [%ESP + 4], %EAX
        call test
***     mov %EAX, DWORD PTR [%ESP + 12]
***     mov %EAX, DWORD PTR [%EAX]
        add %ESP, 20
        ret
.LBB2:  # F
        mov DWORD PTR [%ESP], 123
        mov %EAX, 1
        mov DWORD PTR [%ESP + 4], %EAX
        call test
***     mov %EAX, DWORD PTR [%ESP + 12]
***     mov %EAX, DWORD PTR [%EAX]
        add %ESP, 20
        ret

llvm-svn: 13557

593d22d6

Substantially improve code generation for address exposed locals (aka fixed · 2bb33259

Chris Lattner authored May 13, 2004

sized allocas in the entry block).  Instead of generating code like this:

entry:
  reg1024 = ESP+1234
... (much later)
  *reg1024 = 17


Generate code that looks like this:
entry:
  (no code generated)
... (much later)
  t = ESP+1234
  *t = 17

The advantage being that we DRAMATICALLY reduce the register pressure for these
silly temporaries (they were all being spilled to the stack, resulting in very
silly code).  This is actually a manual implementation of rematerialization :)

I have a patch to fold the alloca address computation into loads & stores, which
will make this much better still, but just getting this right took way too much time
and I'm sleepy.

llvm-svn: 13554

2bb33259

May 12, 2004

Pass boolean constants into function calls more efficiently, generating: · e2d382e1

Chris Lattner authored May 12, 2004

        mov DWORD PTR [%ESP + 4], 1

instead of:

        mov %EAX, 1
        mov DWORD PTR [%ESP + 4], %EAX

llvm-svn: 13494

e2d382e1

May 10, 2004
- Fix a fairly serious pessimizaion that was preventing us from efficiently · 72fb3256
  Chris Lattner authored May 10, 2004
```
compiling things like 'add long %X, 1'.  The problem is that we were switching
the order of the operands for longs even though we can't fold them yet.

llvm-svn: 13451
```
  72fb3256
- Fix some comments, avoid sign extending booleans when zero extend works fine · a367dd74
  Chris Lattner authored May 09, 2004
```
llvm-svn: 13440
```
  a367dd74
- Generate more efficient code for casting booleans to integers (no sign extension required) · 1542a98e
  Chris Lattner authored May 09, 2004
```
llvm-svn: 13439
```
  1542a98e
May 07, 2004

Codegen floating point stores of constants into integer instructions. This · a2dc6bf6

Chris Lattner authored May 07, 2004

allows us to compile:

store float 10.0, float* %P

into:
        mov DWORD PTR [%EAX], 1092616192

instead of:

.CPItest_0:                                     # float 0x4024000000000000
.long   1092616192      # float 10
...
        fld DWORD PTR [.CPItest_0]
        fstp DWORD PTR [%EAX]

llvm-svn: 13409

a2dc6bf6

Make comparisons against the null pointer as efficient as integer comparisons · cecf3f94

Chris Lattner authored May 07, 2004

against zero.  In particular, don't emit:

        mov %ESI, 0
        cmp %ECX, %ESI

instead, emit:

       test %ECX, %ECX

llvm-svn: 13407

cecf3f94

May 04, 2004

Remove unneeded check · c6f60131
Chris Lattner authored May 04, 2004
```
llvm-svn: 13355
```
c6f60131

Improve signed division by power of 2 *dramatically* from this: · 22df9a59

Chris Lattner authored May 04, 2004

div:
        mov %EDX, DWORD PTR [%ESP + 4]
        mov %ECX, 64
        mov %EAX, %EDX
        sar %EDX, 31
        idiv %ECX
        ret

to this:

div:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, %EAX
        sar %ECX, 5
        shr %ECX, 26
        mov %EDX, %EAX
        add %EDX, %ECX
        sar %EAX, 6
        ret

Note that the intel compiler is currently making this:

div:
        movl      4(%esp), %edx                                 #3.5
        movl      %edx, %eax                                    #4.14
        sarl      $5, %eax                                      #4.14
        shrl      $26, %eax                                     #4.14
        addl      %edx, %eax                                    #4.14
        sarl      $6, %eax                                      #4.14
        ret                                                     #4.14

Which has one less register->register copy.  (hint hint alkis :)

llvm-svn: 13354

22df9a59

Improve code generated for integer multiplications by 2,3,5,9 · 8c22ece2
Chris Lattner authored May 04, 2004
```
llvm-svn: 13342
```
8c22ece2

May 01, 2004
- Remove unused #include · 7b0a2046
  Chris Lattner authored May 01, 2004
```
llvm-svn: 13304
```
  7b0a2046
- Iterate over the Machine CFG that Brian added instead of the LLVM CFG. · acbf0c84
  Chris Lattner authored May 01, 2004
```
Look at all of the pretty minuses. :)

llvm-svn: 13303
```
  acbf0c84
Apr 28, 2004

Make RequiresFPRegKill() take a MachineBasicBlock arg. · 4390e4a7

Brian Gaeke authored Apr 28, 2004

In InsertFPRegKills(), just check the MachineBasicBlock for successors
instead of its corresponding BasicBlock.

llvm-svn: 13213

4390e4a7