Commits · 206baa44256e1e69c53c3a30ed00f1c77467b440 · Roger Ferrer / llvm-epi-0.8

Apr 24, 2007

Rollback some changes that adversely affected performance. I'm currently rethinking · c24701ed
Owen Anderson authored Apr 24, 2007
```
my approach to this, so hopefully I'll find a way to do this without making this slower.

llvm-svn: 36392
```
c24701ed

Fix · 38bc86f0

Devang Patel authored Apr 23, 2007

http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20070423/048333.html

llvm-svn: 36380

38bc86f0

Apr 21, 2007
- Make PredicateSimplifier not use DominatorTree. · 64995e1b
  Owen Anderson authored Apr 21, 2007
```
llvm-svn: 36300
```
  64995e1b
- Fix a comment. · 2965adb8
  Owen Anderson authored Apr 21, 2007
```
llvm-svn: 36299
```
  2965adb8
- Comment out usage of write() for now. · 5959f424
  Jeff Cohen authored Apr 20, 2007
```
llvm-svn: 36287
```
  5959f424
Apr 20, 2007
- Avoid recursion. · 83a3adcc
  Devang Patel authored Apr 20, 2007
```
llvm-svn: 36272
```
  83a3adcc
- Move more passes to using ETForest instead of DominatorTree. · 2da606c7
  Owen Anderson authored Apr 20, 2007
```
llvm-svn: 36271
```
  2da606c7
Apr 19, 2007
- Make use of ConstantInt::isZero instead of ConstantInt::isNullValue. · aafe4e21
  Zhou Sheng authored Apr 19, 2007
```
llvm-svn: 36261
```
  aafe4e21
- Make the operations of APInt variables more efficient. · 82fcf3cb
  Zhou Sheng authored Apr 19, 2007
```
llvm-svn: 36260
```
  82fcf3cb
- Revert Owen's last check-in. This is breaking Mac OS X / PPC llvm-gcc bootstrap. · db9b65d6
  Evan Cheng authored Apr 18, 2007
```
llvm-svn: 36258
```
  db9b65d6
Apr 18, 2007
- Revert changes that caused breakage. · 9421f039
  Owen Anderson authored Apr 18, 2007
```
llvm-svn: 36255
```
  9421f039
- Switch more uses of DominatorTree over to ETForest. · 9a6091de
  Owen Anderson authored Apr 18, 2007
```
llvm-svn: 36254
```
  9a6091de
- Use ETForest instead of DominatorTree. · 550e8db9
  Owen Anderson authored Apr 18, 2007
```
llvm-svn: 36252
```
  550e8db9
- Use ETForest instead of DominatorTree. · fc40d446
  Owen Anderson authored Apr 18, 2007
```
llvm-svn: 36249
```
  fc40d446
- Use new ETForest accessor. · 08293fd6
  Owen Anderson authored Apr 18, 2007
```
llvm-svn: 36248
```
  08293fd6
- Use ETForest instead of DominatorTree. · f38f2f23
  Owen Anderson authored Apr 18, 2007
```
llvm-svn: 36247
```
  f38f2f23
Apr 17, 2007
- Spell doFinalization right, so that it is a proper virtual override and · 2ce1116b
  Dan Gohman authored Apr 17, 2007
```
gets called.

llvm-svn: 36208
```
  2ce1116b
- remove use of BasicBlock::getNext · 233f97ac
  Chris Lattner authored Apr 17, 2007
```
llvm-svn: 36205
```
  233f97ac
- remove use of BasicBlock::getNext · 24e2d9ca
  Chris Lattner authored Apr 17, 2007
```
llvm-svn: 36202
```
  24e2d9ca
- eliminate use of Instruction::getNext() · cd9bda71
  Chris Lattner authored Apr 17, 2007
```
llvm-svn: 36200
```
  cd9bda71
- remove use of Instruction::getNext · 77a3edcb
  Chris Lattner authored Apr 17, 2007
```
llvm-svn: 36199
```
  77a3edcb
- · abdff3fe
  Devang Patel authored Apr 16, 2007
```
Fix
http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20070416/047888.html

llvm-svn: 36182
```
  abdff3fe
Apr 16, 2007
- Removed tabs everywhere except autogenerated & external files. Add make · fb80151c
  Anton Korobeynikov authored Apr 16, 2007
```
target for tabs checking.

llvm-svn: 36146
```
  fb80151c
Apr 15, 2007

Fix PR1335 and Transforms/Inline/2007-04-15-InlineEH.ll · 343c88cd
Chris Lattner authored Apr 15, 2007
```
llvm-svn: 36090
```
343c88cd

Remove ImmediateDominator analysis. The same information can be obtained from... · f35a1dbc

Owen Anderson authored Apr 15, 2007

Remove ImmediateDominator analysis.  The same information can be obtained from DomTree.  A lot of code for
constructing ImmediateDominator is now folded into DomTree construction.

This is part of the ongoing work for PR217.

llvm-svn: 36063

f35a1dbc

fix SimplifyLibCalls/IsDigit.ll · f8a7bf31
Chris Lattner authored Apr 15, 2007
```
llvm-svn: 36047
```
f8a7bf31

Extend store merging to support the 'if/then' version in addition to if/then/else. · 4a6e0cbd

Chris Lattner authored Apr 15, 2007

This sinks the two stores in this example into a single store in cond_next.  In this
case, it allows elimination of the load as well:

        store double 0.000000e+00, double* @s.3060
        %tmp3 = fcmp ogt double %tmp1, 5.000000e-01             ; <i1> [#uses=1]
        br i1 %tmp3, label %cond_true, label %cond_next
cond_true:              ; preds = %entry
        store double 1.000000e+00, double* @s.3060
        br label %cond_next
cond_next:              ; preds = %entry, %cond_true
        %tmp6 = load double* @s.3060            ; <double> [#uses=1]

This implements Transforms/InstCombine/store-merge.ll:test2

llvm-svn: 36040

4a6e0cbd

refactor some code, no functionality change. · 14a251b9
Chris Lattner authored Apr 15, 2007
```
llvm-svn: 36037
```
14a251b9
fix long lines · 28d921d0
Chris Lattner authored Apr 14, 2007
```
llvm-svn: 36031
```
28d921d0

Implement Transforms/InstCombine/vec_extract_elt.ll, transforming: · 7bfdd0ab

Chris Lattner authored Apr 14, 2007

define i32 @test(float %f) {
        %tmp7 = insertelement <4 x float> undef, float %f, i32 0
        %tmp17 = bitcast <4 x float> %tmp7 to <4 x i32>
        %tmp19 = extractelement <4 x i32> %tmp17, i32 0
        ret i32 %tmp19
}

into:

define i32 @test(float %f) {
        %tmp19 = bitcast float %f to i32                ; <i32> [#uses=1]
        ret i32 %tmp19
}

On PPC, this is the difference between:

_test:
        mfspr r2, 256
        oris r3, r2, 8192
        mtspr 256, r3
        stfs f1, -16(r1)
        addi r3, r1, -16
        addi r4, r1, -32
        lvx v2, 0, r3
        stvx v2, 0, r4
        lwz r3, -32(r1)
        mtspr 256, r2
        blr

and:

_test:
        stfs f1, -4(r1)
        nop
        nop
        nop
        lwz r3, -4(r1)
        blr

llvm-svn: 36025

7bfdd0ab

Implement InstCombine/vec_demanded_elts.ll:test2. This allows us to turn · b37fb6a0

Chris Lattner authored Apr 14, 2007

unsigned test(float f) {
 return _mm_cvtsi128_si32( (__m128i) _mm_set_ss( f*f ));
}

into:

_test:
        movss 4(%esp), %xmm0
        mulss %xmm0, %xmm0
        movd %xmm0, %eax
        ret

instead of:

_test:
        movss 4(%esp), %xmm0
        mulss %xmm0, %xmm0
        xorps %xmm1, %xmm1
        movss %xmm0, %xmm1
        movd %xmm1, %eax
        ret

GCC gets:

_test:
        subl    $28, %esp
        movss   32(%esp), %xmm0
        mulss   %xmm0, %xmm0
        xorps   %xmm1, %xmm1
        movss   %xmm0, %xmm1
        movaps  %xmm1, %xmm0
        movd    %xmm0, 12(%esp)
        movl    12(%esp), %eax
        addl    $28, %esp
        ret

llvm-svn: 36020

b37fb6a0

avoid copying sets and vectors around. · a6b56602
Chris Lattner authored Apr 14, 2007
```
llvm-svn: 36017
```
a6b56602

Apr 14, 2007
- avoid iterator invalidation. · 6f58839b
  Chris Lattner authored Apr 14, 2007
```
llvm-svn: 36002
```
  6f58839b
- An even better fix. · 4bd0fd36
  Jeff Cohen authored Apr 14, 2007
```
llvm-svn: 35998
```
  4bd0fd36
- Fix recent regression that broke several llvm-tests. · 7233aa93
  Jeff Cohen authored Apr 14, 2007
```
llvm-svn: 35996
```
  7233aa93
- Implement a few missing xforms: printf("foo\n") -> puts. printf("x") -> putchar · 49fa8d2b
  Chris Lattner authored Apr 14, 2007
```
printf("") -> noop.  Still need to do the xforms for fprintf.

This implements Transforms/SimplifyLibCalls/Printf.ll

llvm-svn: 35984
```
  49fa8d2b
- in addition to merging, constantmerge should also delete trivially dead globals, · 02137eec
  Chris Lattner authored Apr 14, 2007
```
in order to clean up after simplifylibcalls.

llvm-svn: 35982
```
  02137eec
- Implement PR1201 and test/Transforms/InstCombine/malloc-free-delete.ll · efb33d28
  Chris Lattner authored Apr 14, 2007
```
llvm-svn: 35981
```
  efb33d28
- use an accessor to simplify code. · 164b7656
  Chris Lattner authored Apr 14, 2007
```
llvm-svn: 35979
```
  164b7656
Apr 13, 2007

Now that codegen prepare isn't defeating me, I can finally fix what I set · efd3051d

Chris Lattner authored Apr 13, 2007

out to do! :)

This fixes a problem where LSR would insert a bunch of code into each MBB
that uses a particular subexpression (e.g. IV+base+C).  The problem is that
this code cannot be CSE'd back together if inserted into different blocks.

This patch changes LSR to attempt to insert a single copy of this code and
share it, allowing codegenprepare to duplicate the code if it can be sunk
into various addressing modes.  On CodeGen/ARM/lsr-code-insertion.ll,
for example, this gives us code like:

        add r8, r0, r5
        str r6, [r8, #+4]
..
        ble LBB1_4      @cond_next
LBB1_3: @cond_true
        str r10, [r8, #+4]
LBB1_4: @cond_next
...
LBB1_5: @cond_true55
        ldr r6, LCPI1_1
        str r6, [r8, #+4]

instead of:

        add r10, r0, r6
        str r8, [r10, #+4]
...
        ble LBB1_4      @cond_next
LBB1_3: @cond_true
        add r8, r0, r6
        str r10, [r8, #+4]
LBB1_4: @cond_next
...
LBB1_5: @cond_true55
        add r8, r0, r6
        ldr r10, LCPI1_1
        str r10, [r8, #+4]

Besides being smaller and more efficient, this makes it immediately
obvious that it is profitable to predicate LBB1_3 now :)

llvm-svn: 35972

efd3051d