Commits · 4f4c3f9fd570be8296341c134f85ce3c97ba77c0 · Roger Ferrer / llvm-epi-0.8

Oct 30, 2007

It's not safe to tell SplitCriticalEdge to merge identical edges. It may... · c2dbfee4

Evan Cheng authored Oct 30, 2007

It's not safe to tell SplitCriticalEdge to merge identical edges. It may delete the phi instruction that's being processed.

llvm-svn: 43524

c2dbfee4

Oct 29, 2007
- - Bug fixes. · b024c4c8
  Evan Cheng authored Oct 29, 2007
```
- Allow icmp rewrite using an iv / stride of a smaller integer type.

llvm-svn: 43480
```
  b024c4c8
- Update a comment to reflect the current code. · 7414e21e
  Dan Gohman authored Oct 29, 2007
```
llvm-svn: 43463
```
  7414e21e
- Remove an unused function argument. · f5feb010
  Dan Gohman authored Oct 29, 2007
```
llvm-svn: 43462
```
  f5feb010
- Fix a typo in a comment. · 50d42224
  Dan Gohman authored Oct 29, 2007
```
llvm-svn: 43461
```
  50d42224
- Avoid calling ValidStride when not all uses are addresses. · 8e8adada
  Dan Gohman authored Oct 29, 2007
```
llvm-svn: 43460
```
  8e8adada
Oct 27, 2007

A number of LSR fixes: · 9dbe99dc

Evan Cheng authored Oct 26, 2007

- ChangeCompareStride only reuse stride that is larger than current stride. It
  will let the general reuse mechanism to try to reuse a smaller stride.
- Watch out for multiplication overflow in ChangeCompareStride.
- Replace std::set with SmallPtrSet.

llvm-svn: 43408

9dbe99dc

Oct 26, 2007

Fix a crash. Make sure TLI is not null. · d78a3e55
Evan Cheng authored Oct 26, 2007
```
llvm-svn: 43384
```
d78a3e55

Loosen up iv reuse to allow reuse of the same stride but a larger type when... · 7f3d0247

Evan Cheng authored Oct 26, 2007

Loosen up iv reuse to allow reuse of the same stride but a larger type when truncating from the larger type to smaller type is free.
e.g.
Turns this loop:
LBB1_1: # entry.bb_crit_edge
        xorl    %ecx, %ecx
        xorw    %dx, %dx
        movw    %dx, %si
LBB1_2: # bb
        movl    L_X$non_lazy_ptr, %edi
        movw    %si, (%edi)
        movl    L_Y$non_lazy_ptr, %edi
        movw    %dx, (%edi)
		addw    $4, %dx
		incw    %si
		incl    %ecx
		cmpl    %eax, %ecx
		jne     LBB1_2  # bb
	
into

LBB1_1: # entry.bb_crit_edge
        xorl    %ecx, %ecx
        xorw    %dx, %dx
LBB1_2: # bb
        movl    L_X$non_lazy_ptr, %esi
        movw    %cx, (%esi)
        movl    L_Y$non_lazy_ptr, %esi
        movw    %dx, (%esi)
        addw    $4, %dx
		incl    %ecx
        cmpl    %eax, %ecx
        jne     LBB1_2  # bb

llvm-svn: 43375

7f3d0247

Do not rewrite compare instruction using iv of a different stride if the new · 29e29e63
Evan Cheng authored Oct 25, 2007
```
stride may be rewritten using the stride of the compare instruction.

llvm-svn: 43367
```
29e29e63

Oct 25, 2007

Remove code that's commented out. · 5a381083
Evan Cheng authored Oct 25, 2007
```
llvm-svn: 43356
```
5a381083

If a loop termination compare instruction is the only use of its stride, · 133694db

Evan Cheng authored Oct 25, 2007

and the compaison is against a constant value, try eliminate the stride
by moving the compare instruction to another stride and change its
constant operand accordingly. e.g.

loop:
...
v1 = v1 + 3
v2 = v2 + 1
if (v2 < 10) goto loop
=>
loop:
...
v1 = v1 + 3
if (v1 < 30) goto loop

llvm-svn: 43336

133694db

Oct 22, 2007

Strength reduction improvements. · e0c3d9f3

Dan Gohman authored Oct 22, 2007

 - Avoid attempting stride-reuse in the case that there are users that
   aren't addresses. In that case, there will be places where the
   multiplications won't be folded away, so it's better to try to
   strength-reduce them.

 - Several SSE intrinsics have operands that strength-reduction can
   treat as addresses. The previous item makes this more visible, as
   any non-address use of an IV can inhibit stride-reuse.

 - Make ValidStride aware of whether there's likely to be a base
   register in the address computation. This prevents it from thinking
   that things like stride 9 are valid on x86 when the base register is
   already occupied.

Also, XFAIL the 2007-08-10-LEA16Use32.ll test; the new logic to avoid
stride-reuse elimintes the LEA in the loop, so the test is no longer
testing what it was intended to test.

llvm-svn: 43231

e0c3d9f3

Move the SCEV object factors from being static members of the individual · a37eaf2b
Dan Gohman authored Oct 22, 2007
```
SCEV subclasses to being non-static member functions of the ScalarEvolution
class.

llvm-svn: 43224
```
a37eaf2b

Oct 02, 2007
- Fix stride computations for long double arrays. · b6c05b1f
  Dale Johannesen authored Oct 01, 2007
```
llvm-svn: 42508
```
  b6c05b1f
Aug 02, 2007
- wrap some long lines. Major offenders that are left include · 27406944
  Chris Lattner authored Aug 02, 2007
```
gvn, gvnpre, dse, and predsimplify.  To see these, use:

  make check-line-length

llvm-svn: 40738
```
  27406944
Aug 01, 2007
- More explicit keywords. · 34d442f2
  Dan Gohman authored Aug 01, 2007
```
llvm-svn: 40673
```
  34d442f2
Jul 31, 2007

Use SCEVExpander::InsertCastOfTo instead of calling new IntToPtrInst · 8c4da37b

Dan Gohman authored Jul 31, 2007

directly, because the insert point used by the SCEVExpander may vary
from what LSR originally computes.

llvm-svn: 40641

8c4da37b

Jun 19, 2007

Rename ScalarEvolution::deleteInstructionFromRecords to · 32f53bbd

Dan Gohman authored Jun 19, 2007

deleteValueFromRecords and loosen the types to all it to accept
Value* instead of just Instruction*, since this is what
ScalarEvolution uses internally anyway. This allows more flexibility
for future uses.

llvm-svn: 37657

32f53bbd

Jun 15, 2007

Add a SCEV class and supporting code for sign-extend expressions. · cb9e09ad

Dan Gohman authored Jun 15, 2007

This created an ambiguity for expandInTy to decide when to use
sign-extension or zero-extension, but it turns out that most of its callers
don't actually need a type conversion, now that LLVM types don't have
explicit signedness. Drop expandInTy in favor of plain expand, and change
the few places that actually need a type conversion to do it themselves.

llvm-svn: 37591

cb9e09ad

Jun 07, 2007
- Use DominatorTree instead of ETForest. · df6355cc
  Devang Patel authored Jun 07, 2007
```
llvm-svn: 37499
```
  df6355cc
Jun 06, 2007
- Fix PR1495 and CodeGen/X86/2007-06-05-LSR-Dominator.ll · 1b7b6e76
  Chris Lattner authored Jun 06, 2007
```
llvm-svn: 37454
```
  1b7b6e76
May 19, 2007

Handle negative strides much more optimally. This compiles X86/lsr-negative-stride.ll · e8bd53c3

Chris Lattner authored May 19, 2007

into:

_t:
        movl 8(%esp), %ecx
        movl 4(%esp), %eax
        cmpl %ecx, %eax
        je LBB1_3       #bb17
LBB1_1: #bb
        cmpl %ecx, %eax
        jg LBB1_4       #cond_true
LBB1_2: #cond_false
        subl %eax, %ecx
        cmpl %ecx, %eax
        jne LBB1_1      #bb
LBB1_3: #bb17
        ret
LBB1_4: #cond_true
        subl %ecx, %eax
        cmpl %ecx, %eax
        jne LBB1_1      #bb
        jmp LBB1_3      #bb17

instead of:

_t:
        subl $4, %esp
        movl %esi, (%esp)
        movl 12(%esp), %ecx
        movl 8(%esp), %eax
        cmpl %ecx, %eax
        je LBB1_4       #bb17
LBB1_1: #bb.outer
        movl %ecx, %edx
        negl %edx
LBB1_2: #bb
        cmpl %ecx, %eax
        jle LBB1_5      #cond_false
LBB1_3: #cond_true
        addl %edx, %eax
        cmpl %ecx, %eax
        jne LBB1_2      #bb
LBB1_4: #bb17
        movl (%esp), %esi
        addl $4, %esp
        ret
LBB1_5: #cond_false
        movl %ecx, %edx
        subl %eax, %edx
        movl %eax, %esi
        addl %esi, %esi
        cmpl %ecx, %esi
        je LBB1_4       #bb17
LBB1_6: #cond_false.bb.outer_crit_edge
        movl %edx, %ecx
        jmp LBB1_1      #bb.outer

llvm-svn: 37252

e8bd53c3

May 12, 2007
- significantly improve debug output of lsr · 1480e165
  Chris Lattner authored May 11, 2007
```
llvm-svn: 36996
```
  1480e165
May 04, 2007
- Use IntrinsicInst to test for prefetch instructions, which is ever so · 2bcbd5b7
  Dan Gohman authored May 04, 2007
```
slightly nicer than using CallInst with an extra check; thanks Chris.

llvm-svn: 36743
```
  2bcbd5b7
- Allow strength reduction to make use of addressing modes for the · 3fbb18d1
  Dan Gohman authored May 03, 2007
```
address operand in a prefetch intrinsic.

llvm-svn: 36713
```
  3fbb18d1
May 03, 2007
- Drop 'const' · 8c78a0bf
  Devang Patel authored May 03, 2007
```
llvm-svn: 36662
```
  8c78a0bf
May 02, 2007

Use 'static const char' instead of 'static const int'. · e95c6ad8

Devang Patel authored May 02, 2007

Due to darwin gcc bug, one version of darwin linker coalesces
static const int, which defauts PassID based pass identification.

llvm-svn: 36652

e95c6ad8

May 01, 2007
- Do not use typeinfo to identify pass in pass manager. · 09f162ca
  Devang Patel authored May 01, 2007
```
llvm-svn: 36632
```
  09f162ca
Apr 24, 2007

Fix · 38bc86f0

Devang Patel authored Apr 23, 2007

http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20070423/048333.html

llvm-svn: 36380

38bc86f0

Apr 15, 2007

Remove ImmediateDominator analysis. The same information can be obtained from... · f35a1dbc

Owen Anderson authored Apr 15, 2007

Remove ImmediateDominator analysis.  The same information can be obtained from DomTree.  A lot of code for
constructing ImmediateDominator is now folded into DomTree construction.

This is part of the ongoing work for PR217.

llvm-svn: 36063

f35a1dbc

Apr 13, 2007

Now that codegen prepare isn't defeating me, I can finally fix what I set · efd3051d

Chris Lattner authored Apr 13, 2007

out to do! :)

This fixes a problem where LSR would insert a bunch of code into each MBB
that uses a particular subexpression (e.g. IV+base+C).  The problem is that
this code cannot be CSE'd back together if inserted into different blocks.

This patch changes LSR to attempt to insert a single copy of this code and
share it, allowing codegenprepare to duplicate the code if it can be sunk
into various addressing modes.  On CodeGen/ARM/lsr-code-insertion.ll,
for example, this gives us code like:

        add r8, r0, r5
        str r6, [r8, #+4]
..
        ble LBB1_4      @cond_next
LBB1_3: @cond_true
        str r10, [r8, #+4]
LBB1_4: @cond_next
...
LBB1_5: @cond_true55
        ldr r6, LCPI1_1
        str r6, [r8, #+4]

instead of:

        add r10, r0, r6
        str r8, [r10, #+4]
...
        ble LBB1_4      @cond_next
LBB1_3: @cond_true
        add r8, r0, r6
        str r10, [r8, #+4]
LBB1_4: @cond_next
...
LBB1_5: @cond_true55
        add r8, r0, r6
        ldr r10, LCPI1_1
        str r10, [r8, #+4]

Besides being smaller and more efficient, this makes it immediately
obvious that it is profitable to predicate LBB1_3 now :)

llvm-svn: 35972

efd3051d

Apr 10, 2007
- switch LSR to use isLegalAddressingMode instead of other simpler hooks · 780c0097
  Chris Lattner authored Apr 09, 2007
```
llvm-svn: 35837
```
  780c0097
Apr 07, 2007
- Completely purge DomSet. This is the (hopefully) final patch for PR1171. · 8763ba1b
  Owen Anderson authored Apr 07, 2007
```
llvm-svn: 35731
```
  8763ba1b
Apr 03, 2007
- split some code out into a helper function · 81e07075
  Chris Lattner authored Apr 03, 2007
```
llvm-svn: 35615
```
  81e07075
- allow -1 strides to reuse "1" strides. · f3197a7d
  Chris Lattner authored Apr 02, 2007
```
llvm-svn: 35607
```
  f3197a7d
Apr 02, 2007

Pass the type of the store access, not the type of the store, into the · 28e0e4e1

Chris Lattner authored Apr 02, 2007

target hook.  This allows us to codegen a loop as:

LBB1_1: @cond_next
        mov r2, #0
        str r2, [r0, +r3, lsl #2]
        add r3, r3, #1
        cmn r3, #1
        bne LBB1_1      @cond_next

instead of:

LBB1_1: @cond_next
        mov r2, #0
        str r2, [r0], #+4
        add r3, r3, #1
        cmn r3, #1
        bne LBB1_1      @cond_next

This looks the same, but has one fewer induction variable (and therefore,
one fewer register) live in the loop.

llvm-svn: 35592

28e0e4e1

print the type of an inserted IV in -debug mode. · 8fe3cbe6
Chris Lattner authored Apr 01, 2007
```
llvm-svn: 35563
```
8fe3cbe6

Mar 26, 2007
- Look through bitcast when finding IVs. (Chris' patch really.) · e5866e7b
  Dale Johannesen authored Mar 26, 2007
```
llvm-svn: 35347
```
  e5866e7b
Mar 20, 2007
- do not share old induction variables when this would result in invalid · bacf4acf
  Dale Johannesen authored Mar 20, 2007
```
instructions (that would have to be split later)

llvm-svn: 35227
```
  bacf4acf