Commits · 877aaa589bad47b76c48688bd322ddfdc32fe217 · Roger Ferrer / llvm-epi-0.8

Jan 08, 2011

fix an issue in IsPointerOffset that prevented us from recognizing that · 5120ebf1
Chris Lattner authored Jan 08, 2011
```
P and P+1 are relative to the same base pointer.

llvm-svn: 123087
```
5120ebf1
enhance memcpyopt to merge a store and a subsequent · 4dc1fd93
Chris Lattner authored Jan 08, 2011
```
memset into a single larger memset.

llvm-svn: 123086
```
4dc1fd93

constify TargetData references. · c638147e

Chris Lattner authored Jan 08, 2011

Split memset formation logic out into its own
"tryMergingIntoMemset" helper function.

llvm-svn: 123081

c638147e

When loop rotation happens, it is *very* common for the duplicated condbr · 59c82f85

Chris Lattner authored Jan 08, 2011

to be foldable into an uncond branch.  When this happens, we can make a
much simpler CFG for the loop, which is important for nested loop cases
where we want the outer loop to be aggressively optimized.

Handle this case more aggressively.  For example, previously on
phi-duplicate.ll we would get this:


define void @test(i32 %N, double* %G) nounwind ssp {
entry:
  %cmp1 = icmp slt i64 1, 1000
  br i1 %cmp1, label %bb.nph, label %for.end

bb.nph:                                           ; preds = %entry
  br label %for.body

for.body:                                         ; preds = %bb.nph, %for.cond
  %j.02 = phi i64 [ 1, %bb.nph ], [ %inc, %for.cond ]
  %arrayidx = getelementptr inbounds double* %G, i64 %j.02
  %tmp3 = load double* %arrayidx
  %sub = sub i64 %j.02, 1
  %arrayidx6 = getelementptr inbounds double* %G, i64 %sub
  %tmp7 = load double* %arrayidx6
  %add = fadd double %tmp3, %tmp7
  %arrayidx10 = getelementptr inbounds double* %G, i64 %j.02
  store double %add, double* %arrayidx10
  %inc = add nsw i64 %j.02, 1
  br label %for.cond

for.cond:                                         ; preds = %for.body
  %cmp = icmp slt i64 %inc, 1000
  br i1 %cmp, label %for.body, label %for.cond.for.end_crit_edge

for.cond.for.end_crit_edge:                       ; preds = %for.cond
  br label %for.end

for.end:                                          ; preds = %for.cond.for.end_crit_edge, %entry
  ret void
}

Now we get the much nicer:

define void @test(i32 %N, double* %G) nounwind ssp {
entry:
  br label %for.body

for.body:                                         ; preds = %entry, %for.body
  %j.01 = phi i64 [ 1, %entry ], [ %inc, %for.body ]
  %arrayidx = getelementptr inbounds double* %G, i64 %j.01
  %tmp3 = load double* %arrayidx
  %sub = sub i64 %j.01, 1
  %arrayidx6 = getelementptr inbounds double* %G, i64 %sub
  %tmp7 = load double* %arrayidx6
  %add = fadd double %tmp3, %tmp7
  %arrayidx10 = getelementptr inbounds double* %G, i64 %j.01
  store double %add, double* %arrayidx10
  %inc = add nsw i64 %j.01, 1
  %cmp = icmp slt i64 %inc, 1000
  br i1 %cmp, label %for.body, label %for.end

for.end:                                          ; preds = %for.body
  ret void
}

With all of these recent changes, we are now able to compile:

void foo(char *X) {
 for (int i = 0; i != 100; ++i) 
   for (int j = 0; j != 100; ++j)
     X[j+i*100] = 0;
}

into a single memset of 10000 bytes.  This series of changes
should also be helpful for other nested loop scenarios as well.

llvm-svn: 123079

59c82f85

split ssa updating code out to its own helper function. Don't bother · 30f318e5
Chris Lattner authored Jan 08, 2011
```
moving the OrigHeader block anymore: we just merge it away anyway so
its code layout doesn't matter.

llvm-svn: 123077
```
30f318e5

Implement a TODO: Enhance loopinfo to merge away the unconditional branch · 2615130e

Chris Lattner authored Jan 08, 2011

that it was leaving in loops after rotation (between the original latch
block and the original header.

With this change, it is possible for rotated loops to have just a single
basic block, which is useful.

llvm-svn: 123075

2615130e

various code cleanups, enhance MergeBlockIntoPredecessor to preserve · 930b716e
Chris Lattner authored Jan 08, 2011
```
loop info.

llvm-svn: 123074
```
930b716e
inline preserveCanonicalLoopForm now that it is simple. · fee37c5f
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123073
```
fee37c5f

Three major changes: · 063dca0f

Chris Lattner authored Jan 08, 2011

1. Rip out LoopRotate's domfrontier updating code.  It isn't
   needed now that LICM doesn't use DF and it is super complex
   and gross.
2. Make DomTree updating code a lot simpler and faster.  The 
   old loop over all the blocks was just to find a block??
3. Change the code that inserts the new preheader to just use
   SplitCriticalEdge instead of doing an overcomplex 
   reimplementation of it.

No behavior change, except for the name of the inserted preheader.

llvm-svn: 123072

063dca0f

reduce nesting. · 30d95f9f
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123071
```
30d95f9f
LoopRotate requires canonical loop form, so it always has preheaders · 7fab23bc
Chris Lattner authored Jan 08, 2011
```
and latch blocks.  Reorder entry conditions to make hte pass faster
and more logical.

llvm-svn: 123069
```
7fab23bc
use the LI ivar. · d62691f4
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123068
```
d62691f4
some cleanups: remove dead arguments and eliminate ivars · 385f2ec6
Chris Lattner authored Jan 08, 2011
```
that are just passed to one function.

llvm-svn: 123067
```
385f2ec6
fix an issue duncan pointed out, which could cause loop rotate · 25ba40a0
Chris Lattner authored Jan 08, 2011
```
to violate LCSSA form

llvm-svn: 123066
```
25ba40a0
Fix coding style issues. · b4ab257b
Cameron Zwarich authored Jan 08, 2011
```
llvm-svn: 123065
```
b4ab257b

Make more passes preserve dominators (or state that they preserve dominators if · 84986b29

Cameron Zwarich authored Jan 08, 2011

they all ready do). This removes two dominator recomputations prior to isel,
which is a 1% improvement in total llc time for 403.gcc.

The only potentially suspect thing is making GCStrategy recompute dominators if
it used a custom lowering strategy.

llvm-svn: 123064

84986b29

Contract subloop bodies. However, it is still important to visit the phis at the · 80bd9af7
Cameron Zwarich authored Jan 08, 2011
```
top of subloop headers, as the phi uses logically occur outside of the subloop.

llvm-svn: 123062
```
80bd9af7
Fix a bug in r123034 (trying to sext/zext non-integers) and clean up a little. · 6a1fb8f2
Frits van Bommel authored Jan 08, 2011
```
llvm-svn: 123061
```
6a1fb8f2

Have loop-rotate simplify instructions (yay instsimplify!) as it clones · 8c5defd0

Chris Lattner authored Jan 08, 2011

them into the loop preheader, eliminating silly instructions like
"icmp i32 0, 100" in fixed tripcount loops. This also better exposes the
bigger problem with loop rotate that I'd like to fix: once this has been
folded, the duplicated conditional branch *often* turns into an uncond branch.

Not aggressively handling this is pessimizing later loop optimizations
somethin' fierce by making "dominates all exit blocks" checks fail.

llvm-svn: 123060

8c5defd0

Revamp the ValueMapper interfaces in a couple ways: · 43f8d164

Chris Lattner authored Jan 08, 2011

1. Take a flags argument instead of a bool.  This makes
   it more clear to the reader what it is used for.
2. Add a flag that says that "remapping a value not in the
   map is ok".
3. Reimplement MapValue to share a bunch of code and be a lot
   more efficient.  For lookup failures, don't drop null values
   into the map.
4. Using the new flag a bunch of code can vaporize in LinkModules
   and LoopUnswitch, kill it.

No functionality change.

llvm-svn: 123058

43f8d164

two minor changes: switch to the standard ValueToValueMapTy · 2b3f20e6

Chris Lattner authored Jan 08, 2011

map from ValueMapper.h (giving us access to its utilities)
and add a fastpath in the loop rotation code, avoiding expensive
ssa updator manipulation for values with nothing to update.

llvm-svn: 123057

2b3f20e6

Jan 07, 2011

InstCombine: Match min/max hidden by sext/zext · fc3d7f66

Tobias Grosser authored Jan 07, 2011

X = sext x; x >s c ? X : C+1 --> X = sext x; X <s C+1 ? C+1 : X
X = sext x; x <s c ? X : C-1 --> X = sext x; X >s C-1 ? C-1 : X
X = zext x; x >u c ? X : C+1 --> X = zext x; X <u C+1 ? C+1 : X
X = zext x; x <u c ? X : C-1 --> X = zext x; X >u C-1 ? C-1 : X
X = sext x; x >u c ? X : C+1 --> X = sext x; X <u C+1 ? C+1 : X
X = sext x; x <u c ? X : C-1 --> X = sext x; X >u C-1 ? C-1 : X

Instead of calculating this with mixed types promote all to the
larger type. This enables scalar evolution to analyze this
expression. PR8866

llvm-svn: 123034

fc3d7f66

Some whitespace fixes · 411e6eed
Tobias Grosser authored Jan 07, 2011
```
llvm-svn: 123033
```
411e6eed
Revert 122959, it needs more thought. Add it back to README.txt with additional notes. · 134cde91
Benjamin Kramer authored Jan 07, 2011
```
llvm-svn: 123030
```
134cde91
Remove all uses of the "ugly" method BranchInst::setUnconditionalDest(). · 89afb43b
Jay Foad authored Jan 07, 2011
```
llvm-svn: 123025
```
89afb43b

Jan 06, 2011
- InstCombine: Turn _chk functions into the "unsafe" variant if length and max langth are equal. · ae67cc13
  Benjamin Kramer authored Jan 06, 2011
```
This happens when we take the (non-constant) length from a malloc.

llvm-svn: 122961
```
  ae67cc13
- InstCombine: If we call llvm.objectsize on a malloc call we can replace it... · 799b0112
  Benjamin Kramer authored Jan 06, 2011
```
InstCombine: If we call llvm.objectsize on a malloc call we can replace it with the size passed to malloc.

llvm-svn: 122959
```
  799b0112
- InstCombine: Teach llvm.objectsize folding to look through GEPs. · a76cc117
  Benjamin Kramer authored Jan 06, 2011
```
llvm-svn: 122958
```
  a76cc117
- Add the CallInst optimizations that don't involve expanding inline assembly to · 9ec19ea0
  Cameron Zwarich authored Jan 06, 2011
```
OptimizeInst() so that they can be used on a worklist instruction.

llvm-svn: 122945
```
  9ec19ea0
- Move the GEP handling in CodeGenPrepare to OptimizeInst(). · d28c78eb
  Cameron Zwarich authored Jan 06, 2011
```
llvm-svn: 122944
```
  d28c78eb
- Split the optimizations in CodeGenPrepare that don't manipulate the iterators · 14ac865c
  Cameron Zwarich authored Jan 06, 2011
```
into a separate function, so that it can be called from a loop using a worklist
rather than a loop traversing a whole basic block.

llvm-svn: 122943
```
  14ac865c
- Zap the last two -Wself-assign warnings in llvm. · 70be93a2
  Jakob Stoklund Olesen authored Jan 06, 2011
```
Simplify RALinScan::DowngradeRegister with TRI::getOverlaps while we are there.

llvm-svn: 122940
```
  70be93a2
- Stop reallocating SunkAddrs for each basic block. When we move to an instruction · ce3b930a
  Cameron Zwarich authored Jan 06, 2011
```
worklist, the key will need to become std::pair<BasicBlock*, Value*>.

llvm-svn: 122932
```
  ce3b930a
Jan 05, 2011
- Add some more statistics to CodeGenPrepare. · b62ccb24
  Cameron Zwarich authored Jan 05, 2011
```
llvm-svn: 122891
```
  b62ccb24
- Add some stats to CodeGenPrepare to make it easier to speed it up without · ced753fa
  Cameron Zwarich authored Jan 05, 2011
```
regressing code quality.

llvm-svn: 122887
```
  ced753fa
- Use pop_back_val instead of back followed by pop_back. · 6a789953
  Cameron Zwarich authored Jan 05, 2011
```
llvm-svn: 122876
```
  6a789953
- Use a worklist for later iterations just like ordinary instsimplify. The next · 5a2bb998
  Cameron Zwarich authored Jan 05, 2011
```
step is to only process instructions in subloops if they have been modified by
an earlier simplification.

llvm-svn: 122869
```
  5a2bb998
- Change LoopInstSimplify back to a LoopPass. It revisits subloops rather than · 4c51d122
  Cameron Zwarich authored Jan 05, 2011
```
skipping them, but it should probably use a worklist and only revisit those
instructions in subloops that have actually changed. It should probably also
use a worklist after the first iteration like instsimplify now does. Regardless,
it's only 0.3% of opt -O2 time on 403.gcc if it replaces the instcombine placed
in the middle of the loop passes.

llvm-svn: 122868
```
  4c51d122
Jan 04, 2011

Don't bother value numbering instructions with void types in GVN. In theory... · 7b25ff04

Owen Anderson authored Jan 04, 2011

Don't bother value numbering instructions with void types in GVN. In theory this should allow us to insert
fewer things into the value numbering maps, but any speedup is beneath the noise threshold on my machine
on 403.gcc.

llvm-svn: 122844

7b25ff04

Complete the NumberTable --> LeaderTable rename. · e39cb57b
Owen Anderson authored Jan 04, 2011
```
llvm-svn: 122828
```
e39cb57b