Commits · 58f9f5871657b1b050ea8360fc93148ee2fe53b8 · Roger Ferrer / llvm-epi-0.8

Nov 21, 2010

Implement PR8644: forwarding a memcpy value to a byval, · 58f9f587

Chris Lattner authored Nov 21, 2010

allowing the memcpy to be eliminated.

Unfortunately, the requirements on byval's without explicit 
alignment are really weak and impossible to predict in the 
mid-level optimizer, so this doesn't kick in much with current
frontends.  The fix is to change clang to set alignment on all
byval arguments.

llvm-svn: 119916

58f9f587

Nov 20, 2010
- Simplify code. No change in functionality. · ddd1b7b8
  Benjamin Kramer authored Nov 20, 2010
```
llvm-svn: 119908
```
  ddd1b7b8
Nov 19, 2010
- Document the new GVN number table structure. · ea326db4
  Owen Anderson authored Nov 19, 2010
```
llvm-svn: 119865
```
  ea326db4
- When folding addressing modes in CodeGenPrepare, attempt to look through PHI nodes · dfb8c3bb
  Owen Anderson authored Nov 19, 2010
```
if all the operands of the PHI are equivalent.  This allows CodeGenPrepare to undo
unprofitable PRE transforms.

llvm-svn: 119853
```
  dfb8c3bb
Nov 18, 2010

Factor code for testing whether replacing one value with another · aef146b8

Duncan Sands authored Nov 18, 2010

preserves LCSSA form out of ScalarEvolution and into the LoopInfo
class.  Use it to check that SimplifyInstruction simplifications
are not breaking LCSSA form.  Fixes PR8622.

llvm-svn: 119727

aef146b8

Completely rework the datastructure GVN uses to represent the value number to... · c21c100f

Owen Anderson authored Nov 18, 2010

Completely rework the datastructure GVN uses to represent the value number to leader mapping. Previously,
this was a tree of hashtables, and a query recursed into the table for the immediate dominator ad infinitum
if the initial lookup failed. This led to really bad performance on tall, narrow CFGs.

We can instead replace it with what is conceptually a multimap of value numbers to leaders (actually
represented by a hashtable with a list of Value*'s as the value type), and then
determine which leader from that set to use very cheaply thanks to the DFS numberings maintained by
DominatorTree. Because there are typically few duplicates of a given value, this scan tends to be
quite fast. Additionally, we use a custom linked list and BumpPtr allocation to avoid any unnecessary
allocation in representing the value-side of the multimap.

This change brings with it a 15% (!) improvement in the total running time of GVN on 403.gcc, which I
think is pretty good considering that includes all the "real work" being done by MemDep as well.

The one downside to this approach is that we can no longer use GVN to perform simple conditional progation,
but that seems like an acceptable loss since we now have LVI and CorrelatedValuePropagation to pick up
the slack. If you see conditional propagation that's not happening, please file bugs against LVI or CVP.

llvm-svn: 119714

c21c100f

slightly simplify code and substantially improve comment. Instead of · 1385dff8
Chris Lattner authored Nov 18, 2010
```
saying "it would be bad", give an example of what is going on.

llvm-svn: 119695
```
1385dff8

remove a pointless restriction from memcpyopt. It was · 731caac7

Chris Lattner authored Nov 18, 2010

refusing to optimize two memcpy's like this:

copy A <- B
copy C <- A

if it couldn't prove that noalias(B,C).  We can eliminate
the copy by producing a memmove instead of memcpy.

llvm-svn: 119694

731caac7

remove another pointless noalias check: M is a memcpy, so the · c274a834
Chris Lattner authored Nov 18, 2010
```
source and dest are known to not overlap.

llvm-svn: 119692
```
c274a834
use AA::isNoAlias instead of open coding it. Remove an extraneous noalias check: · 75cfe985
Chris Lattner authored Nov 18, 2010
```
there is no need to check to see if the source and dest of a memcpy are noalias,
behavior is undefined if not.

llvm-svn: 119691
```
75cfe985
finish a thought. · 1e37bbaf
Chris Lattner authored Nov 18, 2010
```
llvm-svn: 119690
```
1e37bbaf
rearrange some code, splitting memcpy/memcpy optimization · 7e9b2ea3
Chris Lattner authored Nov 18, 2010
```
out of processMemCpy into its own function.

llvm-svn: 119687
```
7e9b2ea3

allow eliminating an alloca that is just copied from an constant global · ac570131

Chris Lattner authored Nov 18, 2010

if it is passed as a byval argument.  The byval argument will just be a
read, so it is safe to read from the original global instead.  This allows
us to promote away the %agg.tmp alloca in PR8582

llvm-svn: 119686

ac570131

enhance the "alloca is just a memcpy from constant global" · f183d5c4
Chris Lattner authored Nov 18, 2010
```
to ignore calls that obviously can't modify the alloca
because they are readonly/readnone.

llvm-svn: 119683
```
f183d5c4

fix a small oversight in the "eliminate memcpy from constant global" · 7aeae25c

Chris Lattner authored Nov 18, 2010

optimization.  If the alloca that is "memcpy'd from constant" also has
a memcpy from *it*, ignore it: it is a load.  We now optimize the testcase to:

define void @test2() {
  %B = alloca %T
  %a = bitcast %T* @G to i8*
  %b = bitcast %T* %B to i8*
  call void @llvm.memcpy.p0i8.p0i8.i64(i8* %b, i8* %a, i64 124, i32 4, i1 false)
  call void @bar(i8* %b)
  ret void
}

previously we would generate:

define void @test() {
  %B = alloca %T
  %b = bitcast %T* %B to i8*
  %G.0 = getelementptr inbounds %T* @G, i32 0, i32 0
  %tmp3 = load i8* %G.0, align 4
  %G.1 = getelementptr inbounds %T* @G, i32 0, i32 1
  %G.15 = bitcast [123 x i8]* %G.1 to i8*
  %1 = bitcast [123 x i8]* %G.1 to i984*
  %srcval = load i984* %1, align 1
  %B.0 = getelementptr inbounds %T* %B, i32 0, i32 0
  store i8 %tmp3, i8* %B.0, align 4
  %B.1 = getelementptr inbounds %T* %B, i32 0, i32 1
  %B.12 = bitcast [123 x i8]* %B.1 to i8*
  %2 = bitcast [123 x i8]* %B.1 to i984*
  store i984 %srcval, i984* %2, align 1
  call void @bar(i8* %b)
  ret void
}

llvm-svn: 119682

7aeae25c

Nov 17, 2010

Move SCEV::dominates and properlyDominates to ScalarEvolution. · 20d9ce21
Dan Gohman authored Nov 17, 2010
```
llvm-svn: 119570
```
20d9ce21
Move SCEV::isLoopInvariant and hasComputableLoopEvolution to be member · afd6db99
Dan Gohman authored Nov 17, 2010
```
functions of ScalarEvolution, in preparation for memoization and
other optimizations.

llvm-svn: 119562
```
afd6db99
Reference ScalarEvolution by name rather than directly in LICM, · 1ee6d240
Dan Gohman authored Nov 17, 2010
```
to avoid an unneeded dependence.

llvm-svn: 119557
```
1ee6d240
InstCombine: Add a missing irem identity (X % X -> 0). · 07726c7d
Benjamin Kramer authored Nov 17, 2010
```
llvm-svn: 119538
```
07726c7d

Move some those Xor simplifications which don't require creating new · c89ac07e

Duncan Sands authored Nov 17, 2010

instructions out of InstCombine and into InstructionSimplify.  While
there, introduce an m_AllOnes pattern to simplify matching with integers
and vectors with all bits equal to one.

llvm-svn: 119536

c89ac07e

Have InlineFunction use SimplifyInstruction rather than · 9d9a4e2c

Duncan Sands authored Nov 17, 2010

hasConstantValue.  I was leery of using SimplifyInstruction
while the IR was still in a half-baked state, which is the
reason for delaying the simplification until the IR is fully
cooked.

llvm-svn: 119494

9d9a4e2c

Have RemovePredecessorAndSimplify you SimplifyInstruction · ba0b22c7
Duncan Sands authored Nov 17, 2010
```
rather than hasConstantValue.

llvm-svn: 119457
```
ba0b22c7

Remove dead code in GVN: now that SimplifyInstruction is called · 72313843

Duncan Sands authored Nov 17, 2010

systematically, CollapsePhi will always return null here.  Note
that CollapsePhi did an extra check, isSafeReplacement, which
the SimplifyInstruction logic does not do.  I think that check
was bogus - I guess we will soon find out!  (It was originally
added in commit 41998 without a testcase).

llvm-svn: 119456

72313843

Nov 16, 2010
- Have a few places that want to simplify phi nodes use SimplifyInstruction · 63704951
  Duncan Sands authored Nov 16, 2010
```
rather than calling hasConstantValue.  No intended functionality change.

llvm-svn: 119352
```
  63704951
Nov 14, 2010

If dom tree information is available, make it possible to pass · b99f39b9
Duncan Sands authored Nov 14, 2010
```
it to get better phi node simplification.

llvm-svn: 119055
```
b99f39b9

Teach InstructionSimplify about phi nodes. I chose to have it simply · 4581ddc1

Duncan Sands authored Nov 14, 2010

offload the work to hasConstantValue rather than do something more
complicated (such handling mutually recursive phis) because (1) it is
not clear it is worth it; and (2) if it is worth it, maybe such logic
would be better placed in hasConstantValue.  Adjust some GVN tests
which are now cleaned up much further (eg: all phi nodes are removed).

llvm-svn: 119043

4581ddc1

Nov 13, 2010

Generalize the reassociation transform in SimplifyCommutative (now renamed to · 641baf16

Duncan Sands authored Nov 13, 2010

SimplifyAssociativeOrCommutative) "(A op C1) op C2" -> "A op (C1 op C2)",
which previously was only done if C1 and C2 were constants, to occur whenever
"C1 op C2" simplifies (a la InstructionSimplify).  Since the simplifying operand
combination can no longer be assumed to be the right-hand terms, consider all of
the possible permutations.  When compiling "gcc as one big file", transform 2
(i.e. using right-hand operands) fires about 4000 times but it has to be said
that most of the time the simplifying operands are both constants.  Transforms
3, 4 and 5 each fired once.  Transform 6, which is an existing transform that
I didn't change, never fired.  With this change, the testcase is now optimized
perfectly with one run of instcombine (previously it required instcombine +
reassociate + instcombine, and it may just have been luck that this worked).

llvm-svn: 119002

641baf16

Nov 12, 2010

Have GVN simplify instructions as it goes. For example, consider · 246b71c5

Duncan Sands authored Nov 12, 2010

"%z = %x and %y".  If GVN can prove that %y equals %x, then it turns
this into "%z = %x and %x".  With the new code, %z will be replaced
with %x everywhere (and then deleted).  Previously %z would be value
numbered too, which is a waste of time.  Also, while a clever value
numbering algorithm would give %z the same value number as %x, our
current one doesn't do so (at least I don't think it does).  The new
logic has an essentially equivalent effect to what you would get if
%z was given the same value number as %x, i.e. it should make value
numbering smarter.  While there, get hold of target data once at the
start rather than a gazillion times all over the place.

llvm-svn: 118923

246b71c5

Enhance DSE to handle the case where a free call makes more than · d4b7fff2
Dan Gohman authored Nov 12, 2010
```
one store dead. This is especially noticeable in
SingleSource/Benchmarks/Shootout/objinst.

llvm-svn: 118875
```
d4b7fff2

Nov 11, 2010
- Add helper functions for computing the Location of load, store, · 65316d67
  Dan Gohman authored Nov 11, 2010
```
and vaarg instructions.

llvm-svn: 118845
```
  65316d67
- Factor out Instruction::isSafeToSpeculativelyExecute's code for · a826a887
  Dan Gohman authored Nov 11, 2010
```
testing for dereferenceable pointers into a helper function,
isDereferenceablePointer.  Teach it how to reason about GEPs
with simple non-zero indices.

Also eliminate ArgumentPromtion's IsAlwaysValidPointer,
which didn't check for weak externals or out of range gep
indices.

llvm-svn: 118840
```
  a826a887
- TBAA-enable ArgumentPromotion. · dcdfd8dd
  Dan Gohman authored Nov 11, 2010
```
llvm-svn: 118804
```
  dcdfd8dd
- Make Sink tbaa-aware. · 0cc4c751
  Dan Gohman authored Nov 11, 2010
```
llvm-svn: 118788
```
  0cc4c751
- It's safe to sink some instructions which are not safe to speculatively · c3b4ea7b
  Dan Gohman authored Nov 11, 2010
```
execute. Make Sink's predicate more precise.

llvm-svn: 118787
```
  c3b4ea7b
Nov 10, 2010
- Enhance GVN to do more precise alias queries for non-local memory · 0a6021a5
  Dan Gohman authored Nov 10, 2010
```
references. For example, this allows gvn to eliminate the load in
this example:

  void foo(int n, int* p, int *q) {
    p[0] = 0;
    p[1] = 1;
    if (n) {
      *q = p[0];
    }
  }

llvm-svn: 118714
```
  0a6021a5
- Use getValueOperand() and getPointerOperand() on load and store · d2099116
  Dan Gohman authored Nov 10, 2010
```
instructions instead of hard-coding operand numbers.

llvm-svn: 118698
```
  d2099116
- Add a doesAccessArgPointees helper function, and update code to use · 066c1bb1
  Dan Gohman authored Nov 10, 2010
```
it, and to be consistent.

llvm-svn: 118692
```
  066c1bb1
- Factor out the code for testing whether a function accesses · 25775809
  Dan Gohman authored Nov 10, 2010
```
arbitrary memory into a helper function, and adjust some comments.

llvm-svn: 118687
```
  25775809
- When checking that the necessary bits are zero in · 0171dc30
  Dale Johannesen authored Nov 10, 2010
```
order to reduce ((x<<30)>>24) to x<<6, check the
correct bits.  PR 8547.

llvm-svn: 118665
```
  0171dc30
- Make ModRefBehavior a lattice. Use this to clean up AliasAnalysis · 2694e140
  Dan Gohman authored Nov 10, 2010
```
chaining and simplify FunctionAttrs' GetModRefBehavior logic.

llvm-svn: 118660
```
  2694e140