Commits · b4df1d5a3ecf9981b63e28a14a97ddb93992e3af · Roger Ferrer / llvm-epi-0.8

Nov 30, 2010
- prune an llvmcontext include and simplify some code. · b4df1d5a
  Chris Lattner authored Nov 29, 2010
```
llvm-svn: 120347
```
  b4df1d5a
Nov 29, 2010
- fix PR8677, patch by Jakub Staszak! · 2e879348
  Chris Lattner authored Nov 29, 2010
```
llvm-svn: 120325
```
  2e879348
- Transform (extractvalue (load P), ...) to (load (gep P, 0, ...)) if the load... · 28218aa8
  Frits van Bommel authored Nov 29, 2010
```
Transform (extractvalue (load P), ...) to (load (gep P, 0, ...)) if the load has no other uses, shrinking the load.

llvm-svn: 120323
```
  28218aa8
Nov 27, 2010

Second attempt at fixing the performance regressions introduced · 8ba5f39f

Owen Anderson authored Nov 27, 2010

by my recent GVN improvement.  Looking through a single layer of
PHI nodes when attempting to sink GEPs, we need to iteratively
look through arbitrary PHI nests.

llvm-svn: 120202

8ba5f39f

Nov 24, 2010
- Treat a call of function pointer like a load of the pointer when considering · b8de00ee
  Nick Lewycky authored Nov 24, 2010
```
whether the pointer can be replaced with the global variable it is a copy of.
Fixes PR8680.

llvm-svn: 120126
```
  b8de00ee
Nov 23, 2010

Rename SimplifyDistributed to the more meaningfull name SimplifyByFactorizing. · 0488d564
Duncan Sands authored Nov 23, 2010
```
llvm-svn: 120051
```
0488d564
The srem -> urem transform is not safe for any divisor that's not a power of two. · 94a622af
Benjamin Kramer authored Nov 23, 2010
```
E.g. -5 % 5 is 0 with srem and 1 with urem.

Also addresses Frits van Bommel's comments.

llvm-svn: 120049
```
94a622af
Replace calls to ConstantFoldInstruction with calls to SimplifyInstruction · 433c1679
Duncan Sands authored Nov 23, 2010
```
in two places that are really interested in simplified instructions, not
constants.

llvm-svn: 120044
```
433c1679
Constant folding here is pointless, because InstructionSimplify · bb2cd025
Duncan Sands authored Nov 23, 2010
```
(which does constant folding and more) is called a few lines
later.

llvm-svn: 120042
```
bb2cd025
InstCombine: Reduce "X shift (A srem B)" to "X shift (A urem B)" iff B is positive. · b5afa65b
Benjamin Kramer authored Nov 23, 2010
```
This allows to transform the rem in "1 << ((int)x % 8);" to an and.

llvm-svn: 120028
```
b5afa65b
Propagate LeftDistributes and RightDistributes into their only uses. · 60813f96
Duncan Sands authored Nov 23, 2010
```
Stylistic improvement suggested by Frits van Bommel.

llvm-svn: 120026
```
60813f96
Fix typo pointed out by Frits van Bommel and Marius Wachtler. · 22df7416
Duncan Sands authored Nov 23, 2010
```
llvm-svn: 120025
```
22df7416

Exploit distributive laws (eg: And distributes over Or, Mul over Add, etc) in a · adc7771f

Duncan Sands authored Nov 23, 2010

fairly systematic way in instcombine. Some of these cases were already dealt
with, in which case I removed the existing code. The case of Add has a bunch of
funky logic which covers some of this plus a few variants (considers shifts to be
a form of multiplication), which I didn't touch. The simplification performed is:
A*B+A*C -> A*(B+C). The improvement is to do this in cases that were not already
handled [such as A*B-A*C -> A*(B-C), which was reported on the mailing list], and
also to do it more often by not checking for "only one use" if "B+C" simplifies.

llvm-svn: 120024

adc7771f

duncan's spider sense was right, I completely reversed the condition · e5afa15b
Chris Lattner authored Nov 23, 2010
```
on this instcombine xform.  This fixes a miscompilation of 403.gcc.

llvm-svn: 119988
```
e5afa15b

Nov 22, 2010
- InstCombine: Implement X - A*-B -> X + A*B. · f1ebb631
  Benjamin Kramer authored Nov 22, 2010
```
llvm-svn: 119984
```
  f1ebb631
- If a GEP index simply advances by multiples of a type of zero size, · c133c544
  Duncan Sands authored Nov 22, 2010
```
then replace the index with zero.

llvm-svn: 119974
```
  c133c544
- Move the "gep undef" -> "undef" transform from instcombine to · 8a0f486e
  Duncan Sands authored Nov 22, 2010
```
InstructionSimplify.

llvm-svn: 119970
```
  8a0f486e
- Don't keep track of inserted phis in PromoteMemoryToRegister: the information · c6648eb4
  Duncan Sands authored Nov 22, 2010
```
is never used.  Patch by Cameron Zwarich.

llvm-svn: 119963
```
  c6648eb4
Nov 21, 2010

fix comment · fc9aead6
Chris Lattner authored Nov 21, 2010
```
llvm-svn: 119948
```
fc9aead6

rework some DSE paths to use the newly-public "getPointerDependencyFrom" · 59572296

Chris Lattner authored Nov 21, 2010

method in MemDep instead of inserting an instruction, doing a query,
then removing it.  Neither operation is effectively cached.

llvm-svn: 119930

59572296

implement PR8576, deleting dead stores with intervening may-alias stores. · e48c31ce
Chris Lattner authored Nov 21, 2010
```
llvm-svn: 119927
```
e48c31ce

optimize: · f7e89613

Chris Lattner authored Nov 21, 2010

void a(int x) { if (((1<<x)&8)==0) b(); }

into "x != 3", which occurs over 100 times in 403.gcc but in no
other program in llvm-test.

llvm-svn: 119922

f7e89613

Implement PR8644: forwarding a memcpy value to a byval, · 58f9f587

Chris Lattner authored Nov 21, 2010

allowing the memcpy to be eliminated.

Unfortunately, the requirements on byval's without explicit 
alignment are really weak and impossible to predict in the 
mid-level optimizer, so this doesn't kick in much with current
frontends.  The fix is to change clang to set alignment on all
byval arguments.

llvm-svn: 119916

58f9f587

Nov 20, 2010
- Simplify code. No change in functionality. · ddd1b7b8
  Benjamin Kramer authored Nov 20, 2010
```
llvm-svn: 119908
```
  ddd1b7b8
Nov 19, 2010
- Document the new GVN number table structure. · ea326db4
  Owen Anderson authored Nov 19, 2010
```
llvm-svn: 119865
```
  ea326db4
- When folding addressing modes in CodeGenPrepare, attempt to look through PHI nodes · dfb8c3bb
  Owen Anderson authored Nov 19, 2010
```
if all the operands of the PHI are equivalent.  This allows CodeGenPrepare to undo
unprofitable PRE transforms.

llvm-svn: 119853
```
  dfb8c3bb
Nov 18, 2010

Factor code for testing whether replacing one value with another · aef146b8

Duncan Sands authored Nov 18, 2010

preserves LCSSA form out of ScalarEvolution and into the LoopInfo
class.  Use it to check that SimplifyInstruction simplifications
are not breaking LCSSA form.  Fixes PR8622.

llvm-svn: 119727

aef146b8

Completely rework the datastructure GVN uses to represent the value number to... · c21c100f

Owen Anderson authored Nov 18, 2010

Completely rework the datastructure GVN uses to represent the value number to leader mapping. Previously,
this was a tree of hashtables, and a query recursed into the table for the immediate dominator ad infinitum
if the initial lookup failed. This led to really bad performance on tall, narrow CFGs.

We can instead replace it with what is conceptually a multimap of value numbers to leaders (actually
represented by a hashtable with a list of Value*'s as the value type), and then
determine which leader from that set to use very cheaply thanks to the DFS numberings maintained by
DominatorTree. Because there are typically few duplicates of a given value, this scan tends to be
quite fast. Additionally, we use a custom linked list and BumpPtr allocation to avoid any unnecessary
allocation in representing the value-side of the multimap.

This change brings with it a 15% (!) improvement in the total running time of GVN on 403.gcc, which I
think is pretty good considering that includes all the "real work" being done by MemDep as well.

The one downside to this approach is that we can no longer use GVN to perform simple conditional progation,
but that seems like an acceptable loss since we now have LVI and CorrelatedValuePropagation to pick up
the slack. If you see conditional propagation that's not happening, please file bugs against LVI or CVP.

llvm-svn: 119714

c21c100f

slightly simplify code and substantially improve comment. Instead of · 1385dff8
Chris Lattner authored Nov 18, 2010
```
saying "it would be bad", give an example of what is going on.

llvm-svn: 119695
```
1385dff8

remove a pointless restriction from memcpyopt. It was · 731caac7

Chris Lattner authored Nov 18, 2010

refusing to optimize two memcpy's like this:

copy A <- B
copy C <- A

if it couldn't prove that noalias(B,C).  We can eliminate
the copy by producing a memmove instead of memcpy.

llvm-svn: 119694

731caac7

remove another pointless noalias check: M is a memcpy, so the · c274a834
Chris Lattner authored Nov 18, 2010
```
source and dest are known to not overlap.

llvm-svn: 119692
```
c274a834
use AA::isNoAlias instead of open coding it. Remove an extraneous noalias check: · 75cfe985
Chris Lattner authored Nov 18, 2010
```
there is no need to check to see if the source and dest of a memcpy are noalias,
behavior is undefined if not.

llvm-svn: 119691
```
75cfe985
finish a thought. · 1e37bbaf
Chris Lattner authored Nov 18, 2010
```
llvm-svn: 119690
```
1e37bbaf
rearrange some code, splitting memcpy/memcpy optimization · 7e9b2ea3
Chris Lattner authored Nov 18, 2010
```
out of processMemCpy into its own function.

llvm-svn: 119687
```
7e9b2ea3

allow eliminating an alloca that is just copied from an constant global · ac570131

Chris Lattner authored Nov 18, 2010

if it is passed as a byval argument.  The byval argument will just be a
read, so it is safe to read from the original global instead.  This allows
us to promote away the %agg.tmp alloca in PR8582

llvm-svn: 119686

ac570131

enhance the "alloca is just a memcpy from constant global" · f183d5c4
Chris Lattner authored Nov 18, 2010
```
to ignore calls that obviously can't modify the alloca
because they are readonly/readnone.

llvm-svn: 119683
```
f183d5c4

fix a small oversight in the "eliminate memcpy from constant global" · 7aeae25c

Chris Lattner authored Nov 18, 2010

optimization.  If the alloca that is "memcpy'd from constant" also has
a memcpy from *it*, ignore it: it is a load.  We now optimize the testcase to:

define void @test2() {
  %B = alloca %T
  %a = bitcast %T* @G to i8*
  %b = bitcast %T* %B to i8*
  call void @llvm.memcpy.p0i8.p0i8.i64(i8* %b, i8* %a, i64 124, i32 4, i1 false)
  call void @bar(i8* %b)
  ret void
}

previously we would generate:

define void @test() {
  %B = alloca %T
  %b = bitcast %T* %B to i8*
  %G.0 = getelementptr inbounds %T* @G, i32 0, i32 0
  %tmp3 = load i8* %G.0, align 4
  %G.1 = getelementptr inbounds %T* @G, i32 0, i32 1
  %G.15 = bitcast [123 x i8]* %G.1 to i8*
  %1 = bitcast [123 x i8]* %G.1 to i984*
  %srcval = load i984* %1, align 1
  %B.0 = getelementptr inbounds %T* %B, i32 0, i32 0
  store i8 %tmp3, i8* %B.0, align 4
  %B.1 = getelementptr inbounds %T* %B, i32 0, i32 1
  %B.12 = bitcast [123 x i8]* %B.1 to i8*
  %2 = bitcast [123 x i8]* %B.1 to i984*
  store i984 %srcval, i984* %2, align 1
  call void @bar(i8* %b)
  ret void
}

llvm-svn: 119682

7aeae25c

Nov 17, 2010
- Move SCEV::dominates and properlyDominates to ScalarEvolution. · 20d9ce21
  Dan Gohman authored Nov 17, 2010
```
llvm-svn: 119570
```
  20d9ce21
- Move SCEV::isLoopInvariant and hasComputableLoopEvolution to be member · afd6db99
  Dan Gohman authored Nov 17, 2010
```
functions of ScalarEvolution, in preparation for memoization and
other optimizations.

llvm-svn: 119562
```
  afd6db99
- Reference ScalarEvolution by name rather than directly in LICM, · 1ee6d240
  Dan Gohman authored Nov 17, 2010
```
to avoid an unneeded dependence.

llvm-svn: 119557
```
  1ee6d240