Commits · d5425e8f8db5d3ea7bdc949d6333ca5c2d91751d · Roger Ferrer / llvm-epi-0.8

Apr 22, 2008

Dig through multiple levels of AND to thread jumps if needed. · d5425e8f
Chris Lattner authored Apr 22, 2008
```
llvm-svn: 50106
```
d5425e8f

Teach jump threading to thread through blocks like: · 3df4c15d

Chris Lattner authored Apr 22, 2008

  br (and X, phi(Y, Z, false)), label L1, label L2

This triggers once on 252.eon and 6 times on 176.gcc.  Blocks 
in question often look like this:

bb262:		; preds = %bb261, %bb248
	%iftmp.251.0 = phi i1 [ true, %bb261 ], [ false, %bb248 ]		; <i1> [#uses=4]
	%tmp270 = icmp eq %struct.rtx_def* %tmp.0.i, null		; <i1> [#uses=1]
	%bothcond = or i1 %iftmp.251.0, %tmp270		; <i1> [#uses=1]
	br i1 %bothcond, label %bb288, label %bb273

In this case, it is clear that it doesn't matter if tmp.0.i is null when coming from bb261.  When coming from bb248, it is all that matters.


Another random example:

check_asm_operands.exit:		; preds = %check_asm_operands.exit.thr_comm, %bb30.i, %bb12.i, %bb6.i413
	%tmp.0.i420 = phi i1 [ true, %bb6.i413 ], [ true, %bb12.i ], [ true, %bb30.i ], [ false, %check_asm_operands.exit.thr_comm ; <i1> [#uses=1]
	call void @llvm.stackrestore( i8* %savedstack ) nounwind 
	%tmp4389 = icmp eq i32 %added_sets_1.0, 0		; <i1> [#uses=1]
	%tmp4394 = icmp eq i32 %added_sets_2.0, 0		; <i1> [#uses=1]
	%bothcond80 = and i1 %tmp4389, %tmp4394		; <i1> [#uses=1]
	%bothcond81 = and i1 %bothcond80, %tmp.0.i420		; <i1> [#uses=1]
	br i1 %bothcond81, label %bb4398, label %bb4397

Here is the case from 252.eon:

bb290.i.i:		; preds = %bb23.i57.i.i, %bb8.i39.i.i, %bb100.i.i, %bb100.i.i, %bb85.i.i110
	%myEOF.1.i.i = phi i1 [ true, %bb100.i.i ], [ true, %bb100.i.i ], [ true, %bb85.i.i110 ], [ true, %bb8.i39.i.i ], [ false, %bb23.i57.i.i ]		; <i1> [#uses=2]
	%i.4.i.i = phi i32 [ %i.1.i.i, %bb85.i.i110 ], [ %i.0.i.i, %bb100.i.i ], [ %i.0.i.i, %bb100.i.i ], [ %i.3.i.i, %bb8.i39.i.i ], [ %i.3.i.i, %bb23.i57.i.i ]		; <i32> [#uses=3]
	%tmp292.i.i = load i8* %tmp16.i.i100, align 1		; <i8> [#uses=1]
	%tmp293.not.i.i = icmp ne i8 %tmp292.i.i, 0		; <i1> [#uses=1]
	%bothcond.i.i = and i1 %tmp293.not.i.i, %myEOF.1.i.i		; <i1> [#uses=1]
	br i1 %bothcond.i.i, label %bb202.i.i, label %bb301.i.i
  Factoring out 3 common predecessors.

On the path from any blocks other than bb23.i57.i.i, the load and compare 
are dead.

llvm-svn: 50096

3df4c15d

refactor some code, no functionality change. · e369c35a
Chris Lattner authored Apr 22, 2008
```
llvm-svn: 50094
```
e369c35a
remove dead code. · 8fb13cbe
Chris Lattner authored Apr 22, 2008
```
llvm-svn: 50080
```
8fb13cbe

optimize "p != gep p, ..." better. This allows us to compile · c3a43935

Chris Lattner authored Apr 22, 2008

getelementptr-seteq.ll into:

define i1 @test(i64 %X, %S* %P) {
	%C = icmp eq i64 %X, -1		; <i1> [#uses=1]
	ret i1 %C
}

instead of:

define i1 @test(i64 %X, %S* %P) {
	%A.idx.mask = and i64 %X, 4611686018427387903		; <i64> [#uses=1]
	%C = icmp eq i64 %A.idx.mask, 4611686018427387903		; <i1> [#uses=1]
	ret i1 %C
}

And fixes the second half of PR2235.  This speeds up the insertion sort
case by 45%, from 1.12s to 0.77s.  In practice, this will significantly
speed up for loops structured like:

for (double *P = Base + N; P != Base; --P)
  ...

Which happens frequently for C++ iterators.

llvm-svn: 50079

c3a43935

Apr 21, 2008
- fix grammar-o, thanks to Duncan for noticing. · bab7bec9
  Chris Lattner authored Apr 21, 2008
```
llvm-svn: 50047
```
  bab7bec9
- Remove unneeded #include's. · a5b96ece
  Owen Anderson authored Apr 21, 2008
```
llvm-svn: 50035
```
  a5b96ece
- Refactor memcpyopt based on Chris' suggestions. Consolidate several functions · 6a7355ca
  Owen Anderson authored Apr 21, 2008
```
and simplify code that was fallout from the separation of memcpyopt and gvn.

llvm-svn: 50034
```
  6a7355ca
- Use the new SplitBlockPredecessors to implement a todo. · f6236cc2
  Chris Lattner authored Apr 21, 2008
```
llvm-svn: 50022
```
  f6236cc2
- fit some more code in 80 cols. · 559c867e
  Chris Lattner authored Apr 21, 2008
```
llvm-svn: 50016
```
  559c867e
- finish the first cut of a jump threading pass implementation. · ff1c6e38
  Chris Lattner authored Apr 20, 2008
```
llvm-svn: 50006
```
  ff1c6e38
- replace a slow and verbose version of Instruction::isUsedOutsideOfBlock with · 567166c0
  Chris Lattner authored Apr 20, 2008
```
a call to Instruction::isUsedOutsideOfBlock.

llvm-svn: 50005
```
  567166c0
Apr 20, 2008
- we can only thread blocks when there is a pred we can determine the succ of. · 9c1f1a82
  Chris Lattner authored Apr 20, 2008
```
llvm-svn: 50003
```
  9c1f1a82
- improve comments, infrastructure, and add some validity checks for threading. · 2115722f
  Chris Lattner authored Apr 20, 2008
```
Add a cost function.

llvm-svn: 50002
```
  2115722f
- Add a new Jump Threading pass, which will handle cases · b3b6007c
  Chris Lattner authored Apr 20, 2008
```
such as those in PR2235.  Right now the pass is not very
effective. :)

llvm-svn: 50000
```
  b3b6007c
- g++-4.3 build-fix: CHAR_BIT requires <climits>. · ab207847
  Torok Edwin authored Apr 20, 2008
```
llvm-svn: 49989
```
  ab207847
- Switch to using Simplified ConstantFP::get API. · 3b18762f
  Chris Lattner authored Apr 20, 2008
```
llvm-svn: 49977
```
  3b18762f
Apr 17, 2008
- Make GVN able to remove unnecessary calls to read-only functions again. · f9ae76d8
  Owen Anderson authored Apr 17, 2008
```
llvm-svn: 49842
```
  f9ae76d8
Apr 14, 2008
- Remove unnecessary <sstream> includes. · 4fff979a
  Dan Gohman authored Apr 14, 2008
```
llvm-svn: 49681
```
  4fff979a
- Minor whitespace and comment cleanups. · e36714c0
  Dan Gohman authored Apr 14, 2008
```
llvm-svn: 49671
```
  e36714c0
- Revert r49614. As Dan pointed out, some of these aren't correct. · 7629b71d
  Owen Anderson authored Apr 14, 2008
```
llvm-svn: 49657
```
  7629b71d
Apr 13, 2008
- Replace calls of the form V1->setName(V2->getName()) with V1->takeName(V2), · 1f6fbc4b
  Owen Anderson authored Apr 13, 2008
```
which is significantly more efficient.

llvm-svn: 49614
```
  1f6fbc4b
Apr 11, 2008
- Fix PR2213 by simultaneously making GVN more aggressive with the return values · 1e73f29a
  Owen Anderson authored Apr 11, 2008
```
of calls and less aggressive with non-readnone calls.

llvm-svn: 49516
```
  1e73f29a
Apr 10, 2008

Teach InstCombine's ComputeMaskedBits to handle pointer expressions · 99b7b3f0

Dan Gohman authored Apr 10, 2008

in addition to integer expressions. Rewrite GetOrEnforceKnownAlignment
as a ComputeMaskedBits problem, moving all of its special alignment
knowledge to ComputeMaskedBits as low-zero-bits knowledge.

Also, teach ComputeMaskedBits a few basic things about Mul and PHI
instructions.

This improves ComputeMaskedBits-based simplifications in a few cases,
but more noticeably it significantly improves instcombine's alignment
detection for loads, stores, and memory intrinsics.

llvm-svn: 49492

99b7b3f0

Apr 09, 2008
- Be conservative if getresult operand is neither call nor invoke. · a7dfbc03
  Devang Patel authored Apr 09, 2008
```
llvm-svn: 49430
```
  a7dfbc03
- Factor a bunch of functionality related to memcpy and memset transforms out of · ef9a6fd5
  Owen Anderson authored Apr 09, 2008
```
GVN and into its own pass.

llvm-svn: 49419
```
  ef9a6fd5
- Remove accidentally duplicated code. · 8ee792d1
  Owen Anderson authored Apr 09, 2008
```
llvm-svn: 49418
```
  8ee792d1
Apr 07, 2008
- Add operator= implementations to SparseBitVector, allowing it to be used in GVN. This results · ed92b41a
  Owen Anderson authored Apr 07, 2008
```
in both time and memory savings for GVN.  For example, one testcase went from 10.5s to 6s with
this patch.

llvm-svn: 49345
```
  ed92b41a
- Make GVN more memory efficient, particularly on code that contains a large number of · 0c1e634c
  Owen Anderson authored Apr 07, 2008
```
allocations, which GVN can't optimize anyways.

llvm-svn: 49329
```
  0c1e634c
Apr 06, 2008
- silence a warning when assertions are disabled. · a39cfc5c
  Chris Lattner authored Apr 06, 2008
```
llvm-svn: 49283
```
  a39cfc5c
- API changes for class Use size reduction, wave 1. · e9ecc68d
  Gabor Greif authored Apr 06, 2008
```
Specifically, introduction of XXX::Create methods
for Users that have a potentially variable number of
Uses.

llvm-svn: 49277
```
  e9ecc68d
Apr 02, 2008

· 586740f4

David Greene authored Apr 02, 2008

Iterators folloring a SmallVector erased element are invalidated so
don't access cached iterators from after the erased element.

Re-apply 49056 with SmallVector support.

llvm-svn: 49106

586740f4

Reverting 49056 due to the build being broken. · 052838c5
Tanya Lattner authored Apr 01, 2008
```
llvm-svn: 49060
```
052838c5

· 7f7edc38

David Greene authored Apr 01, 2008

Iterators folloring a SmallVector erased element are invalidated so
don't access cached iterators from after the erased element.

llvm-svn: 49056

7f7edc38

Mar 31, 2008
- Don't eliminate bitcast instructions that change the type of a pointer · f2b0b0eb
  Nate Begeman authored Mar 31, 2008
```
llvm-svn: 48971
```
  f2b0b0eb
Mar 30, 2008
- Fix "Control reaches the end of non-void function" warnings, · 0f760dfe
  Chris Lattner authored Mar 30, 2008
```
patch by David Chisnall.

llvm-svn: 48963
```
  0f760dfe
Mar 29, 2008

change iterator invalidation avoidance to just move the iterator backward · 4311ad2d

Chris Lattner authored Mar 29, 2008

when something changes, instead of moving forward.  This allows us to 
simplify memset lowering, inserting the memset at the end of the range of 
stuff we're touching instead of at the start.

This, in turn, allows us to make use of the addressing instructions already
used in the function instead of inserting our own.  For example, we now
codegen:

	%tmp41 = getelementptr [8 x i8]* %ref_idx, i32 0, i32 0		; <i8*> [#uses=2]
	call void @llvm.memset.i64( i8* %tmp41, i8 -1, i64 8, i32 1 )

instead of:

	%tmp20 = getelementptr [8 x i8]* %ref_idx, i32 0, i32 7		; <i8*> [#uses=1]
	%ptroffset = getelementptr i8* %tmp20, i64 -7		; <i8*> [#uses=1]
	call void @llvm.memset.i64( i8* %ptroffset, i8 -1, i64 8, i32 1 )

llvm-svn: 48940

4311ad2d

make the common case of a single store (which clearly shouldn't be turned · ac955157
Chris Lattner authored Mar 29, 2008
```
into a memset!) faster by avoiding an allocation of an std::list node.

llvm-svn: 48939
```
ac955157
give form-memset a significantly more sane heuristic, enable it by default. · d528b21a
Chris Lattner authored Mar 29, 2008
```
llvm-svn: 48937
```
d528b21a

Mar 28, 2008

make memset inference significantly more powerful: it can now handle · d62964a7

Chris Lattner authored Mar 28, 2008

memsets that initialize "structs of arrays" and other store sequences
that are not sequential.  This is still only enabled if you pass 
-form-memset-from-stores.  The flag is not heavily tested and I haven't
analyzed the perf regressions when -form-memset-from-stores is passed
either, but this causes no make check regressions.

llvm-svn: 48909

d62964a7