Commits · 4755d9df786f8fd699fc4aee988c74af4c8ea8f6 · Lorenzo Albano / LLVM bpEVL

Jan 09, 2009
- Adjustments to last patch based on review. · 4755d9df
  Dale Johannesen authored Jan 09, 2009
```
llvm-svn: 61969
```
  4755d9df
Jan 08, 2009

Do not inline functions with (dynamic) alloca into · b48fc71f

Dale Johannesen authored Jan 08, 2009

functions that don't already have a (dynamic) alloca.
Dynamic allocas cause inefficient codegen and we shouldn't
propagate this (behavior follows gcc).  Two existing tests
assumed such inlining would be done; they are hacked by
adding an alloca in the caller, preserving the point of
the tests.

llvm-svn: 61946

b48fc71f

This implements the second half of the fix for PR3290, handling · c518dfd1

Chris Lattner authored Jan 08, 2009

loads from allocas that cover the entire aggregate.  This handles
some memcpy/byval cases that are produced by llvm-gcc.  This triggers
a few times in kc++ (with std::pair<std::_Rb_tree_const_iterator
<kc::impl_abstract_phylum*>,bool>) and once in 176.gcc (with %struct..0anon).

llvm-svn: 61915

c518dfd1

Jan 07, 2009

Whitespace - correct formatting. · 0bcf0858
Duncan Sands authored Jan 07, 2009
```
llvm-svn: 61879
```
0bcf0858

Remove alloca tracking from nocapture analysis. Not only · 289f59f2

Duncan Sands authored Jan 07, 2009

was it not very helpful, it was also wrong!  The problem
is shown in the testcase: the alloca might be passed to
a nocapture callee which dereferences it and returns the
original pointer.  But because it was a nocapture call we
think we don't need to track its uses, but we do.

llvm-svn: 61876

289f59f2

Reorder these. · 94bcbbab
Duncan Sands authored Jan 07, 2009
```
llvm-svn: 61873
```
94bcbbab
Use a switch rather than a sequence of "isa" tests. · 02599850
Duncan Sands authored Jan 07, 2009
```
llvm-svn: 61872
```
02599850
The verifier checks that the aliasee is not null. · 187c5716
Duncan Sands authored Jan 07, 2009
```
llvm-svn: 61870
```
187c5716

Implement the first half of PR3290: if there is a store of an · f2b8c82a

Chris Lattner authored Jan 07, 2009

integer to a (transitive) bitcast the alloca and if that integer
has the full size of the alloca, then it clobbers the whole thing.
Handle this by extracting pieces out of the stored integer and 
filing them away in the SROA'd elements.

This triggers fairly frequently because the CFE uses integers to
pass small structs by value and the inliner exposes these.  For 
example, in kimwitu++, I see a bunch of these with i64 stores to
"%struct.std::pair<std::_Rb_tree_const_iterator<kc::impl_abstract_phylum*>,bool>"

In 176.gcc I see a few i32 stores to "%struct..0anon".

In the testcase, this is a difference between compiling test1 to:

_test1:
	subl	$12, %esp
	movl	20(%esp), %eax
	movl	%eax, 4(%esp)
	movl	16(%esp), %eax
	movl	%eax, (%esp)
	movl	(%esp), %eax
	addl	4(%esp), %eax
	addl	$12, %esp
	ret

vs:

_test1:
	movl	8(%esp), %eax
	addl	4(%esp), %eax
	ret

The second half of this will be to handle loads of the same form.

llvm-svn: 61853

f2b8c82a

Factor a bunch of code out into a helper method. · 9a2de65f
Chris Lattner authored Jan 07, 2009
```
llvm-svn: 61852
```
9a2de65f
use continue to simplify code and reduce nesting, no functionality · db561146
Chris Lattner authored Jan 07, 2009
```
change.

llvm-svn: 61851
```
db561146
Get TargetData once up front and cache as an ivar instead of · 938b54f3
Chris Lattner authored Jan 07, 2009
```
requerying it all over the place.

llvm-svn: 61850
```
938b54f3
Use the hasAllZeroIndices predicate to simplify some · a63dba9e
Chris Lattner authored Jan 07, 2009
```
code, no functionality change.

llvm-svn: 61849
```
a63dba9e

Jan 06, 2009

Change m_ConstantInt and m_SelectCst to take their constant integers · 2fdcc59b

Chris Lattner authored Jan 05, 2009

as template arguments instead of as instance variables, exposing more
optimization opportunities to the compiler earlier.

llvm-svn: 61776

2fdcc59b

Jan 05, 2009
- Teach the internalize pass to also internalize · 582c53d1
  Duncan Sands authored Jan 05, 2009
```
global aliases.

llvm-svn: 61754
```
  582c53d1
- Find loop back edges only after empty blocks are eliminated. · 8804293f
  Evan Cheng authored Jan 05, 2009
```
llvm-svn: 61752
```
  8804293f
- Not having an aliasee is a theoretical possibility. · 52e5deec
  Duncan Sands authored Jan 05, 2009
```
llvm-svn: 61745
```
  52e5deec
- Format more neatly. · 821d13cf
  Duncan Sands authored Jan 05, 2009
```
llvm-svn: 61744
```
  821d13cf
- Remove trailing spaces. · d24b93f3
  Duncan Sands authored Jan 05, 2009
```
llvm-svn: 61743
```
  d24b93f3
- Delete unused global aliases with internal linkage. · f5dbbae4
  Duncan Sands authored Jan 05, 2009
```
In fact this also deletes those with linkonce linkage,
however this is currently dead because for the moment
aliases aren't allowed to have this linkage type.

llvm-svn: 61742
```
  f5dbbae4
- Tidy up #includes, deleting a bunch of unnecessary #includes. · 906152a2
  Dan Gohman authored Jan 05, 2009
```
llvm-svn: 61715
```
  906152a2
- Move the libcall annotating part from doFinalization to doInitialization. · e4e5532e
  Nick Lewycky authored Jan 05, 2009
```
Finalization occurs after all the FunctionPasses in the group have run, which
is clearly not what we want.

This also means that we have to make sure that we apply the right param 
attributes when creating a new function.

Also, add a missed optimization: strdup and strndup. NoCapture and 
NoAlias return!

llvm-svn: 61658
```
  e4e5532e
Jan 04, 2009
- Run a post-pass that marks known function declarations by name. · 959af7ba
  Nick Lewycky authored Jan 04, 2009
```
llvm-svn: 61632
```
  959af7ba
- Revert this transform. It was causing some dramatic slowdowns in a few tests. See PR3266. · 0c04f9fd
  Bill Wendling authored Jan 04, 2009
```
llvm-svn: 61623
```
  0c04f9fd
Jan 03, 2009
- Any void readonly functions are provably dead, don't waste time adding · 1d805c62
  Nick Lewycky authored Jan 03, 2009
```
nocapture attributes to them.

llvm-svn: 61610
```
  1d805c62
Jan 02, 2009

Load tracking means that the value analyzed may · c7affb0a

Duncan Sands authored Jan 02, 2009

not have pointer type.  In particular, it may
be the condition argument for a select or a GEP
index.  While I was unable to construct a testcase
for which some bits of the original pointer are
captured due to one of these, it's very very close
to being possible - so play safe and exclude these
possibilities.

llvm-svn: 61580

c7affb0a

When calculating 'nocapture' argument attributes, allow · b193a37c

Duncan Sands authored Jan 02, 2009

the argument to be stored to an alloca by tracking uses
of the alloca.  This occurs 4 times (out of 7121, 0.05%)
in MultiSource/Applications, so may not be worth it.  On
the other hand, it is easy to do and fairly cheap.  The
functions it helps are: W_addcom and W_addlit in spiff;
process_args (argv) in d (make_dparser); ercPixConcealIMB
in JM/ldecod.

llvm-svn: 61570

b193a37c

Improve comments and reorganize a bit - no functionality · cefc8604
Duncan Sands authored Jan 02, 2009
```
change.

llvm-svn: 61569
```
cefc8604

Make adding nocapture a bit stronger. FreeInst is nocapture. Also, · 7e82055e

Nick Lewycky authored Jan 02, 2009

functions that don't write can't leak a pointer except through 
the return value, so a void readonly function is implicitly nocapture.

Test these, and add a test that verifies that f1 calling f2 with an 
otherwise dead pointer gets both of them marked nocapture.

llvm-svn: 61552

7e82055e

Jan 01, 2009
- Mention that this pass does escape analysis in the · 1f11d2bb
  Duncan Sands authored Jan 01, 2009
```
leading comments.

llvm-svn: 61548
```
  1f11d2bb
- Fix comment. · 0fcff2c2
  Bill Wendling authored Jan 01, 2009
```
llvm-svn: 61538
```
  0fcff2c2
- Add transformation: · aedb54a9
  Bill Wendling authored Jan 01, 2009
```
 xor (or (icmp, icmp), true) -> and(icmp, icmp)

This is possible because of De Morgan's law.

llvm-svn: 61537
```
  aedb54a9
Dec 31, 2008
- Look through phi nodes and select instructions when · 16384802
  Duncan Sands authored Dec 31, 2008
```
calculating nocapture attributes.

llvm-svn: 61535
```
  16384802
- Don't analyze arguments already marked 'nocapture'. · df128eb4
  Duncan Sands authored Dec 31, 2008
```
llvm-svn: 61532
```
  df128eb4
- Rename AddReadAttrs to FunctionAttrs, and teach it how · 44c8cd97
  Duncan Sands authored Dec 31, 2008
```
to work out (in a very simplistic way) which function
arguments (pointer arguments only) are only dereferenced
and so do not escape.  Mark such arguments 'nocapture'.

llvm-svn: 61525
```
  44c8cd97
Dec 29, 2008

Experiments show that looking through phi nodes · f6069577

Duncan Sands authored Dec 29, 2008

and select instructions doesn't buy anything here
except extra complexity: the only difference in
the entire testsuite was that a readonly function
became readnone in MiBench/consumer-typeset.  Add
a comment about this.

llvm-svn: 61478

f6069577

Allow readnone functions to read (and write!) global · c125d6a3

Duncan Sands authored Dec 29, 2008

constants, since doing so is irrelevant for aliasing
purposes.  While this doesn't increase the total number
of functions marked readonly or readnone in MultiSource/
Applications (3089), it does result in 12 functions being
marked readnone rather than readonly.
Before:
  readnone: 820
  readonly: 2269
After:
  readnone: 832
  readonly: 2257

llvm-svn: 61469

c125d6a3

Dec 24, 2008
- Revert 61362 and 61402 until SPEC breakage is fixed. · 656237be
  Dale Johannesen authored Dec 23, 2008
```
llvm-svn: 61403
```
  656237be
- This fixes the bug in 175.vpr. It doesn't fix the · f8b161bc
  Dale Johannesen authored Dec 23, 2008
```
other SPEC breakage.  I'll be reverting all recent
changes shortly, this checking is mostly so this
change doesn't get lost.

llvm-svn: 61402
```
  f8b161bc
Dec 23, 2008

Fix the time regression I introduced in 464.h264ref with · 93b9aa87

Dale Johannesen authored Dec 23, 2008

my last patch to this file.

The issue there was that all uses of an IV inside a loop
are actually references to Base[IV*2], and there was one
use outside that was the same but LSR didn't see the base
or the scaling because it didn't recurse into uses outside
the loop; thus, it used base+IV*scale mode inside the loop
instead of pulling base out of the loop.  This was extra bad
because register pressure later forced both base and IV into
memory.  Doing that recursion, at least enough
to figure out addressing modes, is a good idea in general;
the change in AddUsersIfInteresting does this.  However,
there were side effects....

It is also possible for recursing outside the loop to
introduce another IV where there was only 1 before (if
the refs inside are not scaled and the ref outside is).
I don't think this is a common case, but it's in the testsuite.
It is right to be very aggressive about getting rid of
such introduced IVs (CheckForIVReuse and the handling of
nonzero RewriteFactor in StrengthReduceStridedIVUsers).
In the testcase in question the new IV produced this way
has both a nonconstant stride and a nonzero base, neither
of which was handled before.  And when inserting 
new code that feeds into a PHI, it's right to put such 
code at the original location rather than in the PHI's 
immediate predecessor(s) when the original location is outside 
the loop (a case that couldn't happen before)
(RewriteInstructionToUseNewBase); better to avoid making
multiple copies of it in this case.

Also, the mechanism for keeping SCEV's corresponding to GEP's
no longer works, as the GEP might change after its SCEV
is remembered, invalidating the SCEV, and we might get a bad
SCEV value when looking up the GEP again for a later loop.  
This also couldn't happen before, as we weren't recursing
into GEP's outside the loop.

I owe some testcases for this, want to get it in for nightly runs.

llvm-svn: 61362

93b9aa87