Commits · 27dfb1e1a4e6d3c36e0ade15792dbd1487c7a931 · Roger Ferrer / llvm-epi-0.8

Feb 05, 2010

Do not reassociate expressions with i1 type. SimplifyCFG converts some · 27dfb1e1

Bob Wilson authored Feb 04, 2010

short-circuited conditions to AND/OR expressions, and those expressions
are often converted back to a short-circuited form in code gen.  The
original source order may have been optimized to take advantage of the
expected values, and if we reassociate them, we change the order and
subvert that optimization.  Radar 7497329.

llvm-svn: 95333

27dfb1e1

Feb 03, 2010

Adjust the heuristics used to decide when SROA is likely to be profitable. · 04365c5f

Bob Wilson authored Feb 03, 2010

The SRThreshold value makes perfect sense for checking if an entire aggregate
should be promoted to a scalar integer, but it is not so good for splitting
an aggregate into its separate elements. A struct may contain a large embedded
array along with some scalar fields that would benefit from being split apart
by SROA. Even if the total aggregate size is large, it may still be good to
perform SROA. Thus, the most important piece of this patch is simply moving
the aggregate size comparison vs. SRThreshold so that it guards only the
aggregate promotion.

We have also been checking the number of elements to decide if an aggregate
should be split up. The limit of "SRThreshold/4" seemed rather arbitrary,
and I don't think it's very useful to derive this limit from SRThreshold
anyway. I've collected some data showing that the current default limit of
32 (since SRThreshold defaults to 128) is a reasonable cutoff for struct
types. One thing suggested by the data is that distinguishing between structs
and arrays might be useful. There are (obviously) a lot more large arrays
than large structs (as measured by the number of elements and not the total
size -- a large array inside a struct still counts as a single element given
the way we do SROA right now). Out of 8377 arrays where we successfully
performed SROA while compiling a large set of benchmarks, only 16 of them had
more than 8 elements. And, for those 16 arrays, it's not at all clear that
SROA was actually beneficial. So, to offset the compile time cost of
investigating more large structs for SROA, the patch lowers the limit on array
elements to 8.

This fixes Apple Radar 7563690.

llvm-svn: 95224

04365c5f

Revert 94937 and move the noreturn check to codegen. · 27a41d54
Evan Cheng authored Feb 03, 2010
```
llvm-svn: 95198
```
27a41d54
Fix some comment typos. · 76e8c595
Bob Wilson authored Feb 03, 2010
```
llvm-svn: 95170
```
76e8c595
Recommit this, looks like it wasn't the cause. · d86233c1
Eric Christopher authored Feb 03, 2010
```
llvm-svn: 95165
```
d86233c1
Hopefully temporarily revert this. · e67d01a9
Eric Christopher authored Feb 02, 2010
```
llvm-svn: 95154
```
e67d01a9

Feb 02, 2010
- Re-add strcmp and known size object size checking optimization. · 4264e7e4
  Eric Christopher authored Feb 02, 2010
```
Passed bootstrap and nightly test run here.

llvm-svn: 95145
```
  4264e7e4
- fix a crash in loop unswitch on a loop invariant vector condition. · 302240d7
  Chris Lattner authored Feb 02, 2010
```
llvm-svn: 95055
```
  302240d7
- Don't need to check the last argument since it'll always be bool. We also · 14dfc3f6
  Eric Christopher authored Feb 02, 2010
```
don't use TargetData here.

llvm-svn: 95040
```
  14dfc3f6
- More indentation/tabification fixes. · 9afa9732
  Eric Christopher authored Feb 02, 2010
```
llvm-svn: 95036
```
  9afa9732
- Untabify previous commit. · 14082347
  Eric Christopher authored Feb 02, 2010
```
llvm-svn: 95035
```
  14082347
- Formatting. · 56e4182c
  Eric Christopher authored Feb 01, 2010
```
llvm-svn: 95027
```
  56e4182c
Feb 01, 2010

Add an option to GVN to remove all partially redundant loads. This is currently · d517b520

Bob Wilson authored Feb 01, 2010

disabled by default.  This divides the existing load PRE code into 2 phases:
first it checks that it is safe to move the load to each of the predecessors
where it is unavailable, and then if it is safe, the code is changed to move
the load.  Radar 7571861.

llvm-svn: 95007

d517b520

Jan 31, 2010

Do not mark no-return calls tail calls. It'll screw up special calls like... · d86d3fe0

Evan Cheng authored Jan 31, 2010

Do not mark no-return calls tail calls. It'll screw up special calls like longjmp and it doesn't make much sense for performance reason. If my logic is faulty, please let me know.

llvm-svn: 94937

d86d3fe0

Jan 30, 2010

Check alignment of loads when deciding whether it is safe to execute them · 56600a15

Bob Wilson authored Jan 30, 2010

unconditionally.  Besides checking the offset, also check that the underlying
object is aligned as much as the load itself.

llvm-svn: 94875

56600a15

Jan 29, 2010
- Revert my last couple of patches. They appear to have broken bison. · 5a0e1748
  Eric Christopher authored Jan 29, 2010
```
llvm-svn: 94841
```
  5a0e1748
- Improve isSafeToLoadUnconditionally to recognize that GEPs with constant · 7c42b9d5
  Bob Wilson authored Jan 29, 2010
```
indices are safe if the result is known to be within the bounds of the
underlying object.

llvm-svn: 94829
```
  7c42b9d5
- Make strcpy_chk lower to strcpy if we have a safe size. · 9b3c02b7
  Eric Christopher authored Jan 29, 2010
```
llvm-svn: 94783
```
  9b3c02b7
- Generic reformatting and comment fixing. No functionality change. · 48816a0b
  Bill Wendling authored Jan 29, 2010
```
llvm-svn: 94771
```
  48816a0b
- Add newline to debugging output, and fix some grammar-os in comment. · 8277838c
  Bill Wendling authored Jan 29, 2010
```
llvm-svn: 94765
```
  8277838c
Jan 27, 2010
- Use the less expensive getName function instead of getNameStr. · 40582a89
  Benjamin Kramer authored Jan 27, 2010
```
llvm-svn: 94683
```
  40582a89
Jan 25, 2010

Remove check for an impossible condition: the condition of the while loop has · 70c8fe5e
Bob Wilson authored Jan 25, 2010
```
already checked that TmpBB->getSinglePredecessor() is non-null.

llvm-svn: 94451
```
70c8fe5e

Change Value::getUnderlyingObject to have the MaxLookup value specified as a · fc060e43

Bob Wilson authored Jan 25, 2010

parameter with a default value, instead of just hardcoding it in the
implementation.  The limit of MaxLookup = 6 was introduced in r69151 to fix
a performance problem with O(n^2) behavior in instcombine, but the scalarrepl
pass is relying on getUnderlyingObject to go all the way back to an AllocaInst.
Making the limit part of the method signature makes it clear that by default
the result is limited and should help avoid similar problems in the future.
This fixes pr6126.

llvm-svn: 94433

fc060e43

Jan 24, 2010
- make -fno-rtti the default unless a directory builds with REQUIRES_RTTI. · 823aed16
  Chris Lattner authored Jan 24, 2010
```
llvm-svn: 94378
```
  823aed16
Jan 23, 2010
- third bug from PR6119: the xor dupe extension allows · 29b15c5c
  Chris Lattner authored Jan 23, 2010
```
for arbitrary terminators in predecessors, don't assume
it is a conditional or uncond branch.  The testcase shows
an example where they can happen with switches.

llvm-svn: 94323
```
  29b15c5c
- add an early out to ProcessBranchOnXOR to speed it up, · ba2d0b89
  Chris Lattner authored Jan 23, 2010
```
handle the case when we can infer an input to the xor
from all inputs that agree, instead of going into an
infinite loop.  Another part of PR6199

llvm-svn: 94321
```
  ba2d0b89
- fix a crash in jump threading, PR6119 · de5ab486
  Chris Lattner authored Jan 23, 2010
```
llvm-svn: 94319
```
  de5ab486
- Reapply 94059 while fixing the calling convention setup · ba7cd4c3
  Eric Christopher authored Jan 23, 2010
```
for strcpy.

llvm-svn: 94287
```
  ba7cd4c3
Jan 22, 2010

Revert 94059. It is breaking the MultiSource/Benchmarks/Prolangs-C/bison · 6c0c8d41
Bob Wilson authored Jan 22, 2010
```
test on ARM.

llvm-svn: 94198
```
6c0c8d41

Stop building RTTI information for *most* llvm libraries. Notable · 7ba0661f

Chris Lattner authored Jan 22, 2010

missing ones are libsupport, libsystem and libvmcore.  libvmcore is
currently blocked on bugpoint, which uses EH.  Once it stops using
EH, we can switch it off.

This #if 0's out 3 unit tests, because gtest requires RTTI information.
Suggestions welcome on how to fix this.

llvm-svn: 94164

7ba0661f

Revert LoopStrengthReduce.cpp to pre-r94061 for now. · 045f8198
Dan Gohman authored Jan 22, 2010
```
llvm-svn: 94123
```
045f8198

DbgInfoIntrinsics no longer appear in an instruction's use list; so clean up... · 1df65186

Victor Hernandez authored Jan 21, 2010

DbgInfoIntrinsics no longer appear in an instruction's use list; so clean up looking for them in use iterations and remove OnlyUsedByDbgInfoIntrinsics()

llvm-svn: 94111

1df65186

When inserting expressions for post-increment users which contain · b1ee154b

Dan Gohman authored Jan 21, 2010

loop-variant components, adds must be inserted after the increment.
Keep track of the increment position for this case, and insert
these adds in the correct location.

llvm-svn: 94110

b1ee154b

Jan 21, 2010

Include IVUsers information in LSR's debug output. · cb8d577e
Dan Gohman authored Jan 21, 2010
```
llvm-svn: 94108
```
cb8d577e

Prune the search for candidate formulae if the number of register · 29916e02

Dan Gohman authored Jan 21, 2010

operands exceeds the number of registers used in the initial
solution, as that wouldn't lead to a profitable solution anyway.

llvm-svn: 94107

29916e02

Add a comment. · c903499f
Dan Gohman authored Jan 21, 2010
```
llvm-svn: 94104
```
c903499f

Re-implement the main strength-reduction portion of LoopStrengthReduction. · 51ad99d2

Dan Gohman authored Jan 21, 2010

This new version is much more aggressive about doing "full" reduction in
cases where it reduces register pressure, and also more aggressive about
rewriting induction variables to count down (or up) to zero when doing so
reduces register pressure.

It currently uses fairly simplistic algorithms for finding reuse
opportunities, but it introduces a new framework allows it to combine
multiple strategies at once to form hybrid solutions, instead of doing
all full-reduction or all base+index.

llvm-svn: 94061

51ad99d2

Add strcpy_chk -> strcpy support for "don't know" object size · fa863258
Eric Christopher authored Jan 21, 2010
```
answers.  This will update as object size checking gets better information.

llvm-svn: 94059
```
fa863258

Jan 19, 2010

When doing address-mode sinking, expand the base register first, rather · ca19445d

Dan Gohman authored Jan 19, 2010

than the scaled register. This makes it more likely that subsequent
AddrModeMatcher queries will match the new address the same way as the
old, instead of accidentally matching what had been the base register
as the new scaled register, and then failing to match the scaled register.
This fixes some problems with address-mode sinking multiple muls into a
block, which will be a lot more common with some upcoming
LoopStrengthReduction changes.

llvm-svn: 93935

ca19445d

Fix a crash in scalarrepl for memcpy/memmove where the source and destination · 58d59fe3

Bob Wilson authored Jan 19, 2010

are the same.  I had already fixed a similar problem where the source and
destination were different bitcasts derived from the same alloca, but the
previous fix still did not handle the case where both operands are exactly
the same value.  Radar 7552893.

llvm-svn: 93848

58d59fe3