Commits · 27dfb1e1a4e6d3c36e0ade15792dbd1487c7a931 · Roger Ferrer / llvm-epi-0.8

Feb 05, 2010

Do not reassociate expressions with i1 type. SimplifyCFG converts some · 27dfb1e1

Bob Wilson authored Feb 04, 2010

short-circuited conditions to AND/OR expressions, and those expressions
are often converted back to a short-circuited form in code gen.  The
original source order may have been optimized to take advantage of the
expected values, and if we reassociate them, we change the order and
subvert that optimization.  Radar 7497329.

llvm-svn: 95333

27dfb1e1

Feb 04, 2010

Increase inliner thresholds by 25. · 113fb54b

Jakob Stoklund Olesen authored Feb 04, 2010

This makes the inliner about as agressive as it was before my changes to the
inliner cost calculations. These levels give the same performance and slightly
smaller code than before.

llvm-svn: 95320

113fb54b

Temporarily revert this since it appears to have caused a build · 107a1fbf
Eric Christopher authored Feb 04, 2010
```
failure.

llvm-svn: 95294
```
107a1fbf

Rework constant expr and array handling for objectsize instcombining. · 42fa84a8

Eric Christopher authored Feb 04, 2010

Fix bugs where we would compute out of bounds as in bounds, and where
we couldn't know that the linker could override the size of an array.

Add a few new testcases, change existing testcase to use a private
global array instead of extern.

llvm-svn: 95283

42fa84a8

If we're dealing with a zero-length array, don't lower to any · f12e18db
Eric Christopher authored Feb 03, 2010
```
particular size, we just don't know what the length is yet.

llvm-svn: 95266
```
f12e18db

Feb 03, 2010

Adjust the heuristics used to decide when SROA is likely to be profitable. · 04365c5f

Bob Wilson authored Feb 03, 2010

The SRThreshold value makes perfect sense for checking if an entire aggregate
should be promoted to a scalar integer, but it is not so good for splitting
an aggregate into its separate elements. A struct may contain a large embedded
array along with some scalar fields that would benefit from being split apart
by SROA. Even if the total aggregate size is large, it may still be good to
perform SROA. Thus, the most important piece of this patch is simply moving
the aggregate size comparison vs. SRThreshold so that it guards only the
aggregate promotion.

We have also been checking the number of elements to decide if an aggregate
should be split up. The limit of "SRThreshold/4" seemed rather arbitrary,
and I don't think it's very useful to derive this limit from SRThreshold
anyway. I've collected some data showing that the current default limit of
32 (since SRThreshold defaults to 128) is a reasonable cutoff for struct
types. One thing suggested by the data is that distinguishing between structs
and arrays might be useful. There are (obviously) a lot more large arrays
than large structs (as measured by the number of elements and not the total
size -- a large array inside a struct still counts as a single element given
the way we do SROA right now). Out of 8377 arrays where we successfully
performed SROA while compiling a large set of benchmarks, only 16 of them had
more than 8 elements. And, for those 16 arrays, it's not at all clear that
SROA was actually beneficial. So, to offset the compile time cost of
investigating more large structs for SROA, the patch lowers the limit on array
elements to 8.

This fixes Apple Radar 7563690.

llvm-svn: 95224

04365c5f

Revert 94937 and move the noreturn check to codegen. · 27a41d54
Evan Cheng authored Feb 03, 2010
```
llvm-svn: 95198
```
27a41d54
Fix some comment typos. · 76e8c595
Bob Wilson authored Feb 03, 2010
```
llvm-svn: 95170
```
76e8c595
Recommit this, looks like it wasn't the cause. · d86233c1
Eric Christopher authored Feb 03, 2010
```
llvm-svn: 95165
```
d86233c1
Hopefully temporarily revert this. · e67d01a9
Eric Christopher authored Feb 02, 2010
```
llvm-svn: 95154
```
e67d01a9

Feb 02, 2010
- Reformat my last patch slightly. · f9553572
  Eric Christopher authored Feb 02, 2010
```
llvm-svn: 95147
```
  f9553572
- Re-add strcmp and known size object size checking optimization. · 4264e7e4
  Eric Christopher authored Feb 02, 2010
```
Passed bootstrap and nightly test run here.

llvm-svn: 95145
```
  4264e7e4
- don't turn (A & (C0?-1:0)) | (B & ~(C0?-1:0)) -> C0 ? A : B · 8e2c4716
  Chris Lattner authored Feb 02, 2010
```
for vectors.  Codegen is generating awful code or segfaulting
in various cases (e.g. PR6204).

llvm-svn: 95058
```
  8e2c4716
- fix a crash in loop unswitch on a loop invariant vector condition. · 302240d7
  Chris Lattner authored Feb 02, 2010
```
llvm-svn: 95055
```
  302240d7
- LangRef.html says that inttoptr and ptrtoint always use zero-extension · 949458d0
  Dan Gohman authored Feb 02, 2010
```
when the cast is extending.

llvm-svn: 95046
```
  949458d0
- Don't need to check the last argument since it'll always be bool. We also · 14dfc3f6
  Eric Christopher authored Feb 02, 2010
```
don't use TargetData here.

llvm-svn: 95040
```
  14dfc3f6
- More indentation/tabification fixes. · 9afa9732
  Eric Christopher authored Feb 02, 2010
```
llvm-svn: 95036
```
  9afa9732
- Untabify previous commit. · 14082347
  Eric Christopher authored Feb 02, 2010
```
llvm-svn: 95035
```
  14082347
- Formatting. · 56e4182c
  Eric Christopher authored Feb 01, 2010
```
llvm-svn: 95027
```
  56e4182c
Feb 01, 2010

Add an option to GVN to remove all partially redundant loads. This is currently · d517b520

Bob Wilson authored Feb 01, 2010

disabled by default.  This divides the existing load PRE code into 2 phases:
first it checks that it is safe to move the load to each of the predecessors
where it is unavailable, and then if it is safe, the code is changed to move
the load.  Radar 7571861.

llvm-svn: 95007

d517b520

cleanups. · 9306ffa0
Chris Lattner authored Feb 01, 2010
```
llvm-svn: 94995
```
9306ffa0

fix rdar://7590304 , a miscompilation of objc apps on arm. The caller · 846a52e2

Chris Lattner authored Feb 01, 2010

of objc message send was getting marked arm_apcscc, but the prototype
isn't.  This is fine at runtime because objcmsgsend is implemented in
assembly.  Only turn a mismatched caller and callee into 'unreachable'
if the callee is a definition.

llvm-svn: 94986

846a52e2

fix rdar://7590304 , an infinite loop in instcombine. In the invoke · 2cecedf0

Chris Lattner authored Feb 01, 2010

case, instcombine can't zap the invoke for fear of changing the CFG.
However, we have to do something to prevent the next iteration of
instcombine from inserting another store -> undef before the invoke
thereby getting into infinite iteration between dead store elim and
store insertion.

Just zap the callee to null, which will prevent the next iteration
from doing anything.

llvm-svn: 94985

2cecedf0

Fix pr6198 by moving the isSized() check to an outer conditional. · f65ba356

Bob Wilson authored Feb 01, 2010

The testcase from pr6198 does not crash for me -- I don't know what's up with
that -- so I'm not adding it to the tests.

llvm-svn: 94984

f65ba356

Jan 31, 2010
- Simplify/generalize the xor+add->sign-extend instcombine. · a2cc2875
  Eli Friedman authored Jan 31, 2010
```
llvm-svn: 94943
```
  a2cc2875
- Add a small transform: transform -(X<<Y) to (-X<<Y) when the shift has a single · 37a8197b
  Eli Friedman authored Jan 31, 2010
```
use and X is free to negate.

llvm-svn: 94941
```
  37a8197b
- Do not mark no-return calls tail calls. It'll screw up special calls like... · d86d3fe0
  Evan Cheng authored Jan 31, 2010
```
Do not mark no-return calls tail calls. It'll screw up special calls like longjmp and it doesn't make much sense for performance reason. If my logic is faulty, please let me know.

llvm-svn: 94937
```
  d86d3fe0
Jan 30, 2010
- Check alignment of loads when deciding whether it is safe to execute them · 56600a15
  Bob Wilson authored Jan 30, 2010
```
unconditionally.  Besides checking the offset, also check that the underlying
object is aligned as much as the load itself.

llvm-svn: 94875
```
  56600a15
- Use more specific types to avoid casts. No functionality change. · 4b71b6c1
  Bob Wilson authored Jan 30, 2010
```
llvm-svn: 94863
```
  4b71b6c1
- Keep iterating over all uses when meeting a phi node in AllUsesOfValueWillTrapIfNull(). · e27dc727
  Jakob Stoklund Olesen authored Jan 29, 2010
```
This bug was exposed by my inliner cost changes in r94615, and caused failures
of lencod on most architectures when building with LTO.

This patch fixes lencod and 464.h264ref on x86-64 (and likely others).

llvm-svn: 94858
```
  e27dc727
Jan 29, 2010
- Preserve load alignment in instcombine transformations. I've been unable to · 1b845306
  Bob Wilson authored Jan 29, 2010
```
create a testcase where this matters.  The select+load transformation only
occurs when isSafeToLoadUnconditionally is true, and in those situations,
instcombine also changes the underlying objects to be aligned.  This seems
like a good idea regardless, and I've verified that it doesn't pessimize
the subsequent realignment.

llvm-svn: 94850
```
  1b845306
- Revert my last couple of patches. They appear to have broken bison. · 5a0e1748
  Eric Christopher authored Jan 29, 2010
```
llvm-svn: 94841
```
  5a0e1748
- Use uint64_t instead of unsigned for offsets and sizes. · 34e10c22
  Bob Wilson authored Jan 29, 2010
```
llvm-svn: 94835
```
  34e10c22
- Improve isSafeToLoadUnconditionally to recognize that GEPs with constant · 7c42b9d5
  Bob Wilson authored Jan 29, 2010
```
indices are safe if the result is known to be within the bounds of the
underlying object.

llvm-svn: 94829
```
  7c42b9d5
- Having RHSKnownZero and RHSKnownOne be alternative names for KnownZero and KnownOne · c8a3e568
  Duncan Sands authored Jan 29, 2010
```
(via APInt &RHSKnownZero = KnownZero, etc) seems dangerous and confusing to me: it
is easy not to notice this, and then wonder why KnownZero/RHSKnownZero changed
underneath you when you modified RHSKnownZero/KnownZero etc.  So get rid of this.
No intended functionality change (tested with "make check" + llvm-gcc bootstrap).

llvm-svn: 94802
```
  c8a3e568
- Make strcpy_chk lower to strcpy if we have a safe size. · 9b3c02b7
  Eric Christopher authored Jan 29, 2010
```
llvm-svn: 94783
```
  9b3c02b7
- Add constant support to object size handling and remove default · 997f7ca8
  Eric Christopher authored Jan 29, 2010
```
lowering. We'll either figure it out, or not and be lowered by
SelectionDAGBuild.

Add test.

llvm-svn: 94775
```
  997f7ca8
- Generic reformatting and comment fixing. No functionality change. · 48816a0b
  Bill Wendling authored Jan 29, 2010
```
llvm-svn: 94771
```
  48816a0b
- Add newline to debugging output, and fix some grammar-os in comment. · 8277838c
  Bill Wendling authored Jan 29, 2010
```
llvm-svn: 94765
```
  8277838c
- mem2reg erases the dbg.declare intrinsics that it converts to dbg.val intrinsics · 006b53f1
  Victor Hernandez authored Jan 29, 2010
```
llvm-svn: 94763
```
  006b53f1