Commits · 97d44349c94470d41d8ecf36b81409ee3304e560 · Roger Ferrer / llvm-epi-0.8

Sep 13, 2012

Dmitri Gribenko authored Sep 13, 2012

* wrap code blocks in \code ... \endcode;
* refer to parameter names in paragraphs correctly (\arg is not what most
  people want -- it starts a new paragraph).

llvm-svn: 163790

2bc1d483

Sep 12, 2012
- Detect overflow in the path count computation. rdar://12277446. · 7c84dad8
  Dan Gohman authored Sep 12, 2012
```
llvm-svn: 163739
```
  7c84dad8
- Release build: guard dump functions with · 49d684e1
  Manman Ren authored Sep 12, 2012
```
"#if !defined(NDEBUG) || defined(LLVM_ENABLE_DUMP)"

No functional change. Update r163344.

llvm-svn: 163679
```
  49d684e1
Sep 10, 2012
- Move spaces to the right places. No functionality change. · 12d825d9
  Nick Lewycky authored Sep 09, 2012
```
llvm-svn: 163485
```
  12d825d9
Sep 09, 2012
- DSE: Poking holes into a SetVector is expensive, avoid it if possible. · 2b11eb07
  Benjamin Kramer authored Sep 09, 2012
```
llvm-svn: 163480
```
  2b11eb07
Sep 06, 2012
- Release build: guard dump functions with "ifndef NDEBUG" · c3366cce
  Manman Ren authored Sep 06, 2012
```
No functional change.

llvm-svn: 163344
```
  c3366cce
- Update function names to conform to guidelines. · 30c4282f
  Jim Grosbach authored Sep 06, 2012
```
No functional change.

llvm-svn: 163279
```
  30c4282f
Sep 05, 2012

Make provenance checking conservative in cases when · df476e5e

Dan Gohman authored Sep 04, 2012

pointers-to-strong-pointers may be in play. These can lead to retains and
releases happening in unstructured ways, foiling the optimizer. This fixes
rdar://12150909.

llvm-svn: 163180

df476e5e

Sep 04, 2012

Generic Bypass Slow Div · cdf540d5

Preston Gurd authored Sep 04, 2012

- CodeGenPrepare pass for identifying div/rem ops
- Backend specifies the type mapping using addBypassSlowDivType
- Enabled only for Intel Atom with O2 32-bit -> 8-bit
- Replace IDIV with instructions which test its value and use DIVB if the value
is positive and less than 256.
- In the case when the quotient and remainder of a divide are used a DIV
and a REM instruction will be present in the IR. In the non-Atom case
they are both lowered to IDIVs and CSE removes the redundant IDIV instruction,
using the quotient and remainder from the first IDIV. However,
due to this optimization CSE is not able to eliminate redundant
IDIV instructions because they are located in different basic blocks.
This is overcome by calculating both the quotient (DIV) and remainder (REM)
in each basic block that is inserted by the optimization and reusing the result
values when a subsequent DIV or REM instruction uses the same operands.
- Test cases check for the presents of the optimization when calculating
either the quotient, remainder,  or both.

Patch by Tyler Nowicki!

llvm-svn: 163150

cdf540d5

LICM may hoist an instruction with undefined behavior above a trap. · 03dcd85b

Nadav Rotem authored Sep 04, 2012

Scan the body of the loop and find instructions that may trap.
Use this information when deciding if it is safe to hoist or sink instructions.
Notice that we can optimize the search of instructions that may throw in the case of nested loops.

rdar://11518836

llvm-svn: 163132

03dcd85b

Sep 02, 2012

Not all targets have efficient ISel code generation for select instructions. · 9d832026

Nadav Rotem authored Sep 02, 2012

For example, the ARM target does not have efficient ISel handling for vector
selects with scalar conditions. This patch adds a TLI hook which allows the
different targets to report which selects are supported well and which selects
should be converted to CF duting codegen prepare.

llvm-svn: 163093

9d832026

LoopRotation: Make the brute force DomTree update more brute force. · 599a4bb6

Benjamin Kramer authored Sep 02, 2012

We update until we hit a fixpoint. This is probably slow but also
slightly simplifies the code. It should also fix the occasional
invalid domtrees observed when building with expensive checking.

I couldn't find a case where this had a measurable slowdown, but
if someone finds a pathological case where it does we may have
to find a cleverer way of updating dominators here.

Thanks to Duncan for the test case.

llvm-svn: 163091

599a4bb6

Sep 01, 2012
- LoopRotation: Check some invariants of the dominator updating code. · 3be6a480
  Benjamin Kramer authored Sep 01, 2012
```
llvm-svn: 163058
```
  3be6a480
Aug 30, 2012

LoopRotate: Also rotate loops with multiple exits. · afdfdb5c

Benjamin Kramer authored Aug 30, 2012

The old PHI updating code in loop-rotate was replaced with SSAUpdater a while
ago, it has no problems with comples PHIs. What had to be fixed is detecting
whether a loop was already rotated and updating dominators when multiple exits
were present.

This change increases overall code size a bit, mostly due to additional loop
unrolling opportunities. Passes test-suite and selfhost with -verify-dom-info.
Fixes PR7447.

Thanks to Andy for the input on the domtree updating code.

llvm-svn: 162912

afdfdb5c

Aug 29, 2012

Make MemoryBuiltins aware of TargetLibraryInfo. · 8bcc9711

Benjamin Kramer authored Aug 29, 2012

This disables malloc-specific optimization when -fno-builtin (or -ffreestanding)
is specified. This has been a problem for a long time but became more severe
with the recent memory builtin improvements.

Since the memory builtin functions are used everywhere, this required passing
TLI in many places. This means that functions that now have an optional TLI
argument, like RecursivelyDeleteTriviallyDeadFunctions, won't remove dead
mallocs anymore if the TLI argument is missing. I've updated most passes to do
the right thing.

Fixes PR13694 and probably others.

llvm-svn: 162841

8bcc9711

Aug 27, 2012
- Don't use for loops for code that is only intended to execute once. No · 10c82cee
  Dan Gohman authored Aug 27, 2012
```
intended functionality change. Thanks to Ahmed Charles for spotting it.

llvm-svn: 162686
```
  10c82cee
Aug 24, 2012

GVN: Fix quadratic runtime on the number of switch cases. · dd62d6b6

Benjamin Kramer authored Aug 24, 2012

No intended behavior change.  This was introduced in r162023.  With the fixed
algorithm a Release build of ARMInstPrinter.cpp goes from 16s to 10s on a
2011 MBP.

llvm-svn: 162559

dd62d6b6

Aug 22, 2012
- SimplifyLibCalls: Give all safely-shrinkable libcalls the same treatment. · e07728b9
  Benjamin Kramer authored Aug 22, 2012
```
llvm-svn: 162383
```
  e07728b9
- Add a few float shrinking optimizations to SimplifyLibCalls. Unsafe · 0122909d
  Chad Rosier authored Aug 22, 2012
```
optimizations are guarded by the -enable-double-float-shrink LLVM option.
Last bit of PR13574.  Patch by Weiming Zhao <weimingz@codeaurora.org>.

llvm-svn: 162368
```
  0122909d
- Add a new helper function, AddOpt(F1, F1, Opt), as part of PR13574. No · b2f5c1cd
  Chad Rosier authored Aug 22, 2012
```
functional change intended.  Patch by Weiming Zhao <weimingz@codeaurora.org>.

llvm-svn: 162363
```
  b2f5c1cd
Aug 21, 2012
- Don't bind a reference to a dereferenced null pointer (for return value of WeakVH::operator*). · ad9c8e83
  Richard Smith authored Aug 21, 2012
```
llvm-svn: 162309
```
  ad9c8e83
- Port the global copy optimization from the SROA pass to InstCombine. · c908ca17
  Chandler Carruth authored Aug 21, 2012
```
This optimization is really just replacing allocas wholesale with
globals, there is no scalarization.

The underlying motivation for this patch is to simplify the SROA pass
and focus it on splitting and promoting allocas.

llvm-svn: 162271
```
  c908ca17
- revise debug output to avoid dangling pointer · 6e12d128
  Michael Liao authored Aug 21, 2012
```
llvm-svn: 162256
```
  6e12d128
Aug 18, 2012
- SimplifyLibcalls: Add fabs and trunc to the list of libcalls that are safe to... · 00013245
  Benjamin Kramer authored Aug 18, 2012
```
SimplifyLibcalls: Add fabs and trunc to the list of libcalls that are safe to shrink from double to float.

llvm-svn: 162173
```
  00013245
Aug 16, 2012

Teach GVN to reason about edges dominating uses. This allows it to handle cases · cc80cdeb

Rafael Espindola authored Aug 16, 2012

where some fact lake a=b dominates a use in a phi, but doesn't dominate the
basic block itself.

This feature could also be implemented by splitting critical edges, but at least
with the current algorithm reasoning about the dominance directly is faster.

The time for running "opt -O2" in the testcase in pr10584 is 1.003 times slower
and on gcc as a single file it is 1.0007 times faster.

llvm-svn: 162023

cc80cdeb

Aug 15, 2012
- Remove dead flag. · 4d5150d9
  Bill Wendling authored Aug 15, 2012
```
llvm-svn: 161990
```
  4d5150d9
Aug 14, 2012

Change greater than to greater than or equal so that an identical sized store... · 2a40418a

Craig Topper authored Aug 14, 2012

Change greater than to greater than or equal so that an identical sized store to the same offset is treated as completing overwriting.

llvm-svn: 161857

2a40418a

During the CodeGenPrepare we often lower intrinsics (such as objsize) · 70409991

Nadav Rotem authored Aug 14, 2012

and allow some optimizations to turn conditional branches into unconditional.
This commit adds a simple control-flow optimization which merges two consecutive
basic blocks which are connected by a single edge. This allows the codegen to
operate on larger basic blocks.

rdar://11973998

llvm-svn: 161852

70409991

Aug 10, 2012

Constify some basic blocks, no functionality change. · 64e7b570
Rafael Espindola authored Aug 10, 2012
```
llvm-svn: 161668
```
64e7b570

Fix crash when when do lto on Bullet. Dynamic GEPs in SROA were incorrectly... · 0deca6be

Pete Cooper authored Aug 10, 2012

Fix crash when when do lto on Bullet.  Dynamic GEPs in SROA were incorrectly being applied to all accesses to an alloca, not just the ones which read from the GEP.  Thanks to Evan for reducing the test.  rdar://11861001

llvm-svn: 161654

0deca6be

Aug 08, 2012

isAllocLikeFn is allowed to return true for functions which read memory; make · 08ec0a81
Eli Friedman authored Aug 08, 2012
```
sure we account for that correctly in DeadStoreElimination.  Fixes a regression
from r158919.  PR13547.

llvm-svn: 161468
```
08ec0a81

Avoid recomputing the unique exit blocks and their insert points when doing · b9487360

Dan Gohman authored Aug 08, 2012

multiple scalar promotions on a single loop. This also has the effect of
preserving the order of stores sunk out of loops, which is aesthetically
pleasing, and it happens to fix the testcase in PR13542, though it doesn't
fix the underlying problem.

llvm-svn: 161459

b9487360

Jul 27, 2012
- Teach CodeGenPrep to look past bitcast when it's duplicating return instruction · 249716e8
  Evan Cheng authored Jul 27, 2012
```
into predecessor blocks to enable tail call optimization.

rdar://11958338

llvm-svn: 160894
```
  249716e8
Jul 26, 2012
- do null checks for a few more Emit*() functions. · 5940c4a1
  Nuno Lopes authored Jul 26, 2012
```
Thanks Eli for noticing.

llvm-svn: 160787
```
  5940c4a1
- Stop reassociate from looking through expressions of arbitrary complexity. This · 56514520
  Duncan Sands authored Jul 26, 2012
```
is a temporary measure until my fix for PR13021 is ready.

llvm-svn: 160778
```
  56514520
Jul 25, 2012

make all Emit*() functions consult the TargetLibraryInfo information before... · 89702e94

Nuno Lopes authored Jul 25, 2012

make all Emit*() functions consult the TargetLibraryInfo information before creating a call to a library function.
Update all clients to pass the TLI information around.
Previous draft reviewed by Eli.

llvm-svn: 160733

89702e94

Jul 24, 2012
- Clean whitespaces. · 465834c8
  Nadav Rotem authored Jul 24, 2012
```
llvm-svn: 160668
```
  465834c8
Jul 23, 2012
- An objc_retain can serve as a may-use for a different pointer. · f64ff8ed
  Dan Gohman authored Jul 23, 2012
```
rdar://11931823.

llvm-svn: 160637
```
  f64ff8ed
- Suppress a warning. · 1088811c
  Nadav Rotem authored Jul 23, 2012
```
llvm-svn: 160629
```
  1088811c
- Fix a typo (the the => the) · 35521e23
  Sylvestre Ledru authored Jul 23, 2012
```
llvm-svn: 160621
```
  35521e23