Commits · bf4b9afbebbf98428ebbb97d261bad2cdff23aab · Lorenzo Albano / LLVM bpEVL

Apr 30, 2012

Bill Wendling authored Apr 30, 2012

Allow the "SplitCriticalEdge" function to split the edge to a landing pad. If
the pass is *sure* that it thinks it knows what it's doing, then it may go ahead
and specify that the landing pad can have its critical edge split. The loop
unswitch pass is one of these passes. It will split the critical edges of all
edges coming from a loop to a landing pad not within the loop. Doing so will
retain important loop analysis information, such as loop simplify.

llvm-svn: 155817

bf4b9afb

Use an ArrayRef instead of explicit vector type. · 325e6cd9
Bill Wendling authored Apr 30, 2012
```
llvm-svn: 155816
```
325e6cd9
Remove hack from r154987. The problem persists even with it, so it's not even a good hack. · 712d85a8
Bill Wendling authored Apr 30, 2012
```
llvm-svn: 155813
```
712d85a8
Make sure HoistInsertPosition finds a position that is dominated by all · dd489314
Rafael Espindola authored Apr 30, 2012
```
inputs.

llvm-svn: 155809
```
dd489314

Apr 27, 2012
- Don't vectorize target-specific types (ppc_fp128, x86_fp80, etc.). · 27c32461
  Hal Finkel authored Apr 27, 2012
```
Target specific types should not be vectorized. As a practical matter,
these types are already register matched (at least in the x86 case),
and codegen does not always work correctly (at least in the ppc case,
and this is not worth fixing because ppc_fp128 is currently broken and
will probably go away soon).

llvm-svn: 155729
```
  27c32461
- Change recurse depth limit to uint32 to fix warning. · 84e4b399
  David Blaikie authored Apr 27, 2012
```
llvm-svn: 155727
```
  84e4b399
- Miscellaneous accumulated cleanups. · dae3349a
  Dan Gohman authored Apr 27, 2012
```
llvm-svn: 155725
```
  dae3349a
- Add an early bailout to IsValueFullyAvailableInBlock from deeply nested blocks. · 6120cfb8
  Mon P Wang authored Apr 27, 2012
```
The limit is set to an arbitrary 1000 recursion depth to avoid stack overflow
issues. <rdar://problem/11286839>.

llvm-svn: 155722
```
  6120cfb8
- [asan] small optimization: do not emit "x+0" instructions · 5a464f03
  Kostya Serebryany authored Apr 27, 2012
```
llvm-svn: 155701
```
  5a464f03
- [tsan] Atomic support for ThreadSanitizer, patch by Dmitry Vyukov · a1259778
  Kostya Serebryany authored Apr 27, 2012
```
llvm-svn: 155698
```
  a1259778
- Break up getProfitableChainIncrement(). · c90abc89
  Jakob Stoklund Olesen authored Apr 26, 2012
```
The required checks are moved to ChainInstruction() itself and the
policy decisions are moved to IVChain::isProfitableInc().

Also cache the ExprBase in IVChain to avoid frequent recomputations.

No functional change intended.

llvm-svn: 155676
```
  c90abc89
- Turn IVChain into a struct. · a0337d7b
  Jakob Stoklund Olesen authored Apr 26, 2012
```
No functional change intended.

llvm-svn: 155675
```
  a0337d7b
- Add instcombine patterns for the following transformations: · 7813dcee
  Chad Rosier authored Apr 26, 2012
```
 (x & y) | (x ^ y) -> x | y 
 (x & y) + (x ^ y) -> x | y 

Patch by Manman Ren.
rdar://10770603

llvm-svn: 155674
```
  7813dcee
Apr 26, 2012

Teach the reassociate pass to fold chains of multiplies with repeated · 739ef80f

Chandler Carruth authored Apr 26, 2012

elements to minimize the number of multiplies required to compute the
final result. This uses a heuristic to attempt to form near-optimal
binary exponentiation-style multiply chains. While there are some cases
it misses, it seems to at least a decent job on a very diverse range of
inputs.

Initial benchmarks show no interesting regressions, and an 8%
improvement on SPASS. Let me know if any other interesting results (in
either direction) crop up!

Credit to Richard Smith for the core algorithm, and helping code the
patch itself.

llvm-svn: 155616

739ef80f

Apr 25, 2012
- Print IV chain numbers while collecting them. · 293673d7
  Jakob Stoklund Olesen authored Apr 25, 2012
```
llvm-svn: 155567
```
  293673d7
- Reverting r155468. Chris and Chandler have convinced me that it's dangerous and · 2fd0c691
  Lang Hames authored Apr 25, 2012
```
in poor taste.

Talking through some alternate solutions with Chandler.

llvm-svn: 155530
```
  2fd0c691
- Simplify the known retain count tracking; use a boolean state instead · 62079b43
  Dan Gohman authored Apr 25, 2012
```
of a precise count. Also, move RRInfo's Partial field into PtrState,
now that it won't increase the size.

llvm-svn: 155513
```
  62079b43
- Build custom predecessor and successor lists for each basic block. · c24c66f2
  Dan Gohman authored Apr 24, 2012
```
These lists exclude invoke unwind edges and loop backedges which
are being ignored. This makes it easier to ignore them
consistently.

llvm-svn: 155500
```
  c24c66f2
Apr 24, 2012
- Add support for llvm.arm.neon.vmull* intrinsics to InstCombine. This fixes · 84531c2b
  Lang Hames authored Apr 24, 2012
```
<rdar://problem/11291436>.

llvm-svn: 155468
```
  84531c2b
Apr 23, 2012

Reapply r155136 after fixing PR12599. · 43bcb970

Jakob Stoklund Olesen authored Apr 23, 2012

Original commit message:

Defer some shl transforms to DAGCombine.

The shl instruction is used to represent multiplication by a constant
power of two as well as bitwise left shifts. Some InstCombine
transformations would turn an shl instruction into a bit mask operation,
making it difficult for later analysis passes to recognize the
constsnt multiplication.

Disable those shl transformations, deferring them to DAGCombine time.
An 'shl X, C' instruction is now treated mostly the same was as 'mul X, C'.

These transformations are deferred:

  (X >>? C) << C   --> X & (-1 << C)  (When X >> C has multiple uses)
  (X >>? C1) << C2 --> X << (C2-C1) & (-1 << C2)   (When C2 > C1)
  (X >>? C1) << C2 --> X >>? (C1-C2) & (-1 << C2)  (When C1 > C2)

The corresponding exact transformations are preserved, just like
div-exact + mul:

  (X >>?,exact C) << C   --> X
  (X >>?,exact C1) << C2 --> X << (C2-C1)
  (X >>?,exact C1) << C2 --> X >>?,exact (C1-C2)

The disabled transformations could also prevent the instruction selector
from recognizing rotate patterns in hash functions and cryptographic
primitives. I have a test case for that, but it is too fragile.

llvm-svn: 155362

43bcb970

Fix issue 67 by checking that the interface functions weren't redefined in the... · 056e27ea
Alexander Potapenko authored Apr 23, 2012
```
Fix issue 67 by checking that the interface functions weren't redefined in the compiled source file.

llvm-svn: 155346
```
056e27ea
[tsan] use llvm/ADT/Statistic.h for tsan stats · 5a4b7a23
Kostya Serebryany authored Apr 23, 2012
```
llvm-svn: 155341
```
5a4b7a23

Apr 20, 2012

Revert r155136 "Defer some shl transforms to DAGCombine." · 205ee3b3

Jakob Stoklund Olesen authored Apr 20, 2012

While the patch was perfect and defect free, it exposed a really nasty
bug in X86 SelectionDAG that caused an llc crash when compiling lencod.

I'll put the patch back in after fixing the SelectionDAG problem.

llvm-svn: 155181

205ee3b3

Put this expensive check below the less expensive ones. · 9f975952
Bill Wendling authored Apr 19, 2012
```
llvm-svn: 155166
```
9f975952

Apr 19, 2012

Avoid a bug in the path count computation, preventing an infinite · 26aa8274
Dan Gohman authored Apr 19, 2012
```
loop repeatedlt making the same change. This is for rdar://11256239.

llvm-svn: 155160
```
26aa8274

Defer some shl transforms to DAGCombine. · 6b6c81e6

Jakob Stoklund Olesen authored Apr 19, 2012

The shl instruction is used to represent multiplication by a constant
power of two as well as bitwise left shifts. Some InstCombine
transformations would turn an shl instruction into a bit mask operation,
making it difficult for later analysis passes to recognize the
constsnt multiplication.

Disable those shl transformations, deferring them to DAGCombine time.
An 'shl X, C' instruction is now treated mostly the same was as 'mul X, C'.

These transformations are deferred:

  (X >>? C) << C   --> X & (-1 << C)  (When X >> C has multiple uses)
  (X >>? C1) << C2 --> X << (C2-C1) & (-1 << C2)   (When C2 > C1)
  (X >>? C1) << C2 --> X >>? (C1-C2) & (-1 << C2)  (When C1 > C2)

The corresponding exact transformations are preserved, just like
div-exact + mul:

  (X >>?,exact C) << C   --> X
  (X >>?,exact C1) << C2 --> X << (C2-C1)
  (X >>?,exact C1) << C2 --> X >>?,exact (C1-C2)

The disabled transformations could also prevent the instruction selector
from recognizing rotate patterns in hash functions and cryptographic
primitives. I have a test case for that, but it is too fragile.

llvm-svn: 155136

6b6c81e6

Don't crash on code where the user put __attribute__((constructor)) on · 22fbe8d7
Dan Gohman authored Apr 18, 2012
```
a function with arguments. This fixes rdar://11265785.

llvm-svn: 155073
```
22fbe8d7

Apr 18, 2012

Use a heavy hammer to fix PR12573. · 4d4d0257

Bill Wendling authored Apr 18, 2012

If the loop contains invoke instructions, whose unwind edge escapes the loop,
then don't try to unswitch the loop. Doing so may cause the unwind edge to be
split, which not only is non-trivial but doesn't preserve loop simplify
information.

Fixes PR12573

llvm-svn: 154987

4d4d0257

loop-reduce: Add an early bailout to catch extremely large loops. · 19f80c1e

Andrew Trick authored Apr 18, 2012

This introduces a threshold of 200 IV Users, which is very
conservative but should be sufficient to avoid serious compile time
sink or stack overflow. The llvm test-suite with LTO never exceeds 190
users per loop.

The bug doesn't relate to a specific type of loop. Checking in an
arbitrary giant loop as a unit test would be silly.

Fixes rdar://11262507.

llvm-svn: 154983

19f80c1e

fix pr12559: mark unavailable win32 math libcalls · a81bcbb9

Joe Groff authored Apr 17, 2012

also fix SimplifyLibCalls to use TLI rather than compile-time conditionals to enable optimizations on floor, ceil, round, rint, and nearbyint

llvm-svn: 154960

a81bcbb9

Apr 16, 2012
- Fix style violation in BBVectorize (pointed out by Bill Wendling) · 52ba49f3
  Hal Finkel authored Apr 16, 2012
```
llvm-svn: 154810
```
  52ba49f3
- Add a Fixme. · 82b90a38
  Bill Wendling authored Apr 16, 2012
```
llvm-svn: 154793
```
  82b90a38
- Simplify checking for pointer types in BBVectorize (this change was suggested by Duncan). · 8ee309d9
  Hal Finkel authored Apr 16, 2012
```
llvm-svn: 154787
```
  8ee309d9
Apr 14, 2012

Fix an error in BBVectorize important for vectorizing pointer types. · 83c97960

Hal Finkel authored Apr 14, 2012

When vectorizing pointer types it is important to realize that potential
pairs cannot be connected via the address pointer argument of a load or store.
This is because even after vectorization, the address is still a scalar because
the address of the higher half of the pair is implicit from the address of the
lower half (it need not be, and should not be, explicitly computed).

llvm-svn: 154735

83c97960

Enhance BBVectorize to more-properly handle pointer values and vectorize GEPs. · f589519a
Hal Finkel authored Apr 14, 2012
```
llvm-svn: 154734
```
f589519a

Apr 13, 2012

Add support to BBVectorize for vectorizing selects. · b2336a79
Hal Finkel authored Apr 13, 2012
```
llvm-svn: 154700
```
b2336a79
Add some comments, and fix a few places that missed setting Changed. · 670f9374
Dan Gohman authored Apr 13, 2012
```
llvm-svn: 154687
```
670f9374
Consider ObjC runtime calls objc_storeWeak and others which make a copy of · e1e352af
Dan Gohman authored Apr 13, 2012
```
their argument as "escape" points for objc_retainBlock optimization.
This fixes rdar://11229925.

llvm-svn: 154682
```
e1e352af

By default, use Early-CSE instead of GVN for vectorization cleanup. · 204bf535

Hal Finkel authored Apr 13, 2012

As has been suggested by Duncan and others, Early-CSE and GVN should
do similar redundancy elimination, but Early-CSE is much less expensive.
Most of my autovectorization benchmarks show a performance regresion, but
all of these are < 0.1%, and so I think that it is still worth using
the less expensive pass.

llvm-svn: 154673

204bf535

Use the new Use-aware dominates method to apply the objc runtime · de8d2c44

Dan Gohman authored Apr 13, 2012

library return value optimization for phi uses. Even when the
phi itself is not dominated, the specific use may be dominated.

llvm-svn: 154647

de8d2c44