Commits · a0653a3e6cbc820632050bdcf27704ac52fe0559 · Lorenzo Albano / LLVM bpEVL

Apr 29, 2014

Add optimization remarks to the loop unroller and vectorizer. · 34fc8a7c

Diego Novillo authored Apr 29, 2014

Summary:
This calls emitOptimizationRemark from the loop unroller and vectorizer
at the point where they make a positive transformation. For the
vectorizer, it reports vectorization and interleave factors. For the
loop unroller, it reports all the different supported types of
unrolling.

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D3456

llvm-svn: 207528

34fc8a7c

Apr 25, 2014
- [C++] Use 'nullptr'. Transforms edition. · f40110f4
  Craig Topper authored Apr 25, 2014
```
llvm-svn: 207196
```
  f40110f4
Apr 22, 2014

[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE · 964daaaf

Chandler Carruth authored Apr 22, 2014

definition below all of the header #include lines, lib/Transforms/...
edition.

This one is tricky for two reasons. We again have a couple of passes
that define something else before the includes as well. I've sunk their
name macros with the DEBUG_TYPE.

Also, InstCombine contains headers that need DEBUG_TYPE, so now those
headers #define and #undef DEBUG_TYPE around their code, leaving them
well formed modular headers. Fixing these headers was a large motivation
for all of these changes, as "leaky" macros of this form are hard on the
modules implementation.

llvm-svn: 206844

964daaaf

Jan 28, 2014

[LPM] Fix PR18616 where the shifts to the loop pass manager to extract · d84f776e

Chandler Carruth authored Jan 28, 2014

LCSSA from it caused a crasher with the LoopUnroll pass.

This crasher is really nasty. We destroy LCSSA form in a suprising way.
When unrolling a loop into an outer loop, we not only need to restore
LCSSA form for the outer loop, but for all children of the outer loop.
This is somewhat obvious in retrospect, but hey!

While this seems pretty heavy-handed, it's not that bad. Fundamentally,
we only do this when we unroll a loop, which is already a heavyweight
operation. We're unrolling all of these hypothetical inner loops as
well, so their size and complexity is already on the critical path. This
is just adding another pass over them to re-canonicalize.

I have a test case from PR18616 that is great for reproducing this, but
pretty useless to check in as it relies on many 10s of nested empty
loops that get unrolled and deleted in just the right order. =/ What's
worse is that investigating this has exposed another source of failure
that is likely to be even harder to test. I'll try to come up with test
cases for these fixes, but I want to get the fixes into the tree first
as they're causing crashes in the wild.

llvm-svn: 200273

d84f776e

Jan 23, 2014

[LPM] Make LoopSimplify no longer a LoopPass and instead both a utility · aa7fa5e4

Chandler Carruth authored Jan 23, 2014

function and a FunctionPass.

This has many benefits. The motivating use case was to be able to
compute function analysis passes *after* running LoopSimplify (to avoid
invalidating them) and then to run other passes which require
LoopSimplify. Specifically passes like unrolling and vectorization are
critical to wire up to BranchProbabilityInfo and BlockFrequencyInfo so
that they can be profile aware. For the LoopVectorize pass the only
things in the way are LoopSimplify and LCSSA. This fixes LoopSimplify
and LCSSA is next on my list.

There are also a bunch of other benefits of doing this:
- It is now very feasible to make more passes *preserve* LoopSimplify
  because they can simply run it after changing a loop. Because
  subsequence passes can assume LoopSimplify is preserved we can reduce
  the runs of this pass to the times when we actually mutate a loop
  structure.
- The new pass manager should be able to more easily support loop passes
  factored in this way.
- We can at long, long last observe that LoopSimplify is preserved
  across SCEV. This *halves* the number of times we run LoopSimplify!!!

Now, getting here wasn't trivial. First off, the interfaces used by
LoopSimplify are all over the map regarding how analysis are updated. We
end up with weird "pass" parameters as a consequence. I'll try to clean
at least some of this up later -- I'll have to have it all clean for the
new pass manager.

Next up I discovered a really frustrating bug. LoopUnroll *claims* to
preserve LoopSimplify. That's actually a lie. But the way the
LoopPassManager ends up running the passes, it always ran LoopSimplify
on the unrolled-into loop, rectifying this oversight before any
verification could kick in and point out that in fact nothing was
preserved. So I've added code to the unroller to *actually* simplify the
surrounding loop when it succeeds at unrolling.

The only functional change in the test suite is that we now catch a case
that was previously missed because SCEV and other loop transforms see
their containing loops as simplified and thus don't miss some
opportunities. One test case has been converted to check that we catch
this case rather than checking that we miss it but at least don't get
the wrong answer.

Note that I have #if-ed out all of the verification logic in
LoopSimplify! This is a temporary workaround while extracting these bits
from the LoopPassManager. Currently, there is no way to have a pass in
the LoopPassManager which preserves LoopSimplify along with one which
does not. The LPM will try to verify on each loop in the nest that
LoopSimplify holds but the now-Function-pass cannot distinguish what
loop is being verified and so must try to verify all of them. The inner
most loop is clearly no longer simplified as there is a pass which
didn't even *attempt* to preserve it. =/ Once I get LCSSA out (and maybe
LoopVectorize and some other fixes) I'll be able to re-enable this check
and catch any places where we are still failing to preserve
LoopSimplify. If this causes problems I can back this out and try to
commit *all* of this at once, but so far this seems to work and allow
much more incremental progress.

llvm-svn: 199884

aa7fa5e4

Jan 13, 2014

[PM] Split DominatorTree into a concrete analysis result object which · 73523021

Chandler Carruth authored Jan 13, 2014

can be used by both the new pass manager and the old.

This removes it from any of the virtual mess of the pass interfaces and
lets it derive cleanly from the DominatorTreeBase<> template. In turn,
tons of boilerplate interface can be nuked and it turns into a very
straightforward extension of the base DominatorTree interface.

The old analysis pass is now a simple wrapper. The names and style of
this split should match the split between CallGraph and
CallGraphWrapperPass. All of the users of DominatorTree have been
updated to match using many of the same tricks as with CallGraph. The
goal is that the common type remains the resulting DominatorTree rather
than the pass. This will make subsequent work toward the new pass
manager significantly easier.

Also in numerous places things became cleaner because I switched from
re-running the pass (!!! mid way through some other passes run!!!) to
directly recomputing the domtree.

llvm-svn: 199104

73523021

[cleanup] Move the Dominators.h and Verifier.h headers into the IR · 5ad5f15c

Chandler Carruth authored Jan 13, 2014

directory. These passes are already defined in the IR library, and it
doesn't make any sense to have the headers in Analysis.

Long term, I think there is going to be a much better way to divide
these matters. The dominators code should be fully separated into the
abstract graph algorithm and have that put in Support where it becomes
obvious that evn Clang's CFGBlock's can use it. Then the verifier can
manually construct dominance information from the Support-driven
interface while the Analysis library can provide a pass which both
caches, reconstructs, and supports a nice update API.

But those are very long term, and so I don't want to leave the really
confusing structure until that day arrives.

llvm-svn: 199082

5ad5f15c

Dec 07, 2013
- Don't #include heavy Dominators.h file in LoopInfo.h. This change reduces · 3ab283c1
  Jakub Staszak authored Dec 07, 2013
```
overall time of LLVM compilation by ~1%.

llvm-svn: 196667
```
  3ab283c1
Nov 17, 2013
- Utils/LoopUnroll.cpp: Tweak (StringRef)OldName to be valid until it is used, since r194601. · f9c8339a
  NAKAMURA Takumi authored Nov 17, 2013
```
eraseFromParent() invalidates OldName.

llvm-svn: 194970
```
  f9c8339a
Nov 13, 2013
- Use StringRef instead of std::string · 86a7492f
  Jakub Staszak authored Nov 13, 2013
```
llvm-svn: 194601
```
  86a7492f
Sep 16, 2013
- Replace some unnecessary vector copies with references. · 7d605268
  Benjamin Kramer authored Sep 15, 2013
```
llvm-svn: 190770
```
  7d605268
Jan 02, 2013

Move all of the header files which are involved in modelling the LLVM IR · 9fb823bb

Chandler Carruth authored Jan 02, 2013

into their new header subdirectory: include/llvm/IR. This matches the
directory structure of lib, and begins to correct a long standing point
of file layout clutter in LLVM.

There are still more header files to move here, but I wanted to handle
them in separate commits to make tracking what files make sense at each
layer easier.

The only really questionable files here are the target intrinsic
tablegen files. But that's a battle I'd rather not fight today.

I've updated both CMake and Makefile build systems (I think, and my
tests think, but I may have missed something).

I've also re-sorted the includes throughout the project. I'll be
committing updates to Clang, DragonEgg, and Polly momentarily.

llvm-svn: 171366

9fb823bb

Dec 03, 2012

Use the new script to sort the includes of every file under lib. · ed0881b2

Chandler Carruth authored Dec 03, 2012

Sooooo many of these had incorrect or strange main module includes.
I have manually inspected all of these, and fixed the main module
include to be the nearest plausible thing I could find. If you own or
care about any of these source files, I encourage you to take some time
and check that these edits were sensible. I can't have broken anything
(I strictly added headers, and reordered them, never removed), but they
may not be the headers you'd really like to identify as containing the
API being implemented.

Many forward declarations and missing includes were added to a header
files to allow them to parse cleanly when included first. The main
module rule does in fact have its merits. =]

llvm-svn: 169131

ed0881b2

Jun 05, 2012
- LoopUnroll: always check for NULL LoopPassManager · a6fb910f
  Andrew Trick authored Jun 05, 2012
```
llvm-svn: 158007
```
  a6fb910f
May 08, 2012
- Allow NULL LoopPassManager argument in UnrollLoop. PR12734. · d29cd732
  Andrew Trick authored May 08, 2012
```
llvm-svn: 156358
```
  d29cd732
Apr 10, 2012

Fix 12513: Loop unrolling breaks with indirect branches. · 4442bfe5

Andrew Trick authored Apr 10, 2012

Take this opportunity to generalize the indirectbr bailout logic for
loop transformations. CFG transformations will never get indirectbr
right, and there's no point trying.

llvm-svn: 154386

4442bfe5

Dec 16, 2011
- Avoid a confusing assert for silly options: -unroll-runtime -unroll-count=1. · ca3417e9
  Andrew Trick authored Dec 16, 2011
```
No need for an explicit test case for an unsupported combination of options.

llvm-svn: 146721
```
  ca3417e9
Dec 09, 2011

Add -unroll-runtime for unrolling loops with run-time trip counts. · d04d1529

Andrew Trick authored Dec 09, 2011

Patch by Brendon Cahoon!

This extends the existing LoopUnroll and LoopUnrollPass. Brendon
measured no regressions in the llvm test suite with -unroll-runtime
enabled. This implementation works by using the existing loop
unrolling code to unroll the loop by a power-of-two (default 8). It
generates an if-then-else sequence of code prior to the loop to
execute the extra iterations before entering the unrolled loop.

llvm-svn: 146245

d04d1529

Aug 10, 2011

Comments. Thanks for the spell check Nick! · 6dbb0607

Andrew Trick authored Aug 10, 2011

Also, my apologies for spoiling the autocomplete on SimplifyInstructions.cpp. I couldn't think of a better filename.

llvm-svn: 137229

6dbb0607

Invoke SimplifyIndVar when we partially unroll a loop. Fixes PR10534. · 4d0040ba
Andrew Trick authored Aug 10, 2011
```
llvm-svn: 137203
```
4d0040ba
Cleanup. Added LoopBlocksDFS::perform for simple clients. · 78b40c3f
Andrew Trick authored Aug 10, 2011
```
llvm-svn: 137195
```
78b40c3f

Fix the LoopUnroller to handle nontrivial loops and partial unrolling. · b72bbe2a

Andrew Trick authored Aug 10, 2011

These are not individual bug fixes. I had to rewrite a good chunk of
the unroller to make it sane. I think it was getting lucky on trivial
completely unrolled loops with no early exits. I included some fairly
simple unit tests for partial unrolling. I didn't do much stress
testing, so it may not be perfect, but should be usable now.

llvm-svn: 137190

b72bbe2a

Aug 09, 2011
- LoopUnroll looks like it has some stale code. Remove it to prove my sanity and... · 5e0ee1c7
  Andrew Trick authored Aug 09, 2011
```
LoopUnroll looks like it has some stale code. Remove it to prove my sanity and avoid further confusion.

llvm-svn: 137106
```
  5e0ee1c7
Aug 03, 2011
- SCEV: Use AssertingVH to catch dangling BasicBlock* when passes forget · bf69d033
  Andrew Trick authored Aug 03, 2011
```
to notify SCEV of a change. Add forgetLoop in a couple of those places.

llvm-svn: 136797
```
  bf69d033
Jul 26, 2011
- Add clarifying comments for the new arguments to UnrollLoop. · 990f771a
  Andrew Trick authored Jul 25, 2011
```
llvm-svn: 135988
```
  990f771a
Jul 23, 2011
- Move trip count discovery outside of the generic LoopUnroll helper. This · 1cabe54f
  Andrew Trick authored Jul 23, 2011
```
removes its dependence on canonical induction variables.

llvm-svn: 135829
```
  1cabe54f
- whitespace · 279e7a6c
  Andrew Trick authored Jul 23, 2011
```
llvm-svn: 135828
```
  279e7a6c
Jun 23, 2011

Reinstate r133513 (reverted in r133700) with an additional fix for a · 61ea0e46
Jay Foad authored Jun 23, 2011
```
-Wshorten-64-to-32 warning in Instructions.h.

llvm-svn: 133708
```
61ea0e46

Revert r133513: · 96513120

Eric Christopher authored Jun 23, 2011

"Reinstate r133435 and r133449 (reverted in r133499) now that the clang
self-hosted build failure has been fixed (r133512)."

Due to some additional warnings.

llvm-svn: 133700

96513120

Jun 21, 2011
- Remove unused variables. · ccbb77f2
  Benjamin Kramer authored Jun 21, 2011
```
llvm-svn: 133514
```
  ccbb77f2
- Reinstate r133435 and r133449 (reverted in r133499) now that the clang · a97a2c99
  Jay Foad authored Jun 21, 2011
```
self-hosted build failure has been fixed (r133512).

llvm-svn: 133513
```
  a97a2c99
- Revert r133435 and r133449 to appease buildbots. · 184f3b37
  Chad Rosier authored Jun 21, 2011
```
llvm-svn: 133499
```
  184f3b37
Jun 20, 2011

Change how PHINodes store their operands. · e03c05c3

Jay Foad authored Jun 20, 2011

Change PHINodes to store simple pointers to their incoming basic blocks,
instead of full-blown Uses.

Note that this loses an optimization in SplitCriticalEdge(), because we
can no longer walk the use list of a BasicBlock to find phi nodes. See
the comment I removed starting "However, the foreach loop is slow for
blocks with lots of predecessors".

Extend replaceAllUsesWith() on a BasicBlock to also update any phi
nodes in the block's successors. This mimics what would have happened
when PHINodes were proper Users of their incoming blocks. (Note that
this only works if OldBB->replaceAllUsesWith(NewBB) is called when
OldBB still has a terminator instruction, so it still has some
successors.)

llvm-svn: 133435

e03c05c3

Feb 18, 2011

Don't unroll loops whose header block's address is taken. · 4a14fbc5

Chris Lattner authored Feb 18, 2011

This is part of a futile attempt to not "break" bizzaro
code like this:

 l1:
  printf("l1: %p\n", &&l1);
  ++x;

  if( x < 3 ) goto l1;


Previously we'd fold &&l1 to 1, which is fine per our semantics
but not helpful to the user.

llvm-svn: 125827

4a14fbc5

Jan 11, 2011
- random cleanups · dfcfcb49
  Chris Lattner authored Jan 11, 2011
```
llvm-svn: 123221
```
  dfcfcb49
Jan 07, 2011
- Remove all uses of the "ugly" method BranchInst::setUnconditionalDest(). · 89afb43b
  Jay Foad authored Jan 07, 2011
```
llvm-svn: 123025
```
  89afb43b
Nov 23, 2010
- Replace calls to ConstantFoldInstruction with calls to SimplifyInstruction · 433c1679
  Duncan Sands authored Nov 23, 2010
```
in two places that are really interested in simplified instructions, not
constants.

llvm-svn: 120044
```
  433c1679
Oct 13, 2010
- Be more consistent in using ValueToValueMapTy. · 229e38f0
  Rafael Espindola authored Oct 13, 2010
```
llvm-svn: 116387
```
  229e38f0
Jul 26, 2010
- Preserve ScalarEvolution in the loop unroller. · a7908ae3
  Dan Gohman authored Jul 26, 2010
```
llvm-svn: 109412
```
  a7908ae3
Jun 24, 2010

Use ValueMap instead of DenseMap. · 0dc3c2d3

Devang Patel authored Jun 24, 2010

The ValueMapper used by various cloning utility maps MDNodes also.

llvm-svn: 106706

0dc3c2d3