Commits · 19e291aac04f3fcaf3de55763e496b187158c938 · Lorenzo Albano / LLVM bpEVL

Oct 12, 2015

GlobalOpt does not treat externally_initialized globals correctly · 939724cd

Oliver Stannard authored Oct 12, 2015

GlobalOpt currently merges stores into the initialisers of internal,
externally_initialized globals, but should not do so as the value of the global
may change between the initialiser and any code in the module being run.

llvm-svn: 250035

939724cd

Oct 09, 2015

[PlaceSafeopints] Extract out `callsGCLeafFunction`, NFC · c21a05a3

Sanjoy Das authored Oct 08, 2015

Summary:
This will be used in a later change to RewriteStatepointsForGC.

Reviewers: reames, swaroop.sridhar

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D13490

llvm-svn: 249777

c21a05a3

Oct 07, 2015

[IndVars] Preserve LCSSA in `eliminateIdentitySCEV` · 0015e5a0

Sanjoy Das authored Oct 07, 2015

Summary:
After r249211, SCEV can see through some LCSSA phis.  Add a
`replacementPreservesLCSSAForm` check before replacing uses of these phi
nodes with a simplified use of the induction variable to avoid breaking
LCSSA.

Fixes 25047.

Depends on D13460.

Reviewers: atrick, hfinkel

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D13461

llvm-svn: 249575

0015e5a0

Fix Clang-tidy modernize-use-nullptr warnings in source directories and... · 083ca9bb

Hans Wennborg authored Oct 06, 2015

Fix Clang-tidy modernize-use-nullptr warnings in source directories and generated files; other minor cleanups.

Patch by Eugene Zelenko!

Differential Revision: http://reviews.llvm.org/D13321

llvm-svn: 249482

083ca9bb

Oct 06, 2015

[IndVars] Don't break dominance in `eliminateIdentitySCEV` · 5c8bead4

Sanjoy Das authored Oct 06, 2015

Summary:
After r249211, `getSCEV(X) == getSCEV(Y)` does not guarantee that X and
Y are related in the dominator tree, even if X is an operand to Y (I've
included a toy example in comments, and a real example as a test case).

This commit changes `SimplifyIndVar` to require a `DominatorTree`.  I
don't think this is a problem because `ScalarEvolution` requires it
anyway.

Fixes PR25051.

Depends on D13459.

Reviewers: atrick, hfinkel

Subscribers: joker.eph, llvm-commits, sanjoy

Differential Revision: http://reviews.llvm.org/D13460

llvm-svn: 249471

5c8bead4

[IndVars] Extract out eliminateIdentitySCEV, NFC · 088bb0ea

Sanjoy Das authored Oct 06, 2015

Summary:
Reflow a comment while at it.

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D13459

llvm-svn: 249470

088bb0ea

Oct 03, 2015

inariant.group handling in GVN · dc9b2cfc

Piotr Padlewski authored Oct 02, 2015

The most important part required to make clang
devirtualization works ( ͡°͜ʖ ͡°).
The code is able to find non local dependencies, but unfortunatelly
because the caller can only handle local dependencies, I had to add
some restrictions to look for dependencies only in the same BB.

http://reviews.llvm.org/D12992

llvm-svn: 249196

dc9b2cfc

Oct 02, 2015

[SimplifyLibCalls] Fix instruction misplacement in string/memory libcall optimization · b491a2d6

Bruno Cardoso Lopes authored Oct 01, 2015

When trying to optimize fortified library functions use the right
location to insert new instructions in order to preserve correct
def-use order.

This fixes an issue where a misplaced instruction definition would
happen to be *after* one of its use after a RAUW, forming invalid IR.
This behavior was introduced by r227250.

Differential Revision: http://reviews.llvm.org/D13301

rdar://problem/22802369

llvm-svn: 249092

b491a2d6

Sep 30, 2015
- Fix debug info with SafeStack. · f608111d
  Evgeniy Stepanov authored Sep 30, 2015
```
llvm-svn: 248933
```
  f608111d
Sep 29, 2015

Move dbg.declare intrinsics when merging and replacing allocas. · d8b86f7c

Evgeniy Stepanov authored Sep 29, 2015

Place new and update dbg.declare calls immediately after the
corresponding alloca.

Current code in replaceDbgDeclareForAlloca puts the new dbg.declare
at the end of the basic block. LLVM codegen has problems emitting
debug info in a situation when dbg.declare appears after all uses of
the variable. This usually kinda works for inlining and ASan (two
users of this function) but not for SafeStack (see the pending change
in http://reviews.llvm.org/D13178).

llvm-svn: 248769

d8b86f7c

Sep 28, 2015

Improve performance of SimplifyInstructionsInBlock · f74cc40e

Fiona Glaser authored Sep 28, 2015

1. Use a worklist, not a recursive approach, to avoid needless
   revisitation and being repeatedly forced to jump back to the
   start of the BB if a handle is invalidated.

2. Only insert operands to the worklist if they become unused
   after a dead instruction is removed, so we don’t have to
   visit them again in most cases.

3. Use a SmallSetVector to track the worklist.

4. Instead of pre-initting the SmallSetVector like in
   DeadCodeEliminationPass, only put things into the worklist
   if they have to be revisited after the first run-through.
   This minimizes how much the actual SmallSetVector gets used,
   which saves a lot of time.

llvm-svn: 248727

f74cc40e

Sep 27, 2015

[EH] Create removeUnwindEdge utility · 09af67ab

Joseph Tremoulet authored Sep 27, 2015

Summary:
Factor the code that rewrites invokes to calls and rewrites WinEH
terminators to their "unwind to caller" equivalents into a helper in
Utils/Local, and use it in the three places I'm aware of that need to do
this.


Reviewers: andrew.w.kaylor, majnemer, rnk

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D13152

llvm-svn: 248677

09af67ab

Sep 24, 2015

[Unroll] When completely unrolling the loop, replace conditinal branches with unconditional. · d56ee06d

Michael Zolotukhin authored Sep 23, 2015

Nothing is expected to change, except we do less redundant work in
clean-up.

Reviewers: hfinkel

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D12951

llvm-svn: 248444

d56ee06d

Sep 23, 2015

[Inline] Use AssumptionCache from the right Function · ff08e926

Vedant Kumar authored Sep 23, 2015

This changes the behavior of AddAligntmentAssumptions to match its
comment. I.e, prove the asserted alignment in the context of the caller,
not the callee.

Thanks to Mehdi Amini for seeing the issue here! Also to Artur Pilipenko
who also saw a fix for the issue.

rdar://22521387

Differential Revision: http://reviews.llvm.org/D12997

llvm-svn: 248390

ff08e926

[SCEV] Introduce ScalarEvolution::getOne and getZero. · 2aacc0ec

Sanjoy Das authored Sep 23, 2015

Summary:
It is fairly common to call SE->getConstant(Ty, 0) or
SE->getConstant(Ty, 1); this change makes such uses a little bit
briefer.

I've refactored the call sites I could find easily to use getZero /
getOne.

Reviewers: hfinkel, majnemer, reames

Subscribers: sanjoy, llvm-commits

Differential Revision: http://reviews.llvm.org/D12947

llvm-svn: 248362

2aacc0ec

Sep 21, 2015

[LoopUtils,LV] Propagate fast-math flags on generated FCmp instructions · 50a4c27f

James Molloy authored Sep 21, 2015

We're currently losing any fast-math flags when synthesizing fcmps for
min/max reductions. In LV, make sure we copy over the scalar inst's
flags. In LoopUtils, we know we only ever match patterns with
hasUnsafeAlgebra, so apply that to any synthesized ops.

llvm-svn: 248201

50a4c27f

Sep 16, 2015
- don't repeat function names in comments; NFC · 815adacd
  Sanjay Patel authored Sep 16, 2015
```
llvm-svn: 247813
```
  815adacd
Sep 15, 2015
- more space; NFC · f9b77635
  Sanjay Patel authored Sep 15, 2015
```
llvm-svn: 247699
```
  f9b77635
Sep 14, 2015

[opaque pointer types] Switch a few cases of getElementType over, since I had... · 6614d8d2
David Blaikie authored Sep 14, 2015
```
[opaque pointer types] Switch a few cases of getElementType over, since I had them lying around anyway

llvm-svn: 247610
```
6614d8d2

Revert "[opaque pointer type] Pass GlobalAlias the actual pointer type rather... · 16a2f3e3

David Blaikie authored Sep 14, 2015

Revert "[opaque pointer type] Pass GlobalAlias the actual pointer type rather than decomposing it into pointee type + address space"

This was a flawed change - it just caused the getElementType call to be
deferred until later, when we really need to remove it. Now that the IR
for GlobalAliases has been updated, the root cause is addressed that way
instead and this change is no longer needed (and in fact gets in the way
- because we want to pass the pointee type directly down further).

Follow up patches to push this through GlobalValue, bitcode format, etc,
will come along soon.

This reverts commit 236160.

llvm-svn: 247585

16a2f3e3

Sep 11, 2015
- Remove gcc warning when comparing an unsigned var for >= 0 · 48b090a3
  Filipe Cabecinhas authored Sep 10, 2015
```
llvm-svn: 247352
```
  48b090a3
Sep 10, 2015

[LV] Relax Small Size Reduction Type Requirement · 29dc0f70

Matthew Simpson authored Sep 10, 2015

This patch enables small size reductions in which the source types are smaller
than the reduction type (e.g., computing an i16 sum from the values in an i8
array). The previous behavior was to only allow small size reductions if the
source types and reduction type were the same. The change accounts for the fact
that the existing sign- and zero-extend instructions in these cases should
still be included in the cost model.

Differential Revision: http://reviews.llvm.org/D12770

llvm-svn: 247337

29dc0f70

[SimplifyCFG] Use known bits to eliminate dead switch defaults · 05370139

Philip Reames authored Sep 10, 2015

This is a follow up to http://reviews.llvm.org/D11995 implementing the suggestion by Hans.

If we know some of the bits of the value being switched on, we know that the maximum number of unique cases covers the unknown bits. This allows to eliminate switch defaults for large integers (i32) when most bits in the value are known.

Note that I had to make the transform contingent on not having any dead cases. This is conservatively correct with the old code, but required for the new code since we might have a dead case which varies one of the known bits. Counting that towards our number of covering cases would be bad. If we do have dead cases, we'll eliminate them first, then revisit the possibly dead default.

Differential Revision: http://reviews.llvm.org/D12497

llvm-svn: 247309

05370139

80-cols; NFC · 9361d355
Sanjay Patel authored Sep 10, 2015
```
llvm-svn: 247295
```
9361d355
use range-based for loop; NFCI · f4b34b76
Sanjay Patel authored Sep 10, 2015
```
llvm-svn: 247294
```
f4b34b76
use range-based for loop; NFCI · 5e7bd918
Sanjay Patel authored Sep 10, 2015
```
llvm-svn: 247293
```
5e7bd918
fix typo; NFC · 59661459
Sanjay Patel authored Sep 10, 2015
```
llvm-svn: 247287
```
59661459

Sep 09, 2015

[PM/AA] Rebuild LLVM's alias analysis infrastructure in a way compatible · 7b560d40

Chandler Carruth authored Sep 09, 2015

with the new pass manager, and no longer relying on analysis groups.

This builds essentially a ground-up new AA infrastructure stack for
LLVM. The core ideas are the same that are used throughout the new pass
manager: type erased polymorphism and direct composition. The design is
as follows:

- FunctionAAResults is a type-erasing alias analysis results aggregation
  interface to walk a single query across a range of results from
  different alias analyses. Currently this is function-specific as we
  always assume that aliasing queries are *within* a function.

- AAResultBase is a CRTP utility providing stub implementations of
  various parts of the alias analysis result concept, notably in several
  cases in terms of other more general parts of the interface. This can
  be used to implement only a narrow part of the interface rather than
  the entire interface. This isn't really ideal, this logic should be
  hoisted into FunctionAAResults as currently it will cause
  a significant amount of redundant work, but it faithfully models the
  behavior of the prior infrastructure.

- All the alias analysis passes are ported to be wrapper passes for the
  legacy PM and new-style analysis passes for the new PM with a shared
  result object. In some cases (most notably CFL), this is an extremely
  naive approach that we should revisit when we can specialize for the
  new pass manager.

- BasicAA has been restructured to reflect that it is much more
  fundamentally a function analysis because it uses dominator trees and
  loop info that need to be constructed for each function.

All of the references to getting alias analysis results have been
updated to use the new aggregation interface. All the preservation and
other pass management code has been updated accordingly.

The way the FunctionAAResultsWrapperPass works is to detect the
available alias analyses when run, and add them to the results object.
This means that we should be able to continue to respect when various
passes are added to the pipeline, for example adding CFL or adding TBAA
passes should just cause their results to be available and to get folded
into this. The exception to this rule is BasicAA which really needs to
be a function pass due to using dominator trees and loop info. As
a consequence, the FunctionAAResultsWrapperPass directly depends on
BasicAA and always includes it in the aggregation.

This has significant implications for preserving analyses. Generally,
most passes shouldn't bother preserving FunctionAAResultsWrapperPass
because rebuilding the results just updates the set of known AA passes.
The exception to this rule are LoopPass instances which need to preserve
all the function analyses that the loop pass manager will end up
needing. This means preserving both BasicAAWrapperPass and the
aggregating FunctionAAResultsWrapperPass.

Now, when preserving an alias analysis, you do so by directly preserving
that analysis. This is only necessary for non-immutable-pass-provided
alias analyses though, and there are only three of interest: BasicAA,
GlobalsAA (formerly GlobalsModRef), and SCEVAA. Usually BasicAA is
preserved when needed because it (like DominatorTree and LoopInfo) is
marked as a CFG-only pass. I've expanded GlobalsAA into the preserved
set everywhere we previously were preserving all of AliasAnalysis, and
I've added SCEVAA in the intersection of that with where we preserve
SCEV itself.

One significant challenge to all of this is that the CGSCC passes were
actually using the alias analysis implementations by taking advantage of
a pretty amazing set of loop holes in the old pass manager's analysis
management code which allowed analysis groups to slide through in many
cases. Moving away from analysis groups makes this problem much more
obvious. To fix it, I've leveraged the flexibility the design of the new
PM components provides to just directly construct the relevant alias
analyses for the relevant functions in the IPO passes that need them.
This is a bit hacky, but should go away with the new pass manager, and
is already in many ways cleaner than the prior state.

Another significant challenge is that various facilities of the old
alias analysis infrastructure just don't fit any more. The most
significant of these is the alias analysis 'counter' pass. That pass
relied on the ability to snoop on AA queries at different points in the
analysis group chain. Instead, I'm planning to build printing
functionality directly into the aggregation layer. I've not included
that in this patch merely to keep it smaller.

Note that all of this needs a nearly complete rewrite of the AA
documentation. I'm planning to do that, but I'd like to make sure the
new design settles, and to flesh out a bit more of what it looks like in
the new pass manager first.

Differential Revision: http://reviews.llvm.org/D12080

llvm-svn: 247167

7b560d40

Sep 07, 2015
- Prune utf8 chars in comments. · 0d72539d
  NAKAMURA Takumi authored Sep 07, 2015
```
llvm-svn: 246953
```
  0d72539d
Sep 05, 2015
- Fix build warning. · 02a55d70
  Craig Topper authored Sep 05, 2015
```
llvm-svn: 246908
```
  02a55d70
- Fix build warning · 2a9a6d8c
  Andrew Kaylor authored Sep 05, 2015
```
llvm-svn: 246903
```
  2a9a6d8c
- Fix build warning · a212aba6
  Andrew Kaylor authored Sep 04, 2015
```
llvm-svn: 246899
```
  a212aba6
- [WinEH] Teach SimplfyCFG to eliminate empty cleanup pads. · 50e4e86c
  Andrew Kaylor authored Sep 04, 2015
```
Differential Revision: http://reviews.llvm.org/D12434

llvm-svn: 246896
```
  50e4e86c
Sep 03, 2015

[WinEH] Add cleanupendpad instruction · 9ce71f76

Joseph Tremoulet authored Sep 03, 2015

Summary:
Add a `cleanupendpad` instruction, used to mark exceptional exits out of
cleanups (for languages/targets that can abort a cleanup with another
exception).  The `cleanupendpad` instruction is similar to the `catchendpad`
instruction in that it is an EH pad which is the target of unwind edges in
the handler and which itself has an unwind edge to the next EH action.
The `cleanupendpad` instruction, similar to `cleanupret` has a `cleanuppad`
argument indicating which cleanup it exits.  The unwind successors of a
`cleanuppad`'s `cleanupendpad`s must agree with each other and with its
`cleanupret`s.

Update WinEHPrepare (and docs/tests) to accomodate `cleanupendpad`.

Reviewers: rnk, andrew.w.kaylor, majnemer

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D12433

llvm-svn: 246751

9ce71f76

Sep 02, 2015

Constant propagation after hitting assume(cmp) bugfix · 28ffcbe1

Piotr Padlewski authored Sep 02, 2015

Last time code run into assertion `BBE.isSingleEdge()` in
lib/IR/Dominators.cpp:200.

http://reviews.llvm.org/D12170

llvm-svn: 246696

28ffcbe1

[RemoveDuplicatePHINodes] Start over after removing a PHI. · f175e044

Benjamin Kramer authored Sep 02, 2015

This makes RemoveDuplicatePHINodes more effective and fixes an assertion
failure. Triggering the assertions requires a DenseSet reallocation
so this change only contains a constructive test.

I'll explain the issue with a small example. In the following function
there's a duplicate PHI, %4 and %5 are identical. When this is found
the DenseSet in RemoveDuplicatePHINodes contains %2, %3 and %4.

define void @F() {
  br label %1

; <label>:1                                       ; preds = %1, %0
  %2 = phi i32 [ 42, %0 ], [ %4, %1 ]
  %3 = phi i32 [ 42, %0 ], [ %5, %1 ]
  %4 = phi i32 [ 42, %0 ], [ 23, %1 ]
  %5 = phi i32 [ 42, %0 ], [ 23, %1 ]
  br label %1
}

after RemoveDuplicatePHINodes runs the function looks like this. %3 has
changed and is now identical to %2, but RemoveDuplicatePHINodes never
saw this.

define void @F() {
  br label %1

; <label>:1                                       ; preds = %1, %0
  %2 = phi i32 [ 42, %0 ], [ %4, %1 ]
  %3 = phi i32 [ 42, %0 ], [ %4, %1 ]
  %4 = phi i32 [ 42, %0 ], [ 23, %1 ]
  br label %1
}

If the DenseSet does a reallocation now it will reinsert all
keys and stumble over %3 now having a different hash value than it had
when inserted into the map for the first time. This change clears the
set whenever a PHI is deleted and starts the progress from the
beginning, allowing %3 to be deleted and avoiding inconsistent DenseSet
state. This potentially has a negative performance impact because
it rescans all PHIs, but I don't think that this ever makes a difference
in practice.

llvm-svn: 246694

f175e044

Aug 28, 2015
- Optimize memcmp(x,y,n)==0 for small n and suitably aligned x/y. · dc65532f
  Chad Rosier authored Aug 28, 2015
```
http://reviews.llvm.org/D6952
PR20673

llvm-svn: 246313
```
  dc65532f
- Revert r246244 and r246243 · 61db34d1
  Steven Wu authored Aug 28, 2015
```
These two commits cause clang/llvm bootstrap to hang.

llvm-svn: 246279
```
  61db34d1
- Constant propagation after hitting assume(cmp) bugfix · 3f81ec1e
  Piotr Padlewski authored Aug 28, 2015
```
Last time code run into assertion `BBE.isSingleEdge()` in
lib/IR/Dominators.cpp:200.

http://reviews.llvm.org/D12170

llvm-svn: 246244
```
  3f81ec1e
Aug 27, 2015

[LoopVectorize] Add Support for Small Size Reductions. · c94f8e29

Chad Rosier authored Aug 27, 2015

Unlike scalar operations, we can perform vector operations on element types that
are smaller than the native integer types. We type-promote scalar operations if
they are smaller than a native type (e.g., i8 arithmetic is promoted to i32
arithmetic on Arm targets). This patch detects and removes type-promotions
within the reduction detection framework, enabling the vectorization of small
size reductions.

In the legality phase, we look through the ANDs and extensions that InstCombine
creates during promotion, keeping track of the smaller type. In the
profitability phase, we use the smaller type and ignore the ANDs and extensions
in the cost model. Finally, in the code generation phase, we truncate the result
of the reduction to allow InstCombine to rewrite the entire expression in the
smaller type.

This fixes PR21369.
http://reviews.llvm.org/D12202

Patch by Matt Simpson <mssimpso@codeaurora.org>!

llvm-svn: 246149

c94f8e29