Commits · e211e204da3afd42dcb544a865a0930b04bd779b · Lorenzo Albano / LLVM bpEVL

Sep 28, 2015

[GlobalOpt] Sort members of llvm.used deterministically · ace7818c

Sean Silva authored Sep 28, 2015

Patch by Jake VanAdrighem!

Summary:
Fix the way we sort the llvm.used and llvm.compiler.used members.

This bug seems to have been introduced in rL183756 through a set of improper casts to GlobalValue*. In subsequent patches this problem was missed and transformed into a getName call on a ConstantExpr.

Reviewers: silvas

Subscribers: silvas, llvm-commits

Differential Revision: http://reviews.llvm.org/D12851

llvm-svn: 248728

ace7818c

Improve performance of SimplifyInstructionsInBlock · f74cc40e

Fiona Glaser authored Sep 28, 2015

1. Use a worklist, not a recursive approach, to avoid needless
   revisitation and being repeatedly forced to jump back to the
   start of the BB if a handle is invalidated.

2. Only insert operands to the worklist if they become unused
   after a dead instruction is removed, so we don’t have to
   visit them again in most cases.

3. Use a SmallSetVector to track the worklist.

4. Instead of pre-initting the SmallSetVector like in
   DeadCodeEliminationPass, only put things into the worklist
   if they have to be revisited after the first run-through.
   This minimizes how much the actual SmallSetVector gets used,
   which saves a lot of time.

llvm-svn: 248727

f74cc40e

[LoopReroll] Ignore debug intrinsics · 310770a9

Weiming Zhao authored Sep 28, 2015

Originally, debug intrinsics and annotation intrinsics may prevent
the loop to be rerolled, now they are ignored.

Differential Revision: http://reviews.llvm.org/D13150

llvm-svn: 248718

310770a9

Sep 27, 2015

[InstCombine] fold zexts and constants into a phi (PR24766) · 95334075

Sanjay Patel authored Sep 27, 2015

This is one step towards solving PR24766:
https://llvm.org/bugs/show_bug.cgi?id=24766

We were not producing the same IR for these two C functions because the store
to the temp bool causes extra zexts:

#include <stdbool.h>

bool switchy(char x1, char x2, char condition) {
   bool conditionMet = false;
   switch (condition) {
   case 0: conditionMet = (x1 == x2); break;
   case 1: conditionMet = (x1 <= x2); break;
   }
   return conditionMet;
}

bool switchy2(char x1, char x2, char condition) {
   switch (condition) {
   case 0: return (x1 == x2);
   case 1: return (x1 <= x2);
   }
  return false;
}

As noted in the code comments, this test case manages to avoid the more general existing
phi optimizations where there are only 2 phi inputs or where there are no constant phi 
args mixed in with the casts ops. It seems like a corner case, but if we don't catch it, 
then I don't think we can get SimplifyCFG to further optimize towards the canonical form
for this function shown in the bug report.

Differential Revision: http://reviews.llvm.org/D12866

llvm-svn: 248689

95334075

[EH] Create removeUnwindEdge utility · 09af67ab

Joseph Tremoulet authored Sep 27, 2015

Summary:
Factor the code that rewrites invokes to calls and rewrites WinEH
terminators to their "unwind to caller" equivalents into a helper in
Utils/Local, and use it in the three places I'm aware of that need to do
this.


Reviewers: andrew.w.kaylor, majnemer, rnk

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D13152

llvm-svn: 248677

09af67ab

Sep 26, 2015

[InstCombine] match De Morgan's Law hidden by zext ops (PR22723) · e1b09caa

Sanjay Patel authored Sep 25, 2015

This is a fix for PR22723:
https://llvm.org/bugs/show_bug.cgi?id=22723

My first attempt at this was to change what I thought was the root problem:

xor (zext i1 X to i32), 1 --> zext (xor i1 X, true) to i32

...but we create the opposite pattern in InstCombiner::visitZExt(), so infinite loop!

My next idea was to fix the matchIfNot() implementation in PatternMatch, but that would
mean potentially returning a different size for the match than what was input. I think
this would require all users of m_Not to check the size of the returned match, so I 
abandoned that idea.

I settled on just fixing the exact case presented in the PR. This patch does allow the
2 functions in PR22723 to compile identically (x86):

bool test(bool x, bool y) { return !x | !y; }
bool test(bool x, bool y) { return !x || !y; }
...
andb	%sil, %dil
xorb	$1, %dil
movb	%dil, %al
retq

Differential Revision: http://reviews.llvm.org/D12705

llvm-svn: 248634

e1b09caa

Sep 25, 2015
- ADCE: Fix typo in file comment. NFC · 0638b7ba
  Justin Bogner authored Sep 25, 2015
```
llvm-svn: 248613
```
  0638b7ba
Sep 24, 2015

[InstCombine] Recognize another bswap idiom. · 2720593a

Charlie Turner authored Sep 24, 2015

Summary:
The byte-swap recognizer can now notice that this

```
uint32_t bswap(uint32_t x)
{
  x = (x & 0x0000FFFF) << 16 | (x & 0xFFFF0000) >> 16;
  x = (x & 0x00FF00FF) << 8 | (x & 0xFF00FF00) >> 8;
  return x;
}
```
    
is a bswap. Fixes PR23863.

Reviewers: nlewycky, hfinkel, hans, jmolloy, rengolin

Subscribers: majnemer, rengolin, llvm-commits

Differential Revision: http://reviews.llvm.org/D12637

llvm-svn: 248482

2720593a

Add CFG Simplification pass after Loop Unswitching. · 74621cce

Michael Zolotukhin authored Sep 24, 2015

Loop unswitching produces conditional branches with constant condition,
and it's beneficial for later passes to clean this up with simplify-cfg.
We do this after the second invocation of loop-unswitch, but not after
the first one. Not doing so might cause problem for passes like
LoopUnroll, whose estimate of loop body size would be less accurate.

Reviewers: hfinkel

Differential Revision: http://reviews.llvm.org/D13064

llvm-svn: 248460

74621cce

[safestack] Fix compiler crash in the presence of stack restores. · 8685daf2
Evgeniy Stepanov authored Sep 24, 2015
```
A use can be emitted before def in a function with stack restore
points but no static allocas.

llvm-svn: 248455
```
8685daf2

[Unroll] When completely unrolling the loop, replace conditinal branches with unconditional. · d56ee06d

Michael Zolotukhin authored Sep 23, 2015

Nothing is expected to change, except we do less redundant work in
clean-up.

Reviewers: hfinkel

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D12951

llvm-svn: 248444

d56ee06d

Put profile variables of COMDAT functions to it's own COMDAT group. · 3cc9204a

Wei Mi authored Sep 23, 2015

In -fprofile-instr-generate compilation, to remove the redundant profile
variables for the COMDAT functions, these variables are placed in the same
COMDAT group as its associated function. This way when the COMDAT function
is not picked by the linker, those profile variables will also not be
output in the final binary. This may cause warning when mix link objects
built w and wo -fprofile-instr-generate.

This patch puts the profile variables for COMDAT functions to its own COMDAT
group to avoid the problem.

Patch by xur.
Differential Revision: http://reviews.llvm.org/D12248

llvm-svn: 248440

3cc9204a

Sep 23, 2015

· cac0b892

Lawrence Hu authored Sep 23, 2015

    Swap loop invariant GEP with loop variant GEP to allow more LICM.

    This patch changes the order of GEPs generated by Splitting GEPs
    pass, specially when one of the GEPs has constant and the base is
    loop invariant, then we will generate the GEP with constant first
    when beneficial, to expose more cases for LICM.

    If originally Splitting GEP generate the following:
      do.body.i:
        %idxprom.i = sext i32 %shr.i to i64
        %2 = bitcast %typeD* %s to i8*
        %3 = shl i64 %idxprom.i, 2
        %uglygep = getelementptr i8, i8* %2, i64 %3
        %uglygep7 = getelementptr i8, i8* %uglygep, i64 1032
      ...
    Now it genereates:
      do.body.i:
        %idxprom.i = sext i32 %shr.i to i64
        %2 = bitcast %typeD* %s to i8*
        %3 = shl i64 %idxprom.i, 2
        %uglygep = getelementptr i8, i8* %2, i64 1032
        %uglygep7 = getelementptr i8, i8* %uglygep, i64 %3
      ...

    For no-loop cases, the original way of generating GEPs seems to
    expose more CSE cases, so we don't change the logic for no-loop
    cases, and only limit our change to the specific case we are
    interested in.

llvm-svn: 248420

cac0b892

[InstCombine] Preserve metadata when merging loads that are phi · f6afd115

Akira Hatanaka authored Sep 23, 2015

arguments.

Make sure InstCombiner::FoldPHIArgLoadIntoPHI doesn't drop the following
metadata:

MD_tbaa
MD_alias_scope
MD_noalias
MD_invariant_load
MD_nonnull
MD_range

rdar://problem/17617709

Differential Revision: http://reviews.llvm.org/D12710

llvm-svn: 248419

f6afd115

Android support for SafeStack. · a2002b08

Evgeniy Stepanov authored Sep 23, 2015

Add two new ways of accessing the unsafe stack pointer:

* At a fixed offset from the thread TLS base. This is very similar to
  StackProtector cookies, but we plan to extend it to other backends
  (ARM in particular) soon. Bionic-side implementation here:
  https://android-review.googlesource.com/170988.
* Via a function call, as a fallback for platforms that provide
  neither a fixed TLS slot, nor a reasonable TLS implementation (i.e.
  not emutls).

This is a re-commit of a change in r248357 that was reverted in
r248358.

llvm-svn: 248405

a2002b08

[Inline] Use AssumptionCache from the right Function · ff08e926

Vedant Kumar authored Sep 23, 2015

This changes the behavior of AddAligntmentAssumptions to match its
comment. I.e, prove the asserted alignment in the context of the caller,
not the callee.

Thanks to Mehdi Amini for seeing the issue here! Also to Artur Pilipenko
who also saw a fix for the issue.

rdar://22521387

Differential Revision: http://reviews.llvm.org/D12997

llvm-svn: 248390

ff08e926

[DeadArgElim] Split the invoke successor edge · fa36bde2

David Majnemer authored Sep 23, 2015

Invoking a function which returns an aggregate can sometimes be
transformed to return a scalar value.  However, this means that we need
to create an insertvalue instruction(s) to recreate the correct
aggregate type.  We achieved this by inserting an insertvalue
instruction at the invoke's normal successor.  However, this is not
feasible if the normal successor uses the invoke's return value inside a
PHI node.

Instead, split the edge between the invoke and the unwind successor and
create the insertvalue instruction in the new basic block.  The new
basic block's successor will be the old invoke successor which leaves
us with IR which is well behaved.

This fixes PR24906.

llvm-svn: 248387

fa36bde2

[DeadStoreElimination] Remove dead zero store to calloc initialized memory · 029bd93c

Igor Laevsky authored Sep 23, 2015

This change allows dead store elimination to remove zero and null stores into memory freshly allocated with calloc-like function.

Differential Revision: http://reviews.llvm.org/D13021

llvm-svn: 248374

029bd93c

[X86][SSE] Replace 128-bit SSE41 PMOVSX intrinsics with native IR · 9cb018b6

Simon Pilgrim authored Sep 23, 2015

This patches removes the x86.sse41.pmovsx* intrinsics, provides a suitable upgrade path and updates relevant tests to sign extend a subvector instead.

LLVM counterpart to D12835

Differential Revision: http://reviews.llvm.org/D13002

llvm-svn: 248368

9cb018b6

[SCEV] Introduce ScalarEvolution::getOne and getZero. · 2aacc0ec

Sanjoy Das authored Sep 23, 2015

Summary:
It is fairly common to call SE->getConstant(Ty, 0) or
SE->getConstant(Ty, 1); this change makes such uses a little bit
briefer.

I've refactored the call sites I could find easily to use getZero /
getOne.

Reviewers: hfinkel, majnemer, reames

Subscribers: sanjoy, llvm-commits

Differential Revision: http://reviews.llvm.org/D12947

llvm-svn: 248362

2aacc0ec

Revert "Android support for SafeStack." · 8d0e3011

Evgeniy Stepanov authored Sep 23, 2015

test/Transforms/SafeStack/abi.ll breaks when target is not supported;
needs refactoring.

llvm-svn: 248358

8d0e3011

Android support for SafeStack. · ce2e16f0

Evgeniy Stepanov authored Sep 23, 2015

Add two new ways of accessing the unsafe stack pointer:

* At a fixed offset from the thread TLS base. This is very similar to
  StackProtector cookies, but we plan to extend it to other backends
  (ARM in particular) soon. Bionic-side implementation here:
  https://android-review.googlesource.com/170988.
* Via a function call, as a fallback for platforms that provide
  neither a fixed TLS slot, nor a reasonable TLS implementation (i.e.
  not emutls).

llvm-svn: 248357

ce2e16f0

[Unroll] Do not crash trying to propagate a value to vector load. · deade196
Michael Zolotukhin authored Sep 22, 2015
```
llvm-svn: 248333
```
deade196

Sep 22, 2015

[Unroll] Follow-up for r247769: fix a bug in UnrolledInstAnalyzer::visitLoad. · 8bb31dd0

Michael Zolotukhin authored Sep 22, 2015

Apart from checking that GlobalVariable is a constant, we should check
that it's not a weak constant, in which case we can't propagate its
value.

llvm-svn: 248327

8bb31dd0

Prune trailing whitespaces. · 10c80e79
NAKAMURA Takumi authored Sep 22, 2015
```
llvm-svn: 248265
```
10c80e79
Untabify. · 0a7d0ad9
NAKAMURA Takumi authored Sep 22, 2015
```
llvm-svn: 248264
```
0a7d0ad9
Reformat blank lines. · a9cb538a
NAKAMURA Takumi authored Sep 22, 2015
```
llvm-svn: 248263
```
a9cb538a
Reformat comment lines. · 84965031
NAKAMURA Takumi authored Sep 22, 2015
```
llvm-svn: 248262
```
84965031
Reformat. · 70ad98ac
NAKAMURA Takumi authored Sep 22, 2015
```
llvm-svn: 248261
```
70ad98ac
Remove unused TargetTransformInfo dependency from SafeStack pass. · 3c9c8338
Evgeniy Stepanov authored Sep 22, 2015
```
llvm-svn: 248233
```
3c9c8338

[LoopUnswitch] Require DominatorTree info. · 9f3aea6e

Michael Zolotukhin authored Sep 22, 2015

Summary:
We should either require the DT info to be available, or check if it's
available in every place we use DT (and we already miss such check in
one place, which causes failures in some cases). As other loop passes
preserve DT and it's usually available, it makes sense to just require
it here.

There is no regression test, because the bug only shows up if pass
manager decides to clean DT info right before LoopUnswitch. If
loop-unswitch is run separately, DT is available, so bug isn't exposed.

Reviewers: chandlerc, hfinkel

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D13036

llvm-svn: 248230

9f3aea6e

[LICM] Hoist calls to readonly argmemonly functions even with stores in the loop · 5f99423d

Philip Reames authored Sep 21, 2015

We know that an argmemonly function can only access memory pointed to by it's pointer arguments. Rather than needing to consider all possible stores as aliasing (as we do for a readonly function), we can only consider the aliasing of the pointer arguments.

Note that this change only addresses hoisting. I'm thinking about how to address speculation safety as well, but that will be a different change.

FYI, argmemonly disallows accessing memory through non-pointer typed arguments.

Differential Revision: http://reviews.llvm.org/D12771

llvm-svn: 248220

5f99423d

Sep 21, 2015

Fix UB: can't bind a reference to nullptr (NFC) · 24e20583
Mehdi Amini authored Sep 21, 2015
```
From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 248213
```
24e20583

[LoopUtils,LV] Propagate fast-math flags on generated FCmp instructions · 50a4c27f

James Molloy authored Sep 21, 2015

We're currently losing any fast-math flags when synthesizing fcmps for
min/max reductions. In LV, make sure we copy over the scalar inst's
flags. In LoopUtils, we know we only ever match patterns with
hasUnsafeAlgebra, so apply that to any synthesized ops.

llvm-svn: 248201

50a4c27f

[FunctionAttrs] Extract a helper function for the core logic used to · 7542d376

Chandler Carruth authored Sep 21, 2015

evaluate whether 'readonly' or 'readnone' apply to a given function.
This both reduces indentation and will make it easy to share the logic
with a new pass manager implementation.

llvm-svn: 248181

7542d376

add ShouldChangeType() variant that takes bitwidths · 55dcd40d
Sanjay Patel authored Sep 21, 2015
```
This is more efficient for cases like D12965 where we already have widths.

llvm-svn: 248170
```
55dcd40d
don't repeat function names in comments; NFC · 84dca494
Sanjay Patel authored Sep 21, 2015
```
llvm-svn: 248166
```
84dca494

Sep 20, 2015
- [IndVars] Use C++11 style field initialization; NFCI. · 7cc2cfec
  Sanjoy Das authored Sep 20, 2015
```
llvm-svn: 248131
```
  7cc2cfec
- [IndVars] Don't add a level of indentation for namespace {. NFC. · e1e352d5
  Sanjoy Das authored Sep 20, 2015
```
Whitespace-only change.

llvm-svn: 248130
```
  e1e352d5
- [IndVars] Don't repeat function names in comment; NFC. · 9119bf4c
  Sanjoy Das authored Sep 20, 2015
```
Only changes comments.

llvm-svn: 248112
```
  9119bf4c