Commits · 4bf5c4b7460ca7e663d0ae8f3011a019812544b3 · Roger Ferrer / llvm-epi-0.8

Jun 08, 2013

Fix an assertion in MemCpyOpt pass. · bd254f26

Shuxin Yang authored Jun 07, 2013

  The MemCpyOpt pass is capable of optimizing:
      callee(&S); copy N bytes from S to D.
    into:
      callee(&D);
subject to some legality constraints. 

  Assertion is triggered when the compiler tries to evalute "sizeof(typeof(D))",
while D is an opaque-typed, 'sret' formal argument of function being compiled.
i.e. the signature of the func being compiled is something like this:
  T caller(...,%opaque* noalias nocapture sret %D, ...)

  The fix is that when come across such situation, instead of calling some
utility functions to get the size of D's type (which will crash), we simply
assume D has at least N bytes as implified by the copy-instruction.

rdar://14073661 

llvm-svn: 183584

bd254f26

Jun 07, 2013

[objc-arc] Ensure that the cfg path count does not overflow when we multiply... · 9e7261c8

Michael Gottesman authored Jun 07, 2013

[objc-arc] Ensure that the cfg path count does not overflow when we multiply TopDownPathCount/BottomUpPathCount.

rdar://12480535

llvm-svn: 183489

9e7261c8

Simplify code. No functionality change. · 96ff4d6d
Jakub Staszak authored Jun 06, 2013
```
llvm-svn: 183461
```
96ff4d6d

Jeffrey Yasskin volunteered to benchmark the vectorizer on -O2 or -Os when... · 99e529ea

Nadav Rotem authored Jun 06, 2013

Jeffrey Yasskin volunteered to benchmark the vectorizer on -O2 or -Os when compiling chrome. This patch adds a new flag to enable vectorization on all levels and not only on -O3. It should go away once we make a decision.

llvm-svn: 183456

99e529ea

Jun 06, 2013
- Re-apply "Use IRBuilder instead of ConstantInt methods." with the fixed issues. · bddea11b
  Jakub Staszak authored Jun 06, 2013
```
llvm-svn: 183439
```
  bddea11b
- Revert "Use IRBuilder instead of ConstantInt methods. It simplifies code a little bit." · a7bbc0b7
  Rafael Espindola authored Jun 06, 2013
```
This reverts commit 183328. It caused pr16244 and broke the bots.

llvm-svn: 183422
```
  a7bbc0b7
- Remove unneeded cast<>. · 9de494e0
  Jakub Staszak authored Jun 06, 2013
```
llvm-svn: 183363
```
  9de494e0
- Use IRBuilder instead of ConstantInt methods. · 461d1fe6
  Jakub Staszak authored Jun 06, 2013
```
llvm-svn: 183360
```
  461d1fe6
Jun 05, 2013
- Use IRBuilder instead of ConstantInt methods. It simplifies code a little bit. · 2f390b75
  Jakub Staszak authored Jun 05, 2013
```
llvm-svn: 183328
```
  2f390b75
Jun 04, 2013

IndVarSimplify: check if loop invariant expansion can trap · 29130c5e

David Majnemer authored Jun 04, 2013

IndVarSimplify is willing to move divide instructions outside of their
loop bodies if they are invariant of the loop.  However, it may not be
safe to expand them if we do not know if they can trap.

Instead, check to see if it is not safe to expand the instruction and
skip the expansion.

This fixes PR16041.

Testcase by Rafael Ávila de Espíndola.

llvm-svn: 183239

29130c5e

Second part of pr16069 · a5e536ab

Rafael Espindola authored Jun 04, 2013

The problem this time seems to be a thinko. We were assuming that in the CFG

A
| \
|  B
| /
C

speculating the basic block B would cause only the phi value for the B->C edge
to be speculated. That is not true, the phi's are semantically in the edges, so
if the A->B->C path is taken, any code needed for A->C is not executed and we
have to consider it too when deciding to speculate B.

llvm-svn: 183226

a5e536ab

Typo: s/caes/cases/ in SimplifyCFG · 5cf30be6
Hans Wennborg authored Jun 04, 2013
```
llvm-svn: 183219
```
5cf30be6
Delete dead safety check. · 688d668e
Nick Lewycky authored Jun 03, 2013
```
llvm-svn: 183167
```
688d668e

Jun 03, 2013

SimplifyCFG: Do not transform PHI to select if doing so would be unsafe · c82f27af

David Majnemer authored Jun 03, 2013

PR16069 is an interesting case where an incoming value to a PHI is a
trap value while also being a 'ConstantExpr'.

We do not consider this case when performing the 'HoistThenElseCodeToIf'
optimization.

Instead, make our modifications more conservative if we detect that we
cannot transform the PHI to a select.

llvm-svn: 183152

c82f27af

SimplifyCFG: Small cleanup, use ICmpInst::isEquality() · 8e7dd2f6
David Majnemer authored Jun 03, 2013
```
llvm-svn: 183151
```
8e7dd2f6
[asan] ASan Linux MIPS32 support (llvm part), patch by Jyun-Yan Y · 9e62b301
Kostya Serebryany authored Jun 03, 2013
```
llvm-svn: 183104
```
9e62b301

Jun 01, 2013
- When determining the new index for an insertelement, we may not assume that an · 3f715e26
  Nick Lewycky authored Jun 01, 2013
```
index greater than the size of the vector is invalid. The shuffle may be
shrinking the size of the vector. Fixes a crash!

Also drop the maximum recursion depth of the safety check for this
optimization to five.

llvm-svn: 183080
```
  3f715e26
- SimplifyCFG: Fix typo in comment for ComputeSpeculationCost · 91142c48
  David Majnemer authored Jun 01, 2013
```
llvm-svn: 183078
```
  91142c48
- Move getRealLinkageName to a common place and remove all the duplicates of it. · 7c275640
  Benjamin Kramer authored Jun 01, 2013
```
Also simplify code a bit while there. No functionality change.

llvm-svn: 183076
```
  7c275640
May 31, 2013

LoopVectorize: Change API call to get the backedge taken count · 7b1b4db3

Arnold Schwaighofer authored May 31, 2013

Use ScalarEvolution's getBackedgeTakenCount API instead of getExitCount since
that is really what we want to know. Using the more specific getExitCount was
safe because we made sure that there is only one exiting block.

No functionality change.

llvm-svn: 183047

7b1b4db3

Loop Strength Reduce: Scaling factor cost. · bf490d4a

Quentin Colombet authored May 31, 2013

Account for the cost of scaling factor in Loop Strength Reduce when rating the
formulae. This uses a target hook.

The default implementation of the hook is: if the addressing mode is legal, the
scaling factor is free.

<rdar://problem/13806271>

llvm-svn: 183045

bf490d4a

LoopVectorize: PHIs with only outside users should prevent vectorization · 70a9be52

Arnold Schwaighofer authored May 31, 2013

We check that instructions in the loop don't have outside users (except if
they are reduction values). Unfortunately, we skipped this check for
if-convertable PHIs.

Fixes PR16184.

llvm-svn: 183035

70a9be52

Modify how the formulae are rated in Loop Strength Reduce. · 8aa7abe2

Quentin Colombet authored May 31, 2013

Namely, check if the target allows to fold more that one register in the
addressing mode and if yes, adjust the cost accordingly.

Prior to this commit, reg1 + scale * reg2 accesses were artificially preferred
to reg1 + reg2 accesses. Indeed, the cost model wrongly assumed that reg1 + reg2
needs a temporary register for the computation, whereas it was correctly
estimated for reg1 + scale * reg2.

<rdar://problem/13973908>

llvm-svn: 183021

8aa7abe2

Simplify multiplications by vectors whose elements are powers of 2. · 65281bf3
Rafael Espindola authored May 31, 2013
```
Patch by Andrea Di Biagio.

llvm-svn: 183005
```
65281bf3

[msan] Handle mixed track-origins and keep-going settings (llvm part). · 888385e4

Evgeniy Stepanov authored May 31, 2013

Before this change, each module defined a weak_odr global __msan_track_origins
with a value of 1 if origin tracking is enabled, 0 if disabled. If there are
modules with different values, any of them may win. If 0 wins, and there is at
least one module with 1, the program will most likely crash.

With this change, __msan_track_origins is only emitted if origin tracking is
on. Then runtime library detects if there is at least one module with origin
tracking, and enables runtime support for it.

llvm-svn: 182997

888385e4

Reapply with r182909 with a fix to the calculation of the new indices for · a2b77206
Nick Lewycky authored May 31, 2013
```
insertelement instructions.

llvm-svn: 182976
```
a2b77206

May 30, 2013
- Revert r182909. · 2c142698
  Evgeniy Stepanov authored May 30, 2013
```
PR/16177

llvm-svn: 182919
```
  2c142698
- Swizzle vector inputs if it helps us eliminate shuffles. · d7f27094
  Nick Lewycky authored May 30, 2013
```
llvm-svn: 182909
```
  d7f27094
May 29, 2013
- LoopVectorize.cpp: Fix abuse of StringRef on Twine. Twine captures the pointer of StringRef. · d11b42aa
  NAKAMURA Takumi authored May 29, 2013
```
llvm-svn: 182820
```
  d11b42aa
- Whitespace. · d57ea870
  NAKAMURA Takumi authored May 29, 2013
```
llvm-svn: 182819
```
  d57ea870
May 28, 2013

Add support for llvm.vectorizer metadata · 5fdf836b

Paul Redmond authored May 28, 2013

- llvm.loop.parallel metadata has been renamed to llvm.loop to be more generic
  by making the root of additional loop metadata.
  - Loop::isAnnotatedParallel now looks for llvm.loop and associated
    llvm.mem.parallel_loop_access
  - document llvm.loop and update llvm.mem.parallel_loop_access
- add support for llvm.vectorizer.width and llvm.vectorizer.unroll
  - document llvm.vectorizer.* metadata
  - add utility class LoopVectorizerHints for getting/setting loop metadata
  - use llvm.vectorizer.width=1 to indicate already vectorized instead of
    already_vectorized
- update existing tests that used llvm.loop.parallel and
  llvm.vectorizer.already_vectorized

Reviewed by: Nadav Rotem

llvm-svn: 182802

5fdf836b

Extend RemapInstruction and friends to take an optional new parameter, a ValueMaterializer. · f6f121e2

James Molloy authored May 28, 2013

Extend LinkModules to pass a ValueMaterializer to RemapInstruction and friends to lazily create Functions for lazily linked globals. This is a big win when linking small modules with large (mostly unused) library modules.

llvm-svn: 182776

f6f121e2

[msan] Fix argument shadow alignment. · fca01233
Evgeniy Stepanov authored May 28, 2013
```
llvm-svn: 182771
```
fca01233

May 25, 2013
- Replace Count{Leading,Trailing}Zeros_{32,64} with count{Leading,Trailing}Zeros. · df1ecbd7
  Michael J. Spencer authored May 24, 2013
```
llvm-svn: 182680
```
  df1ecbd7
May 24, 2013

[objc-arc] KnownSafe does not imply that it is safe to perform code motion... · e67f40c5

Michael Gottesman authored May 24, 2013

[objc-arc] KnownSafe does not imply that it is safe to perform code motion across CFG edges since even if it is safe to remove RR pairs, we may still be able to move a retain/release into a loop.

rdar://13949644

llvm-svn: 182670

e67f40c5

[objc-arc] Make sure that multiple owners is propogated correctly through the... · 5a91bbf3

Michael Gottesman authored May 24, 2013

[objc-arc] Make sure that multiple owners is propogated correctly through the pass via the usage of a global data structure.

rdar://13750319

llvm-svn: 182669

5a91bbf3

LoopVectorize: LoopSimplify can't canonicalize loops with an indirectbr in it,... · 6ac1e623

Benjamin Kramer authored May 24, 2013

LoopVectorize: LoopSimplify can't canonicalize loops with an indirectbr in it, don't assert on those cases.

Fixes PR16139.

llvm-svn: 182656

6ac1e623

Run clang-format over the scalarizePHI function. · b34294d0
Joey Gouly authored May 24, 2013
```
llvm-svn: 182640
```
b34294d0

scalarizePHI needs to insert the next ExtractElement in the same block · 83699284

Joey Gouly authored May 24, 2013

as the BinaryOperator, *not* in the block where the IRBuilder is currently
inserting into. Fixes a bug where scalarizePHI would create instructions
that would not dominate all uses.

llvm-svn: 182639

83699284

Re-implement DebugIR in a way that does not subclass AssemblyWriter: · fddddbea

Daniel Malea authored May 23, 2013

- move AsmWriter.h from public headers into lib
- marked all AssemblyWriter functions as non-virtual; no need to override them
- DebugIR now "plugs into" AssemblyWriter with an AssemblyAnnotationWriter helper
- exposed flags to control hiding of a) debug metadata b) debug intrinsic calls

C/R: Paul Redmond

llvm-svn: 182617

fddddbea