Commits · be4fe32eb8ad20e47fecf7e85e57c7e522ec5143 · Lorenzo Albano / LLVM bpEVL

Apr 15, 2014

Add lifetime markers for allocas created to hold byval arguments, make them · be4fe32e
Julien Lerouge authored Apr 15, 2014
```
appear in the InlineFunctionInfo.

llvm-svn: 206308
```
be4fe32e
Split byval argument initialization so the memcpy(s) are injected at the · 957e91c4
Julien Lerouge authored Apr 15, 2014
```
beginning of the first new block after inlining.

llvm-svn: 206307
```
957e91c4

LTO: Add more loop simplification passes to LTO · 2b69189c

Duncan P. N. Exon Smith authored Apr 15, 2014

Similar to r202051, add missing loop simplification passes to the LTO
optimization pipeline.

Patch by Rafael Espindola.

llvm-svn: 206306

2b69189c

verify-di: Implement DebugInfoVerifier · 6ef5f284

Duncan P. N. Exon Smith authored Apr 15, 2014

Implement DebugInfoVerifier, which steals verification relying on
DebugInfoFinder from Verifier.

  - Adds LegacyDebugInfoVerifierPassPass, a ModulePass which wraps
    DebugInfoVerifier.  Uses -verify-di command-line flag.

  - Change verifyModule() to invoke DebugInfoVerifier as well as
    Verifier.

  - Add a call to createDebugInfoVerifierPass() wherever there was a
    call to createVerifierPass().

This implementation as a module pass should sidestep efficiency issues,
allowing us to turn debug info verification back on.

<rdar://problem/15500563>

llvm-svn: 206300

6ef5f284

D3348 - [BUG] "Rotate Loop" pass kills "llvm.vectorizer.enable" metadata · b97f9e86
Alexey Bataev authored Apr 15, 2014
```
llvm-svn: 206266
```
b97f9e86

Apr 14, 2014
- Revert "Revert r206045, "Fix shift by constants for vector."" · fed3dc8d
  Matt Arsenault authored Apr 14, 2014
```
Fix cases where the Value itself is used, and not the constant value.

llvm-svn: 206214
```
  fed3dc8d
- Whitespace. · 58ad0c87
  NAKAMURA Takumi authored Apr 14, 2014
```
llvm-svn: 206154
```
  58ad0c87
- Revert r206045, "Fix shift by constants for vector." · 26afa982
  NAKAMURA Takumi authored Apr 14, 2014
```
It broke some builders, at least, i686.

llvm-svn: 206153
```
  26afa982
- Use APInt arithmetic, fixed typo. Thanks to Benjamin Kramer for noticing that. · b5f3ddc7
  Serge Pavlov authored Apr 14, 2014
```
llvm-svn: 206144
```
  b5f3ddc7
Apr 13, 2014

Recognize test for overflow in integer multiplication. · 4bb54d51

Serge Pavlov authored Apr 13, 2014

If multiplication involves zero-extended arguments and the result is
compared as in the patterns:

    %mul32 = trunc i64 %mul64 to i32
    %zext = zext i32 %mul32 to i64
    %overflow = icmp ne i64 %mul64, %zext
or
    %overflow = icmp ugt i64 %mul64 , 0xffffffff

then the multiplication may be replaced by call to umul.with.overflow.
This change fixes PR4917 and PR4918.

Differential Revision: http://llvm-reviews.chandlerc.com/D2814

llvm-svn: 206137

4bb54d51

Apr 11, 2014

Fix shift by constants for vector. · 173a1e57
Matt Arsenault authored Apr 11, 2014
```
ashr <N x iM>, <N x iM> M -> undef

llvm-svn: 206045
```
173a1e57

Implement depth_first and inverse_depth_first range factory functions. · ceec2bda

David Blaikie authored Apr 11, 2014

Also updated as many loops as I could find using df_begin/idf_begin -
strangely I found no uses of idf_begin. Is that just used out of tree?

Also a few places couldn't use df_begin because either they used the
member functions of the depth first iterators or had specific ordering
constraints (I added a comment in the latter case).

Based on a patch by Jim Grosbach. (Jim - you just had iterator_range<T>
where you needed iterator_range<idf_iterator<T>>)

llvm-svn: 206016

ceec2bda

Apr 10, 2014
- Reapply "SLPVectorizer: Ignore users that are insertelements we can reschedule them" · b373e01d
  Arnold Schwaighofer authored Apr 10, 2014
```
This commit reapplies 205018. After 205855 we should correctly vectorize
intrinsics.

llvm-svn: 205965
```
  b373e01d
Apr 09, 2014
- Fix some doc and comment typos · 16f98b25
  Alp Toker authored Apr 09, 2014
```
llvm-svn: 205899
```
  16f98b25
- SLPVectorizer: Only vectorize intrinsics whose operands are widened equally · fd0bf5d6
  Arnold Schwaighofer authored Apr 09, 2014
```
The vectorizer only knows how to vectorize intrinics by widening all operands by
the same factor.

Patch by Tyler Nowicki!

llvm-svn: 205855
```
  fd0bf5d6
Apr 08, 2014

Add support for optimization reports. · a9298b22

Diego Novillo authored Apr 08, 2014

Summary:
This patch adds backend support for -Rpass=, which indicates the name
of the optimization pass that should emit remarks stating when it
made a transformation to the code.

Pass names are taken from their DEBUG_NAME definitions.

When emitting an optimization report diagnostic, the lack of debug
information causes the diagnostic to use "<unknown>:0:0" as the
location string.

This is the back end counterpart for

http://llvm-reviews.chandlerc.com/D3226

Reviewers: qcolombet

CC: llvm-commits

Differential Revision: http://llvm-reviews.chandlerc.com/D3227

llvm-svn: 205774

a9298b22

Apr 07, 2014
- Add NDEBUG markers around debug only function. · fcd222aa
  Eric Christopher authored Apr 07, 2014
```
llvm-svn: 205706
```
  fcd222aa
- Add debug location information to the vectorizer debug statements. · 6ab6a064
  Eric Christopher authored Apr 07, 2014
```
Patch by Zinovy Nis.

llvm-svn: 205705
```
  6ab6a064
Apr 05, 2014

Fixing typo. · 64256968

David Blaikie authored Apr 05, 2014

Differential Revision: http://reviews.llvm.org/D3154

llvm-svn: 205674

64256968

Apr 03, 2014
- Fix PR19270 - type mismatch caused by invalid optimization. · 9966b26d
  Eli Bendersky authored Apr 03, 2014
```
Patch by Jingyue Wu.

llvm-svn: 205547
```
  9966b26d
- Revert "[Constant Hoisting] Lazily compute the idom and cache the result." · c81000b8
  Juergen Ributzka authored Apr 03, 2014
```
This code is no longer usefull, because we only compute and use the
IDom once. There is no benefit in caching it anymore.

llvm-svn: 205498
```
  c81000b8
- Revert "Reapply "LTO: add API to set strategy for -internalize"" · 4680f40d
  Duncan P. N. Exon Smith authored Apr 02, 2014
```
This reverts commit r199244.

Conflicts:
	include/llvm-c/lto.h
	include/llvm/LTO/LTOCodeGenerator.h
	lib/LTO/LTOCodeGenerator.cpp

llvm-svn: 205471
```
  4680f40d
Apr 02, 2014

SLPVectorizer: compare entire intrinsic for SLP compatibility. · 670df3d9

Tim Northover authored Apr 02, 2014

Some Intrinsics are overloaded to the extent that return type equality (all
that's been checked up to now) does not guarantee that the arguments are the
same. In these cases SLP vectorizer should not recurse into the operands, which
can be achieved by comparing them as "Function *" rather than simply the ID.

llvm-svn: 205424

670df3d9

[LoopVectorizer] Count dependencies of consecutive pointers as uniforms · b0ebdc0f

Hal Finkel authored Apr 02, 2014

For the purpose of calculating the cost of the loop at various vectorization
factors, we need to count dependencies of consecutive pointers as uniforms
(which means that the VF = 1 cost is used for all overall VF values).

For example, the TSVC benchmark function s173 has:
  ...
  %3 = add nsw i64 %indvars.iv, 16000
  %arrayidx8 = getelementptr inbounds %struct.GlobalData* @global_data, i64 0, i32 0, i64 %3
  ...
and we must realize that the add will be a scalar in order to correctly deduce
it to be profitable to vectorize this on PowerPC with VSX enabled. In fact, all
dependencies of a consecutive pointer must be a scalar (uniform), and so we
simply need to add all consecutive pointers to the worklist that currently
detects collects uniforms.

Fixes PR19296.

llvm-svn: 205387

b0ebdc0f

Apr 01, 2014

Add some additional fields to TTI::UnrollingPreferences · 6386cb8d

Hal Finkel authored Apr 01, 2014

In preparation for an upcoming commit implementing unrolling preferences for
x86, this adds additional fields to the UnrollingPreferences structure:

 - PartialThreshold and PartialOptSizeThreshold - Like Threshold and
   OptSizeThreshold, but used when not fully unrolling. These are necessary
   because we need different thresholds for full unrolling from those used when
   partially unrolling (the full unrolling thresholds are generally going to be
   larger).

 - MaxCount - A cap on the unrolling factor when partially unrolling. This can
   be used by a target to prevent the unrolled loop from exceeding some
   resource limit independent of the loop size (such as number of branches).

There should be no functionality change for any in-tree targets.

llvm-svn: 205347

6386cb8d

Move partial/runtime unrolling late in the pipeline · 86b3064f

Hal Finkel authored Mar 31, 2014

The generic (concatenation) loop unroller is currently placed early in the
standard optimization pipeline. This is a good place to perform full unrolling,
but not the right place to perform partial/runtime unrolling. However, most
targets don't enable partial/runtime unrolling, so this never mattered.

However, even some x86 cores benefit from partial/runtime unrolling of very
small loops, and follow-up commits will enable this. First, we need to move
partial/runtime unrolling late in the optimization pipeline (importantly, this
is after SLP and loop vectorization, as vectorization can drastically change
the size of a loop), while keeping the full unrolling where it is now. This
change does just that.

llvm-svn: 205264

86b3064f

Revert "SLPVectorizer: Ignore users that are insertelements we can reschedule them" · 15262e67

Arnold Schwaighofer authored Mar 31, 2014

This reverts commit r205018.

Conflicts:
	lib/Transforms/Vectorize/SLPVectorizer.cpp
	test/Transforms/SLPVectorizer/X86/insert-element-build-vector.ll

This is breaking libclc build.

llvm-svn: 205260

15262e67

Mar 30, 2014

Add a missing break. · 5e66a7e6

Rafael Espindola authored Mar 30, 2014

Patch by Tobias Güntner.

I tried to write a test, but the only difference is the Changed value that
gets returned. It can be tested with "opt -debug-pass=Executions -functionattrs,
but that doesn't seem worth it.

llvm-svn: 205121

5e66a7e6

Mar 29, 2014

ARM64: initial backend import · 00ed9964

Tim Northover authored Mar 29, 2014

This adds a second implementation of the AArch64 architecture to LLVM,
accessible in parallel via the "arm64" triple. The plan over the
coming weeks & months is to merge the two into a single backend,
during which time thorough code review should naturally occur.

Everything will be easier with the target in-tree though, hence this
commit.

llvm-svn: 205090

00ed9964

Mar 28, 2014
- SLPVectorizer: Take credit for free extractelement instructions · c9d58e8d
  Arnold Schwaighofer authored Mar 28, 2014
```
Extract element instructions that will be removed when vectorzing lower the
cost.

Patch by Arch D. Robison!

llvm-svn: 205020
```
  c9d58e8d
- SLPVectorizer: Fix typos · b0d3bcdd
  Arnold Schwaighofer authored Mar 28, 2014
```
Patch by Arch D. Robison!

llvm-svn: 205019
```
  b0d3bcdd
- SLPVectorizer: Ignore users that are insertelements we can reschedule them · b190cb30
  Arnold Schwaighofer authored Mar 28, 2014
```
Patch by Arch D. Robison!

llvm-svn: 205018
```
  b190cb30
- Revert "InstCombine: merge constants in both operands of icmp." · 5e1bac3a
  Erik Verbruggen authored Mar 28, 2014
```
This reverts commit r204912, and follow-up commit r204948.

This introduced a performance regression, and the fix is not completely
clear yet.

llvm-svn: 205010
```
  5e1bac3a
- Revert "GVN: merge overflow intrinsics with non-overflow instructions." · 2074ebd8
  Erik Verbruggen authored Mar 28, 2014
```
This reverts commit r203553, and follow-up commits r203558 and r203574.

I will follow this up on the mailinglist to do it in a way that won't
cause subtle PRE bugs.

llvm-svn: 205009
```
  2074ebd8
- C++11: convert verbose loops to range-based loops. · 79c8e8f0
  Adrian Prantl authored Mar 27, 2014
```
llvm-svn: 204981
```
  79c8e8f0
Mar 27, 2014

InstCombine: Don't combine constants on unsigned icmps · 3bdf9bc4

Reid Kleckner authored Mar 27, 2014

Fixes a miscompile introduced in r204912.  It would miscompile code like
(unsigned)(a + -49) <= 5U.  The transform would turn this into
(unsigned)a < 55U, which would return true for values in [0, 49], when
it should not.

llvm-svn: 204948

3bdf9bc4

Prevent alias from pointing to weak aliases. · 24a669d2

Rafael Espindola authored Mar 27, 2014

This adds back r204781.

Original message:

Aliases are just another name for a position in a file. As such, the
regular symbol resolutions are not applied. For example, given

define void @my_func() {
  ret void
}
@my_alias = alias weak void ()* @my_func
@my_alias2 = alias void ()* @my_alias

We produce without this patch:

        .weak   my_alias
my_alias = my_func
        .globl  my_alias2
my_alias2 = my_alias

That is, in the resulting ELF file my_alias, my_func and my_alias are
just 3 names pointing to offset 0 of .text. That is *not* the
semantics of IR linking. For example, linking in a

@my_alias = alias void ()* @other_func

would require the strong my_alias to override the weak one and
my_alias2 would end up pointing to other_func.

There is no way to represent that with aliases being just another
name, so the best solution seems to be to just disallow it, converting
a miscompile into an error.

llvm-svn: 204934

24a669d2

InstCombine: merge constants in both operands of icmp. · 59a12198

Erik Verbruggen authored Mar 27, 2014

Transform:
    icmp X+Cst2, Cst
into:
    icmp X, Cst-Cst2
when Cst-Cst2 does not overflow, and the add has nsw.

llvm-svn: 204912

59a12198

Treat lifetime.start'd memory like we treat freshly alloca'd memory. Patch by Björn Steinbrink! · 77d5fb40
Nick Lewycky authored Mar 26, 2014
```
llvm-svn: 204876
```
77d5fb40

Mar 26, 2014

CloneFunction: Clone all attributes, including the CC · 23798a97

Reid Kleckner authored Mar 26, 2014

Summary:
Tested with a unit test because we don't appear to have any transforms
that use this other than ASan, I think.

Fixes PR17935.

Reviewers: nicholas

CC: llvm-commits

Differential Revision: http://llvm-reviews.chandlerc.com/D3194

llvm-svn: 204866

23798a97