Commits · 45a9b282c2d9914ea11fb29e4cd053b53b991cdf · Roger Ferrer / llvm-epi

Oct 27, 2015
- Switch ownership of miscellaneous ARM target to myself. · 45a9b282
  Tim Northover authored Oct 26, 2015
```
llvm-svn: 251367
```
  45a9b282
Oct 26, 2015

[x86] Make the vselect-minmax test 2x to 3x faster by deleting all the · 5a14186b
Chandler Carruth authored Oct 26, 2015
```
instructions that aren't relevant for instruction selection of vector
min and max.

llvm-svn: 251366
```
5a14186b
Use Twin instead of std::to_string. · 6c2403f3
Oleksiy Vyalov authored Oct 26, 2015
```
http://reviews.llvm.org/D14095

llvm-svn: 251365
```
6c2403f3
Fix indents. It's a follow up to r251353. · 465fbe25
Ivan Krasin authored Oct 26, 2015
```
llvm-svn: 251364
```
465fbe25

[LLVMSymbolize] Don't use LLVMSymbolizer::Options in ModuleInfo. NFC. · 0fb6451a

Alexey Samsonov authored Oct 26, 2015

LLVMSymbolizer::Options is mostly used in LLVMSymbolizer class anyway.
Let's keep their usage restricted to that class, especially given that
it's worth to move ModuleInfo to a different header, independent from
the symbolizer class.

llvm-svn: 251363

0fb6451a

reorganize logic; NFCI (retry r251349) · e9b500f7

Sanjay Patel authored Oct 26, 2015

This is a preliminary step before adding another optimization
to PerformBITCASTCombine().

..and I really hope it's NFC this time!

llvm-svn: 251357

e9b500f7

Move imported entities into DwarfCompilationUnit to speed up LTO linking. · 298639a5

Ivan Krasin authored Oct 26, 2015

Summary:
In particular, this CL speeds up the official Chrome linking with LTO by
1.8x.

See more details in https://crbug.com/542426

Reviewers: dblaikie

Subscribers: jevinskie

Differential Revision: http://reviews.llvm.org/D13918

llvm-svn: 251353

298639a5

ARM: make sure VFP loads and stores are properly aligned. · 939f0892

Tim Northover authored Oct 26, 2015

Both VLDRS and VLDRD fault if the memory is not 4 byte aligned, which wasn't
really being checked before, leading to faults at runtime.

llvm-svn: 251352

939f0892

revert r251349; it included code for a functional change · f29fed42
Sanjay Patel authored Oct 26, 2015
```
llvm-svn: 251350
```
f29fed42

reorganize logic; NFCI · fdf75452

Sanjay Patel authored Oct 26, 2015

This is a preliminary step before adding another optimization
to PerformBITCASTCombine().

llvm-svn: 251349

fdf75452

Initialize BasicAAWrapperPass in it's constructor · 277bfaef

Keno Fischer authored Oct 26, 2015

Summary: This idiom is used elsewhere in LLVM, but was overlooked here.

Reviewers: chandlerc

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D13628

llvm-svn: 251348

277bfaef

Fix build failure on GCC 4.7 (old libstdc++ doesn't have std::map::emplace). · ff8a80b4
Alexey Samsonov authored Oct 26, 2015
```
llvm-svn: 251347
```
ff8a80b4
Remove use of std::map<>::emplace which is not supported on some older versions of libstdc++ · efbb2915
David Blaikie authored Oct 26, 2015
```
llvm-svn: 251346
```
efbb2915
Remove unused local variable. NFC. · e822b636
Diego Novillo authored Oct 26, 2015
```
llvm-svn: 251344
```
e822b636
Fix tests. · 56fff8d3
Peter Collingbourne authored Oct 26, 2015
```
llvm-svn: 251343
```
56fff8d3
ARM/ELF: Restore original (pre-r251322) logic for deciding whether to use GOT. · 99fac80d
Peter Collingbourne authored Oct 26, 2015
```
Unbreaks linking with gold, which cannot resolve direct relocations referring
to global symbols.

llvm-svn: 251342
```
99fac80d
[LLVMSymbolize] Use symbol table only if function linkage name was requested. · f3ecfd3a
Alexey Samsonov authored Oct 26, 2015
```
Now it's enough to just specify -functions=short without additionally
providing -use-symbol-table=false.

llvm-svn: 251339
```
f3ecfd3a
Fix build error by fully qualifying llvm::make_unique. · 1d3f3271
Alexey Samsonov authored Oct 26, 2015
```
llvm-svn: 251338
```
1d3f3271

Optimize StringTableBuilder. · df94852a

Rui Ueyama authored Oct 26, 2015

This is a patch to improve StringTableBuilder's performance. That class'
finalize function is very hot particularly in LLD because the function
does tail-merge strings in string tables or SHF_MERGE sections.

Generic std::sort-style sorter is not efficient for sorting strings.
The function implemented in this patch seems to be more efficient.

Here's a benchmark of LLD to link Clang with or without this patch.
The numbers are medians of 50 runs.

-O0
real 0m0.455s
real 0m0.430s (5.5% faster)

-O3
real 0m0.487s
real 0m0.452s (7.2% faster)

Since that is a benchmark of the whole linker, the speedup of
StringTableBuilder itself is much more than that.

http://reviews.llvm.org/D14053

llvm-svn: 251337

df94852a

[LLVMSymbolize] Use std::unique_ptr more extensively to clarify ownership. · 7a952e53
Alexey Samsonov authored Oct 26, 2015
```
llvm-svn: 251336
```
7a952e53

[RS4GC] Strip noalias attribute after statepoint rewrite · 1ef06559

Igor Laevsky authored Oct 26, 2015

We should remove noalias along with dereference and dereference_or_null attributes 
because statepoint could potentially touch the entire heap including noalias objects.

Differential Revision: http://reviews.llvm.org/D14032

llvm-svn: 251333

1ef06559

SamplePGO - Add optimization reports. · 7963ea19

Diego Novillo authored Oct 26, 2015

This adds a couple of optimization remarks to the SamplePGO
transformation. When it decides to inline a hot function (to mimic the
inline stack and repeat useful inline decisions in the original build).

It will also report branch destinations. For instance, given the code
fragment:

     6      if (i < 1000)
     7        sum -= i;
     8      else
     9        sum += -i * rand();

If the 'else' branch is taken most of the time, building this code with
-Rpass=sample-profile will produce:

a.cc:9:14: remark: most popular destination for conditional branches at small.cc:6:9 [-Rpass=sample-profile]
      sum += -i * rand();
             ^

llvm-svn: 251330

7963ea19

Remove assert(false) in favor of asserting the if conditional it is contained within. · 7b54b525
David Blaikie authored Oct 26, 2015
```
Also adjust the code to avoid 3 redundant map lookups.

llvm-svn: 251327
```
7b54b525
Move the canonical header to the top of its matching cpp file as per coding convention · 94c83370
David Blaikie authored Oct 26, 2015
```
This ensures that the header will be verified to be standalone (and
avoid mistakes like the one fixed in r251178)

llvm-svn: 251326
```
94c83370

Add an (optional) identification block in the bitcode · 5d303285

Mehdi Amini authored Oct 26, 2015

Processing bitcode from a different LLVM version can lead to
unexpected behavior. The LLVM project guarantees autoupdating
bitcode from a previous minor revision for the same major, but
can't make any promise when reading bitcode generated from a
either a non-released LLVM, a vendor toolchain, or a "future"
LLVM release. This patch aims at being more user-friendly and
allows a bitcode produce to emit an optional block at the
beginning of the bitcode that will contains an opaque string
intended to describe the bitcode producer information. The
bitcode reader will dump this information alongside any error it
reports.

The optional block also includes an "epoch" number, monotonically
increasing when incompatible changes are made to the bitcode. The
reader will reject bitcode whose epoch is different from the one
expected.

Differential Revision: http://reviews.llvm.org/D13666

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 251325

5d303285

[safestack] Fast access to the unsafe stack pointer on AArch64/Android. · d1aad265

Evgeniy Stepanov authored Oct 26, 2015

Android libc provides a fixed TLS slot for the unsafe stack pointer,
and this change implements direct access to that slot on AArch64 via
__builtin_thread_pointer() + offset.

This change also moves more code into TargetLowering and its
target-specific subclasses to get rid of target-specific codegen
in SafeStackPass.

This change does not touch the ARM backend because ARM lowers
builting_thread_pointer as aeabi_read_tp, which is not available
on Android.

The previous iteration of this change was reverted in r250461. This
version leaves the generic, compiler-rt based implementation in
SafeStack.cpp instead of moving it to TargetLoweringBase in order to
allow testing without a TargetMachine.

llvm-svn: 251324

d1aad265

BitstreamWriter: Fix integer overflow. · 693499ce

Peter Collingbourne authored Oct 26, 2015

We were previously overflowing a 32-bit multiply operation when emitting large
(>512MB) bitcode files, resulting in corrupted bitcode. Fix by extending
one of the operands to 64 bits.

There are a few other 32-bit integer types in this code that seem like they
also ought to be extended to 64 bits; this will be done separately.

llvm-svn: 251323

693499ce

ARM/ELF: Better codegen for global variable addresses. · 97aae408

Peter Collingbourne authored Oct 26, 2015

In PIC mode we were previously computing global variable addresses (or GOT
entry addresses) by adding the PC, the PC-relative GOT displacement and
the GOT-relative symbol/GOT entry displacement. Because the latter two
displacements are fixed, we ended up performing one more addition than
necessary.

This change causes us to compute addresses using a single PC-relative
displacement, resulting in a shorter code sequence. This reduces code size
by about 4% in a recent build of Chromium for Android.

As a result of this change we no longer need to compute the GOT base address
in the ARM backend, which allows us to remove the Global Base Reg pass and
SDAG lowering for the GOT.

We also now no longer use the GOT when addressing a symbol which is known
to be defined in the same linkage unit. Specifically, the symbol must have
either hidden visibility or a strong definition in the current module in
order to not use the the GOT.

This is a change from the previous behaviour where we would use the GOT to
address externally visible symbols defined in the same module. I think the
only cases where this could matter are cases involving symbol interposition,
but we don't really support that well anyway.

Differential Revision: http://reviews.llvm.org/D13650

llvm-svn: 251322

97aae408

Cleanup test case debug info. NFC. · 5eb5ad09
Diego Novillo authored Oct 26, 2015
```
llvm-svn: 251320
```
5eb5ad09

Refactor: Simplify boolean conditional return statements in lib/Transforms/Instrumentation · 145b0fd2

Alexey Samsonov authored Oct 26, 2015

Summary: Use clang-tidy to simplify boolean conditional return statements.

Differential Revision: http://reviews.llvm.org/D9996

Patch by Richard (legalize@xmission.com)!

llvm-svn: 251318

145b0fd2

Check the case that the numerator and denominator are both zeros when getting... · fff8ccf5

Cong Hou authored Oct 26, 2015

Check the case that the numerator and denominator are both zeros when getting edge probabilities in BPI and return 100% in this case.

This issue is triggered in PGO mode when bootstrapping LLVM. It seems that it is not guaranteed that edge weights are always greater than zero which are read from profile data.

llvm-svn: 251317

fff8ccf5

Move parts of llvm-symbolizer tool into LLVMSymbolize library. · 57f8837a

Alexey Samsonov authored Oct 26, 2015

Summary: See http://lists.llvm.org/pipermail/llvm-dev/2015-October/091624.html

Reviewers: echristo

Subscribers: llvm-commits, aizatsky

Differential Revision: http://reviews.llvm.org/D13998

llvm-svn: 251316

57f8837a

[SystemZ] LTGFR use regclass should be GR32, not GR64. · 83553d0c
Jonas Paulsson authored Oct 26, 2015
```
Discovered by testing int-cmp-44.ll with -verify-machineinstrs (added to
test run).

llvm-svn: 251299
```
83553d0c
[SystemZ] Also clear kill flag for index reg in splitMove(). · 7da38208
Jonas Paulsson authored Oct 26, 2015
```
Discovered by running fp-move-05.ll with -verify-machineinstrs (added
to test case run).

llvm-svn: 251298
```
7da38208
[SystemZ] Don't forget the CC def op on LTEBRCompare pseudos · 9525b2c0
Jonas Paulsson authored Oct 26, 2015
```
Discovered by running fp-cmp-02.ll with -verify-machineinstrs (now added
to test run).

llvm-svn: 251297
```
9525b2c0

[SystemZ] Tie operands in SystemZShorteInst if MI becomes 2-address. · dab74072

Jonas Paulsson authored Oct 26, 2015

Discovered by testing fp-add-02.ll with -verify-machineinstrs.

Test case updated to always run with -verify-machineinstrs.

llvm-svn: 251296

dab74072

[mips] Check for the correct error message in tests for interrupt attributes. · 165121f3

Vasileios Kalintiris authored Oct 26, 2015

Instead of XFAIL-ing the tests with the wrong usage of the "interrupt"
attribute, we should check that we emit the correct error messages to
the user.

llvm-svn: 251295

165121f3

[ValueTracking] Extend r251146 to catch a fairly common case · 493e57de

James Molloy authored Oct 26, 2015

Even though we may not know the value of the shifter operand, it's possible we know the shifter operand is non-zero. This can allow us to infer more known bits - for example:

  %1 = load %p !range {1, 5}
  %2 = shl %q, %1

We don't know %1, but we do know that it is nonzero so %2[0] is known zero, and importantly %2 is known non-zero.

Calling isKnownNonZero is nontrivially expensive so use an Optional to run it lazily and cache its result.

llvm-svn: 251294

493e57de

[SCEV] Fix issues found during the review of r251283. NFC. · cb68efea

Silviu Baranga authored Oct 26, 2015

Summary:
Replace (const SCEVAddRecExpr *) with cast<SCEVAddRecExpr>.

Rename SCEVApplyRewriter to SCEVLoopAddRecRewriter (which is a more
appropriate name) since the description is "takes a scalar evolution
expression and applies the Map (Loop -> SCEV) to all AddRecExprs."

Subscribers: llvm-commits, sanjoy

Differential Revision: http://reviews.llvm.org/D14065

llvm-svn: 251292

cb68efea

Loop Vectorizer - skipping "bitcast" before GEP · 7a771493

Elena Demikhovsky authored Oct 26, 2015

Vectorization of memory instruction (Load/Store) is possible when the pointer is coming from GEP. The GEP analysis allows to estimate the profit.
In some cases we have a "bitcast" between GEP and memory instruction.
I added code that skips the "bitcast".

http://reviews.llvm.org/D13886

llvm-svn: 251291

7a771493