Commits · 0775a2326e39c0a1c3112f16f4e0ee408c3b97bb · Roger Ferrer / llvm-epi

Sep 27, 2016

Improve CMake output of host and target triple · 0775a232

Chris Bieneman authored Sep 27, 2016

Summary:
The previous output was confusing as it would output "Taget triple:
x86_64-unknown-linux-gnu" even when LLVM_HOST_TRIPLE or
LLVM_DEFAULT_TARGET_TRIPLE were set on the CMake command line

Patch by: Alex Richardson!

Reviewers: beanz

Subscribers: Eugene.Zelenko

Differential Revision: https://reviews.llvm.org/D17067

llvm-svn: 282516

0775a232

[SCEV] Replace a struct with a function; NFC · 237c8454
Sanjoy Das authored Sep 27, 2016
```
We can do this now thanks to C++11 lambdas.

llvm-svn: 282515
```
237c8454
[SCEV] Use find instead of find_as; NFC · a2602141
Sanjoy Das authored Sep 27, 2016
```
We don't need the extra generality here.

llvm-svn: 282514
```
a2602141
[SCEV] Reduce the scope of a struct; NFC · c220ac79
Sanjoy Das authored Sep 27, 2016
```
llvm-svn: 282513
```
c220ac79
[SCEV] Remove custom RAII wrapper; NFC · c46bceb6
Sanjoy Das authored Sep 27, 2016
```
Instead use the pre-existing `scope_exit` class.

llvm-svn: 282512
```
c46bceb6

[SCEV] Make PendingLoopPredicates more frugal; NFCI · db933757

Sanjoy Das authored Sep 27, 2016

I don't expect `PendingLoopPredicates` to have very many
elements (e.g. when -O3'ing the sqlite3 amalgamation,
`PendingLoopPredicates` has at most 3 elements).  So now we use a
`SmallPtrSet` for it instead of the more heavyweight `DenseSet`.

llvm-svn: 282511

db933757

[CMake] Use if(... IN_LIST ...) instead of list(FIND...) · 44d29908
Chris Bieneman authored Sep 27, 2016
```
NFC. This is just a little code cleanup to make things easier to read and understand.

llvm-svn: 282510
```
44d29908

Propagate DBG_VALUE entries when there are unvisited predecessors · 83ebef5d

Keith Walker authored Sep 27, 2016

Variables are sometimes missing their debug location information in
blocks in which the variables should be available. This would occur
when one or more predecessor blocks had not yet been visited by the
routine which propagated the information from predecessor blocks.

This is addressed by only considering predecessor blocks which have
already been visited.

The solution to this problem was suggested by Daniel Berlin on the
LLVM developer mailing list.

Differential Revision: https://reviews.llvm.org/D24927

llvm-svn: 282506

83ebef5d

Revert "Output optimization remarks in YAML" · cc2a3fa8
Adam Nemet authored Sep 27, 2016
```
This reverts commit r282499.

The GCC bots are failing

llvm-svn: 282503
```
cc2a3fa8

Add llvm::join_items to StringExtras. · 0e31a384

Zachary Turner authored Sep 27, 2016

llvm::join_items is similar to llvm::join, which produces a string
by concatenating a sequence of values together separated by a
given separator.  But it differs in that the arguments to
llvm::join() are same-type members of a container, whereas the
arguments to llvm::join_items are arbitrary types passed into
a variadic template.  The only requirement on parameters to
llvm::join_items (including for the separator themselves) is
that they be implicitly convertible to std::string or have
an overload of std::string::operator+

Differential Revision: https://reviews.llvm.org/D24880

llvm-svn: 282502

0e31a384

[lit] Fix refacto introduced by rL282479. · 1280004d
Daniel Dunbar authored Sep 27, 2016
```
llvm-svn: 282501
```
1280004d

Output optimization remarks in YAML · 92e928c1

Adam Nemet authored Sep 27, 2016

This allows various presentation of this data using an external tool.
This was first recommended here[1].

As an example, consider this module:

  1 int foo();
  2 int bar();
  3
  4 int baz() {
  5   return foo() + bar();
  6 }

The inliner generates these missed-optimization remarks today (the
hotness information is pulled from PGO):

  remark: /tmp/s.c:5:10: foo will not be inlined into baz (hotness: 30)
  remark: /tmp/s.c:5:18: bar will not be inlined into baz (hotness: 30)

Now with -pass-remarks-output=<yaml-file>, we generate this YAML file:

  --- !Missed
  Pass:            inline
  Name:            NotInlined
  DebugLoc:        { File: /tmp/s.c, Line: 5, Column: 10 }
  Function:        baz
  Hotness:         30
  Args:
    - Callee: foo
    - String:  will not be inlined into
    - Caller: baz
  ...
  --- !Missed
  Pass:            inline
  Name:            NotInlined
  DebugLoc:        { File: /tmp/s.c, Line: 5, Column: 18 }
  Function:        baz
  Hotness:         30
  Args:
    - Callee: bar
    - String:  will not be inlined into
    - Caller: baz
  ...

This is a summary of the high-level decisions:

* There is a new streaming interface to emit optimization remarks.
E.g. for the inliner remark above:

   ORE.emit(DiagnosticInfoOptimizationRemarkMissed(
                DEBUG_TYPE, "NotInlined", &I)
            << NV("Callee", Callee) << " will not be inlined into "
            << NV("Caller", CS.getCaller()) << setIsVerbose());

NV stands for named value and allows the YAML client to process a remark
using its name (NotInlined) and the named arguments (Callee and Caller)
without parsing the text of the message.

Subsequent patches will update ORE users to use the new streaming API.

* I am using YAML I/O for writing the YAML file.  YAML I/O requires you
to specify reading and writing at once but reading is highly non-trivial
for some of the more complex LLVM types.  Since it's not clear that we
(ever) want to use LLVM to parse this YAML file, the code supports and
asserts that we're writing only.

On the other hand, I did experiment that the class hierarchy starting at
DiagnosticInfoOptimizationBase can be mapped back from YAML generated
here (see D24479).

* The YAML stream is stored in the LLVM context.

* In the example, we can probably further specify the IR value used,
i.e. print "Function" rather than "Value".

* As before hotness is computed in the analysis pass instead of
DiganosticInfo.  This avoids the layering problem since BFI is in
Analysis while DiagnosticInfo is in IR.

[1] https://reviews.llvm.org/D19678#419445

Differential Revision: https://reviews.llvm.org/D24587

llvm-svn: 282499

92e928c1

Sort headers · b897fa53
Adam Nemet authored Sep 27, 2016
```
llvm-svn: 282498
```
b897fa53
project_id is from another era in phabricator land and does not provide any value. · 1d9bc9c5
Manuel Klimek authored Sep 27, 2016
```
Patch by Eitan Adler.

llvm-svn: 282494
```
1d9bc9c5

Add xxhash to llvm. · eaeb6d91

Rafael Espindola authored Sep 27, 2016

It will be used for fast fingerprinting in lld at least.

llvm-svn: 282493

eaeb6d91

[docs] Fix naming style in the example · f1e68ffa
Alexander Kornienko authored Sep 27, 2016
```
llvm-svn: 282490
```
f1e68ffa

[AMDGPU] Enable changing instprinter's behavior based on the per-function · da4687c5

Konstantin Zhuravlyov authored Sep 27, 2016

subtarget

This is a prerequisite for coming waitcnt changes

Differential Revision: https://reviews.llvm.org/D24939

llvm-svn: 282489

da4687c5

[mips] Disable tail calls temporarily · d2ed8abb

Simon Dardis authored Sep 27, 2016

Disable tail calls while the remaining bugs are fixed. Enable only for tests.

Reviewers: vkalintiris

Differential Review: https://reviews.llvm.org/D24912

llvm-svn: 282487

d2ed8abb

[mips] Add rsqrt, recip for MIPS · 0486d585

Simon Dardis authored Sep 27, 2016

Add rsqrt.[ds], recip.[ds] for MIPS. Correct the microMIPS definitions for
architecture support and register usage.

Reviewers: vkalintiris, zoran.jovanoic

Differential Review: https://reviews.llvm.org/D24499

llvm-svn: 282485

0486d585

[docs] Make WritingAnLLVMPass.rst up-to-date with current state of things · 3d3ae6f4

Andrey Bokhanko authored Sep 27, 2016

This patch updates WritingAnLLVMPass.rst to make it in line with current state of things.

Specifically:

* Makefile instructions replaced with CMake ones
* Filenames replaced with correct ones
* Example reformatted a bit to make it less confusing and more conforming to LLVM Coding Standards
* opt tool output updated with what it actually prints nowdays
* "gcse" (which doesn't exist anymore) replaced with "gvn" (which still does)

Differential Revision: https://reviews.llvm.org/D24233

llvm-svn: 282482

3d3ae6f4

Trying to fix lldb build breakage probably caused by rL282452 · d48a8672
Dimitar Vlahovski authored Sep 27, 2016
```
llvm-svn: 282479
```
d48a8672

[Power9] Builtins for ELF v.2 API conformance - back end portion · 6f22b413

Nemanja Ivanovic authored Sep 27, 2016

This patch corresponds to review:
https://reviews.llvm.org/D24396

This patch adds support for the "vector count trailing zeroes",
"vector compare not equal" and "vector compare not equal or zero instructions"
as well as "scalar count trailing zeroes" instructions. It also changes the
vector negation to use XXLNOR (when VSX is enabled) so as not to increase
register pressure (previously this was done with a splat immediate of all
ones followed by an XXLXOR). This was done because the altivec.h
builtins (patch to follow) use vector negation and the use of an additional
register for the splat immediate is not optimal.

llvm-svn: 282478

6f22b413

[X86] Add test case for PR30511 and r282341. · 71f1c643
Craig Topper authored Sep 27, 2016
```
llvm-svn: 282473
```
71f1c643
[X86] Expand all-ones-vector test to cover 256-bit and 512-bit vectors. · 4ffe5d5a
Craig Topper authored Sep 27, 2016
```
llvm-svn: 282472
```
4ffe5d5a

[X86] Use std::max to calculate alignment instead of assuming RC->getSize()... · 78988800

Craig Topper authored Sep 27, 2016

[X86] Use std::max to calculate alignment instead of assuming RC->getSize() will not return a value greater than 32. I think it theoretically could be 64 for AVX-512.

llvm-svn: 282471

78988800

[libFuzzer] run re2 test in 8 threads by default · 7d6935c1
Kostya Serebryany authored Sep 27, 2016
```
llvm-svn: 282469
```
7d6935c1
[sanitizer-coverage] fix a bug in trace-gep · 45c14475
Kostya Serebryany authored Sep 27, 2016
```
llvm-svn: 282467
```
45c14475
[sanitizer-coverage] don't emit the CTOR function if nothing has been instrumented · 186d6180
Kostya Serebryany authored Sep 27, 2016
```
llvm-svn: 282465
```
186d6180

Revert r277556. Add -lowertypetests-bitsets-level to control bitsets generation · 4ff4f21e

Ivan Krasin authored Sep 27, 2016

Summary:
We don't currently need this facility for CFI. Disabling individual hot methods proved
to be a better strategy in Chrome.

Also, the design of the feature is suboptimal, as pointed out by Peter Collingbourne.

Reviewers: pcc

Subscribers: kcc

Differential Revision: https://reviews.llvm.org/D24948

llvm-svn: 282461

4ff4f21e

[libFuzzer] add a test based on openssl-1.0.1f (finds heartbleed) · 53543af0
Kostya Serebryany authored Sep 27, 2016
```
llvm-svn: 282460
```
53543af0
[libFuzzer] add -exit_on_src_pos to test libFuzzer itself, add a test script... · 5ff481fd
Kostya Serebryany authored Sep 27, 2016
```
[libFuzzer] add -exit_on_src_pos to test libFuzzer itself, add a test script for RE2 that uses this flag

llvm-svn: 282458
```
5ff481fd
LowerTypeTests: Remove unused variable. · 53a852b6
Peter Collingbourne authored Sep 26, 2016
```
llvm-svn: 282456
```
53a852b6
LowerTypeTests: Create LowerTypeTestsModule class and move implementation... · 6ed92e3f
Peter Collingbourne authored Sep 26, 2016
```
LowerTypeTests: Create LowerTypeTestsModule class and move implementation there. Related simplifications.

llvm-svn: 282455
```
6ed92e3f

[lit] Add a --max-failures option. · 40b65004

Daniel Dunbar authored Sep 26, 2016

 - This is primarily useful as a "fail fast" mode for lit, where it will stop
   running tests after the first failure.

 - Patch by Max Moiseev.

llvm-svn: 282452

40b65004

[CodeGen] Add support for emitting .init_array instead of .ctors on FreeBSD. · a9f85d68
Davide Italiano authored Sep 26, 2016
```
PR: 30494
llvm-svn: 282451
```
a9f85d68
[CodeGen] Switch test as FreeBSD will support .init_array soon. · f5d77f4a
Davide Italiano authored Sep 26, 2016
```
llvm-svn: 282450
```
f5d77f4a

Sep 26, 2016

[WebAssembly] Use the frame pointer instead of the stack pointer · 92d300eb

Derek Schuff authored Sep 26, 2016

When we have dynamic allocas we have a frame pointer, and
when we're lowering frame indexes we should make sure we use it.

Patch by Jacob Gravelle

Differential Revision: https://reviews.llvm.org/D24889

llvm-svn: 282442

92d300eb

Next set of additional error checks for invalid Mach-O files for the · 90986e6c

Kevin Enderby authored Sep 26, 2016

other load commands that use the Mach::linkedit_data_command type
but not used in llvm libObject code but used in llvm tool code.

This includes LC_FUNCTION_STARTS, LC_SEGMENT_SPLIT_INFO
and LC_DYLIB_CODE_SIGN_DRS load commands.

llvm-svn: 282441

90986e6c

Move computation past early return · 0a48b37c

Aditya Kumar authored Sep 26, 2016

Reviewers:
        rafael
        spatel

Differential Revision: https://reviews.llvm.org/D24843

llvm-svn: 282440

0a48b37c

[thinlto] Basic thinlto fdo heuristic · d9830eb7

Piotr Padlewski authored Sep 26, 2016

Summary:
This patch improves thinlto importer
by importing 3x larger functions that are called from hot block.

I compared performance with the trunk on spec, and there
were about 2% on povray and 3.33% on milc. These results seems
to be consistant and match the results Teresa got with her simple
heuristic. Some benchmarks got slower but I think they are just
noisy (mcf, xalancbmki, omnetpp)- running the benchmarks again with
more iterations to confirm. Geomean of all benchmarks including the noisy ones
were about +0.02%.

I see much better improvement on google branch with Easwaran patch
for pgo callsite inlining (the inliner actually inline those big functions)
Over all I see +0.5% improvement, and I get +8.65% on povray.
So I guess we will see much bigger change when Easwaran patch will land
(it depends on new pass manager), but it is still worth putting this to trunk
before it.

Implementation details changes:
- Removed CallsiteCount.
- ProfileCount got replaced by Hotness
- hot-import-multiplier is set to 3.0 for now,
didn't have time to tune it up, but I see that we get most of the interesting
functions with 3, so there is no much performance difference with higher, and
binary size doesn't grow as much as with 10.0.

Reviewers: eraman, mehdi_amini, tejohnson

Subscribers: mehdi_amini, llvm-commits

Differential Revision: https://reviews.llvm.org/D24638

llvm-svn: 282437

d9830eb7