Commits · f74cc40e34f5cf2b42e94b0c2e13d89d7fce14e0 · Lorenzo Albano / LLVM bpEVL

Sep 28, 2015

Improve performance of SimplifyInstructionsInBlock · f74cc40e

Fiona Glaser authored Sep 28, 2015

1. Use a worklist, not a recursive approach, to avoid needless
   revisitation and being repeatedly forced to jump back to the
   start of the BB if a handle is invalidated.

2. Only insert operands to the worklist if they become unused
   after a dead instruction is removed, so we don’t have to
   visit them again in most cases.

3. Use a SmallSetVector to track the worklist.

4. Instead of pre-initting the SmallSetVector like in
   DeadCodeEliminationPass, only put things into the worklist
   if they have to be revisited after the first run-through.
   This minimizes how much the actual SmallSetVector gets used,
   which saves a lot of time.

llvm-svn: 248727

f74cc40e

Add support for local absolute symbols. · dfc7200b
Rafael Espindola authored Sep 28, 2015
```
llvm-svn: 248726
```
dfc7200b

[mips][p5600] Added P5600 processor and initial scheduler. · 7727e109

Daniel Sanders authored Sep 28, 2015

Summary:
The P5600 is an out-of-order, superscalar implementation of the MIPS32R5
architecture.

The scheduler has a few missing details (see the 'Tricky Instructions'
section and some quirks of the P5600 are deliberately omitted due to
implementation difficulty and low chance of significant benefit (e.g. the
predicate on P5600WriteEitherALU). However, testing on SingleSource is
showing significant performance benefits on some apps (seven in the 10-30%
range) and only one significant regression (12%) when
-pre-RA-sched=linearize is given. Without -pre-RA-sched=linearize the
results are more variable. Some do even better (up to 55% improvement) but
increased numbers of copies are slowing others down (up to 12%).

Overall, the scheduler as it currently stands is a 2.4% win with
-pre-RA-sched=linearize and a 2.7% win without -pre-RA-sched=linearize.
I'm sure we can improve on this further.

For completeness, the FPGA this was tested on shows some failures with and
without the P5600 scheduler. These appear to be scheduling related since
the two test runs have fairly different sets of failing tests even after
accounting for other factors (e.g. spurious connection failures) however
it's not P5600 specific since we also get some for the generic scheduler.

Reviewers: vkalintiris

Subscribers: mpf, llvm-commits, atrick, vkalintiris

Differential Revision: http://reviews.llvm.org/D12193

llvm-svn: 248725

7727e109

ELF2: Include file names in error messages. · c5e22d90
Rui Ueyama authored Sep 28, 2015
```
llvm-svn: 248724
```
c5e22d90

[clang-tidy] add option to specify build path · 68b59107

Guillaume Papin authored Sep 28, 2015

Summary:
compile_commands.json is usually generated in the build directory.
Projects like LLVM/Clang enforce out-of-source builds.
This option allow allow such projects to work out of the box, without
moving the compilation database manually.

The naming of the option is similar to the one use by other tools:

    clang-{check,modernize,query,rename,tidy} -p=<build_path> <...>

Reviewers: alexfh

Differential Revision: http://reviews.llvm.org/D13199

llvm-svn: 248723

68b59107

Bind listener to 127.0.0.1 to make sure that loopback address is used. · 36c42b59
Oleksiy Vyalov authored Sep 28, 2015
```
llvm-svn: 248722
```
36c42b59
Introduce !align metadata for load instruction · b4d00904
Artur Pilipenko authored Sep 28, 2015
```
Reviewed By: hfinkel

Differential Revision: http://reviews.llvm.org/D12853

llvm-svn: 248721
```
b4d00904
[CMake] [Darwin] Make darwin_filter_builtin_sources support both whitelist and blacklist filtering. · 8aba4b05
Chris Bieneman authored Sep 28, 2015
```
llvm-svn: 248720
```
8aba4b05

[InstSimplify] Fold simple known implications to true · 13f023c0

Philip Reames authored Sep 28, 2015

This was split off of http://reviews.llvm.org/D13040 to make it easier to test the correctness of the implication logic. For the moment, this only handles a single easy case which shows up when eliminating and combining range checks. In the (near) future, I plan to extend this for other cases which show up in range checks, but I wanted to make those changes incrementally once the framework was in place.

At the moment, the implication logic will be used by three places. One in InstSimplify (this review) and two in SimplifyCFG (http://reviews.llvm.org/D13040 & http://reviews.llvm.org/D13070). Can anyone think of other locations this style of reasoning would make sense?

Differential Revision: http://reviews.llvm.org/D13074

llvm-svn: 248719

13f023c0

[LoopReroll] Ignore debug intrinsics · 310770a9

Weiming Zhao authored Sep 28, 2015

Originally, debug intrinsics and annotation intrinsics may prevent
the loop to be rerolled, now they are ignored.

Differential Revision: http://reviews.llvm.org/D13150

llvm-svn: 248718

310770a9

OpenMP: Name addresses in subfunction structure · 95e59aaa

Tobias Grosser authored Sep 28, 2015

While debugging, this makes it easier to understand due to which memory
reference these stores have been introduced.

llvm-svn: 248717

95e59aaa

[WebAssembly] Support for direct call and call_indirect. · 05a17aa8
Dan Gohman authored Sep 28, 2015
```
llvm-svn: 248716
```
05a17aa8

[ELF2] Add --sysroot command line switch · 1309fc03

Igor Kudrin authored Sep 28, 2015

Reviewers: rafael, ruiu

Subscribers: llvm-commits

Projects: #lld

Differential Revision: http://reviews.llvm.org/D13209

llvm-svn: 248715

1309fc03

clang-format: [JS] Support pseudo-keywords · ba52fcb7

Daniel Jasper authored Sep 28, 2015

JavaScript allows keywords to appear in IdenfierName positions, e.g.
fields, or object literal members, but not as plain identifiers.

Patch by Martin Probst. Thank you!

llvm-svn: 248714

ba52fcb7

clang-format: [JS] handle let (ES6) · 9f642f7d
Daniel Jasper authored Sep 28, 2015
```
Patch by Martin Probst. Thank you!

llvm-svn: 248713
```
9f642f7d

BlockGenerator: Generate synthesisable instructions only on-demand · 28b9a14b

Tobias Grosser authored Sep 28, 2015



Instructions which we can synthesis from a SCEV expression are not generated
directly, but only when they are used as an operand of another instruction. This
avoids generating unnecessary instruction and works more reliably than first
inserting them and then deleting them later on.

Suggested-by: Johannes Doerfert <doerfert@cs.uni-saarland.de>

Differential Revision: http://reviews.llvm.org/D13208

llvm-svn: 248712

28b9a14b

Remove XTIMEOUT from TestProcessAttach on linux · 8ff61200
Pavel Labath authored Sep 28, 2015
```
llvm-svn: 248711
```
8ff61200

Install clang-query by default. · 55dc5df5

Manuel Klimek authored Sep 28, 2015

It is already installed by the autotools build, and it is useful for
developers who are not working on LLVM/Clang itself.

llvm-svn: 248710

55dc5df5

Trying to fix the windows build. · eb990af3
Rafael Espindola authored Sep 28, 2015
```
llvm-svn: 248709
```
eb990af3
Add support for -L and -l command line switches. · abb7b286
Rafael Espindola authored Sep 28, 2015
```
Patch by Igor Kudrin!

llvm-svn: 248708
```
abb7b286
Enable the aarch64 tests. · 83af95d9
Rafael Espindola authored Sep 28, 2015
```
llvm-svn: 248707
```
83af95d9
[mips] Handling of immediates bigger than 16 bits · cdb64566
Zoran Jovanovic authored Sep 28, 2015
```
Differential Revision: http://reviews.llvm.org/D10539

llvm-svn: 248706
```
cdb64566
Improve comments related to MemoryAccess::MemoryOrigin; NFC · 5c0f97d5
Michael Kruse authored Sep 28, 2015
```
llvm-svn: 248705
```
5c0f97d5
[NFC] Add accidentally removed comment line · 58a7c75c
Johannes Doerfert authored Sep 28, 2015
```
llvm-svn: 248704
```
58a7c75c

[ARM] Avoid redundant checks for isThumb1Only() after supportsTailCall() · ad8a0638

Artyom Skrobov authored Sep 28, 2015

supportsTailCall() has two callers. Both of them double-check isThumb1Only(),
and refuse to proceed with tail-calling in that case.
Therefore, it makes sense to move this check to
ARMSubtarget::initSubtargetFeatures, where SupportsTailCall is initialized;
and to eliminate the extra checks at the call sites.

Following a review comment, added an "assert(supportsTailCall())"
in IsEligibleForTailCall.

NFC.

llvm-svn: 248703

ad8a0638

Revert "Fix race condition during process detach" · c8c77d46

Pavel Labath authored Sep 28, 2015

This fix is not correct on its own until D12968 is resolved. Will resumbit once that is done.

llvm-svn: 248702

c8c77d46

Allow switch instructions in SCoPs · 9a132f36

Johannes Doerfert authored Sep 28, 2015

  This patch allows switch instructions with affine conditions in the
  SCoP. Also switch instructions in non-affine subregions are allowed.
  Both did not require much changes to the code, though there was some
  refactoring needed to integrate them without code duplication.

  In the llvm-test suite the number of profitable SCoPs increased from
  135 to 139 but more importantly we can handle more benchmarks and user
  inputs without preprocessing.

Differential Revision: http://reviews.llvm.org/D13200

llvm-svn: 248701

9a132f36

[clang-tidy] Code factorization and cleanup in IdentifierNamingCheck · 3d77768e

Alexander Kornienko authored Sep 28, 2015

This is to level the ground a little bit, in preparation for the changes in http://reviews.llvm.org/D13081.

Code factorization replaces all insertions to NamingCheckFailures map with a unique addUsage function that does the job.
There is also no more difference between the declaration and the references to a given identifier, both cases are treated as ranges in the Usage vector. There is also a check to avoid duplicated ranges to be inserted, which sometimes triggered erroneous replacements.

References can now also be added before the declaration of the identifier is actually found; this looks to be the case for example when a templated class uses its parameters to specialize its templated base class.

Patch by Beren Minor!

Differential revision: http://reviews.llvm.org/D13079

llvm-svn: 248700

3d77768e

[clang-tidy] Removed a stray empty line in the docs. · 501e6cd2
Alexander Kornienko authored Sep 28, 2015
```
llvm-svn: 248699
```
501e6cd2

[DAGCombine] Fix getStoreMergeAndAliasCandidates's AA-enabled chain walking · bd582581

Hal Finkel authored Sep 28, 2015

When AA is being used, non-aliasing stores are canonicalized to use the same
chain, and DAGCombiner::getStoreMergeAndAliasCandidates can take advantage of
this by looking only as users of a store's chain operand. However, user
iteration is not result-number specific, we need to check that the use is as a
chain operand, and not via some other operand. It is certainly possible to have
another potentially-aliasing store, which shares the first's base pointer, and
uses the first's chain's node via some other operand.

Failure to catch this situation caused, at least in the included test case, an
assert later because the relative sequence-number ordering caused later
replacement to create a cycle in the DAG.

llvm-svn: 248698

bd582581

[tests] Add memory writes to make this scop not trivially empty · f223cdf1
Tobias Grosser authored Sep 28, 2015
```
llvm-svn: 248697
```
f223cdf1

[OPENMP 4.1] Add 'simd' clause for 'ordered' directive. · d14d1e6f

Alexey Bataev authored Sep 28, 2015

Parsing and sema analysis for 'simd' clause in 'ordered' directive.
Description
If the simd clause is specified, the ordered regions encountered by any thread will use only a single SIMD lane to execute the ordered
regions in the order of the loop iterations.
Restrictions
An ordered construct with the simd clause is the only OpenMP construct that can appear in the simd region

llvm-svn: 248696

d14d1e6f

Remove obsolete check · f32f5f23

Johannes Doerfert authored Sep 28, 2015

  This check was needed at some point but seems not useful anymore. Only
  one adjustment in the domain generation was needed to cope with the
  cases this check prevented from happening before.

llvm-svn: 248695

f32f5f23

[NFC] Remove unused SCoP diagnostic · 91ad092b
Johannes Doerfert authored Sep 28, 2015
```
llvm-svn: 248694
```
91ad092b
Remove 'const' from some ArrayRefs. ArrayRefs are already immutable. NFC · 862d5d83
Craig Topper authored Sep 28, 2015
```
llvm-svn: 248693
```
862d5d83

AsmWriter: Print the argument names in declarations while debugging · d7d1a72f

Justin Bogner authored Sep 27, 2015

When llvm declarations have argument names, it's helpful to actually
print those names when debugging. Arguably, it'd be nice to print them
all the time, but that would mean the IR we output wouldn't round trip
through bitcode, which doesn't store the names.

Make the varous print() methods in AsmWriter optionally print "for
debug" and set that flag in the dump() methods. The only thing this
does differently for now is print the argument names in declarations.

llvm-svn: 248692

d7d1a72f

Sep 27, 2015

Silence clang warning: variable ‘Status’ set but not used. · e5a9dc2f
Yaron Keren authored Sep 27, 2015
```
llvm-svn: 248691
```
e5a9dc2f

[SCEV] identical instructions don't compute equal values · f1090b60

Sanjoy Das authored Sep 27, 2015

Before this change `HasSameValue` would return true for distinct
`alloca` instructions if they happened to be allocating the same
type (`alloca` instructions are not specified as reading memory).  This
change adds an explicit whitelist of instruction types for which
"identical" instructions compute the same value.

Fixes PR24952.

llvm-svn: 248690

f1090b60

[InstCombine] fold zexts and constants into a phi (PR24766) · 95334075

Sanjay Patel authored Sep 27, 2015

This is one step towards solving PR24766:
https://llvm.org/bugs/show_bug.cgi?id=24766

We were not producing the same IR for these two C functions because the store
to the temp bool causes extra zexts:

#include <stdbool.h>

bool switchy(char x1, char x2, char condition) {
   bool conditionMet = false;
   switch (condition) {
   case 0: conditionMet = (x1 == x2); break;
   case 1: conditionMet = (x1 <= x2); break;
   }
   return conditionMet;
}

bool switchy2(char x1, char x2, char condition) {
   switch (condition) {
   case 0: return (x1 == x2);
   case 1: return (x1 <= x2);
   }
  return false;
}

As noted in the code comments, this test case manages to avoid the more general existing
phi optimizations where there are only 2 phi inputs or where there are no constant phi 
args mixed in with the casts ops. It seems like a corner case, but if we don't catch it, 
then I don't think we can get SimplifyCFG to further optimize towards the canonical form
for this function shown in the bug report.

Differential Revision: http://reviews.llvm.org/D12866

llvm-svn: 248689

95334075

BlockGenerator: Be less agressive with deleting dead instructions · 0722a1e5

Tobias Grosser authored Sep 27, 2015

We now only delete trivially dead instructions in the BB we copy (copyBB), but
not in any other BB. Only for copyBB we know that there will _never_ be any
future uses of instructions that have no use after copyBB has been generated.
Other instructions in the AST that have been generated by IslNodeBuilder may
look dead at the moment, but may possibly still be referenced by GlobalMaps. If
we delete them now, later uses would break surprisingly.

We do not have a test case that breaks due to us deleting too many instructions.
This issue was found by inspection.

llvm-svn: 248688

0722a1e5