Commits · 3a622a14f9eb62b9f8aae5f5e937b5a62dab06a4 · Lorenzo Albano / LLVM bpEVL

Aug 17, 2017

[AVX512] Don't switch unmasked subvector insert/extract instructions when AVX512DQI is enabled. · 3a622a14

Craig Topper authored Aug 17, 2017

There's no reason to switch instructions with and without DQI. It just creates extra isel patterns and test divergences.

There is however value in enabling the masked version of the instructions with DQI.

This required introducing some new multiclasses to enabling this splitting.

Differential Revision: https://reviews.llvm.org/D36661

llvm-svn: 311091

3a622a14

[X86] Remove memopmmx pattern fragment · 59608480

Craig Topper authored Aug 17, 2017

Summary: Just like the FIXME says, there is no alignment requirement for MMX.

Reviewers: RKSimon, zvi, igorb

Reviewed By: RKSimon

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D36815

llvm-svn: 311090

59608480

Mark Verifier/invalid-eh.ll as unsupported on windows · bcfd8e28

Victor Leschuk authored Aug 17, 2017

Mark this unsupported for now as it causes tests hangs on buildbot.
Will place it back when the problem is debugged.

llvm-svn: 311089

bcfd8e28

Unguarded availability diagnoser should use TraverseStmt instead of · 98f9fcdb

Alex Lorenz authored Aug 17, 2017

Base::TraverseStmt when visiting the then/else branches of if statements

This ensures that the statement stack is correctly tracked and correct
multi-statement fixit is generated inside of an if (@available)

llvm-svn: 311088

98f9fcdb

[dfsan] Add explicit zero extensions for shadow parameters in function wrappers. · b5205c69

Simon Dardis authored Aug 17, 2017

In the case where dfsan provides a custom wrapper for a function,
shadow parameters are added for each parameter of the function.
These parameters are i16s. For targets which do not consider this
a legal type, the lack of sign extension information would cause
LLVM to generate anyexts around their usage with phi variables
and calling convention logic.

Address this by introducing zero exts for each shadow parameter.

Reviewers: pcc, slthakur

Differential Revision: https://reviews.llvm.org/D33349

llvm-svn: 311087

b5205c69

[clang-tidy] Ignore statements inside a template instantiation. · 80330ffc

Haojian Wu authored Aug 17, 2017

Reviewers: alexfh

Reviewed By: alexfh

Subscribers: JDevlieghere, xazax.hun, cfe-commits

Differential Revision: https://reviews.llvm.org/D36822

llvm-svn: 311086

80330ffc

Print enum constant values using the original source formatting · 36070ed8
Alex Lorenz authored Aug 17, 2017
```
if possible when creating "Declaration" nodes in XML comments

rdar://14765746

llvm-svn: 311085
```
36070ed8

[globalisel][tablegen] Generate TypeObject table. NFC · 032e7f2c

Daniel Sanders authored Aug 17, 2017

Summary:
Generate the type table from the types used by a target rather than hard-coding
the union of types used by all targets.

Depends on D36084

Reviewers: ab, t.p.northover, qcolombet, rovka, aditya_nandakumar

Reviewed By: rovka

Subscribers: kristof.beyls, igorb, llvm-commits

Differential Revision: https://reviews.llvm.org/D36085

llvm-svn: 311084

032e7f2c

[DAGCombiner] Add support for non-uniform constant vectors to (mul x, (1 << c)) -> x << c · 8be9f4af
Simon Pilgrim authored Aug 17, 2017
```
llvm-svn: 311083
```
8be9f4af
[X86] Refactoring of X86TargetLowering::EmitLoweredSelect. NFC. · 19f15843
Amjad Aboud authored Aug 17, 2017
```
Authored by aivchenk
Differential Revision: https://reviews.llvm.org/D35685

llvm-svn: 311082
```
19f15843

[Verifier] Avoid visiting DIGlobalVariables twice. · 903fd3ea

Davide Italiano authored Aug 17, 2017

We currently visit them twice.
Once, through `visitMDNode()` -> (the code generated by)
  `../include/llvm/IR/Metadata.def:109` -> `visitDIGlobalVariable()`
Then, through `visitMDNode()` -> `visitDIGlobalVariableExpression()`
  -> `visitDIGlobalVariable()`

This results in verification failures printed twice, e.g.:

  $ ./opt -verify ../../test/DebugInfo/pr34186.ll
  missing global variable type
  !4 = distinct !DIGlobalVariable(name: "pat", scope: !0,
    file: !1, line: 27, isLocal: true, isDefinition: true)
  missing global variable type
  !4 = distinct !DIGlobalVariable(name: "pat", scope: !0,
    file: !1, line: 27, isLocal: true, isDefinition: true)
  ./opt: ../../test/DebugInfo/pr34186.ll: error: input module is broken!

The patch removes one call so we ensure each GV is visited exactly once.

Differential Revision:  https://reviews.llvm.org/D36797

llvm-svn: 311081

903fd3ea

[ManagedMemoryRewrite] Learn how to rewrite global arrays, allocas. · 8a2c07f6

Siddharth Bhat authored Aug 17, 2017

- If we have global arrays, we would like to rewrite them to global
  pointers which are allocated using `cudaMallocManaged`.

- If we have allocas in a function, we would like to rewrite them to
  heap-allocations with `cudaMallocManaged` and `cudaFree`.

- With these rewrite mechanisms, we can offload _any_ function to the
  GPU with no code rewrite whatsover.

Differential Revision: https://reviews.llvm.org/D36516

llvm-svn: 311080

8a2c07f6

[clang-tidy] Don't generate fixes for initializer_list constructor in make_unique check. · 4b956cb7

Haojian Wu authored Aug 17, 2017

Summary:
The current fix will break the compilation -- because braced list is not
deducible in std::make_unique (with the use of forwarding) without
specifying the type explicitly.

We could support it in the future.

Reviewers: alexfh

Reviewed By: alexfh

Subscribers: JDevlieghere, xazax.hun, cfe-commits

Differential Revision: https://reviews.llvm.org/D36786

llvm-svn: 311078

4b956cb7

[LV] Using VPlan to model the vectorized code and drive its transformation · 66278833

Ayal Zaks authored Aug 17, 2017

VPlan is an ongoing effort to refactor and extend the Loop Vectorizer. This
patch introduces the VPlan model into LV and uses it to represent the vectorized
code and drive the generation of vectorized IR.

In this patch VPlan models the vectorized loop body: the vectorized control-flow
is represented using VPlan's Hierarchical CFG, with predication refactored from
being a post-vectorization-step into a vectorization planning step modeling
if-then VPRegionBlocks, and generating code inline with non-predicated code. The
vectorized code within each VPBasicBlock is represented as a sequence of
Recipes, each responsible for modelling and generating a sequence of IR
instructions. To keep the size of this commit manageable the Recipes in this
patch are coarse-grained and capture large chunks of LV's code-generation logic.
The constructed VPlans are dumped in dot format under -debug.

This commit retains current vectorizer output, except for minor instruction
reorderings; see associated modifications to lit tests.

For further details on the VPlan model see docs/Proposals/VectorizationPlan.rst
and its references.

Authors: Gil Rapaport and Ayal Zaks

Differential Revision: https://reviews.llvm.org/D32871

llvm-svn: 311077

66278833

Re-commit: [globalisel][tablegen] Support zero-instruction emission. · edd0784b

Daniel Sanders authored Aug 17, 2017

Summary:
Support the case where an operand of a pattern is also the whole of the
result pattern. In this case the original result and all its uses must be
replaced by the operand. However, register class restrictions can require
a COPY. This patch handles both cases by always emitting the copy and
leaving it for the register allocator to optimize.

The previous commit failed on Windows machines due to a flaw in the sort
predicate which allowed both A < B < C and B == C to be satisfied
simultaneously. The cause of this was some sloppiness in the priority order of
G_CONSTANT instructions compared to other instructions. These had equal priority
because it makes no difference, however there were operands had higher priority
than G_CONSTANT but lower priority than any other instruction. As a result, a
priority order between G_CONSTANT and other instructions must be enforced to
ensure the predicate defines a strict weak order.

Reviewers: ab, t.p.northover, qcolombet, rovka, aditya_nandakumar

Subscribers: javed.absar, kristof.beyls, igorb, llvm-commits

Differential Revision: https://reviews.llvm.org/D36084

llvm-svn: 311076

edd0784b

[SystemZ] Also wrap TII with #ifndef NDEBUG in constructor initilizer list. · 593d49c0
Jonas Paulsson authored Aug 17, 2017
```
TII needs to be wrapped with #ifndef NDEBUG to silece compiler warnings.

llvm-svn: 311075
```
593d49c0

[SystemZ] Add a wrapping with #ifndef NDEBUG to silence warning. · d346924a

Jonas Paulsson authored Aug 17, 2017

SystemZHazardRecognizer::TII is only used for debug output, so it needs
also to be wrapped with #ifndef NDEBUG.

llvm-svn: 311074

d346924a

[ELF] - Don't segfault when accessing location counter inside MEMORY command. · 7e5b0a59

George Rimar authored Aug 17, 2017

We would previously crash on next script:
MEMORY { name : ORIGIN = .; }

Patch fixes that.

Differential revision: https://reviews.llvm.org/D36138

llvm-svn: 311073

7e5b0a59

[SystemZ, MachineScheduler] Improve post-RA scheduling. · 57a705d9

Jonas Paulsson authored Aug 17, 2017

The idea of this patch is to continue the scheduler state over an MBB boundary
in the case where the successor block has only one predecessor. This means
that the scheduler will continue in the successor block (after emitting any
branch instructions) with e.g. maintained processor resource counters.
Benchmarks have been confirmed to benefit from this.

The algorithm in MachineScheduler.cpp that extracts scheduling regions of an
MBB has been extended so that the strategy may optionally reverse the order
of processing the regions themselves. This is controlled by a new method
doMBBSchedRegionsTopDown(), which defaults to false.

Handling the top-most region of an MBB first also means that a top-down
scheduler can continue the scheduler state across any scheduling boundary
between to regions inside MBB.

Review: Ulrich Weigand, Matthias Braun, Andy Trick.
https://reviews.llvm.org/D35053

llvm-svn: 311072

57a705d9

[SelectionDAG] Teach the vector-types operand scalarizer about SETCC · 124d3282

Elad Cohen authored Aug 17, 2017

When v1i1 is legal (e.g. AVX512) the legalizer can reach
a case where a v1i1 SETCC with an illgeal vector type operand
wasn't scalarized (since v1i1 is legal) but its operands does
have to be scalarized. This used to assert because SETCC was
missing from the vector operand scalarizer.

This patch attemps to teach the legalizer to handle these cases
by scalazring the operands, converting the node into a scalar
SETCC node.

Differential revision: https://reviews.llvm.org/D36651

llvm-svn: 311071

124d3282

Fix undefined behavior that is caused by not always initializing a bool. · a7e061f0

Daniel Jasper authored Aug 17, 2017

The fix in r310994 is incomplete, as moveFromAndCancel can set the
pointer without initializing OldIsSpeculativelyEvaluating.

llvm-svn: 311070

a7e061f0

[llvm-dlltool] Improve an error message when unable to open files. NFC. · caff3268
Martin Storsjö authored Aug 17, 2017
```
Differential Revision: https://reviews.llvm.org/D36818

llvm-svn: 311069
```
caff3268
[llvm-dlltool] Don't crash if no def file is provided or it can't be opened · 9d8ecb43
Martin Storsjö authored Aug 17, 2017
```
Differential Revision: https://reviews.llvm.org/D36780

llvm-svn: 311068
```
9d8ecb43

[CGP] Fix the rematerialization of gc.relocates · 9e5604db

Serguei Katkov authored Aug 17, 2017

If we want to substitute the relocation of derived pointer with gep of base then
we must ensure that relocation of base dominates the relocation of derived pointer.

Currently only check for basic block is present. However it is possible that both
relocation are in the same basic block but relocation of derived pointer is defined
earlier.

The patch moves the relocation of base pointer right before relocation of derived
pointer in this case.

Reviewers: sanjoy,artagnon,igor-laevsky,reames
Reviewed By: reames
Subscribers: llvm-commits
Differential Revision: https://reviews.llvm.org/D36462

llvm-svn: 311067

9e5604db

Add rewrite by-reference parameter pass · ed6a4acc

Tobias Grosser authored Aug 17, 2017

Summary:
This pass detangles induction variables from functions, which take variables by
reference. Most fortran functions compiled with gfortran pass variables by
reference. Unfortunately a common pattern, printf calls of induction variables,
prevent in this situation the promotion of the induction variable to a register,
which again inhibits any kind of loop analysis. To work around this issue
we developed a specialized pass which introduces separate alloca slots for
known-read-only references, which indicate the mem2reg pass that the induction
variables can be promoted to registers and consquently enable SCEV to work.

We currently hardcode the information that a function
_gfortran_transfer_integer_write does not read its second parameter, as
dragonegg does not add the right annotations and we cannot change old dragonegg
releases. Hopefully flang will produce the right annotations.

Reviewers: Meinersbur, bollu, singam-sanjay

Reviewed By: bollu

Subscribers: mgorny, pollydev, llvm-commits

Tags: #polly

Differential Revision: https://reviews.llvm.org/D36800

llvm-svn: 311066

ed6a4acc

Further refactoring of the constant emitter. NFC. · 99e5e98e
John McCall authored Aug 17, 2017
```
llvm-svn: 311065
```
99e5e98e

[analyzer] Add support for reference counting of parameters on the callee side · a179e171

Devin Coughlin authored Aug 17, 2017

This commit adds the functionality of performing reference counting on the
callee side for Integer Set Library (ISL) to Clang Static Analyzer's
RetainCountChecker.

Reference counting on the callee side can be extensively used to perform
debugging within a function (For example: Finding leaks on error paths).

Patch by Malhar Thakkar!

Differential Revision: https://reviews.llvm.org/D36441

llvm-svn: 311063

a179e171

Revert "[MachineCopyPropagation] Extend pass to do COPY source forwarding" · 4e38e02e

Geoff Berry authored Aug 17, 2017

This reverts commit r311038.

Several buildbots are breaking, and at least one appears to be due to
the forwarding of physical regs enabled by this change.  Reverting while
I investigate further.

llvm-svn: 311062

4e38e02e

ARM: mark CPSR as clobbered for Windows VLAs · dd8c16b5

Saleem Abdulrasool authored Aug 17, 2017

When lowering a VLA, we emit a __chstk call.  However, this call can
internally clobber CPSR.  We did not mark this register as an ImpDef,
which could potentially allow a comparison to be hoisted above the call
to `__chkstk`.  In such a case, the CPSR could be clobbered, and the
check invalidated.  When the support was initially added, it seemed that
the call would take care of preventing CPSR from being clobbered, but
this is not the case.  Mark the register as clobbered to fix a possible
state corruption.

llvm-svn: 311061

dd8c16b5

[X86] Exchange the memory op predicate for PALIGNR/VPALIGNR. I accidentally swapped them. · 2f9743d2
Craig Topper authored Aug 17, 2017
```
llvm-svn: 311060
```
2f9743d2

[X86] Cleanup multiclasses for SSE/AVX2 PALIGNR. Add missing load patterns. · 5357526c

Craig Topper authored Aug 17, 2017

We used to have a separate multiclass for AVX2 and SSE/AVX. Now we have one multiclass and pass the relevant differences.

We were also missing load patterns, though we had them for the AVX-512 version.

llvm-svn: 311059

5357526c

[X86] Remove patterns for PALIGNR with non-vXi8 types. · bbe3e46b
Craig Topper authored Aug 17, 2017
```
llvm-svn: 311058
```
bbe3e46b

Reapply: [ADCE][Dominators] Teach ADCE to preserve dominators · fd5c5c91

Jakub Kuderski authored Aug 17, 2017

Summary:
This patch teaches ADCE to preserve both DominatorTrees and PostDominatorTrees.

I didn't notice any performance impact when bootstrapping clang with this patch.

The patch was originally committed in r311039 and reverted in r311049.
This revision fixes the problem with not adding a dependency on the
DominatorTreeWrapperPass for the LegacyPassManager.

Reviewers: dberlin, chandlerc, sanjoy, davide, grosser, brzycki

Reviewed By: davide

Subscribers: grandinj, zhendongsu, llvm-commits, david2050

Differential Revision: https://reviews.llvm.org/D35869

llvm-svn: 311057

fd5c5c91

Remove a lock and use a std::unique_ptr instead. · 314a0050

Rui Ueyama authored Aug 17, 2017

We had a lock to guard BAlloc from being used concurrently, but that
is not very easy to understand. This patch replaces it with a
std::unique_ptr.

llvm-svn: 311056

314a0050

[X86] Put multiclass closer to its use and simplify slightly. NFC · 42a53535
Craig Topper authored Aug 16, 2017
```
llvm-svn: 311055
```
42a53535
[X86] Use a static array instead of a SmallVector for a small fixed size array. NFC · 9025579e
Craig Topper authored Aug 16, 2017
```
llvm-svn: 311054
```
9025579e

[index] Add indexing for unresolved-using declarations · fd6e39c4

Ben Langmuir authored Aug 16, 2017

In dependent contexts we end up referencing these, so make sure they
have USRs, and have their declarations indexed. For the most part they
behave like typedefs, but we also need to worry about having multiple
using declarations with the same "name".

rdar://problem/33883650

llvm-svn: 311053

fd6e39c4

[x86] add cmov promotion tests for D36711; NFC · 4abc3f60

Sanjay Patel authored Aug 16, 2017

This way we can see what the current codegen looks like.
I've also explicitly added/removed the cmov attribute from the RUN lines,
so we know exactly what we're checking in the runs.

llvm-svn: 311052

4abc3f60

Fix typos in comments; NFC · 03d5db48
George Burgess IV authored Aug 16, 2017
```
llvm-svn: 311051
```
03d5db48

[InstCombine] Teach canEvaluateTruncated to handle arithmetic shift (including... · 86111c66

Amjad Aboud authored Aug 16, 2017

[InstCombine] Teach canEvaluateTruncated to handle arithmetic shift (including those with vector splat shift amount)

Differential Revision: https://reviews.llvm.org/D36784

llvm-svn: 311050

86111c66