Commits · 87867988f9a57bae14a1d14865cca926205685ba · Lorenzo Albano / LLVM bpEVL

Oct 11, 2017

Convert a couple of ErrorOr to Expected. NFC. · 87867988
Rafael Espindola authored Oct 11, 2017
```
llvm-svn: 315475
```
87867988

Convert an ErrorOr to Expected. · 1a0e5a19

Rafael Espindola authored Oct 11, 2017

getRelocationAddend should never be called on non SHT_RELA sections,
but changing that requires changing RelocVisitor.h.

llvm-svn: 315473

1a0e5a19

[Hexagon] Handle non-immediate operands to A2_addi in getIncrementValue · bf626195
Krzysztof Parzyszek authored Oct 11, 2017
```
llvm-svn: 315472
```
bf626195
Spelling mistake in comment. NFCI. · 7db36663
Simon Pilgrim authored Oct 11, 2017
```
llvm-svn: 315471
```
7db36663

[X86] Remove MVT::i1 handling code from LowerTRUNCATE · 3dc22bba

Craig Topper authored Oct 11, 2017

Summary: I don't think this is necessary with i1 being illegal now.

Reviewers: RKSimon, zvi, guyblank

Reviewed By: RKSimon

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D38784

llvm-svn: 315469

3dc22bba

[Pipeliner] Fix offset value for instrs dependent on post-inc load/stores · 12bdcab5

Krzysztof Parzyszek authored Oct 11, 2017

The software pipeliner and the packetizer try to break dependence
between the post-increment instruction and the dependent memory
instructions by changing the base register and the offset value.
However, in some cases, the existing logic didn't work properly
and created incorrect offset value.

Patch by Jyotsna Verma.

llvm-svn: 315468

12bdcab5

[Pipeliner] Improve serialization order for post-increments · 8f174dde

Krzysztof Parzyszek authored Oct 11, 2017

The pipeliner is generating a serial sequence that causes poor
register allocation when a post-increment instruction appears
prior to the use of the post-increment register. This occurs when
there is a circular set of dependences involved with a sequence
of instructions in the same cycle. In this case, there is no
serialization of the parallel semantics that will not cause an
additional register to be allocated.

This patch fixes the problem by changing the instructions so that
the post-increment instruction is used by the subsequent
instruction, which enables the register allocator to make a
better decision and not require another register.

Patch by Brendon Cahoon.

llvm-svn: 315466

8f174dde

[DAGCombiner] convert insertelement of bitcasted vector into shuffle · 34fd5eaa

Sanjay Patel authored Oct 11, 2017

Eg:
insert v4i32 V, (v2i16 X), 2 --> shuffle v8i16 V', X', {0,1,2,3,8,9,6,7}

This is a generalization of the IR fold in D38316 to handle insertion into a non-undef vector.
We may want to abandon that one if we can't find value in squashing the more specific pattern sooner.

We're using the existing legal shuffle target hook to avoid AVX512 horror with vXi1 shuffles.

There may be room for improvement in the shuffle lowering here, but that would be follow-up work.

Differential Revision: https://reviews.llvm.org/D38388

llvm-svn: 315460

34fd5eaa

[TargetLowering] Correctly track NumFixedArgs field of CallLoweringInfo · 4d275f0d

Alex Bradbury authored Oct 11, 2017

The NumFixedArgs field of CallLoweringInfo is used by
TargetLowering::LowerCallTo to determine whether a given argument is passed
using the vararg calling convention or not (specifically, to set IsFixed for
each ISD::OutputArg).

Firstly, CallLoweringInfo::setLibCallee and CallLoweringInfo::setCallee both
incorrectly set NumFixedArgs based on the _previous_ args list. Secondly,
TargetLowering::LowerCallTo failed to increment NumFixedArgs when modifying
the argument list so a pointer is passed for the return value.

If your backend uses the IsFixed property or directly accesses NumFixedArgs,
it is _possible_ this change could result in codegen changes (although the
previous behaviour would have been incorrect). No such cases have been
identified during code review for any in-tree architecture.

Differential Revision: https://reviews.llvm.org/D37898

llvm-svn: 315457

4d275f0d

[RISCV] Fix build after r315327 · 5c1eef46

Alex Bradbury authored Oct 11, 2017

Differential Revision: https://reviews.llvm.org/D38779
Patch by Chih-Mao Chen.

llvm-svn: 315455

5c1eef46

[mips] Add support for parsing target specific flags for MIR · 41851e35
Simon Dardis authored Oct 11, 2017
```
Reviewers: atanasyan

Differential Revision: https://reviews.llvm.org/D38620

llvm-svn: 315451
```
41851e35
[NFC] Fix variables used only for assert in GVN · fecaff1b
Max Kazantsev authored Oct 11, 2017
```
llvm-svn: 315448
```
fecaff1b

[Asm] Add debug tracing in table-generated assembly matcher · 4191b9ea

Oliver Stannard authored Oct 11, 2017

This adds debug tracing to the table-generated assembly instruction matcher,
enabled by the -debug-only=asm-matcher option.

The changes in the target AsmParsers are to add an MCInstrInfo reference under
a consistent name, so that we can use it from table-generated code. This was
already being used this way for targets that use deprecation warnings, but 5
targets did not have it, and Hexagon had it under a different name to the other
backends.

llvm-svn: 315445

4191b9ea

[GVN] Prevent LoadPRE from hoisting across instructions that don't pass control flow to successors · 3b81809e

Max Kazantsev authored Oct 11, 2017

This patch fixes the miscompile that happens when PRE hoists loads across guards and
other instructions that don't always pass control flow to their successors. PRE is now prohibited
to hoist across such instructions because there is no guarantee that the load standing after such
instruction is still valid before such instruction. For example, a load from under a guard may be
invalid before the guard in the following case:
  int array[LEN];
  ...
  guard(0 <= index && index < LEN);
  use(array[index]);

Differential Revision: https://reviews.llvm.org/D37460

llvm-svn: 315440

3b81809e

[LICM] Disallow sinking of unordered atomic loads into loops · 0c8dd052

Max Kazantsev authored Oct 11, 2017

Sinking of unordered atomic load into loop must be disallowed because it turns
a single load into multiple loads. The relevant section of the documentation
is: http://llvm.org/docs/Atomics.html#unordered, specifically the Notes for
Optimizers section. Here is the full text of this section:

> Notes for optimizers
> In terms of the optimizer, this **prohibits any transformation that
> transforms a single load into multiple loads**, transforms a store into
> multiple stores, narrows a store, or stores a value which would not be
> stored otherwise. Some examples of unsafe optimizations are narrowing
> an assignment into a bitfield, rematerializing a load, and turning loads
> and stores into a memcpy call. Reordering unordered operations is safe,
> though, and optimizers should take advantage of that because unordered
> operations are common in languages that need them.

Patch by Daniil Suchkov!

Reviewed By: reames
Differential Revision: https://reviews.llvm.org/D38392

llvm-svn: 315438

0c8dd052

[IRCE] Do not process empty safe ranges · 25d8655d

Max Kazantsev authored Oct 11, 2017

IRCE should not apply when the safe iteration range is proved to be empty.
In this case we do unneeded job creating pre/post loops and then never
go to the main loop.

This patch makes IRCE not apply to empty safe ranges, adds test for this
situation and also modifies one of existing tests where it used to happen
slightly.

Reviewed By: anna
Differential Revision: https://reviews.llvm.org/D38577

llvm-svn: 315437

25d8655d

[GVN] Don't replace constants with constants. · e2138fe4

Davide Italiano authored Oct 11, 2017

This fixes PR34908. Patch by Alex Crichton!

Differential Revision:  https://reviews.llvm.org/D38765

llvm-svn: 315429

e2138fe4

WIN32_FIND_DATA -> WIN32_FIND_DATAW. · b4f1b885
Peter Collingbourne authored Oct 11, 2017
```
Should fix mingw bot.

llvm-svn: 315413
```
b4f1b885

[MC] Have MCObjectStreamer take its MCAsmBackend argument via unique_ptr. · 02d33054

Lang Hames authored Oct 11, 2017

MCObjectStreamer owns its MCAsmBackend -- this fixes the types to reflect that,
and allows us to remove another instance of MCObjectStreamer's weird "holding
ownership via someone else's reference" trick.

llvm-svn: 315410

02d33054

Silence MSVC warnings about unsigned wrapping without UB · 51b2cd8f

Reid Kleckner authored Oct 11, 2017

Of course, casting an unsigned value too large for 'int' is UB. So,
write out the ternary. LLVM folds it to ADD anyway.

Fixes the warning from r303693 a different way.

Thanks to Erich Keane for pointing this out!

llvm-svn: 315406

51b2cd8f

[X86] Remove temporary std::string creation from shuffle comment printing. We... · 85b1da1d

Craig Topper authored Oct 11, 2017

[X86] Remove temporary std::string creation from shuffle comment printing. We can just write directly to the raw_ostream.

llvm-svn: 315399

85b1da1d

[X86] Add 128-bit version of vbroadcasti32x2 to shuffle comment decoding. · 6ce20bd1
Craig Topper authored Oct 11, 2017
```
llvm-svn: 315395
```
6ce20bd1

CodeGen: Minor cleanups to use MachineInstr::getMF. NFC · fdf9bf4f

Justin Bogner authored Oct 10, 2017

Since r315388 we have a shorter way to say this, so we'll replace
MI->getParent()->getParent() with MI->getMF() in a few places.

llvm-svn: 315390

fdf9bf4f

CodeGen: Add MachineInstr::getMF(). NFC · ec7cba53

Justin Bogner authored Oct 10, 2017

Similarly to how Instruction has getFunction, this adds a less verbose
way to write MI->getParent()->getParent(). I'll follow up shortly with
a change that changes a bunch of the uses.

llvm-svn: 315388

ec7cba53

[Transforms] Fix some Clang-tidy modernize and Include What You Use warnings;... · e9ea08a0
Eugene Zelenko authored Oct 10, 2017
```
[Transforms] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC).

llvm-svn: 315383
```
e9ea08a0
[X86] Add broadcast patterns that allow a scalar_to_vector between the broadcast and the load. · bb0e316d
Craig Topper authored Oct 10, 2017
```
We already have these patterns for AVX512VL, but not AVX1 or 2.

llvm-svn: 315382
```
bb0e316d
[CodeGen] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). · 149178d9
Eugene Zelenko authored Oct 10, 2017
```
llvm-svn: 315380
```
149178d9

Support: Have directory_iterator::status() return FindFirstFileEx/FindNextFile results on Windows. · 0dfdb447

Peter Collingbourne authored Oct 10, 2017

This allows clients to avoid an unnecessary fs::status() call on each
directory entry. Because the information returned by FindFirstFileEx
is a subset of the information returned by a regular status() call,
I needed to extract a base class from file_status that contains only
that information.

On my machine, this reduces the time required to enumerate a ThinLTO
cache directory containing 520k files from almost 4 minutes to less
than 2 seconds.

Differential Revision: https://reviews.llvm.org/D38716

llvm-svn: 315378

0dfdb447

Oct 10, 2017

Make the ELFObjectFile constructor private. · ef421f9c

Rafael Espindola authored Oct 10, 2017

This forces every user to use the new create method that returns an
Expected. This in turn propagates better error messages.

llvm-svn: 315371

ef421f9c

Use the first instruction's count to estimate the funciton's entry frequency. · 3f56a05a

Dehao Chen authored Oct 10, 2017

Summary: In the current implementation, we only have accurate profile count for standalone symbols. For inlined functions, we do not have entry count data because it's not available in LBR. In this patch, we use the first instruction's frequency to estimiate the function's entry count, especially for inlined functions. This may be inaccurate due to debug info in optimized code. However, this is a better estimate than the static 80/20 estimation we have in the current implementation.

Reviewers: tejohnson, davidxl

Reviewed By: tejohnson

Subscribers: sanjoy, llvm-commits, aprantl

Differential Revision: https://reviews.llvm.org/D38478

llvm-svn: 315369

3f56a05a

[X86] Fix some patterns that select VLX instructions, but were incorrectly... · ad3d0319

Craig Topper authored Oct 10, 2017

[X86] Fix some patterns that select VLX instructions, but were incorrectly also checking presence of BWI instructions.

The EVEX->VEX pass probably obscures this.

llvm-svn: 315365

ad3d0319

Simplify. NFC. · 04e4dbab
Rafael Espindola authored Oct 10, 2017
```
llvm-svn: 315364
```
04e4dbab

[mips] Correct the instruction predicates for microMIPSr3 · b994128d

Simon Dardis authored Oct 10, 2017

Rather than using the AdditionalPredicates mechanism to guard
the microMIPS instructions, use the existing predicates to properly
guard those instructions.

This also resolves a case where an instruction pattern was incorrectly
available for microMIPS32R6, which caused a register allocation failure
as the registers specified in the pattern were not available.

Reviewers: nitesh.jain, atanasyan

Differential Revision: https://reviews.llvm.org/D38451

llvm-svn: 315362

b994128d

AMDGPU: Fix missing skipFunction calls · f42074b6
Matt Arsenault authored Oct 10, 2017
```
llvm-svn: 315361
```
f42074b6

AMDGPU: Fix failure to select branch with optnone · d674e0ac

Matt Arsenault authored Oct 10, 2017

opt-bisect/optnone disable the AMDGPUUniformAnnotateValues pass.
The heuristic in the custom selector for brcond deferred the
branch uniformity check to the pattern, which would fail.

llvm-svn: 315360

d674e0ac

Convert condition to an early exit (NFC). · 3a3ba77b
Adrian Prantl authored Oct 10, 2017
```
<rdar://problem/34689604>

llvm-svn: 315359
```
3a3ba77b
AMDGPU: Fix incorrect selection of pseudo-branches · cc85223f
Matt Arsenault authored Oct 10, 2017
```
These should only be used if the machine structurizer is enabled.

llvm-svn: 315357
```
cc85223f
Convert two uses of ErrorOr to Expected. · 12db383e
Rafael Espindola authored Oct 10, 2017
```
llvm-svn: 315354
```
12db383e

[AMDGPU] Lower enqueued blocks and generate runtime metadata · de4b88d9

Yaxun Liu authored Oct 10, 2017

This patch adds a post-linking pass which replaces the function pointer of enqueued
block kernel with a global variable (runtime handle) and adds
runtime-handle attribute to the enqueued block kernel.

In LLVM CodeGen the runtime-handle metadata will be translated to
RuntimeHandle metadata in code object. Runtime allocates a global buffer
for each kernel with RuntimeHandel metadata and saves the kernel address
required for the AQL packet into the buffer. __enqueue_kernel function
in device library knows that the invoke function pointer in the block
literal is actually runtime handle and loads the kernel address from it
and puts it into AQL packet for dispatching.

This cannot be done in FE since FE cannot create a unique global variable
with external linkage across LLVM modules. The global variable with internal
linkage does not work since optimization passes will try to replace loads
of the global variable with its initialization value.

Differential Revision: https://reviews.llvm.org/D38610

llvm-svn: 315352

de4b88d9

Support: On Windows, use CreateFileW to delete files in sys::fs::remove(). · 0f9e8898
Peter Collingbourne authored Oct 10, 2017
```
This saves a call to stat().

Differential Revision: https://reviews.llvm.org/D38715

llvm-svn: 315351
```
0f9e8898