Commits · 066e73762df8c5d8a6d8b4fb919b71cc89f28dc8 · Roger Ferrer / llvm-epi

Jan 25, 2018

[X86] Name the MMX phaddd instruction with 3 Ds instead of just 2. NFC · 066e7376
Craig Topper authored Jan 25, 2018
```
llvm-svn: 323403
```
066e7376

[X86] Remove 64/128/256 from MMX/SSE/AVX instruction names for overall consistency. NFC · dbddac09

Craig Topper authored Jan 25, 2018

MMX instrutions all start with MMX_ so the 64 isn't needed for disambigutation.
SSE/AVX1 instructions are assumed 128-bit so we don't need to say 128.
AVX2 instructions should use a Y to indicate 256-bits.

llvm-svn: 323402

dbddac09

[X86] Remove unnecessary '_alt' and '_Int' from scheduler model regular expressions. · 81c87092

Craig Topper authored Jan 25, 2018

These were treated as optional suffixes, but the regular expressions are already prefix matches so this is unnecessary. It breaks the binary search optimization in tablegen due to the top level question mark.

llvm-svn: 323401

81c87092

Add support for pattern matching MachineInsts. · 2036f446

Aditya Nandakumar authored Jan 25, 2018

https://reviews.llvm.org/D42439

Add Instcombine like matchers for MachineInstructions. There are only
globalISel matchers for now.

llvm-svn: 323400

2036f446

[ORC] Refactor the various lookupFlags methods to return the flags map via the · c8a74a04

Lang Hames authored Jan 25, 2018

first argument.

This makes lookupFlags more consistent with lookup (which takes the query as the
first argument) and composes better in practice, since lookups are usually
linearly chained: Each lookupFlags can populate the result map based on the
symbols not found in the previous lookup. (If the maps were returned rather than
passed by reference there would have to be a merge step at the end).

llvm-svn: 323398

c8a74a04

[GISel]: Fix modules build by including <cassert> · 7cff1908
Aditya Nandakumar authored Jan 25, 2018
```
llvm-svn: 323394
```
7cff1908

[ORC] Try to silence compiler error at · 357b88dc

Lang Hames authored Jan 25, 2018

http://lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/17264

NFC.

llvm-svn: 323393

357b88dc

[GISel]: Implement GlobalISel combiner API. · 81c81b64

Aditya Nandakumar authored Jan 25, 2018

https://reviews.llvm.org/D41373

The various components are

GICombinerHelper contains transformations that are common to all
targets. Targets can pick and choose which transformations (at
function/opcode granularity) each pass uses via configuring a
GICombinerInfo.

GICombiner contains some common code and it does the traversal,
driving of combines, worklist management and iterating until
convergence.

GICombinerInfo is an interface with a virtual method called combine.
The combiner info will allow targets to pick and choose (or
implement their own specific combines). CombineInfos can make
use of available combines in GICombineHelper to configure the
transformations for a particular pass. Currently this approach allows
cherry picking transformations from helpers (at function/opcode
granularity) and also allows early returning on specific
transformations. Targets also get to prioritize whether target specific
combines run before/after the opt-in generic combines. Ideally we would
like this part to be configured by both C++ and Tablegen. The
CombinerInfo also has a field which indicates how to deal with
IllegalOps (ie - should we allow to create them/or legalize them?).

A CombinerPass would configure a CombinerInfo, create the GICombiner
with the Info, and call
GICombiner::combineMachineInstrs(MachineFunction&).
This organization is very similar to the GISelLegalizer.

llvm-svn: 323392

81c81b64

[GlobalISel][TableGen] Fix the statistics for emitted patters · 4f3fa798

Volkan Keles authored Jan 25, 2018

Collected statistics for the number of patterns emitted can be
incorrect because rules can be grouped if OptimizeMatchTable
is enabled. Increase the counter in RuleMatcher::emit(...)
to avoid that.

llvm-svn: 323391

4f3fa798

[ORC] Add helpers for building orc::SymbolResolvers from legacy findSymbol-style · d78ba0d4

Lang Hames authored Jan 24, 2018

functions/methods that return JITSymbols.

lookupFlagsWithLegacyFn takes a SymbolNameSet and a legacy lookup function and
returns a LookupFlagsResult. It uses the legacy lookup function to search for
each symbol. If found, getFlags is called on the symbol and the flags added to
the SymbolFlags map. If not found, the symbol is added to the SymbolsNotFound
set.

lookupWithLegacyFn takes an AsynchronousSymbolQuery, a SymbolNameSet and a
legacy lookup function. Each symbol in the SymbolNameSet is searched for via the
legacy lookup function. If it is found, its getAddress function is called
(triggering materialization if it has not happened already) and the resulting
mapping stored in the query. If it is not found the symbol is added to the
unresolved symbols set which is returned at the end of the function. If an
error occurs during legacy lookup or materialization it is passed to the
query via setFailed and the function returns immediately.

llvm-svn: 323388

d78ba0d4

Jan 24, 2018

[GlobalISel] Add a requires: asserts to a test. · 5ee03988
Amara Emerson authored Jan 24, 2018
```
llvm-svn: 323384
```
5ee03988

[TableGen] Add a way of getting the number of generic opcodes without... · 4890a71f

Benjamin Kramer authored Jan 24, 2018

[TableGen] Add a way of getting the number of generic opcodes without including modular CodeGen headers.

This is a bit of a hack, but removes a cycle that broke modular builds
of LLVM. Of course the cycle is still there in form of a dependency
on the .def file.

llvm-svn: 323383

4890a71f

[InstCombine] fix datalayout in test file · 60c13c77

Sanjay Patel authored Jan 24, 2018

The only part of the datalayout that should matter for these tests
is the part that specifies the legal int widths ('n*'). But there
was a bug - that part of the string was not correctly separated with
the expected '-' character, so we were testing as if there were no
legal int widths at all. Removed the leading cruft so we have some 
legal ints to test with.

I noticed this while testing a potential change to the way we 
transform shifts and sexts in D42424.

llvm-svn: 323377

60c13c77

[ORC] Add a LambdaSymbolResolver convenience class and docs for SymbolResolver. · 7f20eacf

Lang Hames authored Jan 24, 2018

This patch adds a LambdaSymbolResolver convenience utility that can create an
orc::SymbolResolver from a pair of function objects that supply the behavior for
the lookupFlags and lookup methods.

This class plays the same role for orc::SymbolResolver as the legacy
LambdaResolver class plays for LegacyJITSymbolResolver, and will replace the
latter class once all ORC APIs are migrated to orc::SymbolResolver.

This patch also adds some documentation for the orc::SymbolResolver class as
this was left out of the original commit.

llvm-svn: 323375

7f20eacf

[Hexagon] Replace EmitFunctionEntryCode with a DAG preprocessing code · 14f3ef1f

Krzysztof Parzyszek authored Jan 24, 2018

The code in EmitFunctionEntryCode needs to know the maximum stack
alignment, but it runs very early in the selection process (before
lowering). The final stack alignment may change during lowering, so
the code needs to be moved to where the alignment is known.

llvm-svn: 323374

14f3ef1f

[globalisel] Fix long lines from r323342 · 538921dc

Daniel Sanders authored Jan 24, 2018

They would be fixed in a later patch but they shouldn't have been introduced.

llvm-svn: 323372

538921dc

[AArch64][GlobalISel] Fall back during AArch64 isel if we have a volatile load. · 4f84f886

Amara Emerson authored Jan 24, 2018

The tablegen imported patterns for sext(load(a)) don't check for single uses
of the load or delete the original after matching. As a result two loads are
left in the generated code. This particular issue will be fixed by adding
support for a G_SEXTLOAD opcode in future.

There are however other potential issues around this that wouldn't be fixed by
a G_SEXTLOAD, so until we have a proper solution we don't try to handle volatile
loads at all in the AArch64 selector.

Fixes/works around PR36018.

llvm-svn: 323371

4f84f886

[GlobalISel] Don't fall back to FastISel. · f386e2b0

Amara Emerson authored Jan 24, 2018

Apparently checking the pass structure isn't enough to ensure that we don't fall
back to FastISel, as it's set up as part of the SelectionDAGISel.

llvm-svn: 323369

f386e2b0

[X86][SSE] Aggressively use PMADDWD for v4i32 multiplies with 17 or more leading zeros · 9f551ad6

Simon Pilgrim authored Jan 24, 2018

As discussed in D41484, PMADDWD for 'zero extended' vXi32 is nearly always a better option than PMULLD:
On SNB it will result in code that isn't any faster, but not any slower so we may as well keep it.
On KNL it only has half the throughput, so I've disabled it on there - ideally there'd be a better way than this.

Differential Revision: https://reviews.llvm.org/D42258

llvm-svn: 323367

9f551ad6

Simplify. NFC. · 349fe0aa
Rafael Espindola authored Jan 24, 2018
```
Thanks to Teresa Johnson for the suggestion.

llvm-svn: 323365
```
349fe0aa
[X86][SSE] Add slow-pmulld attribute (silvermont-style) test · 21f17d40
Simon Pilgrim authored Jan 24, 2018
```
Requested by @zvi on D42258

llvm-svn: 323364
```
21f17d40
Revert "[SLP] Fix for PR32086: Count InsertElementInstr of the same elements as shuffle." · 0affccc8
Alexey Bataev authored Jan 24, 2018
```
This reverts commit r323348 because of the broken buildbots.

llvm-svn: 323359
```
0affccc8
Revert "[ThinLTO] Add call edges' relative block frequency to per-module summary." · bf38deef
Easwaran Raman authored Jan 24, 2018
```
Causes buildbot regressions.

llvm-svn: 323358
```
bf38deef
Fix up and document controlling ccache via CMake options. · 052f14ef
Paul Robinson authored Jan 24, 2018
```
Patch by Matthew Davis!

Differential Revision: https://reviews.llvm.org/D41757

llvm-svn: 323357
```
052f14ef

[AMDGPU] Make sure all super regs of reserved regs are marked reserved. · c4796d47

Geoff Berry authored Jan 24, 2018

Summary:
Move reserveRegisterTuples into AMDGPURegisterInfo and use it in
R600RegisterInfo::getReservedRegs and
R600InstrInfo::reserveIndirectRegisters to ensure that all super
registers of reserved registers are also marked as reserved.

Before this change, under certain circumstances, the registers %t1_x and
%t1_xyzw would be marked as reserved, but %t1_xy and %t1_xyz would not
be, leading to the register allocator sometimes assigning a register to
%t1_xy, which is invalid since %t1_x is reserved.

Reviewers: arsenm, tstellar, MatzeB, qcolombet

Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, mcrosier, llvm-commits

Differential Revision: https://reviews.llvm.org/D42448

llvm-svn: 323356

c4796d47

Revert r321751, "StructurizeCFG: Fix broken backedge detection" · 4afb64e4

Nicolai Haehnle authored Jan 24, 2018

It causes regressions in various OpenGL test suites.

Keep the test cases introduced by r321751 as XFAIL, and add a test case
for the regression.

Change-Id: I90b4cc354f68cebe5fcef1f2422dc8fe1c6d3514
Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=36015
llvm-svn: 323355

4afb64e4

[ARM] Expand long shifts for Thumb1 to __aeabi_ calls · 665784f1

Weiming Zhao authored Jan 24, 2018

Summary: For long shifts, the inlined version takes about 20 instructions on Thumb1. To avoid the code bloat, expand to __aeabi_ calls if target is Thumb1.

Reviewers: samparker

Reviewed By: samparker

Subscribers: samparker, aemerson, javed.absar, kristof.beyls, llvm-commits

Differential Revision: https://reviews.llvm.org/D42401

llvm-svn: 323354

665784f1

[X86] Fix some inconsistencies in the itineraries and Sched for (V)PEXTRW/(V)PINSRW · 05af43fb
Craig Topper authored Jan 24, 2018
```
The weirdest being that PEXTRWrr was tagged as a memory operation.

llvm-svn: 323353
```
05af43fb

[X86] Adjust names of PINSRW/PEXTRW intructions between MMX/SSE/AVX/AVX512 for... · b85b484f

Craig Topper authored Jan 24, 2018

[X86] Adjust names of PINSRW/PEXTRW intructions between MMX/SSE/AVX/AVX512 for consistency and to maybe enable more regular expression compaction in the scheduler models. NFCI

llvm-svn: 323352

b85b484f

[X86] Remove '(_REV)?' from a bunch of scheduler regular expressions. NFC · 23cc866c

Craig Topper authored Jan 24, 2018

The regexs are treated as a prefix match already so the checking for optional text at the end provides no value. Instead it prevents the binary search optimization in tablegen from kicking in due to the top level question mark.

llvm-svn: 323351

23cc866c

[ThinLTO] Add call edges' relative block frequency to per-module summary. · 5f7aff9a

Easwaran Raman authored Jan 24, 2018

Summary:
This allows relative block frequency of call edges to be passed to the
thinlink stage where it will be used to compute synthetic entry counts
of functions.

Reviewers: tejohnson, pcc

Subscribers: mehdi_amini, llvm-commits, inglorion

Differential Revision: https://reviews.llvm.org/D42212

llvm-svn: 323349

5f7aff9a

[SLP] Fix for PR32086: Count InsertElementInstr of the same elements as shuffle. · 4bd8e533

Alexey Bataev authored Jan 24, 2018

Summary:
If the same value is going to be vectorized several times in the same
tree entry, this entry is considered to be a gather entry and cost of
this gather is counter as cost of InsertElementInstrs for each gathered
value. But we can consider these elements as ShuffleInstr with
SK_PermuteSingle shuffle kind.

Reviewers: spatel, RKSimon, mkuper, hfinkel

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D38697

llvm-svn: 323348

4bd8e533

[Hexagon] Run late copy propagation and dead code elimination passes · cf3ad584
Krzysztof Parzyszek authored Jan 24, 2018
```
llvm-svn: 323346
```
cf3ad584
Handle R_386_PLT32 in RuntimeDyldELF. · fc16f76e
Rafael Espindola authored Jan 24, 2018
```
This should fix the 32 bit buildbots.

llvm-svn: 323344
```
fc16f76e

InstSimplify: If divisor element is undef simplify to undef · 51f0d64b

Zvi Rackover authored Jan 24, 2018

Summary:
If any vector divisor element is undef, we can arbitrarily choose it be
zero which would make the div/rem an undef value by definition.

Reviewers: spatel, reames

Reviewed By: spatel

Subscribers: magabari, llvm-commits

Differential Revision: https://reviews.llvm.org/D42485

llvm-svn: 323343

51f0d64b

[globalisel] Introduce LegalityQuery to better encapsulate the legalizer decisions. NFC. · 262ed0ec

Daniel Sanders authored Jan 24, 2018

Summary:
`getAction(const InstrAspect &) const` breaks encapsulation by exposing
the smaller components that are used to decide how to legalize an
instruction.

This is a problem because we need to change the implementation of
LegalizerInfo so that it's able to describe particular type combinations
rather than just cartesian products of types.

For example, declaring the following
  setAction({..., 0, s32}, Legal)
  setAction({..., 0, s64}, Legal)
  setAction({..., 1, s32}, Legal)
  setAction({..., 1, s64}, Legal)
currently declares these type combinations as legal:
  {s32, s32}
  {s64, s32}
  {s32, s64}
  {s64, s64}
but we currently have no means to say that, for example, {s64, s32} is
not legal. Some operations such as G_INSERT/G_EXTRACT/G_MERGE_VALUES/
G_UNMERGE_VALUES has relationships between the types that are currently
described incorrectly.

Additionally, G_LOAD/G_STORE currently have no means to legalize non-atomics
differently to atomics. The necessary information is in the MMO but we have no
way to use this in the legalizer. Similarly, there is currently no way for the
register type and the memory type to differ so there is no way to cleanly
represent extending-load/truncating-store in a way that can't be broken by
optimizers (resulting in illegal MIR).

This patch introduces LegalityQuery which provides all the information
needed by the legalizer to make a decision on whether something is legal
and how to legalize it.

Reviewers: ab, t.p.northover, qcolombet, rovka, aditya_nandakumar, volkan, reames, bogner

Reviewed By: bogner

Subscribers: bogner, llvm-commits, kristof.beyls

Differential Revision: https://reviews.llvm.org/D42244

llvm-svn: 323342

262ed0ec

[NFC] Make magic number for DJB hash function customizable. · 5803a674

Jonas Devlieghere authored Jan 24, 2018

This allows us to specify the magic number for the DJB hash function.
This feature is needed by dsymutil to emit Apple types accelerator
table.

llvm-svn: 323341

5803a674

[dsymutil] Make NonRelocatableStringPool a wrapper around DwarfStringPoolEntry. NFC · e7d3d907

Jonas Devlieghere authored Jan 24, 2018

This is needed in order to use our StringPool entries in the Apple
accelerator tables.

As this is NFC we rely on the existing tests for correctness.

llvm-svn: 323339

e7d3d907

[ValueTracking] add recursion depth param to matchSelectPattern · 1d91ec34

Sanjay Patel authored Jan 24, 2018

We're getting bug reports:
https://bugs.llvm.org/show_bug.cgi?id=35807
https://bugs.llvm.org/show_bug.cgi?id=35840
https://bugs.llvm.org/show_bug.cgi?id=36045
...where we blow up the stack in value tracking because other passes are sending 
in selects that have an operand that is itself the select.

We don't currently have a reliable way to avoid analyzing dead code that may take 
non-standard forms, so bail out when things go too far.

This mimics the recursion depth limitations in other parts of value tracking.

Unfortunately, this pushes the underlying problems for other passes (jump-threading,
simplifycfg, correlated-propagation) into hiding. If someone wants to uncover those
again, the first draft of this patch on Phab would do that (it would assert rather
than bail out).

Differential Revision: https://reviews.llvm.org/D42442

llvm-svn: 323331

1d91ec34

X86 Tests: Add more sdiv combine cases. NFC · 22bfa7e5
Zvi Rackover authored Jan 24, 2018
```
Add cases with vector non-splat pow2 contant divider.

llvm-svn: 323329
```
22bfa7e5