Commits · 435d650ef4391ac081b7e0c3eb9a4497e4e74a80 · Roger Ferrer / llvm-epi

Jan 19, 2018

[cmake] Fix typo in LLVM_UTILS_INSTALL_DIR definition. · 435d650e
Don Hinton authored Jan 19, 2018
```
Differential Revision: https://reviews.llvm.org/D41804

llvm-svn: 322959
```
435d650e
Test commit · 22c49c64
Carey Williams authored Jan 19, 2018
```
llvm-svn: 322958
```
22c49c64

[x86] shrink 'and' immediate values by setting the high bits (PR35907) · 74a1eef7

Sanjay Patel authored Jan 19, 2018

  
Try to reverse the constant-shrinking that happens in SimplifyDemandedBits()
for 'and' masks when it results in a smaller sign-extended immediate.

We are also able to detect dead 'and' ops here (the mask is all ones). In
that case, we replace and return without selecting the 'and'.

Other targets might want to share some of this logic by enabling this under a
target hook, but I didn't see diffs for simple cases with PowerPC or AArch64,
so they may already have some specialized logic for this kind of thing or have
different needs.

This should solve PR35907:
https://bugs.llvm.org/show_bug.cgi?id=35907

Differential Revision: https://reviews.llvm.org/D42088

llvm-svn: 322957

74a1eef7

[InstSimplify] use m_Specific and commutative matcher to reduce code; NFCI · 33cb8457
Sanjay Patel authored Jan 19, 2018
```
llvm-svn: 322955
```
33cb8457

[X86] Extend load-op-store fusion merge to ADC/SBB. · 72d32f24

Nirav Dave authored Jan 19, 2018

Summary: Add handling of EFLAG input to X86 Load-op-store fusion checking.

Reviewers: craig.topper, RKSimon

Subscribers: llvm-commits, hiraditya

Differential Revision: https://reviews.llvm.org/D42128

llvm-svn: 322952

72d32f24

[AArch64][SVE] Asm: Add support for RDVL/ADDVL/ADDPL instructions · 909cf956

Sander de Smalen authored Jan 19, 2018

Reviewers: fhahn, rengolin, t.p.northover, echristo, olista01, SjoerdMeijer

Reviewed By: SjoerdMeijer

Subscribers: SjoerdMeijer, aemerson, javed.absar, tschuett, kristof.beyls, llvm-commits

Differential Revision: https://reviews.llvm.org/D41900

llvm-svn: 322951

909cf956

[X86][AVX] Add more variable permute tests for source vectors smaller than destination · 586b31b8
Simon Pilgrim authored Jan 19, 2018
```
llvm-svn: 322948
```
586b31b8

[SLP] Fix vectorization for tree with trunc to minimum required bit width. · fa80c47c

Alexey Bataev authored Jan 19, 2018

Summary:
If the vectorized tree has truncate to minimum required bit width and
the vector type of the cast operation after the truncation is the same
as the vector type of the cast operands, count cost of the vector cast
operation as 0, because this cast will be later removed.
Also, if the vectorization tree root operations are integer cast operations, do not consider them as candidates for truncation. It will just create extra number of the same vector/scalar operations, which will be removed by instcombiner.

Reviewers: RKSimon, spatel, mkuper, hfinkel, mssimpso

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D41948

llvm-svn: 322946

fa80c47c

[Support] - Check nullptr after allocation with malloc in MallocAllocator -... · b065dabd

Klaus Kretzschmar authored Jan 19, 2018

[Support] - Check nullptr after allocation with malloc in MallocAllocator - Differential Revision: http://reviews.llvm.org/D34753

llvm-svn: 322944

b065dabd

[AMDGPU][MC] Corrected parsing of image modifiers and encoding of image atomics · 0e074e34

Dmitry Preobrazhensky authored Jan 19, 2018

See bugs
    35962: https://bugs.llvm.org/show_bug.cgi?id=35962
    35963: https://bugs.llvm.org/show_bug.cgi?id=35963

Differential Revision: https://reviews.llvm.org/D42184

Reviewers: vpykhtin, artem.tamazov, arsenm
llvm-svn: 322942

0e074e34

Fix line endings. NFCI. · 37d977bc
Simon Pilgrim authored Jan 19, 2018
```
llvm-svn: 322940
```
37d977bc
[X86] Add KNL target to slow PMULLD tests · 65a565bf
Simon Pilgrim authored Jan 19, 2018
```
llvm-svn: 322939
```
65a565bf
[X86] Add RDPID schedule test · 852abd1a
Simon Pilgrim authored Jan 19, 2018
```
llvm-svn: 322938
```
852abd1a
[X86] Regenerate RDPMC intrinsic test · 9b839ef3
Simon Pilgrim authored Jan 19, 2018
```
llvm-svn: 322937
```
9b839ef3
[CodeGen] Unify printing format of debug-location in both MIR and -debug · 548add99
Francis Visoiu Mistrih authored Jan 19, 2018
```
Use "debug-location" instead of "; dbg:" in MI::print.

llvm-svn: 322936
```
548add99
[NFC] fix trivial typos in comments · d24ddcd6
Hiroshi Inoue authored Jan 19, 2018
```
"the the" -> "the"

llvm-svn: 322934
```
d24ddcd6

[ValueLattice] Use getters instead of direct accesses (NFC). · 5045eaf9

Florian Hahn authored Jan 19, 2018

Reviewers: reames, davide, anna

Reviewed By: reames, davide

Differential Revision: https://reviews.llvm.org/D42270

llvm-svn: 322933

5045eaf9

[ModRefInfo] Return NoModRef for Must and NoModRef. · df26cf81

Alina Sbirlea authored Jan 19, 2018

Summary:
In ModRefInfo "Must" was introduced to track presence of MustAlias, but we still want to return NoModRef when there is neither Mod or Ref, even when MustAlias is found. Patch has small fixes to ensure this happens.
Minor cleanup to remove nesting for 2 if statements when calling getModRefInfo for 2 ImmutableCallSites.

Reviewers: sanjoy

Subscribers: jlebar, llvm-commits

Differential Revision: https://reviews.llvm.org/D42209

llvm-svn: 322932

df26cf81

[InstCombine] Make foldSelectOpOp able to handle two-operand getelementptr · 2867bd72

John Brawn authored Jan 19, 2018

Three (or more) operand getelementptrs could plausibly also be handled, but
handling only two-operand fits in easily with the existing BinaryOperator
handling.

Differential Revision: https://reviews.llvm.org/D39958

llvm-svn: 322930

2867bd72

Split MachineLICM into EarlyMachineLICM and MachineLICM; NFC · 4a7c8e7a

Matthias Braun authored Jan 19, 2018

This avoids playing games with pseudo pass IDs and avoids using an
unreliable MRI::isSSA() check to determine whether register allocation
has happened.

Note that this renames:
- MachineLICMID -> EarlyMachineLICM
- PostRAMachineLICMID -> MachineLICMID
to be consistent with the EarlyTailDuplicate/TailDuplicate naming.

llvm-svn: 322927

4a7c8e7a

Split TailDuplicatePass into pre- and post-RA variant; NFC · 3ab9fcb9

Matthias Braun authored Jan 19, 2018

Split TailDuplicatePass into EarlyTailDuplicate and TailDuplicate. This
avoids playing games with fake pass IDs and using MRI::isSSA() to
determine pre-/post-RA state.

llvm-svn: 322926

3ab9fcb9

Move tests to the correct place · 8bb5228d

Matthias Braun authored Jan 19, 2018

test/CodeGen/MIR is for testing the MIR parser/printer. Tests for passes
and targets belong to test/CodeGen/TARGETNAME.

llvm-svn: 322925

8bb5228d

[X86] Make better use of instregex for cmovcc/setcc/jcc instructions in the Intel scheduler models. · f4cd9083
Craig Topper authored Jan 19, 2018
```
Combine all the separate condition codes into a singular expression when possible.

llvm-svn: 322924
```
f4cd9083
Revert [CGP] Re-enable Select in complex addressing mode · 22bb1c0e
Serguei Katkov authored Jan 19, 2018
```
One of buildbots failed. Revert for now till fix the issue.

llvm-svn: 322923
```
22bb1c0e

AArch64: Fix emergency spillslot being out of reach for large callframes · 5c290dc2

Matthias Braun authored Jan 19, 2018

Re-commit of r322200: The testcase shouldn't hit machineverifiers
anymore with r322917 in place.

Large callframes (calls with several hundreds or thousands or
parameters) could lead to situations in which the emergency spillslot is
out of range to be addressed relative to the stack pointer.
This commit forces the use of a frame pointer in the presence of large
callframes.

This commit does several things:
- Compute max callframe size at the end of instruction selection.
- Add mirFileLoaded target callback. Use it to compute the max callframe size
  after loading a .mir file when the size wasn't specified in the file.
- Let TargetFrameLowering::hasFP() return true if there exists a
  callframe > 255 bytes.
- Always place the emergency spillslot close to FP if we have a frame
  pointer.
- Note that `useFPForScavengingIndex()` would previously return false
  when a base pointer was available leading to the emergency spillslot
  getting allocated late (that's the whole effect of this callback).
  Which made no sense to me so I took this case out: Even though the
  emergency spillslot is technically not referenced by FP in this case
  we still want it allocated early.

Differential Revision: https://reviews.llvm.org/D40876

llvm-svn: 322919

5c290dc2

AArch64: Omit callframe setup/destroy when not necessary · dc4b3e87

Matthias Braun authored Jan 19, 2018

Do not create CALLSEQ_START/CALLSEQ_END when there is no callframe to
setup and the callframe size is 0.

- Fixes an invalid callframe nesting for byval arguments, which would
  look like this before this patch (as in `big-byval.ll`):
    ...
    ADJCALLSTACKDOWN 32768, 0, ...   # Setup for extfunc
    ...
    ADJCALLSTACKDOWN 0, 0, ...  # setup for memcpy
    ...
    BL &memcpy ...
    ADJCALLSTACKUP 0, 0, ...    # destroy for memcpy
    ...
    BL &extfunc
    ADJCALLSTACKUP 32768, 0, ...   # destroy for extfunc

- Saves us two instructions in the common case of zero-sized stackframes.
- Remove an unnecessary scheduling barrier (hence the small unittest
  changes).

Differential Revision: https://reviews.llvm.org/D42006

llvm-svn: 322917

dc4b3e87

[WebAssembly] Add test expectations for gcc C++ tests (gcc/testsuite/g++.dg) · b6c5bc27
Sam Clegg authored Jan 19, 2018
```
Differential Revision: https://reviews.llvm.org/D42226

llvm-svn: 322915
```
b6c5bc27
[ORC] Revert r322913 while I investigate an ASan failure. · 44efd042
Lang Hames authored Jan 19, 2018
```
llvm-svn: 322914
```
44efd042

[ORC] Redesign the JITSymbolResolver interface to support bulk queries. · 817df9fa

Lang Hames authored Jan 19, 2018

Bulk queries reduce IPC/RPC overhead for cross-process JITing and expose
opportunities for parallel compilation.

The two new query methods are lookupFlags, which finds the flags for each of a
set of symbols; and lookup, which finds the address and flags for each of a
set of symbols. (See doxygen comments for more details.)

The existing JITSymbolResolver class is renamed LegacyJITSymbolResolver, and
modified to extend the new JITSymbolResolver class using the following scheme:

- lookupFlags is implemented by calling findSymbolInLogicalDylib for each of the
symbols, then returning the result of calling getFlags() on each of these
symbols. (Importantly: lookupFlags does NOT call getAddress on the returned
symbols, so lookupFlags will never trigger materialization, and lookupFlags will
never call findSymbol, so only symbols that are part of the logical dylib will
return results.)

- lookup is implemented by calling findSymbolInLogicalDylib for each symbol and
falling back to findSymbol if findSymbolInLogicalDylib returns a null result.
Assuming a symbol is found its getAddress method is called to materialize it and
the result (if getAddress succeeds) is stored in the result map, or the error
(if getAddress fails) is returned immediately from lookup. If any symbol is not
found then lookup returns immediately with an error.

This change will break any out-of-tree derivatives of JITSymbolResolver. This
can be fixed by updating those classes to derive from LegacyJITSymbolResolver
instead.

llvm-svn: 322913

817df9fa

[X86] Add intrinsic support for the RDPID instruction · 84b26b90

Craig Topper authored Jan 18, 2018

This adds a new instrinsic to support the rdpid instruction. The implementation is a bit weird because the intrinsic is defined as always returning 32-bits, but the assembler support thinks the instruction produces a 64-bit register in 64-bit mode. But really it zeros the upper 32 bits. So I had to add separate patterns where 64-bit mode uses an extract_subreg.

Differential Revision: https://reviews.llvm.org/D42205

llvm-svn: 322910

84b26b90

[InstSimplify] regenerate checks and add tests for commutes; NFC · a19b748f
Sanjay Patel authored Jan 18, 2018
```
llvm-svn: 322907
```
a19b748f

Jan 18, 2018

AMDGPU/SI: Fix typos in d16 support patch the buffer intrinsics. · ba6240cc
Changpeng Fang authored Jan 18, 2018
```
llvm-svn: 322906
```
ba6240cc

[CodeView] Add line numbers for inlined call sites · 7897a789

Reid Kleckner authored Jan 18, 2018

We did this for inline call site line tables, but we hadn't done it for
regular function line tables yet. This patch copies that logic from
encodeInlineLineTable.

llvm-svn: 322905

7897a789

[CodeView] Sink complex inline functions to .cpp file, NFC · b5258722
Reid Kleckner authored Jan 18, 2018
```
I'm cleaning up this code before I attempt to fix a line table bug.

llvm-svn: 322904
```
b5258722

AMDGPU/SI: Add d16 support for image intrinsics. · 4737e892

Changpeng Fang authored Jan 18, 2018

Summary:
  This patch implements d16 support for image load, image store and image sample intrinsics.

Reviewers:
  Matt, Brian.

Differential Revision:
  https://reviews.llvm.org/D3991

llvm-svn: 322903

4737e892

Typo fix SIBABRT -> SIGABRT. · 668e6b4b
Eric Christopher authored Jan 18, 2018
```
Based on a patch by Henry Wong!

llvm-svn: 322902
```
668e6b4b

[test] Actually check the common parts in CodeGen/ARM/global-merge-external.ll. NFC. · d96be854

Martin Storsjö authored Jan 18, 2018

Previously, these parts weren't ever checked. The label patterns
need to be extended to match successfully on macho.

Differential Revision: https://reviews.llvm.org/D42126

llvm-svn: 322900

d96be854

Support: Add missing #include. · 719c1f74

Peter Collingbourne authored Jan 18, 2018

This #include is necessary to provide the definitions of _fpclass
and _FPCLASS_NZ when building with libc++.

llvm-svn: 322885

719c1f74

[DWARFv5] Number the line-table's directory array correctly. · 8181d23b

Paul Robinson authored Jan 18, 2018

The compilation directory has always been #0, but as of DWARF v5 it is
explicitly listed in the line-table section instead of implicitly
being a reference to the compile_unit DIE's DW_AT_comp_dir attribute.
This means the dumper should number the dumped array starting with 0
or 1 depending on the DWARF version of the line table.

References in the generated DWARF are correct, it's just the dumper
that was wrong.  Also some assembler-coded tests were similarly
confused about directory numbers.

llvm-svn: 322884

8181d23b

we have now https support for apt.llvm.org. Updating the URL · 6dce59bc
Sylvestre Ledru authored Jan 18, 2018
```
llvm-svn: 322881
```
6dce59bc