Commits · 8faafa4fb1452b45f7b5bcf77f4e44a6846f6f70 · Lorenzo Albano / LLVM bpEVL

Oct 23, 2017

Add R_PPC_ADDR16_HI relocation support · 8faafa4f

Rui Ueyama authored Oct 22, 2017

The support of R_PPC_ADDR16_HI improves ld compatibility and makes
things on par with RuntimeDyldELF that already implements this
relocation.

Patch by vit9696.

llvm-svn: 316306

8faafa4f

Remove a fast lookup table from MergeInputSection. · d96724db

Rui Ueyama authored Oct 22, 2017

We used to have a map from section piece offsets to section pieces
as a cache for binary search. But I found that the map took quite a
large amount of memory and didn't make linking faster. So, in this
patch, I removed the map.

This patch saves 566 MiB of RAM (2.019 GiB -> 1.453 GiB) when linking
clang with debug info, and the link time is 4% faster in that test case.

Thanks for Sean Silva for pointing this out.

llvm-svn: 316305

d96724db

[c++2a] Update cxx_status w __VA_OPT__ marked as completed in SVN. · 39ff4010
Faisal Vali authored Oct 22, 2017
```
llvm-svn: 316304
```
39ff4010

Oct 22, 2017

ExecutionEngine: make COFF Thumb2 assertions non-tautological · 9e802eaf

Saleem Abdulrasool authored Oct 22, 2017

The overflow detection assertions were tautological due to truncation.
Adjust them to no longer be tautological.

Patch by Alex Langford!

llvm-svn: 316303

9e802eaf

Fix invalid ptrtoint in InstCombine · 92c11ee3

Yichao Yu authored Oct 22, 2017

Summary:
It's unclear if this is the only thing we can do but at least this is consistent with the check
of address space agreement in `isBitCastable`.

The code is used at least in both instcombine and jumpthreading though
I could only find a way to trigger the invalid cast in instcombine.

Reviewers: loladiro, sanjoy, majnemer

Reviewed By: sanjoy

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D34335

llvm-svn: 316302

92c11ee3

Create fewer copies of StringMaps. No functionality change intended. · 24952ce5
Benjamin Kramer authored Oct 22, 2017
```
llvm-svn: 316301
```
24952ce5

Make HIDDEN_DIRECTIVE a function-like macro. NFCI. · c5115b9e

Martin Storsjö authored Oct 22, 2017

This avoids a hack for making it a no-op for windows.

Also explicitly check for _WIN32 instead of assuming it.

Differential Revision: https://reviews.llvm.org/D39156

llvm-svn: 316300

c5115b9e

[X86] Add missing override. NFC. · a7c822a2
Benjamin Kramer authored Oct 22, 2017
```
llvm-svn: 316299
```
a7c822a2

[SimplifyCFG] delay switch condition forwarding to -latesimplifycfg · b80daf0b

Sanjay Patel authored Oct 22, 2017

As discussed in D39011:
https://reviews.llvm.org/D39011
...replacing constants with a variable is inverting the transform done
by other IR passes, so we definitely don't want to do this early. 
In fact, it's questionable whether this transform belongs in SimplifyCFG 
at all. I'll look at moving this to codegen as a follow-up step.

llvm-svn: 316298

b80daf0b

[utils] Support -mtriple=powerpc64 · dc168722

Fangrui Song authored Oct 22, 2017

Summary: test/CodeGen/PowerPC/pr33093.ll uses both powerpc64 (big-endian) and powerpc64le while the former was unsupported.

Subscribers: nemanjai

Differential Revision: https://reviews.llvm.org/D39164

llvm-svn: 316297

dc168722

Strip trailing whitespace. NFCI. · ce55eab9
Simon Pilgrim authored Oct 22, 2017
```
llvm-svn: 316296
```
ce55eab9

Add logic to greedy reg alloc to avoid bad eviction chains · f9371d82

Marina Yatsina authored Oct 22, 2017

This fixes bugzilla 26810
https://bugs.llvm.org/show_bug.cgi?id=26810

This is intended to prevent sequences like:
movl %ebp, 8(%esp) # 4-byte Spill
movl %ecx, %ebp
movl %ebx, %ecx
movl %edi, %ebx
movl %edx, %edi
cltd
idivl %esi
movl %edi, %edx
movl %ebx, %edi
movl %ecx, %ebx
movl %ebp, %ecx
movl 16(%esp), %ebp # 4 - byte Reload

Such sequences are created in 2 scenarios:

Scenario #1:
vreg0 is evicted from physreg0 by vreg1
Evictee vreg0 is intended for region splitting with split candidate physreg0 (the reg vreg0 was evicted from)
Region splitting creates a local interval because of interference with the evictor vreg1 (normally region spliiting creates 2 interval, the "by reg" and "by stack" intervals. Local interval created when interference occurs.)
one of the split intervals ends up evicting vreg2 from physreg1
Evictee vreg2 is intended for region splitting with split candidate physreg1
one of the split intervals ends up evicting vreg3 from physreg2 etc.. until someone spills

Scenario #2
vreg0 is evicted from physreg0 by vreg1
vreg2 is evicted from physreg2 by vreg3 etc
Evictee vreg0 is intended for region splitting with split candidate physreg1
Region splitting creates a local interval because of interference with the evictor vreg1
one of the split intervals ends up evicting back original evictor vreg1 from physreg0 (the reg vreg0 was evicted from)
Another evictee vreg2 is intended for region splitting with split candidate physreg1
one of the split intervals ends up evicting vreg3 from physreg2 etc.. until someone spills

As compile time was a concern, I've added a flag to control weather we do cost calculations for local intervals we expect to be created (it's on by default for X86 target, off for the rest).

Differential Revision: https://reviews.llvm.org/D35816

Change-Id: Id9411ff7bbb845463d289ba2ae97737a1ee7cc39
llvm-svn: 316295

f9371d82

[X86] More correctly support LIG and WIG for EVEX instructions in the disassembler tables. · dac20263

Craig Topper authored Oct 22, 2017

This is similar to how we generate the VEX tables.

More fixes are still needed for the instructions that use EVEX.b (broadcast and embedded rounding).

llvm-svn: 316294

dac20263

[SimplifyCFG] try harder to forward switch condition to phi (PR34471) · 24226504

Sanjay Patel authored Oct 22, 2017

The missed canonicalization/optimization in the motivating test from PR34471 leads to very different codegen:

  int switcher(int x) {
      switch(x) {
      case 17: return 17;
      case 19: return 19;
      case 42: return 42;
      default: break;
      }
      return 0;
    }

  int comparator(int x) {
    if (x == 17) return 17;
    if (x == 19) return 19;
    if (x == 42) return 42;
    return 0;
  }

For the first example, we use a bit-test optimization to avoid a series of compare-and-branch:
https://godbolt.org/g/BivDsw

Differential Revision: https://reviews.llvm.org/D39011

llvm-svn: 316293

24226504

[C++17] Fix PR34970 - tweak overload resolution for class template... · 81b756e6

Faisal Vali authored Oct 22, 2017

[C++17] Fix PR34970 - tweak overload resolution for class template deduction-guides in line with WG21's p0620r0.

In order to identify the copy deduction candidate, I considered two approaches:
  - attempt to determine whether an implicit guide is a copy deduction candidate by checking certain properties of its subsituted parameter during overload-resolution.
  - using one of the many bits (WillHaveBody) from FunctionDecl (that CXXDeductionGuideDecl inherits from) that are otherwise irrelevant for deduction guides

After some brittle gymnastics w the first strategy, I settled on the second, although to avoid confusion and to give that bit a better name, i turned it into a member of an anonymous union.

Given this identification 'bit', the tweak to overload resolution was a simple reordering of the deduction guide checks (in SemaOverload.cpp::isBetterOverloadCandidate), in-line with Jason Merrill's p0620r0 drafting which made it into the working paper.  Concordant with that, I made sure the copy deduction candidate is always added.


References:
See https://bugs.llvm.org/show_bug.cgi?id=34970 
See http://wg21.link/p0620r0

llvm-svn: 316292

81b756e6

shared: Implement aligned vector stores (vstorea_half) · 7ab2d0bd

Jan Vesely authored Oct 22, 2017



Float version passes newly posted piglit tests on turks, float and double pass on carrizo.
v2: scalar vstorea_half
v3: fix typo

Reviewer: Aaron Watry
Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu>
llvm-svn: 316291

7ab2d0bd

shared: Implement aligned vector loads (vloada_half) · 12061c71

Jan Vesely authored Oct 22, 2017



Passes newly posted piglits on turks and carrizo
v2: add scalar vloada_half
v3: fix typo

Reviewer: Aaron Watry
Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu>
llvm-svn: 316290

12061c71

[ARM] Dynamic stack alignment for 16-bit Thumb · d6a4ab3d

Momchil Velikov authored Oct 22, 2017

This patch implements dynamic stack (re-)alignment for 16-bit Thumb. When
targeting processors, which support only the 16-bit Thumb instruction set
the compiler ignores the alignment attributes of automatic variables and may
silently generate incorrect code.

Differential revision: https://reviews.llvm.org/D38143

llvm-svn: 316289

d6a4ab3d

[X86] Add a pass to convert instruction chains between domains. · 92d5ce3b

Guy Blank authored Oct 22, 2017

The pass scans the function to find instruction chains that define
registers in the same domain (closures).
It then calculates the cost of converting the closure to another domain.
If found profitable, the instructions are converted to instructions in
the other domain and the register classes are changed accordingly.

This commit adds the pass infrastructure and a simple conversion from
the GPR domain to the Mask domain.

Differential Revision:
https://reviews.llvm.org/D37251

Change-Id: Ic2cf1d76598110401168326d411128ae2580a604
llvm-svn: 316288

92d5ce3b

[mips] Adds support for R_MIPS_26, HIGHER, HIGHEST relocations in RuntimeDyld. · 757f74c2

Nitesh Jain authored Oct 22, 2017

Reviewers: sdardis

Subscribers: jaydeep, bhushan, llvm-commits

Differential Revision: https://reviews.llvm.org/D38314

llvm-svn: 316287

757f74c2

[Compiler-rt][MIPS] Fix cross build for XRAY. · cf8a5c26

Nitesh Jain authored Oct 22, 2017

Reviewers: dberris, sdardis

Subscribers: jaydeep, bhushan, llvm-commits

Differential Revision: https://reviews.llvm.org/D38021

llvm-svn: 316286

cf8a5c26

[X86] Teach the disassembler that some instructions use VEX.W==0 without a... · e975127d

Craig Topper authored Oct 22, 2017

[X86] Teach the disassembler that some instructions use VEX.W==0 without a corresponding VEX.W==1 instruction and we shouldn't treat them as if VEX.W is ignored.

Fixes PR11304.

llvm-svn: 316285

e975127d

[X86] Add VEX_WIG to applicable AVX512 instructions. · a33846ac
Craig Topper authored Oct 22, 2017
```
This should be NFC. Will be used in future patches to fix disassembler bugs.

llvm-svn: 316284
```
a33846ac
[X86] Add VEX_WIG to VROUNDSSrr/VROUNDSSrm/VROUNDSDrr/VROUNDSDrm · 1bcb0d8a
Craig Topper authored Oct 22, 2017
```
llvm-svn: 316283
```
1bcb0d8a
[X86] Don't allow gather/scatter to disassembler if memory operand does not use a SIB byte. · 158bc647
Craig Topper authored Oct 22, 2017
```
Fixes PR34998.

llvm-svn: 316282
```
158bc647
Simplify. · 53a9aff9
Rui Ueyama authored Oct 22, 2017
```
llvm-svn: 316281
```
53a9aff9

Assume that mergeable input sections are smaller than 4 GiB. · 95bf5098

Rui Ueyama authored Oct 21, 2017

By assuming that mergeable input sections are smaller than 4 GiB,
lld's memory usage when linking clang with debug info drops from
2.788 GiB to 2.019 GiB (measured by valgrind, and that does not include
memory space for mmap'ed files). I think that's a reasonable assumption
given such a large RAM savings, so this patch.

According to valgrind, gold needs 3.54 GiB of RAM to do the same thing.

NB: This patch does not introduce a limitation on the size of
output sections. You can still create sections larger than 4 GiB.

llvm-svn: 316280

95bf5098

Oct 21, 2017

Reverting r316278 due to failing build bots. · 5b4f81ec

Aaron Ballman authored Oct 21, 2017

http://lab.llvm.org:8011/builders/clang-ppc64be-linux/builds/11896
http://lab.llvm.org:8011/builders/clang-s390x-linux/builds/12380

llvm-svn: 316279

5b4f81ec

[libclang, bindings]: add spelling location · 5b3fa2cd

Masud Rahman authored Oct 21, 2017

 o) Add a 'Location' class that represents the four properties of a
    physical location

 o) Enhance 'SourceLocation' to provide 'expansion' and 'spelling'
    locations, maintaining backwards compatibility with existing code by
    forwarding the four properties to 'expansion'.

 o) Update the implementation to use 'clang_getExpansionLocation'
    instead of the deprecated 'clang_getInstantiationLocation', which
    has been present since 2011.

 o) Update the implementation of 'clang_getSpellingLocation' to actually
    obtain spelling location instead of file location.

llvm-svn: 316278

5b3fa2cd

Strip trailing whitespace. NFCI. · ab6dbe2b
Simon Pilgrim authored Oct 21, 2017
```
llvm-svn: 316277
```
ab6dbe2b

Reverting r316270 due to failing build bots. · fc02869c

Aaron Ballman authored Oct 21, 2017

http://lab.llvm.org:8011/builders/clang-x86_64-linux-selfhost-modules-2/builds/12899
http://lab.llvm.org:8011/builders/clang-x86-windows-msvc2015/builds/7951

llvm-svn: 316276

fc02869c

Fix a typo with -fno-double-square-bracket-attributes and add a test to... · 61736556

Aaron Ballman authored Oct 21, 2017

Fix a typo with -fno-double-square-bracket-attributes and add a test to demonstrate that it works as expected in C++11 mode. Additionally corrected the handling of -fdouble-square-bracket-attributes to be properly passed down to the cc1 option.

llvm-svn: 316275

61736556

[X86][SSE] Add extractps/pextrd equivalence to domain tables · 3cb02449
Simon Pilgrim authored Oct 21, 2017
```
Differential Revision: https://reviews.llvm.org/D39135

llvm-svn: 316274
```
3cb02449

[X86] Fix disassembling of EVEX instructions to stop accidentally decoding the... · ca2382d8

Craig Topper authored Oct 21, 2017

[X86] Fix disassembling of EVEX instructions to stop accidentally decoding the SIB index register as an XMM/YMM/ZMM register.

This introduces a new operand type to encode the whether the index register should be XMM/YMM/ZMM. And new code to fixup the results created by readSIB.

This has the nice effect of removing a bunch of code that hard coded the name of every GATHER and SCATTER instruction to map the index type.

This fixes PR32807.

llvm-svn: 316273

ca2382d8

Fix MSVC 'result of 32-bit shift implicitly converted to 64 bits' warning. NFCI. · cb028c73
Simon Pilgrim authored Oct 21, 2017
```
llvm-svn: 316271
```
cb028c73

[PPC CodeGen] Fix the bitreverse.i64 intrinsic. · c7b749bd

Fangrui Song authored Oct 21, 2017

Summary: The two 32-bit words were swapped.

Subscribers: nemanjai, kbarton

Differential Revision: https://reviews.llvm.org/D38705

llvm-svn: 316270

c7b749bd

Add release notes for the recent -fdouble-square-bracket-attributes and... · 2b3bc4ce

Aaron Ballman authored Oct 21, 2017

Add release notes for the recent -fdouble-square-bracket-attributes and -fno-double-square-bracket-attributes compiler flags.

llvm-svn: 316269

2b3bc4ce

[Sema] Fixes for enum handling for tautological comparison diagnostics · ca1aaacc

Roman Lebedev authored Oct 21, 2017

Summary:
As Mattias Eriksson has reported in PR35009, in C, for enums, the underlying type should
be used when checking for the tautological comparison, unlike C++, where the enumerator
values define the value range. So if not in CPlusPlus mode, use the enum underlying type.

Also, i have discovered a problem (a crash) when evaluating tautological-ness of the following comparison:
```
enum A { A_a = 0 };
if (a < 0) // expected-warning {{comparison of unsigned enum expression < 0 is always false}}
return 0;
```
This affects both the C and C++, but after the first fix, only C++ code was affected.
That was also fixed, while preserving (i think?) the proper diagnostic output.

And while there, attempt to enhance the test coverage.
Yes, some tests got moved around, sorry about that :)

Fixes PR35009

Reviewers: aaron.ballman, rsmith, rjmccall

Reviewed By: aaron.ballman

Subscribers: Rakete1111, efriedma, materi, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D39122

llvm-svn: 316268

ca1aaacc

Fixing broken attribute documentation for __attribute__((noescape)); a code... · f45629d2

Aaron Ballman authored Oct 21, 2017

Fixing broken attribute documentation for __attribute__((noescape)); a code block was missing and the existing code block was missing a mandatory newline.

llvm-svn: 316267

f45629d2

[ValueTracking] Remove unnecessary temporary APInt from computeNumSignBitsVectorConstant. · 8e8b6efd
Craig Topper authored Oct 21, 2017
```
We can just use getNumSignBits instead of inverting negative numbers.

llvm-svn: 316266
```
8e8b6efd