Commits · 7174023f57309bc29ed3334c33f56bda33f9f4b2 · Lorenzo Albano / LLVM bpEVL

Feb 03, 2018

[InstCombine] Allow common type conversions to i8/i16/i32 · 7174023f

David Green authored Feb 03, 2018

This, in instcombine, allows conversions to i8/i16/i32 (very
common cases) even if the resulting type is not legal according
to the data layout. This can often open up extra combine
opportunities.

Differential Revision: https://reviews.llvm.org/D42424

llvm-svn: 324174

7174023f

[RISCV] Update two RISCV codegen tests after rL323991 · 7c11527b

Alex Bradbury authored Feb 03, 2018

From the discussion in D41835 it looks possible the change will be backed out, 
but for now let's fix the RISCV tests.

llvm-svn: 324172

7c11527b

Feb 02, 2018

[InstCombine] make sure tests are providing coverage for the stated pattern; NFC · a767ee5a

Sanjay Patel authored Feb 02, 2018

Without extra instructions and uses, swapMayExposeCSEOpportunities() would change
the icmp (as seen in the check lines), so we were not actually testing patterns 
that should be handled by D41480.

llvm-svn: 324143

a767ee5a

[X86] Add avx512 command line to ptest.ll to demonstrate that 512-bit vectors... · e7e147f5

Craig Topper authored Feb 02, 2018

[X86] Add avx512 command line to ptest.ll to demonstrate that 512-bit vectors are not handled by LowerVectorAllZeroTest.

llvm-svn: 324130

e7e147f5

Partially revert r324124 [X86] Add tests for missed opportunities to use ptest... · bd2f6e95

Craig Topper authored Feb 02, 2018

Partially revert r324124 [X86] Add tests for missed opportunities to use ptest for all ones comparison.

Turns out I misunderstood the flag behavior of PTEST because I read the documentation for KORTEST which is different than PTEST/KTEST and made a bad assumption.

Keep the test rename though cause that's useful.

llvm-svn: 324129

bd2f6e95

[X86] Add tests for missed opportunities to use ptest for all ones comparison. · 9c936f88
Craig Topper authored Feb 02, 2018
```
Also rename the test from pr12312.ll to ptest.ll so its more recognizable.

llvm-svn: 324124
```
9c936f88
[InstCombine] add baseline tests for unsigned saturated sub (D41480); NFC · 5b8cb26b
Sanjay Patel authored Feb 02, 2018
```
llvm-svn: 324109
```
5b8cb26b

[X86] Remove checks for FeatureAVX512 from the X86 assembly parser. Remove... · e538fc74

Craig Topper authored Feb 02, 2018

[X86] Remove checks for FeatureAVX512 from the X86 assembly parser. Remove mcpu/mattr from assembly test command lines.

Summary:
We should always be able to accept AVX512 registers and instructions in llvm-mc. The only subtarget mode that should be checked is 16-bit vs 32-bit vs 64-bit mode.

I've also removed all the mattr/mcpu lines from test RUN lines to be consistent with this. Most were due to AVX512, but a few were for other features.

Fixes PR36202

Reviewers: RKSimon, echristo, bkramer

Reviewed By: echristo

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D42824

llvm-svn: 324106

e538fc74

[AMDGPU] Switch to the new addr space mapping by default · 2a22c5de

Yaxun Liu authored Feb 02, 2018

This requires corresponding clang change.

Differential Revision: https://reviews.llvm.org/D40955

llvm-svn: 324101

2a22c5de

Add llc tests for comparison chains. · a43e9653
Clement Courbet authored Feb 02, 2018
```
See https://reviews.llvm.org/D42793#996098 for context.

llvm-svn: 324099
```
a43e9653
[X86][SSE] Force double domain for SHUFPD stack folding tests · 1cb9bc6b
Simon Pilgrim authored Feb 02, 2018
```
llvm-svn: 324094
```
1cb9bc6b

[Analysis] Support aggregate access types in TBAA · ab68bbe5

Ivan A. Kosarev authored Feb 02, 2018

This patch implements analysis for new-format TBAA access tags
with aggregate types as their final access types.

Differential Revision: https://reviews.llvm.org/D41501

llvm-svn: 324092

ab68bbe5

Add missing new files from r324077 · c2dfd502
James Henderson authored Feb 02, 2018
```
Differential Revision: https://reviews.llvm.org/D42481

llvm-svn: 324078
```
c2dfd502

[ThinLTO] - Fix for "ThinLTO inlines variables that should be discarded". · 76c5fae2

George Rimar authored Feb 02, 2018

This fixes PR36187.

Patch teaches ThinLTO to drop non-prevailing variables, 
just like we recently did for functions (in r323633).

Differential revision: https://reviews.llvm.org/D42798

llvm-svn: 324075

76c5fae2

[ARM] fixed some tabs/whitespaces in test. NFC. · 986d64ad
Sjoerd Meijer authored Feb 02, 2018
```
llvm-svn: 324074
```
986d64ad

[GlobalOpt] Include padding in debug fragments · b69e5b73

Mikael Holmen authored Feb 02, 2018

Summary:
When creating the debug fragments for a SRA'd variable, use the types'
allocation sizes. This fixes issues where the pass would emit too small
fragments, placed at the wrong offset, for padded types.

An example of this is long double on x86. The type is represented using
x86_fp80, which is 10 bytes, but the value is aligned to 12/16 bytes.
The padding is included in the type's DW_AT_byte_size attribute;
therefore, the fragments should also include that. Newer GCC releases
(I tested 7.2.0) emit 12/16-byte pieces for long double. Earlier
releases, e.g. GCC 5.5.0, behaved as LLVM did, i.e. by emitting a
10-byte piece, followed by an empty 2/6-byte piece for the padding.

Failing to cover all `DW_AT_byte_size' bytes of a value with non-empty
pieces results in the value being printed as <optimized out> by GDB.

Patch by: David Stenberg

Reviewers: aprantl, JDevlieghere

Reviewed By: aprantl, JDevlieghere

Subscribers: llvm-commits

Tags: #debug-info

Differential Revision: https://reviews.llvm.org/D42807

llvm-svn: 324066

b69e5b73

[SelectionDAG] Consider endianness in scalarizeVectorStore(). · 422dfbf7

Jonas Paulsson authored Feb 02, 2018

When handling vectors with non byte-sized elements, reverse the order of the
elements in the built integer if the target is Big-Endian.

SystemZ tests updated.

Review: Eli Friedman, Ulrich Weigand.
https://reviews.llvm.org/D42786

llvm-svn: 324063

422dfbf7

[SystemZ] Update test case (NFC) · 0e50b6ed

Jonas Paulsson authored Feb 02, 2018

test/CodeGen/SystemZ/vec-trunc-to-i1.ll was marked as a temporary
FAIL when it was previously updated when it needed one more COPY.
This was however wrong, since the loop body had been reduced
significantly, and it was actually an improvement.

Review: Ulrich Weigand.
llvm-svn: 324060

0e50b6ed

[RISCV] Add ELFObjectFileBase::getRISCVFeatures let llvm-objdump could get RISCV target feature · 53489ada

Shiva Chen authored Feb 02, 2018

llvm-objdump could get C feature by ELF::EF_RISCV_RVC e_flag,
so then we don't have to add -mattr=+c on the command line.

Differential Revision: https://reviews.llvm.org/D42629

llvm-svn: 324058

53489ada

[X86] Legalize (v64i1 (bitcast (i64 X))) on 32-bit targets by extracting... · 76c5ce51

Craig Topper authored Feb 02, 2018

[X86] Legalize (v64i1 (bitcast (i64 X))) on 32-bit targets by extracting 32-bit halves from i32, bitcasting each to v32i1, and concatenating.

This prevents the scalarization that would otherwise occur.

llvm-svn: 324057

76c5ce51

[X86] Legalize (i64 (bitcast (v64i1 X))) on 32-bit targets by extracting to... · 5570e03b

Craig Topper authored Feb 02, 2018

[X86] Legalize (i64 (bitcast (v64i1 X))) on 32-bit targets by extracting to v32i1 and bitcasting to i32.

This saves a trip through memory and seems to open up other combining opportunities.

llvm-svn: 324056

5570e03b

[RISCV] Fix c.addi and c.addi16sp immediate constraints which should be non-zero · b22c1d29
Shiva Chen authored Feb 02, 2018
```
Differential Revision: https://reviews.llvm.org/D42782

llvm-svn: 324055
```
b22c1d29

[RISCV] Define getSetCCResultType for setting vector setCC type · bbf4c5c2

Shiva Chen authored Feb 02, 2018

To avoid trigger "No default SetCC type for vectors!" Assertion

Differential Revision: https://reviews.llvm.org/D42675

llvm-svn: 324054

bbf4c5c2

[AArch64][GlobalISel] Fix old use of % sigil in test. · 572f6cec
Amara Emerson authored Feb 02, 2018
```
My rebase had missed the new $ sigil we're using.

llvm-svn: 324051
```
572f6cec

[GlobalISel] Constrain the dest reg of IMPLICT_DEF. · 58aea52b

Amara Emerson authored Feb 02, 2018

This fixes a crash where the user is a COPY, which deliberately does not
constrain its source operands, resulting in a vreg without a reg class escaping
selection.

Differential Revision: https://reviews.llvm.org/D42697

llvm-svn: 324047

58aea52b

SplitKit: Fix liveness recomputation in some remat cases. · ca0abaeb

Matthias Braun authored Feb 02, 2018

Example situation:
```
BB0:
  %0 = ...
  use %0
  ; ...
  condjump BB1
  jmp BB2

BB1:
  %0 = ...   ; rematerialized def from above (from earlier split step)
  jmp BB2

BB2:
  ; ...
  use %0
```

%0 will have a live interval with 3 value numbers (for the BB0, BB1 and
BB2 parts). Now SplitKit tries and succeeds in rematerializing the value
number in BB2 (This only works because it is a secondary split so
SplitKit is can trace this back to a single original def).

We need to recompute all live ranges affected by a value number that we
rematerialize. The case that we missed before is that when the value
that is rematerialized is at a join (Phi VNI) then we also have to
recompute liveness for the predecessor VNIs.

rdar://35699130

Differential Revision: https://reviews.llvm.org/D42667

llvm-svn: 324039

ca0abaeb

[cfi-verify] Add blame context printing, and improved print format. · b2c3ea76

Vlad Tsyrklevich authored Feb 01, 2018

Summary:
This update now allows users to specify `--blame-context` and `--blame-context-all` to print source file blame information for the source of the blame.

Also updates the inline printing to correctly identify the top of the inlining stack for blame information.

Patch by Mitch Phillips!

Reviewers: vlad.tsyrklevich

Subscribers: llvm-commits, kcc, pcc

Differential Revision: https://reviews.llvm.org/D40111

llvm-svn: 324035

b2c3ea76

Feb 01, 2018

Fix check-prefixes typo and line endings. · d1379c6d
Simon Pilgrim authored Feb 01, 2018
```
llvm-svn: 324024
```
d1379c6d
[X86][SSE] Add SSE41 to variable permute tests · 808a0e15
Simon Pilgrim authored Feb 01, 2018
```
llvm-svn: 324017
```
808a0e15
[X86][XOP] Add XOP to variable permute tests · 26bf8006
Simon Pilgrim authored Feb 01, 2018
```
llvm-svn: 324015
```
26bf8006

[InstCombine] allow multi-use values in canEvaluate* if all uses are in 1 inst · 3343fcef

Sanjay Patel authored Feb 01, 2018

This is the enhancement suggested in D42536 to fix a shortcoming in 
regular InstCombine's canEvaluate* functionality.
When we have multiple uses of a value, but they're all in one instruction, we can 
allow that expression to be narrowed or widened for the same cost as a single-use 
value.

AFAICT, this can only matter for multiply: sub/and/or/xor/select would be simplified 
away if the operands are the same value; add becomes shl; shifts with a variable shift 
amount aren't handled.

Differential Revision: https://reviews.llvm.org/D42739

llvm-svn: 324014

3343fcef

[PowerPC] Tell VSX swap removal that scalar conversions are lane-sensitive · 77e34f15

Nemanja Ivanovic authored Feb 01, 2018

This is a rather non-controversial change. We were missing these instructions
from the list of instructions that are lane-sensitive. These two put the result
into lane 0 (BE) or 3 (LE) regardless of the input. This patch fixes PR36068.

llvm-svn: 324005

77e34f15

[DAGCombiner] When folding (insert_subvector undef, (bitcast... · a5944aad

Craig Topper authored Feb 01, 2018

[DAGCombiner] When folding (insert_subvector undef, (bitcast (extract_subvector N1, Idx)), Idx) -> (bitcast N1) make sure that N1 has the same total size as the original output

We were only checking the element count, but not the total width. This could cause illegal bitcasts to be created if for example the output was 512-bits, but N1 is 256 bits, and the extraction size was 128-bits.

Fixes PR36199

Differential Revision: https://reviews.llvm.org/D42809

llvm-svn: 324002

a5944aad

[GlobalISel] Fix assert failure when legalizing non-power-2 loads. · cbc02c71

Amara Emerson authored Feb 01, 2018

Until we support extending loads properly we're going to fall back for these.
We already handle stores in the same way, so this is just being consistent.

llvm-svn: 324001

cbc02c71

[CodeView] Class record member counts should include base classes and ... · 4536c1f5

Brock Wyma authored Feb 01, 2018

Increment the field list member count for base classes and virtual base
classes.

Differential Revision: https://reviews.llvm.org/D41874

llvm-svn: 324000

4536c1f5

[MachineCopyPropagation] Extend pass to do COPY source forwarding · 94503c7b

Geoff Berry authored Feb 01, 2018

Summary:
This change extends MachineCopyPropagation to do COPY source forwarding
and adds an additional run of the pass to the default pass pipeline just
after register allocation.

This version of this patch uses the newly added
MachineOperand::isRenamable bit to avoid forwarding registers is such a
way as to violate constraints that aren't captured in the
Machine IR (e.g. ABI or ISA constraints).

This change is a continuation of the work started in D30751.

Reviewers: qcolombet, javed.absar, MatzeB, jonpa, tstellar

Subscribers: tpr, mgorny, mcrosier, nhaehnle, nemanjai, jyknight, hfinkel, arsenm, inouehrs, eraman, sdardis, guyblank, fedor.sergeev, aheejin, dschuff, jfb, myatsina, llvm-commits

Differential Revision: https://reviews.llvm.org/D41835

llvm-svn: 323991

94503c7b

AMDGPU/SI: Adjust the encoding family for D16 buffer instructions when the... · 29fcf883

Changpeng Fang authored Feb 01, 2018

AMDGPU/SI: Adjust the encoding family for D16 buffer instructions when the target has UnpackedD16VMem feature.

Reviewers:
  Matt and Brian

Differential Revision:
  https://reviews.llvm.org/D42548

llvm-svn: 323988

29fcf883

[X86][SSE] LowerBUILD_VECTORAsVariablePermute - add support for scaling index vectors · 1a8cefc3

Simon Pilgrim authored Feb 01, 2018

This allows us to use PSHUFB for v8i16/v4i32 and VPERMD/PERMPS for v4i64/v4f64 variable shuffles.

Differential Revision: https://reviews.llvm.org/D42487

llvm-svn: 323987

1a8cefc3

[AArch64] add tests with sqrt estimate and ieee denorms; NFC · 702c19cc
Sanjay Patel authored Feb 01, 2018
```
As noted in D42323, we're not checking for denorms as we should.

llvm-svn: 323985
```
702c19cc
[AArch64] auto-generate complete checks; NFC · f42381fd
Sanjay Patel authored Feb 01, 2018
```
llvm-svn: 323984
```
f42381fd