Commits · b8f084289e57bb3ee0268d1fbca44a80beb34de5 · Roger Ferrer / llvm-epi

Sep 19, 2017

[DAGCombiner] fold assertzexts separated by trunc · f31b1a00

Sanjay Patel authored Sep 18, 2017

If we have an AssertZext of a truncated value that has already been AssertZext'ed, 
we can assert on the wider source op to improve the zext-y knowledge:
 assert (trunc (assert X, i8) to iN), i1 --> trunc (assert X, i1) to iN

This moves a fold from being Mips-specific to general combining, and x86 shows
improvements.

Differential Revision: https://reviews.llvm.org/D37017

llvm-svn: 313577

f31b1a00

Sep 18, 2017

[InstCombine] auto-generate complete checks; NFC · 709a804a

Sanjay Patel authored Sep 18, 2017

The code responsible for these transforms has the potential to add 2 
instructions and break min/max patterns (PR33301).

llvm-svn: 313575

709a804a

llvm-dwarfdump: add a --show-parents options when selectively dumping DIEs. · c2bc7170
Adrian Prantl authored Sep 18, 2017
```
llvm-svn: 313567
```
c2bc7170
Fix typo in testcase. · f077b5b8
Adrian Prantl authored Sep 18, 2017
```
llvm-svn: 313566
```
f077b5b8
AMDGPU: Start selecting s_xnor_{b32, b64} · ca8946a3
Konstantin Zhuravlyov authored Sep 18, 2017
```
Differential Revision: https://reviews.llvm.org/D37981

llvm-svn: 313565
```
ca8946a3

[DAG, x86] allow store merging before and after legalization (PR34217) · 7765c93b

Sanjay Patel authored Sep 18, 2017

rL310710 allowed store merging to occur after legalization to catch stores that are created late,
but this exposes a logic hole seen in PR34217:
https://bugs.llvm.org/show_bug.cgi?id=34217

We will miss merging stores if the target lowers vector extracts into target-specific operations.
This patch allows store merging to occur both before and after legalization if the target chooses
to get maximum merging.

I don't think the potential regressions in the other tests are relevant. The tests are for
correctness of weird IR constructs rather than perf tests, and I think those are still correct.

Differential Revision: https://reviews.llvm.org/D37987

llvm-svn: 313564

7765c93b

[X86] Make sure we still emit zext for GR32 to GR64 when the source of the zext is AssertZext · 39cdb845

Craig Topper authored Sep 18, 2017

The AssertZext we might see in this case is only giving information about the lower 32 bits. It isn't providing information about the upper 32 bits. So we should emit a zext.

This fixes PR28540.

Differential Revision: https://reviews.llvm.org/D37729

llvm-svn: 313563

39cdb845

[SLP] Add a test for PR34635, NFC. · 286fe620
Alexey Bataev authored Sep 18, 2017
```
llvm-svn: 313559
```
286fe620
[x86] add tests for PR34217; NFC · 74d12b56
Sanjay Patel authored Sep 18, 2017
```
llvm-svn: 313548
```
74d12b56

[X86][AVX] Improve (i8 bitcast (v8i1 x)) handling for 256-bit vector compare results. · 4aa28b97

Simon Pilgrim authored Sep 18, 2017

As commented on D37849, AVX1 targets were missing a chance to use vmovmskps for v8f32/v8i32 results for bool vector bitcasts

llvm-svn: 313547

4aa28b97

[x86] regenerate checks; NFC · 078d5d97
Sanjay Patel authored Sep 18, 2017
```
llvm-svn: 313545
```
078d5d97

[LoopVectorizer] Add more testcases for PR33804. · 7476f629

Manoj Gupta authored Sep 18, 2017

Summary:
Add test cases when float <-> pointer types conversion is triggered
in presence of load instructions.

Reviewers: Ayal, srhines, mkuper, rengolin

Reviewed By: rengolin

Subscribers: javed.absar, llvm-commits

Differential Revision: https://reviews.llvm.org/D37967

llvm-svn: 313544

7476f629

[SelectionDAG] Add BITCAST handling to ComputeNumSignBits for splatted sign bits. · 0b21ef1f

Simon Pilgrim authored Sep 18, 2017

For cases where we are BITCASTing to vectors of smaller elements, then if the entire source was a splatted sign (src's NumSignBits == SrcBitWidth) we can say that the dst's NumSignBit == DstBitWidth, as we're just splitting those sign bits across multiple elements.

We could generalize this but at the moment the only use case I have is to peek through bitcasts to vector comparison results.

Differential Revision: https://reviews.llvm.org/D37849

llvm-svn: 313543

0b21ef1f

[X86] Fix two more places to prefer VPERMQ/PD over VPERM2X128 when AVX2 is enabled · 77d7f331

Craig Topper authored Sep 18, 2017

The shuffle combining and lowerVectorShuffleAsLanePermuteAndBlend were both still trying to use VPERM2XF128 for unary shuffles when AVX2 is enabled. VPERM2X128 takes two inputs meaning when we use it for a unary shuffle one of those inputs is left undefined creating a false dependency on whatever register gets allocated there.

If we have VPERMQ/PD we should prefer those since they only have a single input.

Differential Revision: https://reviews.llvm.org/D37947

llvm-svn: 313542

77d7f331

[AArch64] Add V8_2aOps feature to Cortex-A55 and 75 · 3fa0ccff

Sam Parker authored Sep 18, 2017

Add the missing hardware features the ProcA55 and ProcA75 feature.
These are already enabled via the target parser, but I had missed
them in the backend.

Differential Revision: https://reviews.llvm.org/D37974

llvm-svn: 313535

3fa0ccff

[ARM] Implement isTruncateFree · 71efbe4c

Sam Parker authored Sep 18, 2017

Implement the isTruncateFree hooks, lifted from AArch64, that are
used by TargetTransformInfo. This allows simplifycfg to reduce the
test case into a single basic block.

Differential Revision: https://reviews.llvm.org/D37516

llvm-svn: 313533

71efbe4c

[X86][SSE] Improve support for vselect(Cond, 0, X) -> ANDN(Cond, X) · 00161c99
Simon Pilgrim authored Sep 18, 2017
```
As discussed on PR28925 and D37849.

Differential Revision: https://reviews.llvm.org/D37975

llvm-svn: 313532
```
00161c99

[ARM] Fix for indexed dot product instruction descriptions · 4e6df159

Sjoerd Meijer authored Sep 18, 2017

The indexed dot product instructions only accept the lower 16 D-registers as
the indexed register, but we were e.g. incorrectly accepting:

vudot.u8 d16,d16,d18[0]

Differential Revision: https://reviews.llvm.org/D37968

llvm-svn: 313531

4e6df159

[dwarfdump] Make .eh_frame an alias for .debug_frame · c0a758d8

Jonas Devlieghere authored Sep 18, 2017

This patch makes the `.eh_frame` extension an alias for `.debug_frame`.
Up till now it was only possible to dump the section using objdump, but
not with dwarfdump. Since the two are essentially interchangeable, we
dump whichever of the two is present.

As a workaround, this patch also adds parsing for 3 currently
unimplemented CFA instructions: `DW_CFA_def_cfa_expression`,
`DW_CFA_expression`, and `DW_CFA_val_expression`. Because I lack the
required knowledge, I just parse the fields without actually creating
the instructions.

Finally, this also fixes the typo in the `.debug_frame` section name
which incorrectly contained a trailing `s`.

Differential revision: https://reviews.llvm.org/D37852

llvm-svn: 313530

c0a758d8

[X86][SSE] Add vselect with zero tests (PR28925) · 360629d1
Simon Pilgrim authored Sep 18, 2017
```
llvm-svn: 313529
```
360629d1

[X86FixupBWInsts] More precise register liveness if no <imp-use> on MOVs. · 84af99b3

Nikolai Bozhenov authored Sep 18, 2017

Summary:
Subregister liveness tracking is not implemented for X86 backend, so
sometimes the whole super register is said to be live, when only a
subregister is really live. That might happen if the def and the use
are located in different MBBs, see added fixup-bw-isnt.mir test.

However, using knowledge of the specific instructions handled by the
bw-fixup-pass we can get more precise liveness information which this
change does.

Reviewers: MatzeB, DavidKreitzer, ab, andrew.w.kaylor, craig.topper

Reviewed By: craig.topper

Subscribers: n.bozhenov, myatsina, llvm-commits, hiraditya

Patch by Andrei Elovikov <andrei.elovikov@intel.com>

Differential Revision: https://reviews.llvm.org/D37559

llvm-svn: 313524

84af99b3

[X86][Codegen] adding masked gathers tests for avx2 · 77cb080c

Mohammed Agabaria authored Sep 18, 2017

related to patch: https://reviews.llvm.org/D35772
adding llvm gathers test before gathers codegen support.

Differential Revision: https://reviews.llvm.org/D37800

llvm-svn: 313516

77cb080c

[XRay][tools] Support tail-call exits before we write them in the runtime · 0f84a7d3

Dean Michael Berris authored Sep 18, 2017

Summary:
This change adds support for explicit tail-exit records to be written by
the XRay runtime. This lets us differentiate the tail exit
records/events in the log, and allows us to treat those exit events
especially in the future. For now we allow printing those out in YAML
(and reading them in).

Reviewers: kpw, pelikan

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D37964

llvm-svn: 313514

0f84a7d3

[X86] Teach the execution domain fixing tables to use movlhps inplace of... · a6054328

Craig Topper authored Sep 18, 2017

[X86] Teach the execution domain fixing tables to use movlhps inplace of unpcklpd for the packed single domain.

MOVLHPS has a smaller encoding than UNPCKLPD in the legacy encodings. With VEX and EVEX encodings it doesn't matter.

llvm-svn: 313509

a6054328

[X86] Teach execution domain fixing to convert between FP and int unpack instructions. · 87f7381e
Craig Topper authored Sep 18, 2017
```
llvm-svn: 313508
```
87f7381e
[X86] Teach execution domain fixing to convert between VPERMILPS and VPSHUFD. · d4341920
Craig Topper authored Sep 18, 2017
```
llvm-svn: 313507
```
d4341920
[X86] Teach shuffle lowering to use MOVLHPS/MOVHLPS for lowering v4f32 unary... · ee6646d7
Craig Topper authored Sep 17, 2017
```
[X86] Teach shuffle lowering to use MOVLHPS/MOVHLPS for lowering v4f32 unary shuffles with SSE1 only.

llvm-svn: 313504
```
ee6646d7
[X86] Add a couple more unary shuffles to the sse1 shuffle test. · 6c221690
Craig Topper authored Sep 17, 2017
```
These can be implemented with movlhps and movhlps.

llvm-svn: 313503
```
6c221690

Sep 17, 2017

Adding test cases for PR34629 & PR34634. · 356e3e2c
Jatin Bhateja authored Sep 17, 2017
```
Differential Revision: https://reviews.llvm.org/D37962

llvm-svn: 313490
```
356e3e2c

[RISCV] Add support for disassembly · 8ab4a969

Alex Bradbury authored Sep 17, 2017

This Disassembly support allows for 'round-trip' testing, and rv32i-valid.s
has been updated appropriately.

Differential Revision: https://reviews.llvm.org/D23567

llvm-svn: 313486

8ab4a969

[RISCV] Add support for all RV32I instructions · 6758ecb9

Alex Bradbury authored Sep 17, 2017

This patch supports all RV32I instructions as described in the RISC-V manual.
A future patch will add support for pseudoinstructions and other instruction
expansions (e.g. 0-arg fence -> fence iorw, iorw).

Differential Revision: https://reviews.llvm.org/D23566

llvm-svn: 313485

6758ecb9

[GlobalISel][X86] Legalize i1 G_ADD/G_SUB/G_MUL/G_XOR/G_OR/G_AND instructions. · f1d388a5
Igor Breger authored Sep 17, 2017
```
llvm-svn: 313483
```
f1d388a5
[GlobalISel][X86] Use correct physical register in mir tests.NFC. · 0f382ccb
Igor Breger authored Sep 17, 2017
```
llvm-svn: 313479
```
0f382ccb

[GlobalISel][X86] G_FCONSTANT support. · 21200ed7

Igor Breger authored Sep 17, 2017

Summary: G_FCONSTANT support, port the implementation from X86FastIsel.

Reviewers: zvi, delena, guyblank

Reviewed By: delena

Subscribers: rovka, llvm-commits, kristof.beyls

Differential Revision: https://reviews.llvm.org/D37734

llvm-svn: 313478

21200ed7

Sep 16, 2017

[llvm-symbolizer] Fix coff-dwarf.test · 6326d561

Zachary Turner authored Sep 16, 2017

This was a bug in the test that was only exposed as a result of
refactoring some code in lit configuration files.  Previously,
llvm's lit configuration would only set the target-windows feature
if the system was also windows.  Since cross-compilation is
a thing, this isn't correct.  target-windows should be set
independently of system-windows.

Adding to that bug, this particular test then checked for
target-windows when it really meant "can I call a certain API on
the host machine", which is what system-windows is for.

Ultimately, this test only works if *both* the target and host
are Windows, so I've updated the test to reflect that.

llvm-svn: 313468

6326d561

Resubmit "Add a shared llvm.lit module that all test suites can use." · c3023d1b

Zachary Turner authored Sep 16, 2017

There were some issues surrounding Py2 / Py3 compatibility, but
I've now tested with both Py2 and Py3 and everything seems to
work.

llvm-svn: 313467

c3023d1b

llvm-dwarfdump: support a --show-children option · 597aa48d

Adrian Prantl authored Sep 16, 2017

This will print all children of a DIE when selectively printing only
one DIE at a given offset.

llvm-svn: 313464

597aa48d

llvm-dwarfdump: Add support for -debug-types=<offset>. · 099d7e45
Adrian Prantl authored Sep 16, 2017
```
llvm-svn: 313463
```
099d7e45

[llvm-readobj] - Teach tool to report error if some section is in multiple COMDAT groups at once. · 762abff6

George Rimar authored Sep 16, 2017

readelf tool reports an error when output contains the same section
in multiple COMDAT groups. That can be useful.
Path teaches llvm-readobj to do the same.

Differential revision: https://reviews.llvm.org/D37567

llvm-svn: 313459

762abff6

[x86] enable storeOfVectorConstantIsCheap() target hook · 65d67807

Sanjay Patel authored Sep 16, 2017

This allows vector-sized store merging of constants in DAGCombiner using the existing code in MergeConsecutiveStores().
All of the twisted logic that decides exactly what vector operations are legal and fast for each particular CPU are
handled separately in there using the appropriate hooks.

For the motivating tests in merge-store-constants.ll, we already produce the same vector code in IR via the SLP vectorizer.
So this is just providing a backend backstop for code that doesn't go through that pass (-O1). More details in PR24449:
https://bugs.llvm.org/show_bug.cgi?id=24449 (this change should be the last step to resolve that bug)

Differential Revision: https://reviews.llvm.org/D37451

llvm-svn: 313458

65d67807