Commits · 674421e4de11b0aeba37fa1435465aff86e9dce9 · Roger Ferrer / llvm-epi

Jun 22, 2017

[Testing/Support] Remove the const_cast in TakeExpected · 674421e4

Pavel Labath authored Jun 22, 2017

Summary:
The const_cast in the "const" version of TakeExpected was quite
dangerous, as the function does indeed modify the apparently const
argument.

I assume the reason the const overload was added was to make the
function bind to xvalues(temporaries). That can be also achieved with
rvalue references, so I use that instead.

Using the ASSERT macros on const Expected objects will now become
illegal, but I believe that is correct, as it is not actually possible
to inspect the error stored in an Expected object without modifying it.

Reviewers: zturner, chandlerc

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D34405

llvm-svn: 306001

674421e4

Revert [mips] Adds support for R_MIPS_26, HIGHER, HIGHEST relocations in RuntimeDyld · 15126308
Sagar Thakur authored Jun 22, 2017
```
Reverting due to build bot failures

llvm-svn: 306000
```
15126308

[AMDGPU] SDWA: remove support for VOP2 instructions that have only 64-bit encoding · ca5a30ed

Sam Kolton authored Jun 22, 2017

Summary:
Despite that this instructions are listed in VOP2, they are treated as VOP3 in specs. They should not support SDWA.
There are no real instructions for them, but there are pseudo instructions.

Reviewers: arsenm, vpykhtin, cfang

Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye

Differential Revision: https://reviews.llvm.org/D34403

llvm-svn: 305999

ca5a30ed

Don't conditionalize Neon instructions, even in IT blocks. · 9665249f

Kristof Beyls authored Jun 22, 2017

This has been deprecated since ARMARM v7-AR, release C.b, published back
in 2012.

This also removes test/CodeGen/Thumb2/ifcvt-neon.ll that originally was
introduced to check that conditionalization of Neon instructions did
happen when generating Thumb2. However, the test had evolved and was no
longer testing that. Rather than trying to adapt that test, this commit
introduces test/CodeGen/Thumb2/ifcvt-neon-deprecated.mir, since we can
now use the MIR framework to write nicer/more maintainable tests.

llvm-svn: 305998

9665249f

[mips] Adds support for R_MIPS_26, HIGHER, HIGHEST relocations in RuntimeDyld · f8858d09

Sagar Thakur authored Jun 22, 2017

After the N64 static relocation model support was added to llvm it is required to add its support in RuntimeDyld also because lldb uses ExecutionEngine for evaluating expressions.

Reviewed by sdardis
Differential: D31649

llvm-svn: 305997

f8858d09

[index] Add the "SpecializationOf" relation to the forward declarations · b6e03aa9

Alex Lorenz authored Jun 22, 2017

of class template specializations

This commit fixes an issue where a forward declaration of a class template
specialization was not related to the base template. We need to relate even
forward declarations because specializations don't have to be defined.

rdar://32869409

Differential Revision: https://reviews.llvm.org/D34462

llvm-svn: 305996

b6e03aa9

[mips] Implement the ".rdata" MIPS assembly directive. · 1c73fcc1

Simon Dardis authored Jun 22, 2017

Rather than creating a separate ".rdata" section distinct from the
customary ".rodata" in ELF, ".rdata" switches to the ".rodata" section.

This patch relands r305949 and r305950 with the correct commit message
and addresses nit raised during review.

Patch By: John Baldwin!

Differential Revision: https://reviews.llvm.org/D34452

llvm-svn: 305995

1c73fcc1

Test commit · 2d5ab693
Ekaterina Vaartis authored Jun 22, 2017
```
llvm-svn: 305994
```
2d5ab693

[ARM] Add .w aliases of MOV with shifted operand · ed78aaf0

John Brawn authored Jun 22, 2017

These appear to have been simply missing.

Differential Revision: https://reviews.llvm.org/D34461

llvm-svn: 305993

ed78aaf0

[ARM] Clean up choice of narrow instructions in ARMAsmParser, NFC · 192f74a8

John Brawn authored Jun 22, 2017

This patch makes a couple of changes to how we decide whether to use the narrow
or wide encoding of thumb2 instructions:
 * Common out the detection of the .w qualifier
 * Check for the CPSR operand in a consistent way

Differential Revision: https://reviews.llvm.org/D34460

llvm-svn: 305992

192f74a8

[analyzer] Do not continue to analyze a path if the constraints contradict with builtin assume · b3bcddf7
Gabor Horvath authored Jun 22, 2017
```
Differential Revision: https://reviews.llvm.org/D34502

llvm-svn: 305991
```
b3bcddf7
Revert "Enable vectorizer-maximize-bandwidth by default." · b512e915
Diana Picus authored Jun 22, 2017
```
This reverts commit r305960 because it broke self-hosting on AArch64.

llvm-svn: 305990
```
b512e915

[GlobalISel][X86] Support vector type G_INSERT legalization/selection. · 1c29be7e

Igor Breger authored Jun 22, 2017

Summary:
Support vector type G_INSERT legalization/selection.
Split from https://reviews.llvm.org/D33665

Reviewers: qcolombet, t.p.northover, zvi, guyblank

Reviewed By: guyblank

Subscribers: guyblank, rovka, llvm-commits, kristof.beyls

Differential Revision: https://reviews.llvm.org/D33956

llvm-svn: 305989

1c29be7e

[ARM] Add macro fusion for AES instructions. · b489e56a

Florian Hahn authored Jun 22, 2017

Summary:
This patch adds a macro fusion using CodeGen/MacroFusion.cpp to pair AES
instructions back to back and adds FeatureFuseAES to enable the feature.

Reviewers: evandro, javed.absar, rengolin, t.p.northover

Reviewed By: javed.absar

Subscribers: aemerson, mgorny, kristof.beyls, llvm-commits

Differential Revision: https://reviews.llvm.org/D34142

llvm-svn: 305988

b489e56a

AVX-512: Lowering Masked Gather intrinsic - fixed a bug · 2dac0b4d

Elena Demikhovsky authored Jun 22, 2017

Masked gather for vector length 2 is lowered incorrectly for element type i32.
The type <2 x i32> was automatically extended to <2 x i64> and we generated VPGATHERQQ instead of VPGATHERQD.
The type <2 x float> is extended to <4 x float>, so there is no bug for this type, but the sequence may be more optimal.

In this patch I'm fixing <2 x i32>bug and optimizing <2 x float> sequence for GATHERs only. The same fix should be done for Scatters as well.

Differential revision: https://reviews.llvm.org/D34343

llvm-svn: 305987

2dac0b4d

[AMDGPU] SDWA: add support for GFX9 in peephole pass · 3c4933fc

Sam Kolton authored Jun 22, 2017

Summary:
Added support based on merged SDWA pseudo instructions. Now peephole allow one scalar operand, omod and clamp modifiers.
Added several subtarget features for GFX9 SDWA.
This diff also contains changes from D34026.
Depends D34026

Reviewers: vpykhtin, rampitec, arsenm

Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye

Differential Revision: https://reviews.llvm.org/D34241

llvm-svn: 305986

3c4933fc

[InstCombine] Add test cases to demonstrate that and->xnor and or->xnor... · 71e2c161

Craig Topper authored Jun 22, 2017

[InstCombine] Add test cases to demonstrate that and->xnor and or->xnor folding can create more instructions than it removed when there are multiple uses. NFC

llvm-svn: 305985

71e2c161

[PowerPC] fix potential verification errors · 1d5693c9

Hiroshi Inoue authored Jun 22, 2017

This patch fixes trivial mishandling of 32-bit/64-bit instructions that may cause verification errors with -verify-machineinstrs.

llvm-svn: 305984

1d5693c9

[ELF] Add an apostrophe after a file name when reporting discarded sections. · 393563a0
Igor Kudrin authored Jun 22, 2017
```
Differential Revision: https://reviews.llvm.org/D34442

llvm-svn: 305983
```
393563a0

[llvm-readobj] Dump the COFF image load config · b7d716c0

Reid Kleckner authored Jun 22, 2017

This includes the safe SEH tables and the control flow guard function
table. LLD will emit the guard table soon, and I need a tool that dumps
them for testing.

llvm-svn: 305979

b7d716c0

[wasm] Fix WebAssembly asm backend after r305968 · ef581757
Reid Kleckner authored Jun 22, 2017
```
llvm-svn: 305978
```
ef581757

Add some catch(...) blocks to the tests so that if they fail, we get a good... · f74609b1

Marshall Clow authored Jun 22, 2017

Add some catch(...) blocks to the tests so that if they fail, we get a good error message. No functional change.

llvm-svn: 305977

f74609b1

Also test thumb. · f9df4290
Rafael Espindola authored Jun 22, 2017
```
llvm-svn: 305976
```
f9df4290
Revert "[Target] Implement the ".rdata" MIPS assembly directive." · 7a6c5c12
Davide Italiano authored Jun 22, 2017
```
This reverts commit r305949 and r305950 as they didn't have the
correct commit message.

llvm-svn: 305973
```
7a6c5c12

[Sanitizers] 32 bit allocator respects allocator_may_return_null flag · f3cc7cc3

Alex Shlyapnikov authored Jun 22, 2017

Summary:
Make SizeClassAllocator32 return nullptr when it encounters OOM, which
allows the entire sanitizer's allocator to follow allocator_may_return_null=1
policy, even for small allocations (LargeMmapAllocator is already fixed
by D34243).

Will add a test for OOM in primary allocator later, when
SizeClassAllocator64 can gracefully handle OOM too.

Reviewers: eugenis

Subscribers: kubamracek, llvm-commits

Differential Revision: https://reviews.llvm.org/D34433

llvm-svn: 305972

f3cc7cc3

[WebAssembly] Cleanup WasmObjectWriter.cpp. NFC · fe6414b0

Sam Clegg authored Jun 21, 2017

- Use auto where appropriate
- Use early return to reduce nesting
- Remove stray comment line
- Use C++ foreach over explicit iterator

Differential Revision: https://reviews.llvm.org/D34477

llvm-svn: 305971

fe6414b0

[AMDGPU] Add FP_CLASS to the add/setcc combine · 3ed38c60

Stanislav Mekhanoshin authored Jun 21, 2017

This is one of the nodes which also compile as v_cmp_*.

Differential Revision: https://reviews.llvm.org/D34485

llvm-svn: 305970

3ed38c60

[ProfileData, Support] Fix some Clang-tidy modernize-use-using and Include... · 72208a82

Eugene Zelenko authored Jun 21, 2017

[ProfileData, Support] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC).

llvm-svn: 305969

72208a82

Use a MutableArrayRef. NFC. · 88d9e37e
Rafael Espindola authored Jun 21, 2017
```
llvm-svn: 305968
```
88d9e37e
Fix build. · 6da25f4f
Rafael Espindola authored Jun 21, 2017
```
llvm-svn: 305967
```
6da25f4f

[codeview] respect signedness of APSInts when printing to YAML · 4d2711fb

Bob Haarman authored Jun 21, 2017

Summary:
This fixes a bug where we always treat APSInts in Codeview as
signed when writing them to YAML. One symptom of this problem is that
llvm-pdbdump raw would show Enumerator Values that differ between the
original PDB and a PDB that has been round-tripped through YAML.

Reviewers: zturner

Reviewed By: zturner

Subscribers: llvm-commits, fhahn

Differential Revision: https://reviews.llvm.org/D34013

llvm-svn: 305965

4d2711fb

[AMDGPU] Combine add and adde, sub and sube · a8b26936

Stanislav Mekhanoshin authored Jun 21, 2017

If one of the arguments of adde/sube is zero we can fold another
add/sub into it.

Differential Revision: https://reviews.llvm.org/D34374

llvm-svn: 305964

a8b26936

Mark dump() methods as const. NFC · 705f798b

Sam Clegg authored Jun 21, 2017

Add const qualifier to any dump() method where adding one
was trivial.

Differential Revision: https://reviews.llvm.org/D34481

llvm-svn: 305963

705f798b

[AMDGPU] simplify add x, *ext (setcc) => addc|subb x, 0, setcc · e3eb42ce

Stanislav Mekhanoshin authored Jun 21, 2017

This simplification allows to avoid generating v_cndmask_b32
to serialize condition code between compare and use.

Differential Revision: https://reviews.llvm.org/D34300

llvm-svn: 305962

e3eb42ce

TableGen.cmake: Use DEPFILE for Ninja Generator with CMake>=3.7. · 1b587358

NAKAMURA Takumi authored Jun 21, 2017

CMake emits build targets as relative paths (from build.ninja) but Ninja doesn't identify absolute path (in *.d) as relative path (in build.ninja).
So, let file names, in the command line, relative from ${CMAKE_BINARY_DIR}, where build.ninja is.

Note that tblgen is executed on ${CMAKE_BINARY_DIR} as working directory.

Differential Revision: https://reviews.llvm.org/D33707

llvm-svn: 305961

1b587358

Enable vectorizer-maximize-bandwidth by default. · 014db29b

Dehao Chen authored Jun 21, 2017

Summary:
vectorizer-maximize-bandwidth is generally useful in terms of performance. I've tested the impact of changing this to default on speccpu benchmarks on sandybridge machines. The result shows non-negative impact:

spec/2006/fp/C++/444.namd 26.84 -0.31%
spec/2006/fp/C++/447.dealII 46.19 +0.89%
spec/2006/fp/C++/450.soplex 42.92 -0.44%
spec/2006/fp/C++/453.povray 38.57 -2.25%
spec/2006/fp/C/433.milc 24.54 -0.76%
spec/2006/fp/C/470.lbm 41.08 +0.26%
spec/2006/fp/C/482.sphinx3 47.58 -0.99%
spec/2006/int/C++/471.omnetpp 22.06 +1.87%
spec/2006/int/C++/473.astar 22.65 -0.12%
spec/2006/int/C++/483.xalancbmk 33.69 +4.97%
spec/2006/int/C/400.perlbench 33.43 +1.70%
spec/2006/int/C/401.bzip2 23.02 -0.19%
spec/2006/int/C/403.gcc 32.57 -0.43%
spec/2006/int/C/429.mcf 40.35 +0.27%
spec/2006/int/C/445.gobmk 26.96 +0.06%
spec/2006/int/C/456.hmmer 24.4 +0.19%
spec/2006/int/C/458.sjeng 27.91 -0.08%
spec/2006/int/C/462.libquantum 57.47 -0.20%
spec/2006/int/C/464.h264ref 46.52 +1.35%

geometric mean +0.29%

The regression on 453.povray seems real, but is due to secondary effects as all hot functions are bit-identical with and without the flag.

I started this patch to consult upstream opinions on this. It will be greatly appreciated if the community can help test the performance impact of this change on other architectures so that we can decided if this should be target-dependent.

Reviewers: hfinkel, mkuper, davidxl, chandlerc

Reviewed By: chandlerc

Subscribers: rengolin, sanjoy, javed.absar, bjope, dorit, magabari, RKSimon, llvm-commits, mzolotukhin

Differential Revision: https://reviews.llvm.org/D33341

llvm-svn: 305960

014db29b

Jun 21, 2017

SwiftCC: Perform physical layout when computing coercion types · 7b871611

Arnold Schwaighofer authored Jun 21, 2017

We need to take type alignment padding into account whe computing physical
layouts.

The layout must be compatible with the input layout, offsets are defined in
terms of offsets within a packed struct which are computed in terms of the alloc
size of a type.

Usingthe store size we would insert padding for the following type for example:

struct {

  int3 v;
  long long l;
} __attribute((packed))

On x86-64 int3 is padded to int4 alignment. The swiftcc type would be
<{ <3 x float>, [4 x i8], i64 }> which is not compatible with <{ <3 x float>,
i64 }>.

The latter has i64 at offset 16 and the former at offset 20.

rdar://32618125

llvm-svn: 305956

7b871611

Attempt to avoid static init ordering issues with globalMemCounter · 05092380
Eric Fiselier authored Jun 21, 2017
```
llvm-svn: 305955
```
05092380

ELF: Don't dereference Repl in MarkLive. NFCI. · bac3570d

Peter Collingbourne authored Jun 21, 2017

This is unnecessary because --gc-sections runs before ICF.

Differential Revision: https://reviews.llvm.org/D34465

llvm-svn: 305954

bac3570d

[Hexagon] Use MachineInstrBuilder instead of changing instruction in place · 5b933fee
Krzysztof Parzyszek authored Jun 21, 2017
```
llvm-svn: 305953
```
5b933fee