Commits · d3c7b8e303e96fae3ab0c3fc6a42f7f8b87a17e4 · Roger Ferrer / llvm-epi

Aug 01, 2016

[msf] Teach LLVM to parse a split Fpm. · d3c7b8e3

Zachary Turner authored Aug 01, 2016

The FPM is split at regular intervals across the MSF file, as the MS code
suggests. It turns out that the value of the interval is precisely the
block size. If the block size is 4096, then there are two Fpm pages every
4096 blocks.

So here we teach the PDBFile class to parse a split FPM, and also add more
options when dumping the FPM to display some additional information such
as orphaned pages (pages which the FPM says are allocated, but which
nothing appears to use), use after free pages (pages which the FPM says
are not allocated, but which are referenced by a stream), and multiple use
pages (pages which the FPM says are allocated but are used more than
once).

Reviewed By: ruiu
Differential Revision: https://reviews.llvm.org/D23022

llvm-svn: 277388

d3c7b8e3

[ExecutionEngine][MCJIT][Orc] Replace RuntimeDyld::SymbolInfo with JITSymbol. · ad4a911f

Lang Hames authored Aug 01, 2016

This patch replaces RuntimeDyld::SymbolInfo with JITSymbol: A symbol class
that is capable of lazy materialization (i.e. the symbol definition needn't be
emitted until the address is requested). This can be used to support common
and weak symbols in the JIT (though this is not implemented in this patch).

For consistency, RuntimeDyld::SymbolResolver is renamed to JITSymbolResolver.

For space efficiency a new class, JITEvaluatedSymbol, is introduced that
behaves like the old RuntimeDyld::SymbolInfo - i.e. it is just a pair of an
address and symbol flags. Instances of JITEvaluatedSymbol can be used in
symbol-tables to avoid paying the space cost of the materializer.

llvm-svn: 277386

ad4a911f

[Hexagon] Tidy up some code, NFC: reapply r277372 with a fix · 317d42c1
Krzysztof Parzyszek authored Aug 01, 2016
```
llvm-svn: 277383
```
317d42c1
[Profile] IR profiling minor cleanup /nfc · d119761b
Xinliang David Li authored Aug 01, 2016
```
Differential Revision: http://reviews.llvm.org/D22995

llvm-svn: 277379
```
d119761b
[LV] Move isGatherOrScatterLegal into LoopVectorizationLegality (NFC) · 228f9731
Matthew Simpson authored Aug 01, 2016
```
llvm-svn: 277376
```
228f9731
[LV] Use getPointerOperand helper where appropriate (NFC) · 1ce88ff6
Matthew Simpson authored Aug 01, 2016
```
llvm-svn: 277375
```
1ce88ff6
Revert r277372, it is causing buildbot failures · d978ae23
Krzysztof Parzyszek authored Aug 01, 2016
```
llvm-svn: 277374
```
d978ae23
[Hexagon] Tidy up some code, NFC · 1f72abb5
Krzysztof Parzyszek authored Aug 01, 2016
```
llvm-svn: 277372
```
1f72abb5

[DAGCombine] Make sext(setcc) combine respect getBooleanContents · c97da7f3

Michael Kuperstein authored Aug 01, 2016

We used to combine "sext(setcc x, y, cc) -> (select (setcc x, y, cc), -1, 0)"
Instead, we should combine to (select (setcc x, y, cc), T, 0) where the value
of T is 1 or -1, depending on the type of the setcc, and getBooleanContents()
for the type if it is not i1.

This fixes PR28504.

llvm-svn: 277371

c97da7f3

[Hexagon] Generate vector printing instructions · 8123b966
Ron Lieberman authored Aug 01, 2016
```
llvm-svn: 277370
```
8123b966

[CFLAA] Remove modref queries from CFLAA. · 5f0e76dc

George Burgess IV authored Aug 01, 2016

As it turns out, modref queries are broken with CFLAA. Specifically,
the data source we were using for determining modref behaviors
explicitly ignores operations on non-pointer values. So, it wouldn't
note e.g. storing an i32 to an i32* (or loading an i64 from an i64*).
It also ignores external function calls, rather than acting
conservatively for them.

(N.B. These operations, where necessary, *are* tracked by CFLAA; we just
use a different mechanism to do so. Said mechanism is relatively
imprecise, so it's unlikely that we can provide reasonably good modref
answers with it as implemented.)

Patch by Jia Chen.

Differential Revision: https://reviews.llvm.org/D22978

llvm-svn: 277366

5f0e76dc

[AArch64] Add support for Samsung Exynos M2 (NFC). · 82e245a2
Evandro Menezes authored Aug 01, 2016
```
llvm-svn: 277364
```
82e245a2

[CFLAA] Make CFLAnders more conservative with new Values. · 4c582660

George Burgess IV authored Aug 01, 2016

Currently, CFLAnders assumes that values it hasn't seen don't alias
anything. This patch fixes that. Given that the only way for this to
happen is to query AA, rely on specific transformations happening, then
query AA again (looking for a specific set of queries), lit testing is a
bit difficult. If someone really wants a test, I'm happy to add one.

Patch by Jia Chen.

Differential Revision: https://reviews.llvm.org/D22981

llvm-svn: 277362

4c582660

Included test for r277360. · d1548eaa
David Majnemer authored Aug 01, 2016
```
llvm-svn: 277361
```
d1548eaa
[Verifier] Resume instructions can only be in functions w/ a personality · ba6665d8
David Majnemer authored Aug 01, 2016
```
This fixes PR28799.

llvm-svn: 277360
```
ba6665d8
Replace MachineInstr* with MachineInstr& in TargetInstrInfo, NFC · 8fb181ca
Krzysztof Parzyszek authored Aug 01, 2016
```
There were a few cases introduced with the modulo scheduler.

llvm-svn: 277358
```
8fb181ca

[Hexagon] Check for offset overflow when reserving scavenging slots · ddafa2cd

Krzysztof Parzyszek authored Aug 01, 2016

Scavenging slots were only reserved when pseudo-instruction expansion in
frame lowering created new virtual registers. It is possible to still
need a scavenging slot even if no virtual registers were created, in cases
where the stack is large enough to overflow instruction offsets.

llvm-svn: 277355

ddafa2cd

Add removed inline-assembly-comment test from r277146 · 6e0b7320
Nirav Dave authored Aug 01, 2016
```
llvm-svn: 277349
```
6e0b7320

[mips][fastisel] Correct argument lowering for (f64, f64, i32) and similar. · b3ae33c7

Daniel Sanders authored Aug 01, 2016

Summary:
Allocating an AFGR64 shadows two GPR32's instead of just one.

This fixes an LNT regression detected by our internal buildbots.

Reviewers: sdardis

Subscribers: dsanders, sdardis, llvm-commits

Differential Revision: https://reviews.llvm.org/D23012

llvm-svn: 277348

b3ae33c7

[AMDGPU] refactor DS instruction definitions. NFC. · 902db310
Valery Pykhtin authored Aug 01, 2016
```
Differential revision: https://reviews.llvm.org/D22522

llvm-svn: 277344
```
902db310
[X86] Use implicit masking of SHLD/SHRD shift double instructions · 46f119a5
Simon Pilgrim authored Aug 01, 2016
```
Similar to the regular shift instructions, SHLD/SHRD only use the bottom bits of the shift value

llvm-svn: 277341
```
46f119a5
Fixed test check ordering issue on windows buildbots · 7fd4ad68
Simon Pilgrim authored Aug 01, 2016
```
llvm-svn: 277337
```
7fd4ad68
Fixed MSVC out of range shift warning · 2ddeee17
Simon Pilgrim authored Aug 01, 2016
```
llvm-svn: 277333
```
2ddeee17

[SimplifyCFG] Fix nasty RAUW bug from r277325 · bade86ce

James Molloy authored Aug 01, 2016

Using RAUW was wrong here; if we have a switch transform such as:
  18 -> 6 then
  6 -> 0

If we use RAUW, while performing the second transform the  *transformed* 6
from the first will be also replaced, so we end up with:
  18 -> 0
  6 -> 0

Found by clang stage2 bootstrap; testcase added.

llvm-svn: 277332

bade86ce

[AArch64] Return the correct size for TLSDESC_CALLSEQ · ab5a4c7d

Diana Picus authored Aug 01, 2016

The branch relaxation pass is computing the wrong offsets because it assumes
TLSDESC_CALLSEQ eats up 4 bytes, when in fact it is lowered to an instruction
sequence taking up 16 bytes. This can become a problem in huge files with lots
of TLS accesses, as it may slowly move branch targets out of the range computed
by the branch relaxation pass.

Fixes PR24234 https://llvm.org/bugs/show_bug.cgi?id=24234

Differential Revision: https://reviews.llvm.org/D22870

llvm-svn: 277331

ab5a4c7d

[AVX-512] Fix a test missed in r277327. · d2b2d745
Craig Topper authored Aug 01, 2016
```
llvm-svn: 277330
```
d2b2d745

[SimplifyCFG] Try and pacify buildbots after r277325 · 91821bd0

James Molloy authored Aug 01, 2016

It looks like the two independent parts of the rotate operation (a lshr and shl) are being reordered on some bots. Add CHECK-DAGs to account for this.

llvm-svn: 277329

91821bd0

[AVX-512] Fix duplicate column in AVX512 execution dependency table that was... · c48c0296

Craig Topper authored Aug 01, 2016

[AVX-512] Fix duplicate column in AVX512 execution dependency table that was preventing VMOVDQU32/VMOVDQA32 from being recognized. Fix a bug in the code that stops execution dependency fix from turning operations on 32-bit integer element types into operations on 64-bit integer element types.

llvm-svn: 277327

c48c0296

[X86] Regenerate a test to pick up shuffle comments that were added at some point. · ddc96cd3
Craig Topper authored Aug 01, 2016
```
llvm-svn: 277326
```
ddc96cd3

[SimplifyCFG] Range reduce switches · b2e436de

James Molloy authored Aug 01, 2016

If a switch is sparse and all the cases (once sorted) are in arithmetic progression, we can extract the common factor out of the switch and create a dense switch. For example:

    switch (i) {
    case 5: ...
    case 9: ...
    case 13: ...
    case 17: ...
    }

can become:

    if ( (i - 5) % 4 ) goto default;
    switch ((i - 5) / 4) {
    case 0: ...
    case 1: ...
    case 2: ...
    case 3: ...
    }

or even better:

   switch ( ROTR(i - 5, 2) {
   case 0: ...
   case 1: ...
   case 2: ...
   case 3: ...
   }

The division and remainder operations could be costly so we only do this if the factor is a power of two, and emit a right-rotate instead of a divide/remainder sequence. Dense switches can be lowered significantly better than sparse switches and can even be transformed into lookup tables.

llvm-svn: 277325

b2e436de

[mips] Clang generates unaligned offset for MSA instruction st.d · 00d96ee7
Hrvoje Varga authored Aug 01, 2016
```
Differential Revision: https://reviews.llvm.org/D19475

llvm-svn: 277323
```
00d96ee7

[AArch64] Register passes so they can be run by llc · 850043b2

Diana Picus authored Aug 01, 2016

Initialize all AArch64-specific passes in the TargetMachine so they can be run
by llc. This can lead to conflicts in opt with some command line options that
share the same name as the pass, so I took this opportunity to do some cleanups:
* rename all relevant command line options from "aarch64-blah" to
  "aarch64-enable-blah" and update the tests accordingly
* run clang-format on their declarations
* move all these declarations to a common place (the TargetMachine) as opposed
  to having them scattered around (AArch64BranchRelaxation and
  AArch64AddressTypePromotion were the only offenders)

llvm-svn: 277322

850043b2

[AVX-512] Teach X86InstrInfo::getLargestLegalSuperClass to inflate to... · 749a111f

Craig Topper authored Aug 01, 2016

[AVX-512] Teach X86InstrInfo::getLargestLegalSuperClass to inflate to FR32X/FR64X if AVX512 is supported and VR128X/VR256X if VLX is supported.

Had to update a stack folding test to clobber the other 16 registers since this now made them get used instead of spilling.

llvm-svn: 277321

749a111f

[AVX512] Replace scalar fp arithmetic intrinsics with native IR in an AVX512... · 9161e4ec

Craig Topper authored Aug 01, 2016

[AVX512] Replace scalar fp arithmetic intrinsics with native IR in an AVX512 test. The intrinsics aren't lowered to AVX512 instructions.

The intrinsics really should be removed and autoupgraded.

llvm-svn: 277320

9161e4ec

[AVX-512] Use FR32X/FR64X/VR128X/VR256X register classes in addRegisterClass... · 39461763

Craig Topper authored Aug 01, 2016

[AVX-512] Use FR32X/FR64X/VR128X/VR256X register classes in addRegisterClass if AVX512(for FR32X/FR64) or VLX(for VR128X/VR256) is supported. This is a minimal requirement to be able to allocate all 32 registers.

llvm-svn: 277319

39461763

[X86] Move mask register handling into the main switch of... · da50eec2

Craig Topper authored Aug 01, 2016

[X86] Move mask register handling into the main switch of getLoadStoreRegOpcode. No functional change intended.

llvm-svn: 277318

da50eec2

Revert r277313 and r277314. · 423c7149

Sean Silva authored Aug 01, 2016

They seem to trigger an LSan failure:
http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/15140/steps/check-llvm%20asan/logs/stdio

Revert "Add the tests for r277313"

This reverts commit r277314.

Revert "CodeExtractor : Add ability to preserve profile data."

This reverts commit r277313.

llvm-svn: 277317

423c7149

Move this test to x86-specific directory. · e5a5c966

Sean Silva authored Aug 01, 2016

No bots have yelled yet, but this test references an x86 intrinsic.
Also, it invokes llc on x86 IR.

Fixup to r277315.

llvm-svn: 277316

e5a5c966

Fix - CodeExtractor : Inherit Target Dependent Attributes from the parent function. · a0a802ab

Sean Silva authored Aug 01, 2016

When extracting a set of blocks make sure to inherit all of the target
dependent attributes to make sure that the function will be valid for
lowering. One example is the "target-features" attribute for x86, if the
extracted region has functionality that relies on a specific feature it
will fail to be lowered.
This also allows for extracted functions to be valid for inlining, at
least back into the parent function, as the target attributes are tested
when inlining for compatibility.

Patch by River Riddle!

Differential Revision: https://reviews.llvm.org/D22713

llvm-svn: 277315

a0a802ab

Add the tests for r277313 · 72be9a69
Sean Silva authored Aug 01, 2016
```
Forgot to `git add` them.

llvm-svn: 277314
```
72be9a69