Commits · dd7e3bd6335804985f4543bf3bec217ca43d5d47 · Roger Ferrer / llvm-epi

Jul 15, 2015

[LAA] Turn RuntimePointerChecking into a class, start hiding things, NFC · 3f3fd8d5
Adam Nemet authored Jul 14, 2015
```
The goal is to start hiding internal APIs.

llvm-svn: 242220
```
3f3fd8d5

[LAA] Introduce RuntimePointerChecking::PointerInfo, NFC · 9f7dedc3

Adam Nemet authored Jul 14, 2015

Turn this structure-of-arrays (i.e. the various pointer attributes) into
array-of-structures.

llvm-svn: 242219

9f7dedc3

[LAA] Lift RuntimePointerCheck out of LoopAccessInfo, NFC · 7cdebac0

Adam Nemet authored Jul 14, 2015

I am planning to add more nested classes inside RuntimePointerCheck so
all these triple-nesting would be hard to follow.

Also rename it to RuntimePointerChecking (i.e. append 'ing').

llvm-svn: 242218

7cdebac0

[PowerPC] Use the ABI indirect-call protocol for patchpoints · 9bbad03b

Hal Finkel authored Jul 14, 2015

We used to take the address specified as the direct target of the patchpoint
and did no TOC-pointer handling. This, however, as not all that useful,
because MCJIT tends to create a lot of modules, and they have their own TOC
sections. Thus, to call from the generated code to other generated code, you
really need to switch TOC pointers. Make this work as expected, and under
ELFv1, tread the address as the function descriptor address so that the correct
TOC pointer can be loaded.

llvm-svn: 242217

9bbad03b

Add support for reading members out of thin archives. · 4b83cb53

Rafael Espindola authored Jul 14, 2015

For now the Archive owns the buffers of the thin archive members.
This makes for a simple API, but all the buffers are destructed
only when the archive is destructed. This should be fine since we
close the files after mmap so we should not hit an open file
limit.

llvm-svn: 242215

4b83cb53

[ExecutionEngine] Re-apply r241962 with fixes for ARM. · 6f7012c2
Lang Hames authored Jul 14, 2015
```
Patch by Pierre-Andre Saulais. Thanks Pierre-Andre!

llvm-svn: 242213
```
6f7012c2

Add allnodes() iterator range to SelectionDAG. NFC. · 65c69407

Pete Cooper authored Jul 14, 2015

SelectionDAG already had begin/end methods for iterating over all
the nodes, but didn't define an iterator_range for us in foreach
loops.

This adds such a method and uses it in some of the eligible places
throughout the backends.

llvm-svn: 242212

65c69407

Jul 14, 2015

Move SDNode::IROrder in to padding to save space. NFC. · 8c20b4eb

Pete Cooper authored Jul 14, 2015

There was a 32-bit padding gap between 'unsigned short NumOperands, NumValues;' and 'DebugLoc debugLoc. Move 'unsigned IROrder' in to that gap.

This trims the size of SDNode's from 76 bytes (really 80 due to alignment) to 72 bytes.

llvm-svn: 242211

8c20b4eb

Constify parameters in SelectionDAG methods. NFC · 06e249e7
Pete Cooper authored Jul 14, 2015
```
llvm-svn: 242210
```
06e249e7

Remove unnecessary .getNode() in SelectionDAG. NFC. · cf17e18f

Pete Cooper authored Jul 14, 2015

The simplify_type specialisation allows us to cast directly from
SDValue to an SDNode* subclass so we don't need to pass a SDNode*
to cast<>.

llvm-svn: 242209

cf17e18f

Use more foreach loops in SelectionDAG. NFC · e89ba67f
Pete Cooper authored Jul 14, 2015
```
llvm-svn: 242208
```
e89ba67f
MIR Serialization: Serialize the machine basic block live in registers. · 9fab370d
Alex Lorenz authored Jul 14, 2015
```
llvm-svn: 242204
```
9fab370d

MIR Printer: move the function 'printReg'. NFC. · 15a00a85

Alex Lorenz authored Jul 14, 2015

This commit moves the function 'printReg' towards the start of the file so that
it can be used by the conversion methods in MIRPrinter and not just the printing
methods in MIPrinter.

llvm-svn: 242203

15a00a85

GVN: use a static array instead of regenerating it each time. NFC. · 586b7419
Tim Northover authored Jul 14, 2015
```
llvm-svn: 242202
```
586b7419

WebAssembly: add basic int/fp instruction codegen. · d9767a36

JF Bastien authored Jul 14, 2015

Summary: This patch has the most basic instruction codegen for 32 and 64 bit int/fp.

Reviewers: sunfish

Subscribers: llvm-commits, jfb

Differential Revision: http://reviews.llvm.org/D11193

llvm-svn: 242201

d9767a36

Fix NDEBUG build warning · fdfaae42
Krzysztof Parzyszek authored Jul 14, 2015
```
llvm-svn: 242200
```
fdfaae42

GVN: tolerate an instruction being replaced without existing in the leaderboard · d5fdef01

Tim Northover authored Jul 14, 2015

Sometimes an incidentally created instruction can duplicate a Value used
elsewhere. It then often doesn't end up in the leader table. If it's later
removed, we attempt to remove it from the leader table and segfault.

Instead we should just ignore the removal request, which won't cause any
problems. The reverse situation, where the original instruction is replaced by
the new one (which you might think could leave the leader table empty) cannot
occur, because the incidental instruction will never be found in the first
place.

llvm-svn: 242199

d5fdef01

test-release.sh: Remove the InstallDir parameter from configure_llvmCore · 923860d8
Hans Wennborg authored Jul 14, 2015
```
After r242187, it's never set.

llvm-svn: 242194
```
923860d8
Fix Windows build: replace __func__ with LLVM_FUNCTION_NAME · 2e828836
Krzysztof Parzyszek authored Jul 14, 2015
```
llvm-svn: 242192
```
2e828836

[MMX] Use the appropriate instructions for GR64 <-> VR64 copies. · 9e6dea1d

Bruno Cardoso Lopes authored Jul 14, 2015

MOVSDto64rr and MOV64toSDrr are defined to convert between FR64 (%xmm)
<-> GR64 registers, not VR64 (%mm) <-> GR64. This is wrong.

I found this by inspection and could not find a suitable testcase for it
since (1) we don't handle MMX bitcasts in Peephole optimizer as to
generate COPYs that (2) could be expanded back to the appropriate x86
instruction in ExpandPostRA.

Switch to use the appropriate instructions: MMX_MOVD64from64rr and
MMX_MOVD64to64rr here.

llvm-svn: 242191

9e6dea1d

[PowerPC] Fix the PPCInstrInfo::getInstrLatency implementation · 8acae527

Hal Finkel authored Jul 14, 2015

PowerPC uses itineraries to describe processor pipelines (and dispatch-group
restrictions for P7/P8 cores). Unfortunately, the target-independent
implementation of TII.getInstrLatency calls ItinData->getStageLatency, and that
looks for the largest cycle count in the pipeline for any given instruction.
This, however, yields the wrong answer for the PPC itineraries, because we
don't encode the full pipeline. Because the functional units are fully
pipelined, we only model the initial stages (there are no relevant hazards in
the later stages to model), and so the technique employed by getStageLatency
does not really work. Instead, we should take the maximum output operand
latency, and that's what PPCInstrInfo::getInstrLatency now does.

This caused some test-case churn, including two unfortunate side effects.
First, the new arrangement of copies we get from function parameters now
sometimes blocks VSX FMA mutation (a FIXME has been added to the code and the
test cases), and we have one significant test-suite regression:

SingleSource/Benchmarks/BenchmarkGame/spectral-norm
56.4185% +/- 18.9398%

In this benchmark we have a loop with a vectorized FP divide, and it with the
new scheduling both divides end up in the same dispatch group (which in this
case seems to cause a problem, although why is not exactly clear). The grouping
structure is hard to predict from the bottom of the loop, and there may not be
much we can do to fix this.

Very few other test-suite performance effects were really significant, but
almost all weakly favor this change. However, in light of the issues
highlighted above, I've left the old behavior available via a
command-line flag.

llvm-svn: 242188

8acae527

Fix several issues with the test-release.sh script · 7fa38a53

Dan Liew authored Jul 14, 2015

* Use the default install prefix (/usr/local) and use DESTDIR instead to
  set a temporary install location for tarballing. This is the correct
  way to package binary releases (otherwise the temporary install path
  ends up in files in the binary release).
* Remove ``-disable-clang`` option. It did not work correctly
  (tarballing assumed phase 3 was run) and when doing a release
  we should always be doing a three-phased build and test.

Note: Technically we should only be using DESTDIR for the third phase
and use --prefix for the first and second phase because we run the built
clang from phase 1 and 2 (and in general an application's behaviour
may depend on the install prefix). However in the case of clang it
seems to not care what the install prefix was so to simplify the script
we use DESTDIR for all three stages.

llvm-svn: 242187

7fa38a53

[Hexagon] Generate instructions for operations on predicate registers · 75874470

Krzysztof Parzyszek authored Jul 14, 2015

Convert logical operations on general-purpose registers to the correspon-
ding operations on predicate registers.

llvm-svn: 242186

75874470

[CodeGen] Force emission of personality directive if explicitly specified · aff703a2

Keno Fischer authored Jul 14, 2015

Summary:
Before this change, personality directives were not emitted
if there was no invoke left in the function (of course until
recently this also meant that we couldn't know what
the personality actually was). This patch forces personality directives
to still be emitted, unless it is known to be a noop in the absence of
invokes, or the user explicitly specified `nounwind` (and not
`uwtable`) on the function.

Reviewers: majnemer, rnk

Subscribers: rnk, llvm-commits

Differential Revision: http://reviews.llvm.org/D10884

llvm-svn: 242185

aff703a2

Add support for on-disk hash table lookup with a known hash, for situations... · 86a6f56c

Richard Smith authored Jul 14, 2015

Add support for on-disk hash table lookup with a known hash, for situations where the same key will be looked up in multiple tables.

llvm-svn: 242179

86a6f56c

Teach config.guess that MSYS exists. · 52511aa3

Yaron Keren authored Jul 14, 2015

We might not want to upgrade config.guess to the current
version due to the license change from GPL2 to GPL3.

llvm-svn: 242178

52511aa3

AMDGPU: Avoid using 64-bit shift for i64 (shl x, 32) · 24692118

Matt Arsenault authored Jul 14, 2015

This can be done only with moves which theoretically
will optimize better later.

Although this transform increases the instruction count,
it should be code size / cycle count neutral in the worst
VALU case. It also seems to slightly improve a couple
of testcases due to other DAG combines this exposes.

This is probably slightly worse for the SALU case, so
it might be better to handle this during moveToVALU,
although then you lose some simplifications like
the load width reducing in the simple testcase.

llvm-svn: 242177

24692118

AMDGPU/SI: Fix read2 merging into a super register. · 84db5d97

Matt Arsenault authored Jul 14, 2015

If the read2 produced was supposed to be writing into a
super register, it would use the wrong subregister indices.
Fix this by inserting copies, so we only ever write to a vreg_64.
Run the register coalescer again to clean this up, although this
isn't ideal and often does result in an extra move.

Also remove the assert that offset1 > offset0.

There isn't a real reason to not allow this other than a minor
convenience in the compiler, and it doesn't seem worth the effort
of avoiding it.

llvm-svn: 242174

84db5d97

MachineRegisterInfo: Remove UsedPhysReg infrastructure · 9912bb81

Matthias Braun authored Jul 14, 2015

We have a detailed def/use lists for every physical register in
MachineRegisterInfo anyway, so there is little use in maintaining an
additional bitset of which ones are used.

Removing it frees us from extra book keeping. This simplifies
VirtRegMap.

Differential Revision: http://reviews.llvm.org/D10911

llvm-svn: 242173

9912bb81

Avoid MSVC-incompatible use of init list. · 9f8ff9c4
David Blaikie authored Jul 14, 2015
```
llvm-svn: 242170
```
9f8ff9c4

RAGreedy: Keep track of allocated PhysRegs internally · 953393a7

Matthias Braun authored Jul 14, 2015

Do not use MachineRegisterInfo::setPhysRegUsed()/isPhysRegUsed()
anymore. This bitset changes function-global state and is set by the
VirtRegRewriter anyway.
Simply use a bitvector private to RAGreedy.

Differential Revision: http://reviews.llvm.org/D10910

llvm-svn: 242169

953393a7

Add missing builtins to the PPC back end for ABI compliance (vol. 4) · 984a3613

Nemanja Ivanovic authored Jul 14, 2015

This patch corresponds to review:
http://reviews.llvm.org/D11183

Back end portion of the fourth round of additions to altivec.h.

llvm-svn: 242167

984a3613

ARM: add at least one real test for r242123. · feabe2e2

Tim Northover authored Jul 14, 2015

The ones committed were orthogonal to the change and would have passed before
that revision. What it *did* do was prevent an assertion failure when
generating object files.

llvm-svn: 242166

feabe2e2

PrologEpilogInserter: Rewrite API to determine callee save regsiters. · 02564865

Matthias Braun authored Jul 14, 2015

This changes TargetFrameLowering::processFunctionBeforeCalleeSavedScan():

- Rename the function to determineCalleeSaves()
- Pass a bitset of callee saved registers by reference, thus avoiding
  the function-global PhysRegUsed bitset in MachineRegisterInfo.
- Without PhysRegUsed the implementation is fine tuned to not save
  physcial registers which are only read but never modified.

Related to rdar://21539507

Differential Revision: http://reviews.llvm.org/D10909

llvm-svn: 242165

02564865

AArch64: add rev64 alias for 64-bit rev instruction. · c962d4f2

Tim Northover authored Jul 14, 2015

It could be useful to assembly programmers and makes the permitted variants a
little more uniform.

llvm-svn: 242164

c962d4f2

[Hexagon] Generate "extract" instructions more aggressively · a0ecf07c

Krzysztof Parzyszek authored Jul 14, 2015

Generate extract instructions (via intrinsics) before the DAG combiner
folds shifts into unrecognizable forms.

llvm-svn: 242163

a0ecf07c

llvm-ar: Don't try to extract from thin archives. · e549b8c2
Rafael Espindola authored Jul 14, 2015
```
This matches the gnu ar behavior.

llvm-svn: 242162
```
e549b8c2
ARMAsmParser: Take MCInst param by const-ref · 61f9efe7
Hans Wennborg authored Jul 14, 2015
```
(Broken out from http://reviews.llvm.org/D11167)

llvm-svn: 242160
```
61f9efe7
Add default value for Args parameter of IRBuilder::CreateCall · c28708bc
David Blaikie authored Jul 14, 2015
```
Convenient for calls to zero-argument functions.

Patch by servuswiegehtz at yahoo.de

llvm-svn: 242159
```
c28708bc
Sleep for 2.1 seconds to see if that makes the test stable on windows. · 4ae78439
Rafael Espindola authored Jul 14, 2015
```
Might fix pr24106.

llvm-svn: 242158
```
4ae78439