Commits · f9701c6e8362dc85db1ac98d4e5773affe75b871 · Roger Ferrer / llvm-epi

Jan 10, 2015

[PowerPC] Mark zext of a small scalar load as free · 5d5d1539

Hal Finkel authored Jan 10, 2015

This initial implementation of PPCTargetLowering::isZExtFree marks as free
zexts of small scalar loads (that are not sign-extending). This callback is
used by SelectionDAGBuilder's RegsForValue::getCopyToRegs, and thus to
determine whether a zext or an anyext is used to lower illegally-typed PHIs.
Because later truncates of zero-extended values are nops, this allows for the
elimination of later unnecessary truncations.

Fixes the initial complaint associated with PR22120.

llvm-svn: 225584

5d5d1539

Remove some whitespace. · 17744c1e
Justin Hibbits authored Jan 10, 2015
```
llvm-svn: 225583
```
17744c1e
ConvertUTFTest: fix misleading empty line · cbc7ae25
Dmitri Gribenko authored Jan 10, 2015
```
llvm-svn: 225580
```
cbc7ae25

tests: fix previous commit · c552218e

Saleem Abdulrasool authored Jan 10, 2015

The previous commit accidentally missed changes to the test output checking,
resulting in an errant failure.

llvm-svn: 225577

c552218e

test: merge ARM relocations test · 48bbb6c8

Saleem Abdulrasool authored Jan 10, 2015

There is a fair number of relocations that are part of the AAELF specification.
Simply merge the tests into a single test file, otherwise, we will end up with
far too many test files to test each relocation type.  NFC.

llvm-svn: 225576

48bbb6c8

tests: convert a couple of ARM relocation tests to readobj · ff2da70f

Saleem Abdulrasool authored Jan 10, 2015

These tests are checking the relocation generation.  Use the readobj output as
it is much easier to follow when glancing over the tests.

llvm-svn: 225575

ff2da70f

Fully fix Bug #22115. · 654346e6

Justin Hibbits authored Jan 10, 2015

Summary:
In the previous commit, the register was saved, but space was not allocated.
This resulted in the parameter save area potentially clobbering r30, leading to
nasty results.

Test Plan: Tests updated

Reviewers: hfinkel

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D6906

llvm-svn: 225573

654346e6

Fix undefined behavior (shift of negative value) in RuntimeDyldMachOAArch64::encodeAddend. · 7c8a7251

Alexey Samsonov authored Jan 10, 2015

Test Plan: regression test suite with/without UBSan.

Reviewers: lhames, ributzka

Subscribers: aemerson, llvm-commits

Differential Revision: http://reviews.llvm.org/D6908

llvm-svn: 225568

7c8a7251

[PowerPC] Readjust the loop unrolling threshold · 611b127a

Hal Finkel authored Jan 10, 2015

Now that the way that the partial unrolling threshold for small loops is used
to compute the unrolling factor as been corrected, a slightly smaller threshold
is preferable. This is expected; other targets may need to re-tune as well.

llvm-svn: 225566

611b127a

[LoopUnroll] Fix the partial unrolling threshold for small loop sizes · 38dd5908

Hal Finkel authored Jan 10, 2015

When we compute the size of a loop, we include the branch on the backedge and
the comparison feeding the conditional branch. Under normal circumstances,
these don't get replicated with the rest of the loop body when we unroll. This
led to the somewhat surprising behavior that really small loops would not get
unrolled enough -- they could be unrolled more and the resulting loop would be
below the threshold, because we were assuming they'd take
(LoopSize * UnrollingFactor) instructions after unrolling, instead of
(((LoopSize-2) * UnrollingFactor)+2) instructions. This fixes that computation.

llvm-svn: 225565

38dd5908

Use the DiagnosticHandler to print diagnostics when reading bitcode. · d0b23bef

Rafael Espindola authored Jan 10, 2015

The bitcode reading interface used std::error_code to report an error to the
callers and it is the callers job to print diagnostics.

This is not ideal for error handling or diagnostic reporting:

* For error handling, all that the callers care about is 3 possibilities:
  * It worked
  * The bitcode file is corrupted/invalid.
  * The file is not bitcode at all.

* For diagnostic, it is user friendly to include far more information
  about the invalid case so the user can find out what is wrong with the
  bitcode file. This comes up, for example, when a developer introduces a
  bug while extending the format.

The compromise we had was to have a lot of error codes.

With this patch we use the DiagnosticHandler to communicate with the
human and std::error_code to communicate with the caller.

This allows us to have far fewer error codes and adds the infrastructure to
print better diagnostics. This is so because the diagnostics are printed when
he issue is found. The code that detected the problem in alive in the stack and
can pass down as much context as needed. As an example the patch updates
test/Bitcode/invalid.ll.

Using a DiagnosticHandler also moves the fatal/non-fatal error decision to the
caller. A simple one like llvm-dis can just use fatal errors. The gold plugin
needs a bit more complex treatment because of being passed non-bitcode files. An
hypothetical interactive tool would make all bitcode errors non-fatal.

llvm-svn: 225562

d0b23bef

Fix UBSan error reports in ValueMapCallbackVH and AssertingVH<T> empty/tombstone keys generation. · 29e464f0

Alexey Samsonov authored Jan 09, 2015

Summary:
One more attempt to fix UBSan reports: make sure DenseMapInfo::getEmptyKey()
and DenseMapInfo::getTombstoneKey() doesn't do any upcasts/downcasts to/from Value*.

Test Plan: check-llvm test suite with/without UBSan bootstrap

Reviewers: chandlerc, dexonsmith

Subscribers: llvm-commits, majnemer

Differential Revision: http://reviews.llvm.org/D6903

llvm-svn: 225558

29e464f0

Disable Go bindings test under UBSan. · 55acbc07
Alexey Samsonov authored Jan 09, 2015
```
llvm-svn: 225557
```
55acbc07

Jan 09, 2015

Fix the JIT event listeners and replace the associated tests. · a10379ad

Andrew Kaylor authored Jan 09, 2015

The changes to EventListenerCommon.h were contributed by Arch Robison.

This fixes bug 22095.

http://reviews.llvm.org/D6905

llvm-svn: 225554

a10379ad

Update comment. · d9ade185
Michael Zolotukhin authored Jan 09, 2015
```
llvm-svn: 225553
```
d9ade185

SimplifyCFG: check uses of constant-foldable instrs in switch destinations (PR20210) · dcc6e5bc

Hans Wennborg authored Jan 09, 2015

The previous code assumed that such instructions could not have any uses
outside CaseDest, with the motivation that the instruction could not
dominate CommonDest because CommonDest has phi nodes in it. That simply
isn't true; e.g., CommonDest could have an edge back to itself.

llvm-svn: 225552

dcc6e5bc

[X86][SSE] Avoid vector byte shuffles with zero by using pshufb to create zeros · ec1f2c2c

Simon Pilgrim authored Jan 09, 2015

pshufb can shuffle in zero bytes as well as bytes from a source vector - we can use this to avoid having to shuffle 2 vectors and ORing the result when the used inputs from a vector are all zeroable.

Differential Revision: http://reviews.llvm.org/D6878

llvm-svn: 225551

ec1f2c2c

Fix an ASAN failure introduced with r225537 (adding the -universal-headers to llvm-obdump). · 0512bd75
Kevin Enderby authored Jan 09, 2015
```
And a fly by fix to some formatting issues with the same commit.

llvm-svn: 225550
```
0512bd75
Add a testcase of llvm-lto error handling. · 1ea49d1b
Rafael Espindola authored Jan 09, 2015
```
llvm-svn: 225545
```
1ea49d1b
Remove duplicating code. NFC. · 1c38bc12
Michael Zolotukhin authored Jan 09, 2015
```
The removed condition is checked in the previous loop.

llvm-svn: 225542
```
1c38bc12
Add the option, -universal-headers, used with -macho to print the Mach-O... · 131d1770
Kevin Enderby authored Jan 09, 2015
```
Add the option, -universal-headers, used with -macho to print the Mach-O universal headers to llvm-objdump.

llvm-svn: 225537
```
131d1770

Re-reapply r221924: "[GVN] Perform Scalar PRE on gep indices that feed loads before · eb16112e

Tim Northover authored Jan 09, 2015

doing Load PRE"

It's not really expected to stick around, last time it provoked a weird LTO
build failure that I can't reproduce now, and the bot logs are long gone. I'll
re-revert it if the failures recur.

Original description: Perform Scalar PRE on gep indices that feed loads before
doing Load PRE.

llvm-svn: 225536

eb16112e

Recommit r224935 with a fix for the ObjC++/AArch64 bug that that revision · 1e923ec1
Lang Hames authored Jan 09, 2015
```
introduced.

A test case for the bug was already committed in r225385.

Patch by Rafael Espindola.

llvm-svn: 225534
```
1e923ec1

Revert "Bitcode: Move the DEBUG_LOC record to DEBUG_LOC_OLD" · 9ed19665

Duncan P. N. Exon Smith authored Jan 09, 2015

This reverts commit r225498 (but leaves r225499, which was a worthy
cleanup).

My plan was to change `DEBUG_LOC` to store the `MDNode` directly rather
than its operands (patch was to go out this morning), but on reflection
it's not clear that it's strictly better.  (I had missed that the
current code is unlikely to emit the `MDNode` at all.)

Conflicts:
	lib/Bitcode/Reader/BitcodeReader.cpp (due to r225499)

llvm-svn: 225531

9ed19665

[mips] Add support for accessing $gp as a named register. · 1440bb2a

Daniel Sanders authored Jan 09, 2015

Summary:
Mips Linux uses $gp to hold a pointer to thread info structure and accesses it
with a named register. This makes this work for LLVM.

The N32 ABI doesn't quite work yet since the frontend generates incorrect IR
for this case. It neglects to truncate the 64-bit GPR to a 32-bit value before
converting to a pointer. Given correct IR (as in the testcase in this patch),
it works correctly.

Reviewers: sstankovic, vmedic, atanasyan

Reviewed By: atanasyan

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D6893

llvm-svn: 225529

1440bb2a

fix typos; remove names from comments; NFC · 2ce8169e
Sanjay Patel authored Jan 09, 2015
```
llvm-svn: 225528
```
2ce8169e
remove names from comments; NFC · 2a385e24
Sanjay Patel authored Jan 09, 2015
```
llvm-svn: 225526
```
2a385e24
fix typos; NFC · 938e2790
Sanjay Patel authored Jan 09, 2015
```
llvm-svn: 225525
```
938e2790
fix typo; NFC · e6e58c1a
Sanjay Patel authored Jan 09, 2015
```
llvm-svn: 225524
```
e6e58c1a
more efficient use of a dyn_cast; no functional change intended · d729115f
Sanjay Patel authored Jan 09, 2015
```
llvm-svn: 225523
```
d729115f

[PowerPC] Enable late partial unrolling on the POWER7 · b359b735

Hal Finkel authored Jan 09, 2015

The P7 benefits from not have really-small loops so that we either have
multiple dispatch groups in the loop and/or the ability to form more-full
dispatch groups during scheduling. Setting the partial unrolling threshold to
44 seems good, empirically, for the P7. Compared to using no late partial
unrolling, this yields the following test-suite speedups:

SingleSource/Benchmarks/Adobe-C++/simple_types_constant_folding
	-66.3253% +/- 24.1975%
SingleSource/Benchmarks/Misc-C++/oopack_v1p8
	-44.0169% +/- 29.4881%
SingleSource/Benchmarks/Misc/pi
	-27.8351% +/- 12.2712%
SingleSource/Benchmarks/Stanford/Bubblesort
	-30.9898% +/- 22.4647%

I've speculatively added a similar setting for the P8. Also, I've noticed that
the unroller does not quite calculate the unrolling factor correctly for really
tiny loops because it neglects to account for the fact that not every loop body
replicant contains an ending branch and counter increment. I'll fix that later.

llvm-svn: 225522

b359b735

[mips] Add comment which explains why we need to change the assembler options... · 68e8a9c0

Toma Tabacu authored Jan 09, 2015

[mips] Add comment which explains why we need to change the assembler options before and after inline asm blocks. NFC.

llvm-svn: 225521

68e8a9c0

Assumption that "VectorizedValue" will always be an Instruction is not correct. · 85d04736

Suyog Sarda authored Jan 09, 2015

It can be a constant or a vector argument.

ex :

define i32 @hadd(<4 x i32> %a) #0 {
entry:
  %vecext = extractelement <4 x i32> %a, i32 0
  %vecext1 = extractelement <4 x i32> %a, i32 1
  %add = add i32 %vecext, %vecext1
  %vecext2 = extractelement <4 x i32> %a, i32 2
  %add3 = add i32 %add, %vecext2
  %vecext4 = extractelement <4 x i32> %a, i32 3
  %add5 = add i32 %add3, %vecext4
  ret i32 %add5
}

llvm-svn: 225517

85d04736

ARM: add support for R_ARM_ABS16 · b68fa3b5
Saleem Abdulrasool authored Jan 09, 2015
```
Add support for R_ARM_ABS16 relocation mapping.  Addresses PR22156.

llvm-svn: 225510
```
b68fa3b5

test: add additional test for SVN r225507 · 3e81ecfe

Saleem Abdulrasool authored Jan 09, 2015

Add an additional test case to ensure that we generate the relocation even if
the thumb target is used.

llvm-svn: 225509

3e81ecfe

ARM: add support for R_ARM_ABS8 relocations · 3c0f78a2
Saleem Abdulrasool authored Jan 09, 2015
```
Add support for R_ARM_ABS8 relocation.  Addresses PR22126.

llvm-svn: 225507
```
3c0f78a2

RegisterCoalescer: Fix removeCopyByCommutingDef with subreg liveness · 7e873845

Matthias Braun authored Jan 09, 2015

The code that eliminated additional coalescable copies in
removeCopyByCommutingDef() used MergeValueNumberInto() which internally
may merge A into B or B into A. In this case A and B had different Def
points, so we have to reset ValNo.Def to the intended one after merging.

llvm-svn: 225503

7e873845

RegisterCoalescer: Some cleanup in removeCopyByCommutingDef(), NFC · ea399e59
Matthias Braun authored Jan 09, 2015
```
llvm-svn: 225502
```
ea399e59
RegisterCoalescer: No need to set kill flags, they are recompute later anyway · 55586a2f
Matthias Braun authored Jan 09, 2015
```
llvm-svn: 225501
```
55586a2f
RegisterCoalescer: Turn some impossible conditions into asserts · 6588b145
Matthias Braun authored Jan 09, 2015
```
llvm-svn: 225500
```
6588b145