Commits · 4213c39e3c8a1e14efee8d0636266a26721cc27c · Roger Ferrer / llvm-epi-0.8

May 29, 2013

LTO+Debug Info: revert r182791. · 4213c39e

Manman Ren authored May 29, 2013

Since the testing case uses ref_addr, which requires version 3+ to work,
we will solve the dwarf version issue first.

This patch also causes failures in one of the bots. I will update the patch
accordingly in my next attempt.

rdar://13926659

llvm-svn: 182867

4213c39e

Tidy some register classes for ARM and Thumb · 13969d0a

JF Bastien authored May 29, 2013

Tidy up three places where the register class for ARM and Thumb wasn't
restrictive enough:
 - No PC dest for reg-reg add/orr/sub.
 - No PC dest for shifts.
 - No PC or SP for Thumb2 reg-imm add.

I encountered this while combining FastISel with
-verify-machineinstrs. These instructions defined registers whose
classes weren't restrictive enough, and the uses failed
verification. They're also undefined in the ISA, or would produce code
that FastISel wouldn't want. This doesn't fix the register class
narrowing issue (where uses should restrict definitions), and isn't
thorough, but it's a small step in the right direction.

llvm-svn: 182863

13969d0a

SparcFrameLowering.cpp: Mark verifyLeafProcRegUse() as UNUSED. [-Wunused-function] · dbd3bbe1
NAKAMURA Takumi authored May 29, 2013
```
llvm-svn: 182850
```
dbd3bbe1
[SystemZ] Immediate compare-and-branch support · e1d9f00f
Richard Sandiford authored May 29, 2013
```
This patch adds support for the CIJ and CGIJ instructions.

llvm-svn: 182846
```
e1d9f00f
Temporary fix to get rid of gcc warning. · ae8faf2e
Patrik Hagglund authored May 29, 2013
```
llvm-svn: 182832
```
ae8faf2e
[Sparc] Add support for leaf functions in sparc backend. · ca0fe2f5
Venkatraman Govindaraju authored May 29, 2013
```
llvm-svn: 182822
```
ca0fe2f5
LoopVectorize.cpp: Fix abuse of StringRef on Twine. Twine captures the pointer of StringRef. · d11b42aa
NAKAMURA Takumi authored May 29, 2013
```
llvm-svn: 182820
```
d11b42aa
Whitespace. · d57ea870
NAKAMURA Takumi authored May 29, 2013
```
llvm-svn: 182819
```
d57ea870

Mips assembler: Improve set register alias handling · 02593003

Jack Carter authored May 28, 2013

This patch solves the problem of numeric register values not being accepted:

../set_alias.s:1:11: error: expected valid expression after comma
        .set    r4,$4
                    ^
The parsing of .set directive is changed and handling of symbols in code 
as well to enable this feature. 

The test example is added.

Patch by Vladimir Medic

llvm-svn: 182807

02593003

May 28, 2013

AArch64: clarify -help message · 8a1aa518
Tim Northover authored May 28, 2013
```
llvm-svn: 182804
```
8a1aa518

Add support for llvm.vectorizer metadata · 5fdf836b

Paul Redmond authored May 28, 2013

- llvm.loop.parallel metadata has been renamed to llvm.loop to be more generic
  by making the root of additional loop metadata.
  - Loop::isAnnotatedParallel now looks for llvm.loop and associated
    llvm.mem.parallel_loop_access
  - document llvm.loop and update llvm.mem.parallel_loop_access
- add support for llvm.vectorizer.width and llvm.vectorizer.unroll
  - document llvm.vectorizer.* metadata
  - add utility class LoopVectorizerHints for getting/setting loop metadata
  - use llvm.vectorizer.width=1 to indicate already vectorized instead of
    already_vectorized
- update existing tests that used llvm.loop.parallel and
  llvm.vectorizer.already_vectorized

Reviewed by: Nadav Rotem

llvm-svn: 182802

5fdf836b

[APInt] Implement tcDecrement as a counterpart to tcIncrement. This is for use... · 9d406f4e

Michael Gottesman authored May 28, 2013

[APInt] Implement tcDecrement as a counterpart to tcIncrement. This is for use in APFloat IEEE-754R 2008 nextUp/nextDown function.

rdar://13852078

llvm-svn: 182801

9d406f4e

ARM: use pristine object file while processing relocations · 3b684d83

Tim Northover authored May 28, 2013

Previously we would read-modify-write the target bits when processing
relocations for the MCJIT. This had the problem that when relocations
were processed multiple times for the same object file (as they can
be), the result is not idempotent and the values became corrupted.

The solution to this is to take any bits used in the destination from
the pristine object file as LLVM emitted it.

This should fix PR16013 and remote MCJIT on ARM ELF targets.

llvm-svn: 182800

3b684d83

LTO+Debug Info: correctly emit inlined_subroutine when the inlined callee is · b5b5453e

Manman Ren authored May 28, 2013

from a different CU.

We used to print out an error message and fail to generate inlined_subroutine.

If we use ref_addr in the generated DWARF, the DWARF version should be 3 or
above.
rdar://13926659

llvm-svn: 182791

b5b5453e

Hexagon: Typo fix. · cceafb2d
Jyotsna Verma authored May 28, 2013
```
llvm-svn: 182790
```
cceafb2d
Simplify code. No functionality change. · 262b1542
Benjamin Kramer authored May 28, 2013
```
llvm-svn: 182779
```
262b1542
Remove double semicolons. · 351d53c2
Benjamin Kramer authored May 28, 2013
```
llvm-svn: 182778
```
351d53c2

Extend RemapInstruction and friends to take an optional new parameter, a ValueMaterializer. · f6f121e2

James Molloy authored May 28, 2013

Extend LinkModules to pass a ValueMaterializer to RemapInstruction and friends to lazily create Functions for lazily linked globals. This is a big win when linking small modules with large (mostly unused) library modules.

llvm-svn: 182776

f6f121e2

[msan] Fix argument shadow alignment. · fca01233
Evgeniy Stepanov authored May 28, 2013
```
llvm-svn: 182771
```
fca01233

[SystemZ] Register compare-and-branch support · 0fb90ab0

Richard Sandiford authored May 28, 2013

This patch adds support for the CRJ and CGRJ instructions.  Support for
the immediate forms will be a separate patch.

The architecture has a large number of comparison instructions.  I think
it's generally better to concentrate on using the "best" comparison
instruction first and foremost, then only use something like CRJ if
CR really was the natual choice of comparison instruction.  The patch
therefore opportunistically converts separate CR and BRC instructions
into a single CRJ while emitting instructions in ISelLowering.

llvm-svn: 182764

0fb90ab0

[SystemZ] Tweak SystemZInstrInfo::isBranch() interface · 53c9efd9
Richard Sandiford authored May 28, 2013
```
This is needed for the upcoming compare-and-branch patch.  No functional
change intended.

llvm-svn: 182762
```
53c9efd9

Make BasicAliasAnalysis recognize the fact a noalias argument cannot alias... · f3e663af

Michael Kuperstein authored May 28, 2013

Make BasicAliasAnalysis recognize the fact a noalias argument cannot alias another argument, even if the other argument is not itself marked noalias.

llvm-svn: 182755

f3e663af

Make it explicit that GlobalAlias are ok in llvm.used. · eaf53276
Rafael Espindola authored May 27, 2013
```
No functionality change.

llvm-svn: 182747
```
eaf53276
Make helper functions static. · f30f2cce
Rafael Espindola authored May 27, 2013
```
And remove header and cpp file that are empty after that.

llvm-svn: 182746
```
f30f2cce

May 27, 2013

Convert sqrt functions into sqrt instructions when -ffast-math is in effect. · 048f99de

Preston Gurd authored May 27, 2013

When -ffast-math is in effect (on Linux, at least), clang defines
__FINITE_MATH_ONLY__ > 0 when including <math.h>. This causes the
preprocessor to include <bits/math-finite.h>, which renames the sqrt functions.
For instance, "sqrt" is renamed as "__sqrt_finite". 

This patch adds the 3 new names in such a way that they will be treated
as equivalent to their respective original names.

llvm-svn: 182739

048f99de

PPC: Add a isConsecutiveLS utility function · 8ebfe6c2

Hal Finkel authored May 27, 2013

isConsecutiveLS is a slightly more general form of
SelectionDAG::isConsecutiveLoad. Aside from also handling stores, it also does
not assume equality of the chain operands is necessary. In the case of the PPC
backend, this chain condition is checked in a more general way by the
surrounding code.

Mostly, this part of the refactoring in preparation for supporting optimized
unaligned stores.

llvm-svn: 182723

8ebfe6c2

May 26, 2013

Prefer to duplicate PPC Altivec loads when expanding unaligned loads · 7d8a691b

Hal Finkel authored May 26, 2013

When expanding unaligned Altivec loads, we use the decremented offset trick to
prevent page faults. Unfortunately, if we have a sequence of consecutive
unaligned loads, this leads to suboptimal code generation because the 'extra'
load from the first unaligned load can be combined with the base load from the
second (but only if the decremented offset trick is not used for the first).
Search up and down the chain, through loads and token factors, looking for
consecutive loads, and if one is found, don't use the offset reduction trick.
These duplicate loads are later combined to yield the desired sequence (in the
future, we might want a more-powerful chain search, but that will require some
changes to allow the combiner routines to access the AA object).

This should complete the initial implementation of the optimized unaligned
Altivec load expansion. There is some refactoring that should be done, but
that will happen when the unaligned store expansion is added.

llvm-svn: 182719

7d8a691b

Fix PR16143: Insert DEBUG_VALUE before terminator. · c66d26ad
Andrew Trick authored May 26, 2013
```
llvm-svn: 182717
```
c66d26ad

May 25, 2013

Add support for DWARF line number table entries for values in the instruction · 80cbcd2d
Cameron Zwarich authored May 25, 2013
```
stream.

llvm-svn: 182712
```
80cbcd2d

PPC: Combine duplicate (offset) lvsl Altivec intrinsics · bc2ee4c4

Hal Finkel authored May 25, 2013

The lvsl permutation control instruction is a function only of the alignment of
the pointer operand (relative to the 16-byte natural alignment of Altivec
vectors). As a result, multiple lvsl intrinsics where the operands differ by a
multiple of 16 can be combined.

llvm-svn: 182708

bc2ee4c4

Track IR ordering of SelectionDAG nodes 3/4. · e2431c64

Andrew Trick authored May 25, 2013

Remove the old IR ordering mechanism and switch to new one.  Fix unit
test failures.

llvm-svn: 182704

e2431c64

Track IR ordering of SelectionDAG nodes 2/4. · ef9de2a7

Andrew Trick authored May 25, 2013

Change SelectionDAG::getXXXNode() interfaces as well as call sites of
these functions to pass in SDLoc instead of DebugLoc.

llvm-svn: 182703

ef9de2a7

Track IR ordering of SelectionDAG nodes 1/4. · 175143bf

Andrew Trick authored May 25, 2013

Use a field in the SelectionDAGNode object to track its IR ordering.
This adds fields and utility classes without changing existing
interfaces or functionality.

llvm-svn: 182701

175143bf

ArrayRef-ize MD5 and clean up a few variable names. · fcee6f0a

Eric Christopher authored May 24, 2013

Add a stringize method to make dumping a bit easier, and add a testcase
exercising a few different paths.

llvm-svn: 182692

fcee6f0a

PPC: Initial support for permutation-based unaligned Altivec loads · cf2e9080

Hal Finkel authored May 24, 2013

Altivec only directly supports aligned loads, but the loads have a strange
property: If given an unaligned address, they truncate the address to the next
lower aligned address, and load from there. This property, along with an extra
load and some special-purpose permutation-control instructions that generate
the appropriate permutations from the original unaligned address, allow
efficient lowering of aligned loads. This code uses the trick explained in the
Apple Velocity Engine optimization overview document to prevent the needed
extra load from possibly causing a page fault if the original address happens
to be aligned.

As noted in the FIXMEs, there are several additional optimizations that can be
performed to reduce the cost of these loads even more. These will be
implemented in future commits.

llvm-svn: 182691

cf2e9080

Follow up of the introduction of MCSymbolizer. · f482805c

Quentin Colombet authored May 24, 2013

- Ressurect old MCDisassemble API to soften transition.
- Extend MCTargetDesc to set target specific symbolizer.

llvm-svn: 182688

f482805c

Replace Count{Leading,Trailing}Zeros_{32,64} with count{Leading,Trailing}Zeros. · df1ecbd7
Michael J. Spencer authored May 24, 2013
```
llvm-svn: 182680
```
df1ecbd7

May 24, 2013

Add missing header for atexit. · 4fd69975
Michael J. Spencer authored May 24, 2013
```
llvm-svn: 182672
```
4fd69975

[objc-arc] KnownSafe does not imply that it is safe to perform code motion... · e67f40c5

Michael Gottesman authored May 24, 2013

[objc-arc] KnownSafe does not imply that it is safe to perform code motion across CFG edges since even if it is safe to remove RR pairs, we may still be able to move a retain/release into a loop.

rdar://13949644

llvm-svn: 182670

e67f40c5

[objc-arc] Make sure that multiple owners is propogated correctly through the... · 5a91bbf3

Michael Gottesman authored May 24, 2013

[objc-arc] Make sure that multiple owners is propogated correctly through the pass via the usage of a global data structure.

rdar://13750319

llvm-svn: 182669

5a91bbf3