Commits · d1c5a317213a7d5551701bf9e2adf9246ceb6283 · Roger Ferrer / llvm-epi-0.8

May 30, 2013
- Rename variable to be more descriptive. · d1c5a317
  Eric Christopher authored May 30, 2013
```
llvm-svn: 182903
```
  d1c5a317
- Formatting. · 1e1c7f1b
  Eric Christopher authored May 30, 2013
```
llvm-svn: 182902
```
  1e1c7f1b
- Reformat comments here. · 64ae44a0
  Eric Christopher authored May 30, 2013
```
llvm-svn: 182901
```
  64ae44a0
- Order CALLSEQ_START and CALLSEQ_END nodes. · ad6d08ac
  Andrew Trick authored May 29, 2013
```
Fixes PR16146: gdb.base__call-ar-st.exp fails after
pre-RA-sched=source fixes.

Patch by Xiaoyi Guo!

This also fixes an unsupported dbg.value test case. Codegen was
previously incorrect but the test was passing by luck.

llvm-svn: 182885
```
  ad6d08ac
May 29, 2013

X86: Fix Defs/Uses for insts that imp-def/imp-use both an A-register and EFLAGS. · 00e08db3

Ahmed Bougacha authored May 29, 2013

This corrects a problem where x86 instructions that implicitly define/use both
an A-register (RAX, EAX, ..) and EFLAGS were declared as only defining/using
EFLAGS, because the outer "let Defs/Uses = [EFLAGS]" in the various multiclasses
overrides the "let Defs/Uses = [areg]" in BinOpAI.

The instructions deriving from BinOpAI were moved out of the "let Defs", and a
BinOpAI_FF class was created, for instructions that implicitly define and use
EFLAGS and the A-register (SBC, ADC).

llvm-svn: 182883

00e08db3

Don't assume the registers will be enumerated sequentially. · 33b73662
Chad Rosier authored May 29, 2013
```
llvm-svn: 182879
```
33b73662

Enable FastISel on ARM for Linux and NaCl · f60e0e44

JF Bastien authored May 29, 2013

FastISel was only enabled for iOS ARM and Thumb2, this patch enables it
for ARM (not Thumb2) on Linux and NaCl.

Thumb2 support needs a bit more work, mainly around register class
restrictions.

The patch punts to SelectionDAG when doing TLS relocation on non-Darwin
targets. I will fix this and other FastISel-to-SelectionDAG failures in
a separate patch.

The patch also forces FastISel to retain frame pointers: iOS always
keeps them for backtracking (so emitted code won't change because of
this), but Linux was getting much worse code that was incorrect when
using big frames (such as test-suite's lencod). I'll also fix this in a
later patch, it will probably require a peephole so that FastISel
doesn't rematerialize frame pointers back-to-back.

The test changes are straightforward, similar to:
  http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130513/174279.html
They also add a vararg test that got dropped in that change.

I ran all of test-suite on A15 hardware with --optimize-option=-O0 and
all the tests pass.

llvm-svn: 182877

f60e0e44

Don't reach into the middle of TargetMachine and cache one of its ivars. · 70b1400e
Bill Wendling authored May 29, 2013
```
Not only does this break encapsulation, it's gross.

llvm-svn: 182876
```
70b1400e

Teach ReMaterialization to be more cunning about subregisters · b65f6b08

Tim Northover authored May 29, 2013

This allows rematerialization during register coalescing to handle
more cases involving operations like SUBREG_TO_REG which might need to
be rematerialized using sub-register indices.

For example, code like:
    v1(GPR64):sub_32 = MOVZ something
    v2(GPR64) = COPY v1(GPR64)
should be convertable to:
    v2(GPR64):sub_32 = MOVZ something

but previously we just gave up in places like this

llvm-svn: 182872

b65f6b08

Simplify logic by using the appropriate functions. · 4db8f644
Adrian Prantl authored May 29, 2013
```
llvm-svn: 182869
```
4db8f644

LTO+Debug Info: revert r182791. · 4213c39e

Manman Ren authored May 29, 2013

Since the testing case uses ref_addr, which requires version 3+ to work,
we will solve the dwarf version issue first.

This patch also causes failures in one of the bots. I will update the patch
accordingly in my next attempt.

rdar://13926659

llvm-svn: 182867

4213c39e

Tidy some register classes for ARM and Thumb · 13969d0a

JF Bastien authored May 29, 2013

Tidy up three places where the register class for ARM and Thumb wasn't
restrictive enough:
 - No PC dest for reg-reg add/orr/sub.
 - No PC dest for shifts.
 - No PC or SP for Thumb2 reg-imm add.

I encountered this while combining FastISel with
-verify-machineinstrs. These instructions defined registers whose
classes weren't restrictive enough, and the uses failed
verification. They're also undefined in the ISA, or would produce code
that FastISel wouldn't want. This doesn't fix the register class
narrowing issue (where uses should restrict definitions), and isn't
thorough, but it's a small step in the right direction.

llvm-svn: 182863

13969d0a

SparcFrameLowering.cpp: Mark verifyLeafProcRegUse() as UNUSED. [-Wunused-function] · dbd3bbe1
NAKAMURA Takumi authored May 29, 2013
```
llvm-svn: 182850
```
dbd3bbe1
[SystemZ] Immediate compare-and-branch support · e1d9f00f
Richard Sandiford authored May 29, 2013
```
This patch adds support for the CIJ and CGIJ instructions.

llvm-svn: 182846
```
e1d9f00f
Temporary fix to get rid of gcc warning. · ae8faf2e
Patrik Hagglund authored May 29, 2013
```
llvm-svn: 182832
```
ae8faf2e
[Sparc] Add support for leaf functions in sparc backend. · ca0fe2f5
Venkatraman Govindaraju authored May 29, 2013
```
llvm-svn: 182822
```
ca0fe2f5
LoopVectorize.cpp: Fix abuse of StringRef on Twine. Twine captures the pointer of StringRef. · d11b42aa
NAKAMURA Takumi authored May 29, 2013
```
llvm-svn: 182820
```
d11b42aa
Whitespace. · d57ea870
NAKAMURA Takumi authored May 29, 2013
```
llvm-svn: 182819
```
d57ea870

Mips assembler: Improve set register alias handling · 02593003

Jack Carter authored May 28, 2013

This patch solves the problem of numeric register values not being accepted:

../set_alias.s:1:11: error: expected valid expression after comma
        .set    r4,$4
                    ^
The parsing of .set directive is changed and handling of symbols in code 
as well to enable this feature. 

The test example is added.

Patch by Vladimir Medic

llvm-svn: 182807

02593003

May 28, 2013

AArch64: clarify -help message · 8a1aa518
Tim Northover authored May 28, 2013
```
llvm-svn: 182804
```
8a1aa518

Add support for llvm.vectorizer metadata · 5fdf836b

Paul Redmond authored May 28, 2013

- llvm.loop.parallel metadata has been renamed to llvm.loop to be more generic
  by making the root of additional loop metadata.
  - Loop::isAnnotatedParallel now looks for llvm.loop and associated
    llvm.mem.parallel_loop_access
  - document llvm.loop and update llvm.mem.parallel_loop_access
- add support for llvm.vectorizer.width and llvm.vectorizer.unroll
  - document llvm.vectorizer.* metadata
  - add utility class LoopVectorizerHints for getting/setting loop metadata
  - use llvm.vectorizer.width=1 to indicate already vectorized instead of
    already_vectorized
- update existing tests that used llvm.loop.parallel and
  llvm.vectorizer.already_vectorized

Reviewed by: Nadav Rotem

llvm-svn: 182802

5fdf836b

[APInt] Implement tcDecrement as a counterpart to tcIncrement. This is for use... · 9d406f4e

Michael Gottesman authored May 28, 2013

[APInt] Implement tcDecrement as a counterpart to tcIncrement. This is for use in APFloat IEEE-754R 2008 nextUp/nextDown function.

rdar://13852078

llvm-svn: 182801

9d406f4e

ARM: use pristine object file while processing relocations · 3b684d83

Tim Northover authored May 28, 2013

Previously we would read-modify-write the target bits when processing
relocations for the MCJIT. This had the problem that when relocations
were processed multiple times for the same object file (as they can
be), the result is not idempotent and the values became corrupted.

The solution to this is to take any bits used in the destination from
the pristine object file as LLVM emitted it.

This should fix PR16013 and remote MCJIT on ARM ELF targets.

llvm-svn: 182800

3b684d83

LTO+Debug Info: correctly emit inlined_subroutine when the inlined callee is · b5b5453e

Manman Ren authored May 28, 2013

from a different CU.

We used to print out an error message and fail to generate inlined_subroutine.

If we use ref_addr in the generated DWARF, the DWARF version should be 3 or
above.
rdar://13926659

llvm-svn: 182791

b5b5453e

Hexagon: Typo fix. · cceafb2d
Jyotsna Verma authored May 28, 2013
```
llvm-svn: 182790
```
cceafb2d
Simplify code. No functionality change. · 262b1542
Benjamin Kramer authored May 28, 2013
```
llvm-svn: 182779
```
262b1542
Remove double semicolons. · 351d53c2
Benjamin Kramer authored May 28, 2013
```
llvm-svn: 182778
```
351d53c2

Extend RemapInstruction and friends to take an optional new parameter, a ValueMaterializer. · f6f121e2

James Molloy authored May 28, 2013

Extend LinkModules to pass a ValueMaterializer to RemapInstruction and friends to lazily create Functions for lazily linked globals. This is a big win when linking small modules with large (mostly unused) library modules.

llvm-svn: 182776

f6f121e2

[msan] Fix argument shadow alignment. · fca01233
Evgeniy Stepanov authored May 28, 2013
```
llvm-svn: 182771
```
fca01233

[SystemZ] Register compare-and-branch support · 0fb90ab0

Richard Sandiford authored May 28, 2013

This patch adds support for the CRJ and CGRJ instructions.  Support for
the immediate forms will be a separate patch.

The architecture has a large number of comparison instructions.  I think
it's generally better to concentrate on using the "best" comparison
instruction first and foremost, then only use something like CRJ if
CR really was the natual choice of comparison instruction.  The patch
therefore opportunistically converts separate CR and BRC instructions
into a single CRJ while emitting instructions in ISelLowering.

llvm-svn: 182764

0fb90ab0

[SystemZ] Tweak SystemZInstrInfo::isBranch() interface · 53c9efd9
Richard Sandiford authored May 28, 2013
```
This is needed for the upcoming compare-and-branch patch.  No functional
change intended.

llvm-svn: 182762
```
53c9efd9

Make BasicAliasAnalysis recognize the fact a noalias argument cannot alias... · f3e663af

Michael Kuperstein authored May 28, 2013

Make BasicAliasAnalysis recognize the fact a noalias argument cannot alias another argument, even if the other argument is not itself marked noalias.

llvm-svn: 182755

f3e663af

Make it explicit that GlobalAlias are ok in llvm.used. · eaf53276
Rafael Espindola authored May 27, 2013
```
No functionality change.

llvm-svn: 182747
```
eaf53276
Make helper functions static. · f30f2cce
Rafael Espindola authored May 27, 2013
```
And remove header and cpp file that are empty after that.

llvm-svn: 182746
```
f30f2cce

May 27, 2013

Convert sqrt functions into sqrt instructions when -ffast-math is in effect. · 048f99de

Preston Gurd authored May 27, 2013

When -ffast-math is in effect (on Linux, at least), clang defines
__FINITE_MATH_ONLY__ > 0 when including <math.h>. This causes the
preprocessor to include <bits/math-finite.h>, which renames the sqrt functions.
For instance, "sqrt" is renamed as "__sqrt_finite". 

This patch adds the 3 new names in such a way that they will be treated
as equivalent to their respective original names.

llvm-svn: 182739

048f99de

PPC: Add a isConsecutiveLS utility function · 8ebfe6c2

Hal Finkel authored May 27, 2013

isConsecutiveLS is a slightly more general form of
SelectionDAG::isConsecutiveLoad. Aside from also handling stores, it also does
not assume equality of the chain operands is necessary. In the case of the PPC
backend, this chain condition is checked in a more general way by the
surrounding code.

Mostly, this part of the refactoring in preparation for supporting optimized
unaligned stores.

llvm-svn: 182723

8ebfe6c2

May 26, 2013

Prefer to duplicate PPC Altivec loads when expanding unaligned loads · 7d8a691b

Hal Finkel authored May 26, 2013

When expanding unaligned Altivec loads, we use the decremented offset trick to
prevent page faults. Unfortunately, if we have a sequence of consecutive
unaligned loads, this leads to suboptimal code generation because the 'extra'
load from the first unaligned load can be combined with the base load from the
second (but only if the decremented offset trick is not used for the first).
Search up and down the chain, through loads and token factors, looking for
consecutive loads, and if one is found, don't use the offset reduction trick.
These duplicate loads are later combined to yield the desired sequence (in the
future, we might want a more-powerful chain search, but that will require some
changes to allow the combiner routines to access the AA object).

This should complete the initial implementation of the optimized unaligned
Altivec load expansion. There is some refactoring that should be done, but
that will happen when the unaligned store expansion is added.

llvm-svn: 182719

7d8a691b

Fix PR16143: Insert DEBUG_VALUE before terminator. · c66d26ad
Andrew Trick authored May 26, 2013
```
llvm-svn: 182717
```
c66d26ad

May 25, 2013

Add support for DWARF line number table entries for values in the instruction · 80cbcd2d
Cameron Zwarich authored May 25, 2013
```
stream.

llvm-svn: 182712
```
80cbcd2d

PPC: Combine duplicate (offset) lvsl Altivec intrinsics · bc2ee4c4

Hal Finkel authored May 25, 2013

The lvsl permutation control instruction is a function only of the alignment of
the pointer operand (relative to the 16-byte natural alignment of Altivec
vectors). As a result, multiple lvsl intrinsics where the operands differ by a
multiple of 16 can be combined.

llvm-svn: 182708

bc2ee4c4