Commits · d8cadd6f174d3ba2cf74ea36ca4c825476920a2c · Roger Ferrer / llvm-epi-0.8

Jan 23, 2014

Make the use of DW_AT_ranges in the compile unit depend also upon · 1bca60d6
Eric Christopher authored Jan 23, 2014
```
the existence of comdat/special sections.

llvm-svn: 199954
```
1bca60d6

Update the X86 assembler for .intel_syntax to produce an error for invalid base · bc570f28

Kevin Enderby authored Jan 23, 2014

registers in memory addresses that do not match the index register. As it does
for .att_syntax.

rdar://15887380

llvm-svn: 199948

bc570f28

Update the X86 assembler for .intel_syntax to produce an error for invalid · 9d11702f

Kevin Enderby authored Jan 23, 2014

scale factors in memory addresses. As it does for .att_syntax.

It was producing:
Assertion failed: (((Scale == 1 || Scale == 2 || Scale == 4 || Scale == 8)) && "Invalid scale!"), function CreateMem, file /Volumes/SandBox/llvm/lib/Target/X86/AsmParser/X86AsmParser.cpp, line 1133.

rdar://14967214

llvm-svn: 199942

9d11702f

Replace vfmaddxx213 instructions with their 231-type equivalents in accumulator · 23de211c
Lang Hames authored Jan 23, 2014
```
loops. Writing back to the accumulator (231-type) allows the coalescer to
eliminate an extra copy.

llvm-svn: 199933
```
23de211c
Note the PR number. · ff856f4c
Rafael Espindola authored Jan 23, 2014
```
llvm-svn: 199932
```
ff856f4c

[Thumbv8] Fix the value of BLXOperandIndex of isV8EligibleForIT · 5930ae6c

Weiming Zhao authored Jan 23, 2014

Originally, BLX was passed as operand #0 in MachineInstr and as operand
#2 in MCInst. But now, it's operand #2 in both cases.

This patch also removes unnecessary FileCheck in the test case added by r199127.

llvm-svn: 199928

5930ae6c

Move test to x86 directory. · 589d6c41
Eric Christopher authored Jan 23, 2014
```
llvm-svn: 199927
```
589d6c41
[AArch64] Added vselect patterns with float and double types · 5d31f694
Ana Pazos authored Jan 23, 2014
```
llvm-svn: 199925
```
5d31f694
Avoid emitting a DWARF type attribute for an ObjC property of type · 4c96056a
Eric Christopher authored Jan 23, 2014
```
void.

Patch by Scott Talbot.

llvm-svn: 199924
```
4c96056a

R600: Disable the BFE pattern · a2a4b8ee

Tom Stellard authored Jan 23, 2014

This pattern uses an SDNodeXForm, which isn't being emitted for some
reason.  I can get it to work by attaching the PatLeaf that has the
XForm to the argument in the output pattern, but this results in an
immediate being used in a register operand, which the backend can't
handle yet.

llvm-svn: 199918

a2a4b8ee

R600: Correctly handle vertex fetch clauses the precede ENDIFs · 805890b2

Tom Stellard authored Jan 23, 2014

The control flow finalizer would sometimes use an ALU_POP_AFTER
instruction before the vetex fetch clause instead of using a POP
instruction after it.

llvm-svn: 199917

805890b2

R600: Unconditionally unroll loops that contain GEPs with alloca pointers · 8cce9bdf

Tom Stellard authored Jan 23, 2014

Implement the getUnrollingPreferences() function for
AMDGPUTargetTransformInfo so that loops that do address calculations
on pointers derived from alloca are unconditionally unrolled.

Unrolling these loops makes it more likely that SROA will be able to
eliminate the allocas, which is a big win for R600 since memory
allocated by alloca (private memory) is really slow.

llvm-svn: 199916

8cce9bdf

Move a unit test into the correct dir. Sorry if it broke Mips-only builds. · 3cc534ac
Andrew Trick authored Jan 23, 2014
```
llvm-svn: 199911
```
3cc534ac

Remove tail marker when changing an argument to an alloca. · 2a05ea5c

Rafael Espindola authored Jan 23, 2014

Argument promotion can replace an argument of a call with an alloca. This
requires clearing the tail marker as it is very likely that the callee is now
using an alloca in the caller.

This fixes pr14710.

llvm-svn: 199909

2a05ea5c

R600: Recommit 199842: Add work-around for the CF stack entry HW bug · 348273df

Tom Stellard authored Jan 23, 2014

The unit test is now disabled on non-asserts builds.

The CF stack can be corrupted if you use CF_ALU_PUSH_BEFORE,
CF_ALU_ELSE_AFTER, CF_ALU_BREAK, or CF_ALU_CONTINUE when the number of
sub-entries on the stack is greater than or equal to the stack entry
size and sub-entries modulo 4 is either 0 or 3 (on cedar the bug is
present when number of sub-entries module 8 is either 7 or 0)

We choose to be conservative and always apply the work-around when the
number of sub-enries is greater than or equal to the stack entry size,
so that we can safely over-allocate the stack when we are unsure of the
stack allocation rules.

reviewed-by: Vincent Lejeune <vljn at ovi.com>
llvm-svn: 199905

348273df

[Object][ELF][Mips] Print symbol name for MIPS ELF relocations. · 793f1b22
Simon Atanasyan authored Jan 23, 2014
```
llvm-svn: 199898
```
793f1b22
AVX-512: added VPERM2D VPERM2Q VPERM2PS VPERM2PD instructions, · a5d38a39
Elena Demikhovsky authored Jan 23, 2014
```
they give better sequences than VPERMI

llvm-svn: 199893
```
a5d38a39

ARM: use litpools for normal i32 imms when compiling minsize. · 55c625f2

Tim Northover authored Jan 23, 2014

With constant-sharing, litpool loads consume 4 + N*2 bytes of code, but
movw/movt pairs consume 8*N. This means litpools are better than movw/movt even
with just one use. Other materialisation strategies can still be better though,
so the logic is a little odd.

llvm-svn: 199891

55c625f2

Prevent repetitive warnings for unrecognized processors and features · a5158963
Artyom Skrobov authored Jan 23, 2014
```
llvm-svn: 199886
```
a5158963

[LPM] Make LoopSimplify no longer a LoopPass and instead both a utility · aa7fa5e4

Chandler Carruth authored Jan 23, 2014

function and a FunctionPass.

This has many benefits. The motivating use case was to be able to
compute function analysis passes *after* running LoopSimplify (to avoid
invalidating them) and then to run other passes which require
LoopSimplify. Specifically passes like unrolling and vectorization are
critical to wire up to BranchProbabilityInfo and BlockFrequencyInfo so
that they can be profile aware. For the LoopVectorize pass the only
things in the way are LoopSimplify and LCSSA. This fixes LoopSimplify
and LCSSA is next on my list.

There are also a bunch of other benefits of doing this:
- It is now very feasible to make more passes *preserve* LoopSimplify
  because they can simply run it after changing a loop. Because
  subsequence passes can assume LoopSimplify is preserved we can reduce
  the runs of this pass to the times when we actually mutate a loop
  structure.
- The new pass manager should be able to more easily support loop passes
  factored in this way.
- We can at long, long last observe that LoopSimplify is preserved
  across SCEV. This *halves* the number of times we run LoopSimplify!!!

Now, getting here wasn't trivial. First off, the interfaces used by
LoopSimplify are all over the map regarding how analysis are updated. We
end up with weird "pass" parameters as a consequence. I'll try to clean
at least some of this up later -- I'll have to have it all clean for the
new pass manager.

Next up I discovered a really frustrating bug. LoopUnroll *claims* to
preserve LoopSimplify. That's actually a lie. But the way the
LoopPassManager ends up running the passes, it always ran LoopSimplify
on the unrolled-into loop, rectifying this oversight before any
verification could kick in and point out that in fact nothing was
preserved. So I've added code to the unroller to *actually* simplify the
surrounding loop when it succeeds at unrolling.

The only functional change in the test suite is that we now catch a case
that was previously missed because SCEV and other loop transforms see
their containing loops as simplified and thus don't miss some
opportunities. One test case has been converted to check that we catch
this case rather than checking that we miss it but at least don't get
the wrong answer.

Note that I have #if-ed out all of the verification logic in
LoopSimplify! This is a temporary workaround while extracting these bits
from the LoopPassManager. Currently, there is no way to have a pass in
the LoopPassManager which preserves LoopSimplify along with one which
does not. The LPM will try to verify on each loop in the nest that
LoopSimplify holds but the now-Function-pass cannot distinguish what
loop is being verified and so must try to verify all of them. The inner
most loop is clearly no longer simplified as there is a pass which
didn't even *attempt* to preserve it. =/ Once I get LCSSA out (and maybe
LoopVectorize and some other fixes) I'll be able to re-enable this check
and catch any places where we are still failing to preserve
LoopSimplify. If this causes problems I can back this out and try to
commit *all* of this at once, but so far this seems to work and allow
much more incremental progress.

llvm-svn: 199884

aa7fa5e4

[AArch64]Add CHECK for two test cases testing scalar_to_vector committed in r199461. · b920682e
Hao Liu authored Jan 23, 2014
```
llvm-svn: 199861
```
b920682e

[Mips] TargetStreamer Support for .set mips16. · 39536724

Jack Carter authored Jan 22, 2014

This patch updates .set mips16 support which
affects the ELF ABI and its flags. In addition the patch uses
a common interface for both the MipsTargetSteamer and
MipsObjectStreamer that the assembler uses for
both ELF and ASCII output for these directives.

llvm-svn: 199851

39536724

Jan 22, 2014

Revert r162101 and replace it with a solution that works for targets where the... · 77e4d444

Owen Anderson authored Jan 22, 2014

Revert r162101 and replace it with a solution that works for targets where the pointer type is illegal.
This is a horrible bit of code. We're calling a simplification routine *in the middle* of type legalization. We tell the
simplification routine that it's running after legalization, but some of the types it will encounter will be illegal! The
fix is only to invoke the simplification if the types in question were legal, so that none of its invariants will be violated.

llvm-svn: 199847

77e4d444

Add CHECK-LABELs · 88b3cc70
Matt Arsenault authored Jan 22, 2014
```
llvm-svn: 199846
```
88b3cc70

Revert "R600: Add work-around for the CF stack entry HW bug" · 31e16388

Tom Stellard authored Jan 22, 2014

This reverts commit 35b8331cad6eb512a2506adbc394201181da94ba.

The -debug-only flag for llc doesn't appear to be available in
all build configurations.

llvm-svn: 199845

31e16388

Provide a dummy section to fix a crash with inline assembly in LTO. · 20fcda71
Rafael Espindola authored Jan 22, 2014
```
Fixes pr18508.

llvm-svn: 199843
```
20fcda71

R600: Add work-around for the CF stack entry HW bug · e89373e0

Tom Stellard authored Jan 22, 2014

The CF stack can be corrupted if you use CF_ALU_PUSH_BEFORE,
CF_ALU_ELSE_AFTER, CF_ALU_BREAK, or CF_ALU_CONTINUE when the number of
sub-entries on the stack is greater than or equal to the stack entry
size and sub-entries modulo 4 is either 0 or 3 (on cedar the bug is
present when number of sub-entries module 8 is either 7 or 0)

We choose to be conservative and always apply the work-around when the
number of sub-enries is greater than or equal to the stack entry size,
so that we can safely over-allocate the stack when we are unsure of the
stack allocation rules.

reviewed-by: Vincent Lejeune <vljn at ovi.com>
llvm-svn: 199842

e89373e0

R600: Refactor stack size calculation · a40f9715
Tom Stellard authored Jan 22, 2014
```
reviewed-by: Vincent Lejeune <vljn at ovi.com>
llvm-svn: 199840
```
a40f9715
Handle an addrspacecast case in memcpyopt · 84de6114
Matt Arsenault authored Jan 22, 2014
```
llvm-svn: 199836
```
84de6114
Eliminate inappropriate use of FindProgramByName() from lli · a1186382
Alp Toker authored Jan 22, 2014
```
llvm-svn: 199835
```
a1186382
Add a testcase for r199430. · 5a8739e0
Quentin Colombet authored Jan 22, 2014
```
llvm-svn: 199831
```
5a8739e0
R600: MOVA is vector only · 476437cb
Tom Stellard authored Jan 22, 2014
```
llvm-svn: 199827
```
476437cb
R600: Take alignment into account when calculating the stack offset · 598f3945
Tom Stellard authored Jan 22, 2014
```
llvm-svn: 199826
```
598f3945
R600: Add support for global addresses with constant initializers · 04c0e985
Tom Stellard authored Jan 22, 2014
```
llvm-svn: 199825
```
04c0e985

R600: Begin private memory at the second GPR. · 27982b1d

Tom Stellard authored Jan 22, 2014

This way private memory does not over-write work group information
stored in GPRs 0 and 1.

llvm-svn: 199824

27982b1d

R600/SI: Add support for i8 and i16 private loads/stores · e9373605
Tom Stellard authored Jan 22, 2014
```
llvm-svn: 199823
```
e9373605

Bug 18228 - Fix accepting bitcasts between vectors of pointers with a · fc3c91d0

Matt Arsenault authored Jan 22, 2014

different number of elements.

Bitcasts were passing with vectors of pointers with different number of
elements since the number of elements was checking
SrcTy->getVectorNumElements() == SrcTy->getVectorNumElements() which
isn't helpful. The addrspacecast was also wrong, but that case at least
is caught by the verifier. Refactor bitcast and addrspacecast handling
in castIsValid to be more readable and fix this problem.

llvm-svn: 199821

fc3c91d0

Fix inline assembly that switches between ARM and Thumb modes · 1f6a6086

Greg Fitzgerald authored Jan 22, 2014

This patch restores the ARM mode if the user's inline assembly
does not.  In the object streamer, it ensures that instructions
following the inline assembly are encoded correctly and that
correct mapping symbols are emitted.  For the asm streamer, it
emits a .arm or .thumb directive.

This patch does not ensure that the inline assembly contains
the ADR instruction to switch modes at runtime.

The problem we need to solve is code like this:

  int foo(int a, int b) {
    int r = a + b;
    asm volatile(
        ".align 2     \n"
        ".arm         \n"
        "add r0,r0,r0 \n"
    : : "r"(r));
    return r+1;
  }

If we compile this function in thumb mode then the inline assembly
will switch to arm mode. We need to make sure that we switch back to
thumb mode after emitting the inline assembly or we will incorrectly
encode the instructions that follow (i.e. the assembly instructions
for return r+1).

Based on patch by David Peixotto

Change-Id: Ib57f6d2d78a22afad5de8693fba6230ff56ba48b
llvm-svn: 199818

1f6a6086

[x86] Allow segment and address-size overrides for INS[BWLQ] (PR9385) · 4ce66069
David Woodhouse authored Jan 22, 2014
```
llvm-svn: 199809
```
4ce66069
[x86] Allow segment and address-size overrides for OUTS[BWLQ] (PR9385) · c472b813
David Woodhouse authored Jan 22, 2014
```
llvm-svn: 199808
```
c472b813