Commits · e6aafbb2424e013870852b0c7f1c64d1d8c4b516 · Roger Ferrer / llvm-epi

May 28, 2014

Don't hard-code ld when extracting host linker version, use ${LD} if · e6aafbb2
Joerg Sonnenberger authored May 28, 2014
```
it is set.

llvm-svn: 209742
```
e6aafbb2
[asancov] Don't emit extra runtime calls when compiling without coverage. · 386b58d0
Evgeniy Stepanov authored May 28, 2014
```
llvm-svn: 209721
```
386b58d0

[AArch64] Add store post-index update folding regression tests for the load/store optimizer. · 7c747fc7

Tilmann Scheller authored May 28, 2014

Add regression tests for the following transformation:

  str X, [x20]
   ...
  add x20, x20, #32
   ->
  str X, [x20], #32

with X being either w0, x0, s0, d0 or q0.

llvm-svn: 209715

7c747fc7

utils: Teach lldbDataFormatters to load automatically · dc30fbf7

Justin Bogner authored May 28, 2014

Add an __lldb_init_module function so that importing the
lldbDataFormatters script automatically adds the formatters.

llvm-svn: 209712

dc30fbf7

[AArch64] Add load post-index update folding regression tests for the load/store optimizer. · 35e45146

Tilmann Scheller authored May 28, 2014

Add regression tests for the following transformation:

 ldr X, [x20]
  ...
 add x20, x20, #32
  ->
 ldr X, [x20], #32

 with X being either w0, x0, s0, d0 or q0.

llvm-svn: 209711

35e45146

Change representation of instruction ranges where variable is accessible. · bb2990df

Alexey Samsonov authored May 27, 2014

Use more straightforward way to represent the set of instruction
ranges where the location of a user variable is defined - vector of pairs
of instructions (defining start/end of each range),
instead of a flattened vector of instructions where some instructions
are supposed to start the range, and the rest are supposed to "clobber" it.

Simplify the code which generates actual .debug_loc entries.

No functionality change.

llvm-svn: 209698

bb2990df

Factor out looking for prologue end into a function · 8a86d6da
Alexey Samsonov authored May 27, 2014
```
llvm-svn: 209697
```
8a86d6da

avoid type mismatch when building SCEVs · f93ef123

Sebastian Pop authored May 27, 2014

This is a corner case I have stumbled upon when dealing with ARM64 type
conversions. I was not able to extract a testcase for the community codebase to
fail on. The patch conservatively discards a division that would have ended up
in an ICE due to a type mismatch when building a multiply expression. I have
also added code to a place that builds add expressions and in which we should be
careful not to pass in operands of different types.

llvm-svn: 209694

f93ef123

do not use the GCD to compute the delinearization strides · e30bd351

Sebastian Pop authored May 27, 2014

We do not need to compute the GCD anymore after we removed the constant
coefficients from the terms: the terms are now all parametric expressions and
there is no need to recognize constant terms that divide only a subset of the
terms. We only rely on the size of the terms, i.e., the number of operands in
the multiply expressions, to sort the terms and recognize the parametric
dimensions.

llvm-svn: 209693

e30bd351

remove BasePointer before delinearizing · 28e6b97b

Sebastian Pop authored May 27, 2014

No functional change is intended: instead of relying on the delinearization to
come up with the base pointer as a remainder of the divisions in the
delinearization, we just compute it from the array access and use that value.
We substract the base pointer from the SCEV to be delinearized and that
simplifies the work of the delinearizer.

llvm-svn: 209692

28e6b97b

remove constant terms · a6e58605

Sebastian Pop authored May 27, 2014

The delinearization is needed only to remove the non linearity induced by
expressions involving multiplications of parameters and induction variables.
There is no problem in dealing with constant times parameters, or constant times
an induction variable.

For this reason, the current patch discards all constant terms and multipliers
before running the delinearization algorithm on the terms. The only thing
remaining in the term expressions are parameters and multiply expressions of
parameters: these simplified term expressions are passed to the array shape
recognizer that will not recognize constant dimensions anymore: these will be
recognized as different strides in parametric subscripts.

The only important special case of a constant dimension is the size of elements.
Instead of relying on the delinearization to infer the size of an element,
compute the element size from the base address type. This is a much more precise
way of computing the element size than before, as we would have mixed together
the size of an element with the strides of the innermost dimension.

llvm-svn: 209691

a6e58605

Don't pre-populate the set of keys in the map with variable locations history. · f0e0cca0

Alexey Samsonov authored May 27, 2014

Current implementation of calculateDbgValueHistory already creates the
keys in the expected order (user variables are listed in order of appearance),
and should do so later by contract.

No functionality change.

llvm-svn: 209690

f0e0cca0

No need for those tests to go thru llvm-as and/or llvm-dis. · de5ff268
Arnaud A. de Grandmaison authored May 27, 2014
```
opt can handle them by itself.

llvm-svn: 209689
```
de5ff268

May 27, 2014

Factor out comparison of Instruction "special" states. · 6a90dc4f
Arnaud A. de Grandmaison authored May 27, 2014
```
No functional change.

llvm-svn: 209688
```
6a90dc4f
Wording fix for llvm.global_dtors docs. · bffbcc59
Reid Kleckner authored May 27, 2014
```
llvm-svn: 209687
```
bffbcc59

DebugInfo: partially revert cleanup committed in r209680 · 6900674a

David Blaikie authored May 27, 2014

I'm not sure exactly where/how we end up with an abstract DbgVariable
with a null DIE, but we do... looking into it & will add a test and/or
fix when I figure it out.

Currently shows up in selfhost or compiler-rt builds.

llvm-svn: 209683

6900674a

DebugInfo: Simplify solution to avoid DW_AT_artificial on inlined parameters. · b85f0080

David Blaikie authored May 27, 2014

Originally committed in r207717, I clearly didn't look very closely at
the code to understand how existing things were working...

llvm-svn: 209680

b85f0080

[mips] Optimize long branch for MIPS64 by removing %higher and %highest. · e41db2fe

Sasa Stankovic authored May 27, 2014

%higher and %highest can have non-zero values only for offsets greater
than 2GB, which is highly unlikely, if not impossible when compiling a
single function. This makes long branch for MIPS64 3 instructions smaller.

Differential Revision: http://llvm-reviews.chandlerc.com/D3281.diff

llvm-svn: 209678

e41db2fe

DebugInfo: Create abstract function definitions even when concrete definitions... · 482097d0

David Blaikie authored May 27, 2014

DebugInfo: Create abstract function definitions even when concrete definitions preceed inline definitions.

After much puppetry, here's the major piece of the work to ensure that
even when a concrete definition preceeds all inline definitions, an
abstract definition is still created and referenced from both concrete
and inline definitions.

Variables are still broken in this case (see comment in
dbg-value-inlined-parameter.ll test case) and will be addressed in
follow up work.

llvm-svn: 209677

482097d0

DebugInfo: Avoid an extra map lookup when finding abstract subprogram DIEs. · 2910f620
David Blaikie authored May 27, 2014
```
llvm-svn: 209676
```
2910f620

DebugInfo: Lazily construct subprogram definition DIEs. · 3c2fff3f

David Blaikie authored May 27, 2014

A further step to correctly emitting concrete out of line definitions
preceeding inlined instances of the same program.

To do this, emission of subprograms must be delayed until required since
we don't know which (abstract only (if there's no out of line
definition), concrete only (if there are no inlined instances), or both)
DIEs are required at the start of the module.

To reduce the test churn in the following commit that actually fixes the
bug, this commit introduces the lazy DIE construction and cleans up test
cases that are impacted by the changes in the resulting DIE ordering.

llvm-svn: 209675

3c2fff3f

DebugInfo: Lazily attach definition attributes to definitions. · f7221adb

David Blaikie authored May 27, 2014

This is a precursor to fixing inlined debug info where the concrete,
out-of-line definition may preceed any inlined usage. To cope with this,
the attributes that may appear on the concrete definition or the
abstract definition are delayed until the end of the module. Then, if an
abstract definition was created, it is referenced (and no other
attributes are added to the out-of-line definition), otherwise the
attributes are added directly to the out-of-line definition.

In a couple of cases this causes not just reordering of attributes, but
reordering of types. When the creation of the attribute is delayed, if
that creation would create a type (such as for a DW_AT_type attribute)
then other top level DIEs may've been constructed during the delay,
causing the referenced type to be created and added after those
intervening DIEs. In the extreme case, in cross-cu-inlining.ll, this
actually causes the DW_TAG_basic_type for "int" to move from one CU to
another.

llvm-svn: 209674

f7221adb

DebugInfo: Separate out the addition of subprogram attribute additions so that... · 7f91686f

David Blaikie authored May 27, 2014

DebugInfo: Separate out the addition of subprogram attribute additions so that they can be added later depending on whether or not the function is inlined.

llvm-svn: 209673

7f91686f

Fixed a test in r209670 · 76cbea6b
Jingyue Wu authored May 27, 2014
```
The test was outdated with r209537.

llvm-svn: 209671
```
76cbea6b

Distribute sext/zext to the operands of and/or/xor · 80a738dc

Jingyue Wu authored May 27, 2014

This is an enhancement to SeparateConstOffsetFromGEP. With this patch, we can
extract a constant offset from "s/zext and/or/xor A, B".

Added a new test @ext_or to verify this enhancement.

Refactoring the code, I also extracted some common logic to function
Distributable. 

llvm-svn: 209670

80a738dc

DebugInfo: Fix argument ordering in test by adding argument numbering. · 66c30774

David Blaikie authored May 27, 2014

This old test didn't have the argument numbering that's now squirelled
away in the high bits of the line number in the DW_TAG_arg_variable
metadata.

Add the numbering and update the test to ensure arguments are in-order.

llvm-svn: 209669

66c30774

Post-commit fixes for r209643 · e8d6a1e8

Filipe Cabecinhas authored May 27, 2014

Detected by Daniel Jasper, Ilia Filippov, and Andrea Di Biagio
Fixed the argument order to select (the mask semantics to blendv* are the
inverse of select) and fixed the tests
Added parenthesis to the assert condition
Ran clang-format

llvm-svn: 209667

e8d6a1e8

AArch64: add test for NZCV cross-copy save. · 9a6217b6
Tim Northover authored May 27, 2014
```
llvm-svn: 209665
```
9a6217b6
AArch64: add AArch64-specific test for 'c' and 'n'. · de9402d3
Tim Northover authored May 27, 2014
```
llvm-svn: 209664
```
de9402d3

[PATCH] Correct type used for VADD_SPLAT optimization on PowerPC · 71dddd51

Bill Schmidt authored May 27, 2014

In PPCISelLowering.cpp: PPCTargetLowering::LowerBUILD_VECTOR(), there
is an optimization for certain patterns to generate one or two vector
splats followed by a vector add or subtract.  This operation is
represented by a VADD_SPLAT in the selection DAG.  Prior to this
patch, it was possible for the VADD_SPLAT to be assigned the wrong
data type, causing incorrect code generation.  This patch corrects the
problem.

Specifically, the code previously assigned the value type of the
BUILD_VECTOR node to the newly generated VADD_SPLAT node.  This is
correct much of the time, but not always.  The problem is that the
call to isConstantSplat() may return a SplatBitSize that is not the
same as the number of bits in the original element vector type.  The
correct type to assign is a vector type with the same element bit size
as SplatBitSize.

The included test case shows an example of this, where the
BUILD_VECTOR node has a type of v16i8.  The vector to be built is {0,
16, 0, 16, 0, 16, 0, 16, 0, 16, 0, 16, 0, 16, 0, 16}.  isConstantSplat
detects that we can generate a splat of 16 for type v8i16, which is
the type we must assign to the VADD_SPLAT node.  If we do not, we
generate a vspltisb of 8 and a vaddubm, which generates the incorrect
result {16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16,
16}.  The correct code generation is a vspltish of 8 and a vadduhm.

This patch also corrected code generation for
CodeGen/PowerPC/2008-07-10-SplatMiscompile.ll, which had been marked
as an XFAIL, so we can remove the XFAIL from the test case.

llvm-svn: 209662

71dddd51

[mips][mips64r6] Add Relocations R_MIPS_PCHI16, R_MIPS_PCLO16 · b355e8f6
Zoran Jovanovic authored May 27, 2014
```
Differential Revision: http://reviews.llvm.org/D3860

llvm-svn: 209659
```
b355e8f6
[ARM] Emit correct build attributes for the relocation models. · ceeb1c48
Amara Emerson authored May 27, 2014
```
Patch by Asiri Rathnayake.

llvm-svn: 209656
```
ceeb1c48
[mips][mips64r6] Add relocations R_MIPS_PC21_S2, R_MIPS_PC26_S2 · 10e06da0
Zoran Jovanovic authored May 27, 2014
```
Differential Revision: http://reviews.llvm.org/D3824

llvm-svn: 209655
```
10e06da0
[asancov] Emit an initializer passing number of coverage code locations in each module. · 47b1a95f
Evgeniy Stepanov authored May 27, 2014
```
llvm-svn: 209654
```
47b1a95f

AArch64: implement copies to/from NZCV as a last ditch effort. · 1bed9afd

Tim Northover authored May 27, 2014

A test in test/Generic creates a DAG where the NZCV output of an ADCS is used
by multiple nodes. This makes LLVM want to save a copy of NZCV for later, which
it couldn't do before.

This should be the last fix required for the aarch64 buildbot.

llvm-svn: 209651

1bed9afd

ARM: teach AAPCS-VFP to deal with Cortex-M4. · 4f1909f1

Tim Northover authored May 27, 2014

Cortex-M4 only has single-precision floating point support, so any LLVM
"double" type will have been split into 2 i32s by now. Fortunately, the
consecutive-register framework turns out to be precisely what's needed to
reconstruct the double and follow AAPCS-VFP correctly!

rdar://problem/17012966

llvm-svn: 209650

4f1909f1

Fix bad assert. · 73458c95
Daniel Jasper authored May 27, 2014
```
llvm-svn: 209648
```
73458c95

AArch64: support 'c' and 'n' inline asm modifiers. · 4719041d

Tim Northover authored May 27, 2014

These are tested by test/CodeGen/Generic, so we should probably know
how to deal with them. Fortunately generic code does it if asked.

llvm-svn: 209646

4719041d

Adding testcase for PR18886. · c0e67033
Dinesh Dwivedi authored May 27, 2014
```
Differential Revision: http://reviews.llvm.org/D3837

llvm-svn: 209645
```
c0e67033

Convert some X86 blendv* intrinsics into IR. · 82ac07c2

Filipe Cabecinhas authored May 27, 2014

Summary:
Implemented an InstCombine transformation that takes a blendv* intrinsic
call and translates it into an IR select, if the mask is constant.

This will eventually get lowered into blends with immediates if possible,
or pblendvb (with an option to further optimize if we can transform the
pblendvb into a blend+immediate instruction, depending on the selector).
It will also enable optimizations by the IR passes, which give up on
sight of the intrinsic.

Both the transformation and the lowering of its result to asm got shiny
new tests.

The transformation is a bit convoluted because of blendvp[sd]'s
definition:

Its mask is a floating point value! This forces us to convert it and get
the highest bit. I suppose this happened because the mask has type
__m128 in Intel's intrinsic and v4sf (for blendps) in gcc's builtin.

I will send an email to llvm-dev to discuss if we want to change this or
not.

Reviewers: grosbach, delena, nadav

Differential Revision: http://reviews.llvm.org/D3859

llvm-svn: 209643

82ac07c2