Commits · 5bf1d0093b8d28c86ea079be89c0f278fffcfe1c · Roger Ferrer / llvm-epi-0.8

Oct 15, 2013

Remove some dead code. (DarwinGDBCompat was retired in r189903). · 5bf1d009
Adrian Prantl authored Oct 15, 2013
```
llvm-svn: 192731
```
5bf1d009
Struct byval: fix a copy-paste error for thumb2. · fd956dba
Manman Ren authored Oct 15, 2013
```
PR17309

llvm-svn: 192730
```
fd956dba

Fix PR17546 · ad71659d

Michael Liao authored Oct 15, 2013

- Type of index used in extract_vector_elt or insert_vector_elt supposes
  to be TLI.getVectorIdxTy() which is pointer type on most targets. It'd
  better to truncate (or zero-extend in case it's changed later) it to
  mask element type to guarantee they are matching instead of asserting
  that.

llvm-svn: 192722

ad71659d

Fix PR16807 · 8ba06821

Michael Liao authored Oct 15, 2013

- Lower signed division by constant powers-of-2 to target-independent
  DAG operators instead of target-dependent ones to support them better
  on targets where vector types are legal but shift operators on that
  types are illegal. E.g., on AVX, PSRAW is only available on <8 x i16>
  though <16 x i16> is a legal type.

llvm-svn: 192721

8ba06821

LoopVectorize: Properly reflect PODness in comments. · c97850be
Benjamin Kramer authored Oct 15, 2013
```
llvm-svn: 192717
```
c97850be
Guard the debug temp variable with NDEBUG to avoid warning/error with NDEBUG defined. · eb4a6e7c
Pekka Jaaskelainen authored Oct 15, 2013
```
llvm-svn: 192709
```
eb4a6e7c
Do not assert when trying to add a meta data operand with · eb08e2e0
Pekka Jaaskelainen authored Oct 15, 2013
```
MachineInstr::addOperand().

llvm-svn: 192707
```
eb08e2e0
[mips][msa] Added support for build_vector for v4f32 and v2f64. · 1dfddc73
Daniel Sanders authored Oct 15, 2013
```
llvm-svn: 192699
```
1dfddc73
Revert "Add AllTargetsBindings sublibrary" as it breaks cmake build on... · 0c3b6539
Anders Waldenborg authored Oct 15, 2013
```
Revert "Add AllTargetsBindings sublibrary" as it breaks cmake build on (atleast) windows and darwin.

llvm-svn: 192697
```
0c3b6539

Add AllTargetsBindings sublibrary instead of having static inlines in the llvm-c headers. · 1d9cb434

Anders Waldenborg authored Oct 15, 2013

This new library will be linked in when using the "all-targets"
component and contains the LLVMInitializeAll* functions.

This means that those functions will exist as real symbols in
the shared library, and can therefore can be called from
bindings that are using ffi the shared library.

llvm-svn: 192690

1d9cb434

[SystemZ] Use A(G)SI when spilling the target of a constant addition · 6af6ff1e
Richard Sandiford authored Oct 15, 2013
```
llvm-svn: 192681
```
6af6ff1e
Fix MSP430 calling convention to match MSPGCC · e9a1d4c2
Job Noorman authored Oct 15, 2013
```
llvm-svn: 192678
```
e9a1d4c2

Remove x86_sse42_crc32_64_8 intrinsic. It has no functional difference from... · ef9e993e

Craig Topper authored Oct 15, 2013

Remove x86_sse42_crc32_64_8 intrinsic. It has no functional difference from x86_sse42_crc32_32_8 and was not mapped to a clang builtin. I'm not even sure why this form of the instruction is even called out explicitly in the docs. Also add AutoUpgrade support to convert it into the other intrinsic with appropriate trunc and zext.

llvm-svn: 192672

ef9e993e

Improve on r192635, ExeDepsFix for avx, and add a test case. · 3a99693c

Andrew Trick authored Oct 15, 2013

rdar:15221834 False AVX register dependencies cause 5x slowdown on
flops-5/6 and significant slowdown on several others.

This was blocking the switch to MI-Sched.

llvm-svn: 192669

3a99693c

[mips] Define a pseudo instruction which writes to both the lower and higher · 06aff571
Akira Hatanaka authored Oct 15, 2013
```
parts of the accumulators and gets expanded post-RA.

llvm-svn: 192667
```
06aff571
[mips] Use predicates to guard instructions using accumulator registers instead · ec67c902
Akira Hatanaka authored Oct 15, 2013
```
of relying on AddedComplexity.

llvm-svn: 192665
```
ec67c902
[mips] Rename isel nodes. · d98c99fd
Akira Hatanaka authored Oct 15, 2013
```
llvm-svn: 192663
```
d98c99fd
[mips] Transfer kill flag to the newly created operand. · 86c3c794
Akira Hatanaka authored Oct 15, 2013
```
llvm-svn: 192662
```
86c3c794
[mips] Set HI/LO registers' HWEncoding field. · 8368b3b3
Akira Hatanaka authored Oct 15, 2013
```
llvm-svn: 192661
```
8368b3b3
[mips] Delete unnecessary code. · 8f31b2fd
Akira Hatanaka authored Oct 15, 2013
```
llvm-svn: 192660
```
8f31b2fd
Update comment list of GLOBALVAR modifiers in BitcodeWriter to include externally_initialized. · 53c885c3
Michael Gottesman authored Oct 14, 2013
```
Thanks to Shuxin Yang for catching this.

llvm-svn: 192637
```
53c885c3

[X86][FastISel] During X86 fastisel, the address of indirect call was resolved · 778dba1d

Quentin Colombet authored Oct 14, 2013

through bitcast, ptrtoint, and inttoptr instructions. This is valid
only if the related instructions are in that same basic block, otherwise
we may reference variables that were not live accross basic blocks
resulting in undefined virtual registers.

The bug was exposed when both SDISel and FastISel were used within the same
function, i.e., one basic block is issued with FastISel and another with SDISel,
as demonstrated with the testcase.

<rdar://problem/15192473>

llvm-svn: 192636

778dba1d

Fix the ExecutionDepsFix pass to handle AVX instructions. · b6d56be6

Andrew Trick authored Oct 14, 2013

This pass is needed to break false dependencies. Without it, unlucky
register assignment can result in wild (5x) swings in
performance. This pass was trying to handle AVX but not getting it
right. AVX doesn't have partial register defs, it has unused register
reads in which the high bits of a source operand are copied into the
unused bits of the dest.

Fixing this requires conservative liveness analysis. This is awkard
because the pass already has its own pseudo-liveness. However, proper
liveness is expensive, and we would like to use a generic utility to
compute it. The fix only invokes liveness on-demand. It is rare to
detect a case that needs undef-read dependence breaking, but when it
happens, it can be needed many times within a very large block.

I think the existing heuristic which uses a register window of 16 is
too conservative for loop-carried false dependencies. If the loop is a
reduction. The out-of-order engine may be able to execute several loop
iterations in parallel. However, I'll leave this tuning exercise for
next time.

llvm-svn: 192635

b6d56be6

LiveRegUnits: Use *MBB for consistency and convenience. · e2f7cc4c
Andrew Trick authored Oct 14, 2013
```
llvm-svn: 192634
```
e2f7cc4c
whitespace · 8460a3bf
Andrew Trick authored Oct 14, 2013
```
llvm-svn: 192633
```
8460a3bf

Oct 14, 2013

Revert part of a fix from 2010, changes since then: · 74002574

Eric Christopher authored Oct 14, 2013

a) x86-64 TLS has been documented
b) the code path should use movq for the correct relocation
   to be generated.

I've also added a fixme for the test case that we should improve
the code generated, it should look something like is documented
in the tls abi document.

llvm-svn: 192631

74002574

Reformat this routine slightly. · 755711e5
Eric Christopher authored Oct 14, 2013
```
llvm-svn: 192630
```
755711e5
Remove some extraneous whitespace. · 584d71c6
Eric Christopher authored Oct 14, 2013
```
llvm-svn: 192629
```
584d71c6

LiveRegUnits::removeRegsInMask safety. · 3f4d6c65

Andrew Trick authored Oct 14, 2013

Clobbering is exclusive not inclusive on register units.
For liveness, we need to consider all the preserved registers.
e.g. A regmask that clobbers YMM0 may preserve XMM0.
Units are only clobbered when all super-registers are clobbered.

llvm-svn: 192623

3f4d6c65

Use a SparseSet in LiveRegUnits. · 276dd453

Andrew Trick authored Oct 14, 2013

Some clients may add block live ins and may track liveness over a
large scope. This guarantees an efficient implementation in all cases
with no memory allocation/deallocation, independent of the number of
target registers. It could be slightly less convenient but is fine in
the expected case.

llvm-svn: 192622

276dd453

Move LiveRegUnits implementation into .cpp. Comment and format. · 0aed0cfc
Andrew Trick authored Oct 14, 2013
```
llvm-svn: 192621
```
0aed0cfc
Convert LiveRegUnits methods to the current convention (it's new code). · ff3585c5
Andrew Trick authored Oct 14, 2013
```
llvm-svn: 192619
```
ff3585c5

Debug Info: static member DIE creation. · c6b63927

Manman Ren authored Oct 14, 2013

Clean up creation of static member DIEs. We can create static member DIEs from
two places, so we call getOrCreateStaticMemberDIE from the two places.

getOrCreateStaticMemberDIE will get or create the context DIE first, then it
will check if the DIE already exists, if not, we create the static member DIE
and add it to the context.

Creation of static member DIEs are handled in a similar way as subprogram DIEs.

llvm-svn: 192618

c6b63927

Fix indenting. · 6004dbc9
David Blaikie authored Oct 14, 2013
```
That wasn't confusing /at all/...

llvm-svn: 192617
```
6004dbc9

MachineSink: Fix and tweak critical-edge breaking heuristic. · 5cb7f4e3

Will Dietz authored Oct 14, 2013

Per original comment, the intention of this loop
is to go ahead and break the critical edge
(in order to sink this instruction) if there's
reason to believe doing so might "unblock" the
sinking of additional instructions that define
registers used by this one.  The idea is that if
we have a few instructions to sink "together"
breaking the edge might be worthwhile.

This commit makes a few small changes
to help better realize this goal:

First, modify the loop to ignore registers
defined by this instruction.  We don't
sink definitions of physical registers,
and sinking an SSA definition isn't
going to unblock an upstream instruction.

Second, ignore uses of physical registers.
Instructions that define physical registers are
rejected for sinking, and so moving this one
won't enable moving any defining instructions.
As an added bonus, while virtual register
use-def chains are generally small due
to SSA goodness, iteration over the uses
and definitions (used by hasOneNonDBGUse)
for physical registers like EFLAGS
can be rather expensive in practice.
(This is the original reason for looking at this)

Finally, to keep things simple continue
to only consider this trick for registers that
have a single use (via hasOneNonDBGUse),
but to avoid spuriously breaking critical edges
only do so if the definition resides
in the same MBB and therefore this one directly
blocks it from being sunk as well.
If sinking them together is meant to be,
let the iterative nature of this pass
sink the definition into this block first.

Update tests to accomodate this change,
add new testcase where sinking avoids pipeline stalls.

llvm-svn: 192608

5cb7f4e3

Remove lib/Transforms/Instrumentation/ProfilingUtils.* · 8c1d78ad
Rafael Espindola authored Oct 14, 2013
```
They were leftover from the old profiling support.

Patch by Alastair Murray.

llvm-svn: 192605
```
8c1d78ad
Remove the now unused strong phi elimination pass. · 9770bde5
Rafael Espindola authored Oct 14, 2013
```
llvm-svn: 192604
```
9770bde5
Basic blocks typically have few predecessors. Use a SmallDenseMap to · 94fc4bed
Chris Lattner authored Oct 14, 2013
```
avoid a heap allocation when this is the case.

llvm-svn: 192602
```
94fc4bed

[msan] Instrument x86.*_cvt* intrinsics. · be83d8f6

Evgeniy Stepanov authored Oct 14, 2013

Currently MSan checks that arguments of *cvt* intrinsics are fully initialized.
That's too much to ask: some of them only operate on lower half, or even
quarter, of the input register.

llvm-svn: 192599

be83d8f6

[AArch64] Add support for NEON scalar integer compare instructions. · d1f40d76
Chad Rosier authored Oct 14, 2013
```
llvm-svn: 192596
```
d1f40d76