Commits · 2f6dfc7d0b6bb82ee561dc919218aef4fe651509 · Roger Ferrer / llvm-epi-0.8

Oct 13, 2012

Allow for loops in LiveIntervals::pruneValue(). · 2f6dfc7d

Jakob Stoklund Olesen authored Oct 13, 2012

It is possible that the live range of the value being pruned loops back
into the kill MBB where the search started. When that happens, make sure
that the beginning of KillMBB is also pruned.

Instead of starting a DFS at KillMBB and skipping the root of the
search, start a DFS at each KillMBB successor, and allow the search to
loop back to KillMBB.

This fixes PR14078.

llvm-svn: 165872

2f6dfc7d

X86: Fix accidentally swapped operands. · ecd15d7f
Benjamin Kramer authored Oct 13, 2012
```
llvm-svn: 165871
```
ecd15d7f

Teach SROA to cope with wrapper aggregates. These show up a lot in ABI · ba931992

Chandler Carruth authored Oct 13, 2012

type coercion code, especially when targetting ARM. Things like [1
x i32] instead of i32 are very common there.

The goal of this logic is to ensure that when we are picking an alloca
type, we look through such wrapper aggregates and across any zero-length
aggregate elements to find the simplest type possible to form a type
partition.

This logic should (generally speaking) rarely fire. It only ends up
kicking in when an alloca is accessed using two different types (for
instance, i32 and float), and the underlying alloca type has wrapper
aggregates around it. I noticed a significant amount of this occurring
looking at stepanov_abstraction generated code for arm, and suspect it
happens elsewhere as well.

Note that this doesn't yet address truly heinous IR productions such as
PR14059 is concerning. Those result in mismatched *sizes* of types in
addition to mismatched access and alloca types.

llvm-svn: 165870

ba931992

Speculatively harden the conversion logic. I have no idea if this will · 482c6178

Chandler Carruth authored Oct 13, 2012

help the dragonegg builders, and no test case at this point, but this
was one dimly plausible case I spotted by inspection. Hopefully will get
a testcase from those bots soon-ish, and will tidy this up with proper
testing.

llvm-svn: 165869

482c6178

X86: Promote i8 cmov when both operands are coming from truncates of the same width. · d6b9362f

Benjamin Kramer authored Oct 13, 2012

X86 doesn't have i8 cmovs so isel would emit a branch. Emitting branches at this
level is often not a good idea because it's too late for many optimizations to
kick in. This solution doesn't add any extensions (truncs are free) and tries
to avoid introducing partial register stalls by filtering direct copyfromregs.

I'm seeing a ~10% speedup on reading a random .png file with libpng15 via
graphicsmagick on x86_64/westmere, but YMMV depending on the microarchitecture.

llvm-svn: 165868

d6b9362f

Silence a warning in -assert builds. · 0fb8a778
Chandler Carruth authored Oct 13, 2012
```
llvm-svn: 165867
```
0fb8a778

Clean up how we rewrite loads and stores to the whole alloca. When these · 891fec0b

Chandler Carruth authored Oct 13, 2012

are single value types, the load and store should be directly based upon
the alloca and then bitcasting can fix the type as needed afterward.
This might in theory improve some of the IR coming out of SROA, but
I don't expect big changes yet and don't have any test cases on hand.
This is really just a cleanup/refactoring patch. The next patch will
cause this code path to be hit a lot more, actually get SROA to promote
more allocas and include several more test cases.

llvm-svn: 165864

891fec0b

[ms-inline asm] Remove the MatchInstruction() function. Previously, this was · 49963555

Chad Rosier authored Oct 13, 2012

the interface between the front-end and the MC layer when parsing inline
assembly.  Unfortunately, this is too deep into the parsing stack. Specifically,
we're unable to handle target-independent assembly (i.e., assembly directives,
labels, etc.).  Note the MatchAndEmitInstruction() isn't the correct
abstraction either.  I'll be exposing target-independent hooks shortly, so this
is really just a cleanup.

llvm-svn: 165858

49963555

Check section type rather than assuming it's code when emitting sections while... · 4732872b
Andrew Kaylor authored Oct 12, 2012
```
Check section type rather than assuming it's code when emitting sections while processing relocations.

llvm-svn: 165854
```
4732872b

ARM: tail-call inside a function where part of a byval argument is on caller's · 7e48b252

Manman Ren authored Oct 12, 2012

local frame causes problem.

For example:
void f(StructToPass s) {
  g(&s, sizeof(s));
}
will cause problem with tail-call since part of s is passed via registers and
saved in f's local frame. When g tries to access s, part of s may be corrupted
since f's local frame is popped out before the tail-call.

The current fix is to disable tail-call if getVarArgsRegSaveSize is not 0 for
the caller. This is a conservative approach, if we can prove the address of
s or part of s is not taken and passed to g, it should be okay to perform
tail-call.

rdar://12442472

llvm-svn: 165853

7e48b252

[ms-inline asm] Capitalize per coding standard. · 4453e845
Chad Rosier authored Oct 12, 2012
```
llvm-svn: 165847
```
4453e845

ARM: Mark VSELECT as 'expand'. · 30af442a

Jim Grosbach authored Oct 12, 2012

The backend already pattern matches to form VBSL when it can. We may want to
teach it to use the vbsl intrinsics at some point to prevent machine licm from
mucking with this, but using the Expand is completely correct.

http://llvm.org/bugs/show_bug.cgi?id=13831
http://llvm.org/bugs/show_bug.cgi?id=13961

Patch by Peter Couperus <peter.couperus@st.com>.

llvm-svn: 165845

30af442a

[ms-inline asm] Use the new API introduced in r165830 in lieu of the · 2f480a8a
Chad Rosier authored Oct 12, 2012
```
MapAndConstraints vector.  Also remove the unused Kind argument.

llvm-svn: 165833
```
2f480a8a

Oct 12, 2012

Use a transposed algorithm for handleMove(). · 1a87a29d

Jakob Stoklund Olesen authored Oct 12, 2012

Completely update one interval at a time instead of collecting live
range fragments to be updated. This avoids building data structures,
except for a single SmallPtrSet of updated intervals.

Also share code between handleMove() and handleMoveIntoBundle().

Add support for moving dead defs across other live values in the
interval. The MI scheduler can do that.

llvm-svn: 165824

1a87a29d

Fix coalescing with IMPLICIT_DEF values. · 1a3eb878

Jakob Stoklund Olesen authored Oct 12, 2012

PHIElimination inserts IMPLICIT_DEF instructions to guarantee that all
PHI predecessors have a live-out value. These IMPLICIT_DEF values are
not considered to be real interference when coalescing virtual
registers:

  %vreg1 = IMPLICIT_DEF
  %vreg2 = MOV32r0

When joining %vreg1 and %vreg2, the IMPLICIT_DEF instruction and its
value number should simply be erased since the %vreg2 value number now
provides a live-out value for the PHI predecesor block.

llvm-svn: 165813

1a3eb878

Fix big-endian codegen bug in DAGTypeLegalizer::ExpandRes_BITCAST · 9aa51d1a

Ulrich Weigand authored Oct 12, 2012

On PowerPC, a bitcast of <16 x i8> to i128 may run through a code
path in ExpandRes_BITCAST that attempts to do an intermediate
bitcast to a <4 x i32> vector, and then construct the Hi and Lo parts
of the resulting i128 by pairing up two of those i32 vector elements
each. The code already recognizes that on a big-endian system, the
first two vector elements form the Hi part, and the final two vector
elements form the Lo part (vice-versa from the little-endian situation).

However, we also need to take endianness into account when forming each
of those separate pairs: on a big-endian system, vector element 0 is
the *high* part of the pair making up the Hi part of the result, and
vector element 1 is the low part of the pair. The code currently always
uses vector element 0 as the low part and vector element 1 as the high
part, as is appropriate for little-endian platforms only.

This patch fixes this by swapping the vector elements as they are
paired up as appropriate.

llvm-svn: 165802

9aa51d1a

Add powerpc-ibm-aix to Triple. Patch by Kai. · d5772de0
Duncan Sands authored Oct 12, 2012
```
llvm-svn: 165792
```
d5772de0
Indenting. · ca2ff70e
Eric Christopher authored Oct 12, 2012
```
llvm-svn: 165785
```
ca2ff70e

fix warning · e9623261

Sebastian Pop authored Oct 12, 2012

DependenceAnalysis.cpp:1164:32: warning: implicit truncation from 'int' to bitfield changes value from -5 to 3
      [-Wconstant-conversion]
    Result.DV[Level].Direction &= ~Dependence::DVEntry::GT;
                               ^  ~~~~~~~~~~~~~~~~~~~~~~~~

Patch from Preston Briggs <preston.briggs@gmail.com>.

llvm-svn: 165784

e9623261

Div, Rem int/unsigned int · cf11c59e
Reed Kotler authored Oct 12, 2012
```
llvm-svn: 165783
```
cf11c59e

Legalizer optimize a pair of div / mod to a call to divrem libcall if they are · 21c4adcd

Evan Cheng authored Oct 12, 2012

not legal. However, it should use a div instruction + mul + sub if divide is
legal. The rem legalization code was missing a check and incorrectly uses a
divrem libcall even when div is legal.

rdar://12481395

llvm-svn: 165778

21c4adcd

Remove unnecessary classof()'s · 506a1c5a

Sean Silva authored Oct 11, 2012

isa<> et al. automatically infer when the cast is an upcast (including a
self-cast), so these are no longer necessary.

llvm-svn: 165767

506a1c5a

Remove buggy classof(). · bead14e9

Sean Silva authored Oct 11, 2012

This classof() is effectively saying that a MachineCodeEmitter "is-a"
JITEmitter, but JITEmitter is in fact a descendant of
MachineCodeEmitter, so this is not semantically correct. Consequently,
none of the assertions that rely on these classof() actualy check
anything.

Remove the RTTI (which didn't actually check anything) and use
static_cast<> instead.

Post-Mortem Bug Analysis
========================

Cause of the bug
----------------

r55022 appears to be the source of the classof() and assertions removed
by this commit. It aimed at removing some dynamic_cast<> that were
solely in the assertions. A typical diff hunk from that commit looked
like:

  -  assert(dynamic_cast<JITEmitter*>(MCE) && "Unexpected MCE?");
  -  JITEmitter *JE = static_cast<JITEmitter*>(getCodeEmitter());
  +  assert(isa<JITEmitter>(MCE) && "Unexpected MCE?");
  +  JITEmitter *JE = cast<JITEmitter>(getCodeEmitter());

Hence, the source of the bug then seems to be an attempt to replace
dynamic_cast<> with LLVM-style RTTI without properly setting up the
class hierarchy for LLVM-style RTTI. The bug therefore appears to be
simply a "thinko".

What initially indicated the presence of the bug
------------------------------------------------

After implementing automatic upcasting for isa<>, classof() functions of
the form

  static bool classof(const Foo *) { return true; }

were removed, since they only serve the purpose of optimizing
statically-OK upcasts. A subsequent recompilation triggered a build
failure on the isa<> tests within the removed asserts, since the
automatic upcasting (correctly) failed to substitute this classof().

Key to pinning down the root cause of the bug
---------------------------------------------

After being alerted to the presence of the bug, some thought about the
semantics which were being asserted by the buggy classof() revealed that
it was incorrect.

How the bug could have been prevented
-------------------------------------

This bug could have been prevented by better documentation for how to
set up LLVM-style RTTI. This should be solved by the recently added
documentation HowToSetUpLLVMStyleRTTI. However, this bug suggests that
the documentation should clearly explain the contract that classof()
must fulfill. The HowToSetUpLLVMStyleRTTI already explains this
contract, but it is a little tucked away. A future patch will expand
that explanation and make it more prominent.

There does not appear to be a simple way to have the compiler prevent
this bug, since fundamentally it boiled down to a spurious classof()
where the programmer made an erroneous statement about the conversion.
This suggests that perhaps the interface to LLVM-style RTTI of classof()
is not the best. There is already some evidence for this, since in a
number of places Clang has classof() forward to classofKind(Kind K)
which evaluates the cast in terms of just the Kind. This could probably
be generalized to simply a `static const Kind MyKind;` field in leaf
classes and `static const Kind firstMyKind, lastMyKind;` for non-leaf
classes, and have the rest of the work be done inside Casting.h,
assuming that the Kind enum is laid out in a preorder traversal of the
inheritance tree.

llvm-svn: 165764

bead14e9

PGO: create metadata for switch only if it has more than one targets. · 97c18762

Manman Ren authored Oct 11, 2012

When all cases of a switch statement are dead, the weights vector only has one
element, and we will get an ssertion failure when calling createBranchWeights.

llvm-svn: 165759

97c18762

Oct 11, 2012

Fix some typos 165739, spotted by Duncan. · 8c9e9411
Jan Wen Voung authored Oct 11, 2012
```
llvm-svn: 165753
```
8c9e9411
Revert 165732 for further review. · 0c61134d
Micah Villmow authored Oct 11, 2012
```
llvm-svn: 165747
```
0c61134d

Change encoding of instruction operands in bitcode binaries to be relative · afaced07

Jan Wen Voung authored Oct 11, 2012

to the instruction position.  The old encoding would give an absolute
ID which counts up within a function, and only resets at the next function.

I.e., Instead of having:

... = icmp eq i32 n-1, n-2
br i1 ..., label %bb1, label %bb2

it will now be roughly:

... = icmp eq i32 1, 2
br i1 1, label %bb1, label %bb2

This makes it so that ids remain relatively small and can be encoded
in fewer bits.

With this encoding, forward reference operands will be given
negative-valued IDs.  Use signed VBRs for the most common case
of forward references, which is phi instructions.

To retain backward compatibility we bump the bitcode version
from 0 to 1 to distinguish between the different encodings.

llvm-svn: 165739

afaced07

Add in the first iteration of support for llvm/clang/lldb to allow variable... · 08318973

Micah Villmow authored Oct 11, 2012

Add in the first iteration of support for llvm/clang/lldb to allow variable per address space pointer sizes to be optimized correctly.

llvm-svn: 165726

08318973

Pass an explicit operand number to addLiveIns. · d0d7860f

Jakob Stoklund Olesen authored Oct 11, 2012

Not all instructions define a virtual register in their first operand.
Specifically, INLINEASM has a different format.

<rdar://problem/12472811>

llvm-svn: 165721

d0d7860f

This patch addresses PR13947. · 22162470

Bill Schmidt authored Oct 11, 2012

For function calls on the 64-bit PowerPC SVR4 target, each parameter
is mapped to as many doublewords in the parameter save area as
necessary to hold the parameter.  The first 13 non-varargs
floating-point values are passed in registers; any additional
floating-point parameters are passed in the parameter save area.  A
single-precision floating-point parameter (32 bits) must be mapped to
the second (rightmost, low-order) word of its assigned doubleword
slot.

Currently LLVM violates this ABI requirement by mapping such a
parameter to the first (leftmost, high-order) word of its assigned
doubleword slot.  This is internally self-consistent but will not
interoperate correctly with libraries compiled with an ABI-compliant
compiler.

This patch corrects the problem by adjusting the parameter addressing
on both sides of the calling convention.

llvm-svn: 165714

22162470

Expose move to/from coprocessor instructions in MIPS64 mode. · 6a00ab4b

David Chisnall authored Oct 11, 2012

Note: [D]M{T,F}CP2 is just a recommended encoding. Vendors often provide a
custom CP2 that interprets instructions differently and may wish to add their
own instructions that use this opcode. We should ensure that this is easy to
do. I will probably add a 'has custom CP{0-3}' subtarget flag to make this
easy: We want to avoid the GCC situation where every MIPS vendor makes a custom
fork that breaks every other MIPS CPU and so can't be merged upstream.

llvm-svn: 165711

6a00ab4b

dependence analysis · 59b61b9e

Sebastian Pop authored Oct 11, 2012

Patch from Preston Briggs <preston.briggs@gmail.com>.

This is an updated version of the dependence-analysis patch, including an MIV
test based on Banerjee's inequalities.

It's a fairly complete implementation of the paper

    Practical Dependence Testing
    Gina Goff, Ken Kennedy, and Chau-Wen Tseng
    PLDI 1991

It cannot yet propagate constraints between coupled RDIV subscripts (discussed
in Section 5.3.2 of the paper).

It's organized as a FunctionPass with a single entry point that supports testing
for dependence between two instructions in a function. If there's no dependence,
it returns null. If there's a dependence, it returns a pointer to a Dependence
which can be queried about details (what kind of dependence, is it loop
independent, direction and distance vector entries, etc). I haven't included
every imaginable feature, but there's a good selection that should be adequate
for supporting many loop transformations. Of course, it can be extended as
necessary.

Included in the patch file are many test cases, commented with C code showing
the loops and array references.

llvm-svn: 165708

59b61b9e

Follow the same routine to add target float expansion hook · 6b49c2f6
Michael Liao authored Oct 11, 2012
```
llvm-svn: 165707
```
6b49c2f6
misched: Handle "transient" non-instructions. · 5f35afb0
Andrew Trick authored Oct 11, 2012
```
llvm-svn: 165701
```
5f35afb0
Don't crash when !tbaa.struct contents is invalid. · 49ac81ac
Nick Lewycky authored Oct 11, 2012
```
llvm-svn: 165693
```
49ac81ac
Revert r165661, "Patch by Shuxin Yang <shuxin.llvm@gmail.com>." · da0730c2
NAKAMURA Takumi authored Oct 11, 2012
```
It broke stage2 clang and test-suite/MultiSource/Benchmarks/mediabench/g721/g721encode.

llvm-svn: 165692
```
da0730c2
Add the LLVM context to this c'tor. It will be needed in the future. · 9d7ba8da
Bill Wendling authored Oct 11, 2012
```
llvm-svn: 165687
```
9d7ba8da
Support a common idiom on how to build an Attributes class with a single attribute. · 13410278
Bill Wendling authored Oct 11, 2012
```
llvm-svn: 165686
```
13410278

Don't crash if a .ll file contains a forward-reference that looks like a global · 686d7cbf

Nick Lewycky authored Oct 11, 2012

value but later turns out to be a function.

Unfortunately, we can't fold tests into a single file because we only get one
error out of llvm-as.

llvm-svn: 165680

686d7cbf

Change MachineInstrBuilder::addDisp to copy over target flags by default. · 60a25a57
Evan Cheng authored Oct 11, 2012
```
llvm-svn: 165677
```
60a25a57