Commits · 6bae2a57d5b7ef62ca950e62da2eaf01aeebfef9 · Roger Ferrer / llvm-epi-0.8

Aug 21, 2012

Fix a quadratic algorithm in MachineBranchProbabilityInfo. · 6bae2a57

Jakob Stoklund Olesen authored Aug 20, 2012

The getSumForBlock function was quadratic in the number of successors
because getSuccWeight would perform a linear search for an already known
iterator.

This patch was originally committed as r161460, but reverted again
because of assertion failures. Now that duplicate Machine CFG edges have
been eliminated, this works properly.

llvm-svn: 162233

6bae2a57

Aug 20, 2012

Don't add CFG edges for redundant conditional branches. · 7d33c573

Jakob Stoklund Olesen authored Aug 20, 2012

IR that hasn't been through SimplifyCFG can look like this:

  br i1 %b, label %r, label %r

Make sure we don't create duplicate Machine CFG edges in this case.

Fix the machine code verifier to accept conditional branches with a
single CFG edge.

llvm-svn: 162230

7d33c573

Add a verification pass after ExpandISelPseudos. · 1d026267

Jakob Stoklund Olesen authored Aug 20, 2012

This pass often has weird CFG hacks and hand-written MI building code
that can go wrong in many ways.

llvm-svn: 162224

1d026267

Add CFG checks to MachineVerifier. · de31b52c

Jakob Stoklund Olesen authored Aug 20, 2012

Verify that the predecessor and successor lists are consistent and free
of duplicates.

llvm-svn: 162223

de31b52c

Use a SmallPtrSet to dedup successors in EmitSjLjDispatchBlock. · 710093e3
Jakob Stoklund Olesen authored Aug 20, 2012
```
The test case ARM/2011-05-04-MultipleLandingPadSuccs.ll was creating
duplicate successor list entries.

llvm-svn: 162222
```
710093e3
fix HexagonSubtarget parsing of -mv flag · 1a0bef6d
Sebastian Pop authored Aug 20, 2012
```
llvm-svn: 162217
```
1a0bef6d
fix a case where all operands of BUILD_VECTOR are undefined · 10ff96ce
Michael Liao authored Aug 20, 2012
```
llvm-svn: 162214
```
10ff96ce
Fix coding style violations in 162135 and 162136. · 11dfbe19
Akira Hatanaka authored Aug 20, 2012
```
Patch by Petar Jovanovic.

llvm-svn: 162213
```
11dfbe19
DataExtractor: Fix integer truncation issues in LEB128 extraction. · 1b07ab51
Benjamin Kramer authored Aug 20, 2012
```
llvm-svn: 162201
```
1b07ab51

Fixed DAGCombiner bug (found and localized by James Malloy): · 6a638ec5

Stepan Dyatkovskiy authored Aug 20, 2012

The DAGCombiner tries to optimise a BUILD_VECTOR by checking if it
consists purely of get_vector_elts from one or two source vectors. If
so, it either makes a concat_vectors node or a shufflevector node.

However, it doesn't check the element type width of the underlying
vector, so if you have this sequence:

Node0: v4i16 = ...
Node1: i32 = extract_vector_elt Node0
Node2: i32 = extract_vector_elt Node0
Node3: v16i8 = BUILD_VECTOR Node1, Node2, ...

It will attempt to:

Node0:    v4i16 = ...
NewNode1: v16i8 = concat_vectors Node0, ...

Where this is actually invalid because the element width is completely
different. This causes an assertion failure on DAG legalization stage.

Fix:
If output item type of BUILD_VECTOR differs from input item type.
Make concat_vectors based on input element type and then bitcast it to the output vector type. So the case described above will transformed to:
Node0:    v4i16 = ...
NewNode1: v8i16 = concat_vectors Node0, ...
NewNode2: v16i8 = bitcast NewNode1

llvm-svn: 162195

6a638ec5

Remove FMA3 intrinsic instructions in favor of patterns. · b58eec4e
Craig Topper authored Aug 20, 2012
```
llvm-svn: 162194
```
b58eec4e
Use correct intrinsic for 256-bit VFMSUBADDPS. · 37eca549
Craig Topper authored Aug 20, 2012
```
llvm-svn: 162193
```
37eca549
Remove trailing white space and tab characters. No functional change. · 5122e9f1
Craig Topper authored Aug 19, 2012
```
llvm-svn: 162192
```
5122e9f1

Aug 19, 2012
- When unsafe math is used, we can use commutative FMAX and FMIN. In some cases · 178250ad
  Nadav Rotem authored Aug 19, 2012
```
this allows for better code generation.

Added a new DAGCombine transformation to convert FMAX and FMIN to FMANC and
FMINC, which are commutative.

For example:

  movaps  %xmm0, %xmm1
  movsd LC(%rip), %xmm0
  minsd %xmm1, %xmm0

becomes:

  minsd LC(%rip), %xmm0

llvm-svn: 162187
```
  178250ad
- Fabs folding is implemented. · fd4fe706
  Benjamin Kramer authored Aug 19, 2012
```
llvm-svn: 162186
```
  fd4fe706
- InstCombine: Fix a crasher when encountering a function pointer. · 9d03242f
  Benjamin Kramer authored Aug 18, 2012
```
llvm-svn: 162180
```
  9d03242f
Aug 18, 2012

Remove the CAND/COR/CXOR custom ISD nodes and their select code. · e1014e7b
Jakob Stoklund Olesen authored Aug 18, 2012
```
These nodes are no longer needed because the peephole pass can fold
CMOV+AND into ANDCC etc.

llvm-svn: 162179
```
e1014e7b

Remove virtual from many methods. These methods replace methods in the base... · fd1c9259

Craig Topper authored Aug 18, 2012

Remove virtual from many methods. These methods replace methods in the base class, but the base class methods aren't virtual so it just increased call overhead.

llvm-svn: 162178

fd1c9259

Also combine zext/sext into selects for ARM. · dded061f

Jakob Stoklund Olesen authored Aug 18, 2012

This turns common i1 patterns into predicated instructions:

  (add (zext cc), x) -> (select cc (add x, 1), x)
  (add (sext cc), x) -> (select cc (add x, -1), x)

For a function like:

  unsigned f(unsigned s, int x) {
    return s + (x>0);
  }

We now produce:

  cmp r1, #0
  it  gt
  addgt.w r0, r0, #1

Instead of:

  movs  r2, #0
  cmp r1, #0
  it  gt
  movgt r2, #1
  add r0, r2

llvm-svn: 162177

dded061f

Also pass logical ops to combineSelectAndUse. · aab43dbf

Jakob Stoklund Olesen authored Aug 18, 2012

Add these transformations to the existing add/sub ones:

  (and (select cc, -1, c), x) -> (select cc, x, (and, x, c))
  (or  (select cc, 0, c), x)  -> (select cc, x, (or, x, c))
  (xor (select cc, 0, c), x)  -> (select cc, x, (xor, x, c))

The selects can then be transformed to a single predicated instruction
by peephole.

This transformation will make it possible to eliminate the ISD::CAND,
COR, and CXOR custom DAG nodes.

llvm-svn: 162176

aab43dbf

Remove overly conservative hasOneUse check, this always expands into a single IR instruction. · 9282aef8
Benjamin Kramer authored Aug 18, 2012
```
llvm-svn: 162175
```
9282aef8
InstCombine: Add a couple of fabs identities for comparing with 0.0. · 8c2a733c
Benjamin Kramer authored Aug 18, 2012
```
llvm-svn: 162174
```
8c2a733c
SimplifyLibcalls: Add fabs and trunc to the list of libcalls that are safe to... · 00013245
Benjamin Kramer authored Aug 18, 2012
```
SimplifyLibcalls: Add fabs and trunc to the list of libcalls that are safe to shrink from double to float.

llvm-svn: 162173
```
00013245
Reapply r162160 with a fix: Optimize Arith->Trunc->SETCC sequence to allow... · a136939f
Nadav Rotem authored Aug 18, 2012
```
Reapply r162160 with a fix: Optimize Arith->Trunc->SETCC sequence to allow better compare/branch code.

llvm-svn: 162172
```
a136939f
fp16-to-fp32 conversion instructions are available in Thumb mode as well. · 1e28826a
Anton Korobeynikov authored Aug 18, 2012
```
Make sure the generic pattern is used.

llvm-svn: 162170
```
1e28826a
Refactor code a bit to reduce number of calls in the final compiled code. No... · 0128f9ba
Craig Topper authored Aug 18, 2012
```
Refactor code a bit to reduce number of calls in the final compiled code. No functional change intended.

llvm-svn: 162166
```
0128f9ba
Reorder initialization list to silence -Wreorder · 2bd9c7bd
Craig Topper authored Aug 18, 2012
```
llvm-svn: 162165
```
2bd9c7bd
Revert r162160 because it made a few buildbots fail. · c324af60
Nadav Rotem authored Aug 18, 2012
```
llvm-svn: 162164
```
c324af60

The X86 backend has a number of optimizations for SETCC nodes which use · 2cb14a5c

Nadav Rotem authored Aug 18, 2012

arithmetic instructions. However, when small data types are used, a truncate
node appears between the SETCC node and the arithmetic operation. This patch
adds support for this pattern.

Before:
  xorl  %esi, %edi
  testb %dil, %dil
  setne %al
  ret

After:
  xorb  %dil, %sil
  setne %al
  ret

rdar://12081007

llvm-svn: 162160

2cb14a5c

Make atomic load and store of pointers work. Tighten verification of atomic operations · 79a6b30d
Eli Friedman authored Aug 17, 2012
```
so other unexpected operations don't slip through.  Based on patch by Logan Chien.
PR11786/PR13186.

llvm-svn: 162146
```
79a6b30d

Aug 17, 2012

Fix undefined behavior (binding a reference to a dereferenced null pointer) if · 257c5f20
Richard Smith authored Aug 17, 2012
```
SSAUpdater was created and destroyed without being initialized.

llvm-svn: 162137
```
257c5f20
Add MipsELFWriterInfo.{h,cpp}. · fb21e842
Akira Hatanaka authored Aug 17, 2012
```
llvm-svn: 162136
```
fb21e842

Correct MCJIT functionality for MIPS32 architecture. · 111174be

Akira Hatanaka authored Aug 17, 2012

No new tests are added.
All tests in ExecutionEngine/MCJIT that have been failing pass after this patch
is applied (when "make check" is done on a mips board). 

Patch by Petar Jovanovic.

llvm-svn: 162135

111174be

Implement stack protectors for structures with character arrays in them. · bfb9b759
Bill Wendling authored Aug 17, 2012
```
<rdar://problem/10545247>

llvm-svn: 162131
```
bfb9b759

Avoid folding ADD instructions with FI operands. · 7b1a2e8f

Jakob Stoklund Olesen authored Aug 17, 2012

PEI can't handle the pseudo-instructions. This can be removed when the
pseudo-instructions are replaced by normal predicated instructions.

Fixes PR13628.

llvm-svn: 162130

7b1a2e8f

Add stub methods for mips assembly matcher. · 7605630c
Akira Hatanaka authored Aug 17, 2012
```
Patch by Vladimir Medic.

llvm-svn: 162124
```
7605630c

MemoryBuiltins: Properly guard ObjectSizeOffsetVisitor against cycles in the IR. · 34764fe2

Benjamin Kramer authored Aug 17, 2012

The previous fix only checked for simple cycles, use a set to catch longer
cycles too.

Drop the broken check from the ObjectSizeOffsetEvaluator. The BoundsChecking
pass doesn't have to deal with invalid IR like InstCombine does.

llvm-svn: 162120

34764fe2

Change the `linker_private_weak_def_auto' linkage to `linkonce_odr_auto_hide' to · 34bc34ec

Bill Wendling authored Aug 17, 2012

make it more consistent with its intended semantics.

The `linker_private_weak_def_auto' linkage type was meant to automatically hide
globals which never had their addresses taken. It has nothing to do with the
`linker_private' linkage type, which outputs the symbols with a `l' (ell) prefix
among other things.

The intended semantic is more like the `linkonce_odr' linkage type.

Change the name of the linkage type to `linkonce_odr_auto_hide'. And therefore
changing the semantics so that it produces the correct output for the linker.

Note: The old linkage name `linker_private_weak_def_auto' will still parse but
is not a synonym for `linkonce_odr_auto_hide'. This should be removed in 4.0.
<rdar://problem/11754934>

llvm-svn: 162114

34bc34ec

Assert that dominates is not given a multiple edge. Finding out if we have · 9a16735e

Rafael Espindola authored Aug 17, 2012

multiple edges between two blocks is linear. If the caller is iterating all
edges leaving a BB that would be a square time algorithm. It is more efficient
to have the callers handle that case.

Currently the only callers are:
* GVN: already avoids the multiple edge case.
* Verifier: could only hit this assert when looking at an invalid invoke. Since
it already rejects the invoke, just avoid computing the dominance for it.

llvm-svn: 162113

9a16735e

Add comment, clean up code. No functional change. · c1dee482
Jakob Stoklund Olesen authored Aug 17, 2012
```
llvm-svn: 162107
```
c1dee482