Commits · 6bae2a57d5b7ef62ca950e62da2eaf01aeebfef9 · Roger Ferrer / llvm-epi-0.8

Aug 21, 2012

Fix a quadratic algorithm in MachineBranchProbabilityInfo. · 6bae2a57

Jakob Stoklund Olesen authored Aug 20, 2012

The getSumForBlock function was quadratic in the number of successors
because getSuccWeight would perform a linear search for an already known
iterator.

This patch was originally committed as r161460, but reverted again
because of assertion failures. Now that duplicate Machine CFG edges have
been eliminated, this works properly.

llvm-svn: 162233

6bae2a57

Aug 20, 2012

Don't add CFG edges for redundant conditional branches. · 7d33c573

Jakob Stoklund Olesen authored Aug 20, 2012

IR that hasn't been through SimplifyCFG can look like this:

  br i1 %b, label %r, label %r

Make sure we don't create duplicate Machine CFG edges in this case.

Fix the machine code verifier to accept conditional branches with a
single CFG edge.

llvm-svn: 162230

7d33c573

Clarify that duplicate edges are not allowed in the Machine CFG. · 784973b8

Jakob Stoklund Olesen authored Aug 20, 2012

LLVM IR has labeled duplicate CFG edges, but since Machine CFG edges
don't have labels, it doesn't make sense to allow duplicates. There is
no way of telling what the edges mean.

Duplicate CFG edges cause confusion when dealing with edge weights. It
seems that code producing duplicate CFG edges usually does the wrong
thing with edge weights.

llvm-svn: 162227

784973b8

Add a verification pass after ExpandISelPseudos. · 1d026267

Jakob Stoklund Olesen authored Aug 20, 2012

This pass often has weird CFG hacks and hand-written MI building code
that can go wrong in many ways.

llvm-svn: 162224

1d026267

Add CFG checks to MachineVerifier. · de31b52c

Jakob Stoklund Olesen authored Aug 20, 2012

Verify that the predecessor and successor lists are consistent and free
of duplicates.

llvm-svn: 162223

de31b52c

Use a SmallPtrSet to dedup successors in EmitSjLjDispatchBlock. · 710093e3
Jakob Stoklund Olesen authored Aug 20, 2012
```
The test case ARM/2011-05-04-MultipleLandingPadSuccs.ll was creating
duplicate successor list entries.

llvm-svn: 162222
```
710093e3

enable cross compilation with cmake · faeca292

Sebastian Pop authored Aug 20, 2012

This patch allows us to use cmake to specify a cross compiler: target different
than host. In particular, it moves LLVM_DEFAULT_TARGET_TRIPLE and TARGET_TRIPLE
variables from cmake/config-ix.cmake to the toplevel CMakeLists.txt to make them
available at configure time.

Here is the command line that I have used to test my patches to create a Hexagon
cross compiler hosted on x86:

$ cmake -G Ninja -D LLVM_TARGETS_TO_BUILD:STRING=Hexagon -D TARGET_TRIPLE:STRING=hexagon-unknown-linux-gnu -D LLVM_DEFAULT_TARGET_TRIPLE:STRING=hexagon-unknown-linux-gnu -D LLVM_TARGET_ARCH:STRING=hexagon-unknown-linux-gnu ..
$ ninja check

llvm-svn: 162219

faeca292

enable Hexagon target from cmake · 8f4aec43

Sebastian Pop authored Aug 20, 2012

The patch adds a missing case for the Hexagon target in cmake/config-ix.cmake.

llvm-svn: 162218

8f4aec43

fix HexagonSubtarget parsing of -mv flag · 1a0bef6d
Sebastian Pop authored Aug 20, 2012
```
llvm-svn: 162217
```
1a0bef6d
fix a case where all operands of BUILD_VECTOR are undefined · 10ff96ce
Michael Liao authored Aug 20, 2012
```
llvm-svn: 162214
```
10ff96ce
Fix coding style violations in 162135 and 162136. · 11dfbe19
Akira Hatanaka authored Aug 20, 2012
```
Patch by Petar Jovanovic.

llvm-svn: 162213
```
11dfbe19
DataExtractor: Fix integer truncation issues in LEB128 extraction. · 1b07ab51
Benjamin Kramer authored Aug 20, 2012
```
llvm-svn: 162201
```
1b07ab51
Forget to add testcase for r162195. Sorry. · 6ee89aaf
Stepan Dyatkovskiy authored Aug 20, 2012
```
llvm-svn: 162196
```
6ee89aaf

Fixed DAGCombiner bug (found and localized by James Malloy): · 6a638ec5

Stepan Dyatkovskiy authored Aug 20, 2012

The DAGCombiner tries to optimise a BUILD_VECTOR by checking if it
consists purely of get_vector_elts from one or two source vectors. If
so, it either makes a concat_vectors node or a shufflevector node.

However, it doesn't check the element type width of the underlying
vector, so if you have this sequence:

Node0: v4i16 = ...
Node1: i32 = extract_vector_elt Node0
Node2: i32 = extract_vector_elt Node0
Node3: v16i8 = BUILD_VECTOR Node1, Node2, ...

It will attempt to:

Node0:    v4i16 = ...
NewNode1: v16i8 = concat_vectors Node0, ...

Where this is actually invalid because the element width is completely
different. This causes an assertion failure on DAG legalization stage.

Fix:
If output item type of BUILD_VECTOR differs from input item type.
Make concat_vectors based on input element type and then bitcast it to the output vector type. So the case described above will transformed to:
Node0:    v4i16 = ...
NewNode1: v8i16 = concat_vectors Node0, ...
NewNode2: v16i8 = bitcast NewNode1

llvm-svn: 162195

6a638ec5

Remove FMA3 intrinsic instructions in favor of patterns. · b58eec4e
Craig Topper authored Aug 20, 2012
```
llvm-svn: 162194
```
b58eec4e
Use correct intrinsic for 256-bit VFMSUBADDPS. · 37eca549
Craig Topper authored Aug 20, 2012
```
llvm-svn: 162193
```
37eca549
Remove trailing white space and tab characters. No functional change. · 5122e9f1
Craig Topper authored Aug 19, 2012
```
llvm-svn: 162192
```
5122e9f1

Aug 19, 2012
- When unsafe math is used, we can use commutative FMAX and FMIN. In some cases · 178250ad
  Nadav Rotem authored Aug 19, 2012
```
this allows for better code generation.

Added a new DAGCombine transformation to convert FMAX and FMIN to FMANC and
FMINC, which are commutative.

For example:

  movaps  %xmm0, %xmm1
  movsd LC(%rip), %xmm0
  minsd %xmm1, %xmm0

becomes:

  minsd LC(%rip), %xmm0

llvm-svn: 162187
```
  178250ad
- Fabs folding is implemented. · fd4fe706
  Benjamin Kramer authored Aug 19, 2012
```
llvm-svn: 162186
```
  fd4fe706
- InstCombine: Fix a crasher when encountering a function pointer. · 9d03242f
  Benjamin Kramer authored Aug 18, 2012
```
llvm-svn: 162180
```
  9d03242f
Aug 18, 2012

Remove the CAND/COR/CXOR custom ISD nodes and their select code. · e1014e7b
Jakob Stoklund Olesen authored Aug 18, 2012
```
These nodes are no longer needed because the peephole pass can fold
CMOV+AND into ANDCC etc.

llvm-svn: 162179
```
e1014e7b

Remove virtual from many methods. These methods replace methods in the base... · fd1c9259

Craig Topper authored Aug 18, 2012

Remove virtual from many methods. These methods replace methods in the base class, but the base class methods aren't virtual so it just increased call overhead.

llvm-svn: 162178

fd1c9259

Also combine zext/sext into selects for ARM. · dded061f

Jakob Stoklund Olesen authored Aug 18, 2012

This turns common i1 patterns into predicated instructions:

  (add (zext cc), x) -> (select cc (add x, 1), x)
  (add (sext cc), x) -> (select cc (add x, -1), x)

For a function like:

  unsigned f(unsigned s, int x) {
    return s + (x>0);
  }

We now produce:

  cmp r1, #0
  it  gt
  addgt.w r0, r0, #1

Instead of:

  movs  r2, #0
  cmp r1, #0
  it  gt
  movgt r2, #1
  add r0, r2

llvm-svn: 162177

dded061f

Also pass logical ops to combineSelectAndUse. · aab43dbf

Jakob Stoklund Olesen authored Aug 18, 2012

Add these transformations to the existing add/sub ones:

  (and (select cc, -1, c), x) -> (select cc, x, (and, x, c))
  (or  (select cc, 0, c), x)  -> (select cc, x, (or, x, c))
  (xor (select cc, 0, c), x)  -> (select cc, x, (xor, x, c))

The selects can then be transformed to a single predicated instruction
by peephole.

This transformation will make it possible to eliminate the ISD::CAND,
COR, and CXOR custom DAG nodes.

llvm-svn: 162176

aab43dbf

Remove overly conservative hasOneUse check, this always expands into a single IR instruction. · 9282aef8
Benjamin Kramer authored Aug 18, 2012
```
llvm-svn: 162175
```
9282aef8
InstCombine: Add a couple of fabs identities for comparing with 0.0. · 8c2a733c
Benjamin Kramer authored Aug 18, 2012
```
llvm-svn: 162174
```
8c2a733c
SimplifyLibcalls: Add fabs and trunc to the list of libcalls that are safe to... · 00013245
Benjamin Kramer authored Aug 18, 2012
```
SimplifyLibcalls: Add fabs and trunc to the list of libcalls that are safe to shrink from double to float.

llvm-svn: 162173
```
00013245
Reapply r162160 with a fix: Optimize Arith->Trunc->SETCC sequence to allow... · a136939f
Nadav Rotem authored Aug 18, 2012
```
Reapply r162160 with a fix: Optimize Arith->Trunc->SETCC sequence to allow better compare/branch code.

llvm-svn: 162172
```
a136939f
fp16-to-fp32 conversion instructions are available in Thumb mode as well. · 1e28826a
Anton Korobeynikov authored Aug 18, 2012
```
Make sure the generic pattern is used.

llvm-svn: 162170
```
1e28826a
Refactor code a bit to reduce number of calls in the final compiled code. No... · 0128f9ba
Craig Topper authored Aug 18, 2012
```
Refactor code a bit to reduce number of calls in the final compiled code. No functional change intended.

llvm-svn: 162166
```
0128f9ba
Reorder initialization list to silence -Wreorder · 2bd9c7bd
Craig Topper authored Aug 18, 2012
```
llvm-svn: 162165
```
2bd9c7bd
Revert r162160 because it made a few buildbots fail. · c324af60
Nadav Rotem authored Aug 18, 2012
```
llvm-svn: 162164
```
c324af60

The X86 backend has a number of optimizations for SETCC nodes which use · 2cb14a5c

Nadav Rotem authored Aug 18, 2012

arithmetic instructions. However, when small data types are used, a truncate
node appears between the SETCC node and the arithmetic operation. This patch
adds support for this pattern.

Before:
  xorl  %esi, %edi
  testb %dil, %dil
  setne %al
  ret

After:
  xorb  %dil, %sil
  setne %al
  ret

rdar://12081007

llvm-svn: 162160

2cb14a5c

Make atomic load and store of pointers work. Tighten verification of atomic operations · 79a6b30d
Eli Friedman authored Aug 17, 2012
```
so other unexpected operations don't slip through.  Based on patch by Logan Chien.
PR11786/PR13186.

llvm-svn: 162146
```
79a6b30d

Aug 17, 2012
- Fix undefined behavior (binding a reference to a dereferenced null pointer) if · 257c5f20
  Richard Smith authored Aug 17, 2012
```
SSAUpdater was created and destroyed without being initialized.

llvm-svn: 162137
```
  257c5f20
- Add MipsELFWriterInfo.{h,cpp}. · fb21e842
  Akira Hatanaka authored Aug 17, 2012
```
llvm-svn: 162136
```
  fb21e842
- Correct MCJIT functionality for MIPS32 architecture. · 111174be
  Akira Hatanaka authored Aug 17, 2012
```
No new tests are added.
All tests in ExecutionEngine/MCJIT that have been failing pass after this patch
is applied (when "make check" is done on a mips board). 

Patch by Petar Jovanovic.

llvm-svn: 162135
```
  111174be
- Implement stack protectors for structures with character arrays in them. · bfb9b759
  Bill Wendling authored Aug 17, 2012
```
<rdar://problem/10545247>

llvm-svn: 162131
```
  bfb9b759
- Avoid folding ADD instructions with FI operands. · 7b1a2e8f
  Jakob Stoklund Olesen authored Aug 17, 2012
```
PEI can't handle the pseudo-instructions. This can be removed when the
pseudo-instructions are replaced by normal predicated instructions.

Fixes PR13628.

llvm-svn: 162130
```
  7b1a2e8f
- Add stub methods for mips assembly matcher. · 7605630c
  Akira Hatanaka authored Aug 17, 2012
```
Patch by Vladimir Medic.

llvm-svn: 162124
```
  7605630c