Commits · 40534fe9a57ae27ba1d47834386d31c649c06972 · Roger Ferrer / llvm-epi-0.8

Nov 13, 2012

Added RegisterCoalescer support for joining global copies first. · 40534fe9

Andrew Trick authored Nov 13, 2012

This adds the -join-globalcopies option which can be enabled by
default once misched is also enabled.

Ideally, the register coalescer would be able to split local live
ranges in a way that produces copies that can be easily resolved by
the scheduler. Until then, this heuristic should be good enough to at
least allow the scheduler to run after coalescing.

llvm-svn: 167825

40534fe9

Figure out <size> argument of llvm.lifetime intrinsics at the moment they are... · cfd662f2
Alexey Samsonov authored Nov 13, 2012
```
Figure out <size> argument of llvm.lifetime intrinsics at the moment they are created (during function inlining)

llvm-svn: 167821
```
cfd662f2
Test commit. · ccfd77ef
Jyotsna Verma authored Nov 13, 2012
```
Add a blank line.

llvm-svn: 167819
```
ccfd77ef
BBVectorize: Remove temporary assert used for debugging · b51bdd20
Hal Finkel authored Nov 13, 2012
```
llvm-svn: 167817
```
b51bdd20

instcombine: Migrate math library call simplifications · 193e035b

Meador Inge authored Nov 13, 2012

This patch migrates the math library call simplifications from the
simplify-libcalls pass into the instcombine library call simplifier.

I have typically migrated just one simplifier at a time, but the math
simplifiers are interdependent because:

   1. CosOpt, PowOpt, and Exp2Opt all depend on UnaryDoubleFPOpt.
   2. CosOpt, PowOpt, Exp2Opt, and UnaryDoubleFPOpt all depend on
      the option -enable-double-float-shrink.

These two factors made migrating each of these simplifiers individually
more of a pain than it would be worth.  So, I migrated them all together.

llvm-svn: 167815

193e035b

BBVectorize: Don't vectorize vector-manipulation chains · 2a1df367

Hal Finkel authored Nov 13, 2012

Don't choose a vectorization plan containing only shuffles and
vector inserts/extracts. Due to inperfections in the cost model,
these can lead to infinite recusion.

llvm-svn: 167811

2a1df367

Revert r167759. Ben is right this isn't likely to help much. · 66dbd3fb
Evan Cheng authored Nov 13, 2012
```
llvm-svn: 167809
```
66dbd3fb

misched: Don't consider artificial edges weak edges. · 4b1f9e3b

Andrew Trick authored Nov 13, 2012

For now be more conservative in case other out-of-tree schedulers rely
on the old behavior of artificial edges.

llvm-svn: 167808

4b1f9e3b

Use the 'count' attribute instead of the 'upper_bound' attribute. · f454dfb6

Bill Wendling authored Nov 13, 2012

If we have a type 'int a[1]' and a type 'int b[0]', the generated DWARF is the
same for both of them because we use the 'upper_bound' attribute. Instead use
the 'count' attrbute, which gives the correct number of elements in the array.
<rdar://problem/12566646>

llvm-svn: 167806

f454dfb6

Cleanup the main RegisterCoalescer loop. · edac22a9
Andrew Trick authored Nov 13, 2012
```
Block priorities still apply outside loops.

llvm-svn: 167793
```
edac22a9
revert r167740 · c94c3bb5
Shuxin Yang authored Nov 13, 2012
```
llvm-svn: 167787
```
c94c3bb5
Cleanup -join-splitedges. Make the loop more obvious. · c25d3fe7
Andrew Trick authored Nov 12, 2012
```
llvm-svn: 167785
```
c25d3fe7

BBVectorize: Only some insert element operand pairs are free. · 3b79f55c

Hal Finkel authored Nov 12, 2012

This fixes another infinite recursion case when using target costs.
We can only replace insert element input chains that are pure (end
with inserting into an undef).

llvm-svn: 167784

3b79f55c

Nov 12, 2012

Add an option to enable prototype "fission" capabilities and debug changes. · 29424311
Eric Christopher authored Nov 12, 2012
```
llvm-svn: 167765
```
29424311

Cache size of PassVector to speed up getNumContainedPasses(). · 4b54c8ff

Evan Cheng authored Nov 12, 2012

getNumContainedPasses() used to compute the size of the vector on demand. It is
called repeated in loops (such as runOnFunction()) and it can be updated while
inside the loop.

llvm-svn: 167759

4b54c8ff

Added a temporary option to avoid critical edges splitting. · 22d688a2

Andrew Trick authored Nov 12, 2012

This teaches the register coalescer to be less prone to split critical
edges. I am currently benchmarking this with the new (post-coalescer)
scheduler. I plan to enable this by default and remove the option as
soon as misched is enabled.

llvm-svn: 167758

22d688a2

Rewrite DIContext interface to take an object. Update all callers. · 7370b552
Eric Christopher authored Nov 12, 2012
```
llvm-svn: 167757
```
7370b552
Revert r167620; this can be implemented using an existing CL option. · 2b2b38d3
Chad Rosier authored Nov 12, 2012
```
llvm-svn: 167755
```
2b2b38d3
misched: rename interfaceto avoid gcc warnings · ec369d53
Andrew Trick authored Nov 12, 2012
```
llvm-svn: 167753
```
ec369d53

BBVectorize: Use a more sophisticated check for input cost · 9cf33729

Hal Finkel authored Nov 12, 2012

The old checking code, which assumed that input shuffles and insert-elements
could always be folded (and thus were free) is too simple.
This can only happen in special circumstances.
Using the simple check caused infinite recursion.

llvm-svn: 167750

9cf33729

misched: Target-independent support for MacroFusion. · 26328024

Andrew Trick authored Nov 12, 2012

Uses the infrastructure from r167742 to support clustering instructure
that the target processor can "fuse". e.g. cmp+jmp.

Next step: target hook implementations with test cases, and enable.

llvm-svn: 167744

26328024

BBVectorize: Check the types of compare instructions · f8326b60

Hal Finkel authored Nov 12, 2012

The pass would previously assert when trying to compute the cost of
compare instructions with illegal vector types (like struct pointers).

llvm-svn: 167743

f8326b60

misched: Target-independent support for load/store clustering. · a7714a0f

Andrew Trick authored Nov 12, 2012

This infrastructure is generally useful for any target that wants to
strongly prefer two instructions to be adjacent after scheduling.

A following checkin will add target-specific hooks with unit
tests. Then this feature will be enabled by default with misched.

llvm-svn: 167742

a7714a0f

This change is to fix rdar://12571717 which is about assertion in Reassociate pass. · 1c442f5e

Shuxin Yang authored Nov 12, 2012

The assertion is trigged when the Reassociater tries to transform expression
     ... + 2 * n * 3 + 2 * m + ...
  into:
     ... + 2 * (n*3 + m).

In the process of the transformation, a helper routine folds the constant 2*3 into 6,
confusing optimizer which is trying the to eliminate the common factor 2, and cannot
find 2 any more. 

Review is pending. But I'd like commit first in order to help those who are waiting 
for this fix. 

llvm-svn: 167740

1c442f5e

misched: Infrastructure for weak DAG edges. · f1ff84c6

Andrew Trick authored Nov 12, 2012

This adds support for weak DAG edges to the general scheduling
infrastructure in preparation for MachineScheduler support for
heuristics based on weak edges.

llvm-svn: 167738

f1ff84c6

Make TOC order deterministic by using MapVector instead of DenseMap. · 2c93acdf
Ulrich Weigand authored Nov 12, 2012
```
llvm-svn: 167737
```
2c93acdf

BBVectorize: Check the input types of shuffles for legality · ef53df0f

Hal Finkel authored Nov 12, 2012

This fixes a bug where shuffles were being fused such that the
resulting input types were not legal on the target. This would
occur only when both inputs and dependencies were also foldable
operations (such as other shuffles) and there were other connected
pairs in the same block.

llvm-svn: 167731

ef53df0f

[ASan] fixup for r167725: Don't fetch name of StructType if it is literal · afc550d9
Alexey Samsonov authored Nov 12, 2012
```
llvm-svn: 167729
```
afc550d9

Fixup for r167558: Store raw pointer (instead of reference) to RelocMap in... · 9cb13d59

Alexey Samsonov authored Nov 12, 2012

Fixup for r167558: Store raw pointer (instead of reference) to RelocMap in DIContext. This is needed to prevent crashes because of dangling reference if the clients don't provide RelocMap to DIContext constructor.

llvm-svn: 167728

9cb13d59

Normalize memcmp constant folding results. · b3e91f6a

Meador Inge authored Nov 12, 2012

The library call simplifier folds memcmp calls with all constant arguments
to a constant.  For example:

  memcmp("foo", "foo", 3) ->  0
  memcmp("hel", "foo", 3) ->  1
  memcmp("foo", "hel", 3) -> -1

The folding is implemented in terms of the system memcmp that LLVM gets
linked with.  It currently just blindly uses the value returned from
the system memcmp as the folded constant.

This patch normalizes the values returned from the system memcmp to
(-1, 0, 1) so that we get consistent results across multiple platforms.
The test cases were adjusted accordingly.

llvm-svn: 167726

b3e91f6a

[ASan]: Add minimalistic support for turning off initialization-order checking... · 582d7de7

Alexey Samsonov authored Nov 12, 2012

[ASan]: Add minimalistic support for turning off initialization-order checking for globals of specified types. Tests for this behavior will go to ASan test suite in compiler-rt.

llvm-svn: 167725

582d7de7

Remove unused field. · 16631130
Eric Christopher authored Nov 12, 2012
```
llvm-svn: 167719
```
16631130

Fix PR14314 · d39c0fb1

Michael Liao authored Nov 12, 2012

- Fix operand order for atomic sub, where the minuend is the value
  loaded from memory and the subtrahend is the parameter specified.

llvm-svn: 167718

d39c0fb1

[NVPTX] Add more precise PTX/SM target attributes · 1812ee9a

Justin Holewinski authored Nov 12, 2012

Each SM and PTX version is modeled as a subtarget feature/CPU. Additionally,
PTX 3.1 is added as the default PTX version to be out-of-the-box compatible
with CUDA 5.0.

Available CPUs for this target:

  sm_10 - Select the sm_10 processor.
  sm_11 - Select the sm_11 processor.
  sm_12 - Select the sm_12 processor.
  sm_13 - Select the sm_13 processor.
  sm_20 - Select the sm_20 processor.
  sm_21 - Select the sm_21 processor.
  sm_30 - Select the sm_30 processor.
  sm_35 - Select the sm_35 processor.

Available features for this target:

  ptx30 - Use PTX version 3.0.
  ptx31 - Use PTX version 3.1.
  sm_10 - Target SM 1.0.
  sm_11 - Target SM 1.1.
  sm_12 - Target SM 1.2.
  sm_13 - Target SM 1.3.
  sm_20 - Target SM 2.0.
  sm_21 - Target SM 2.1.
  sm_30 - Target SM 3.0.
  sm_35 - Target SM 3.5.

llvm-svn: 167699

1812ee9a

Delete a stale comment. No functional change. · f963a8ff
Meador Inge authored Nov 12, 2012
```
llvm-svn: 167698
```
f963a8ff

Nov 11, 2012
- Move some helper methods to being static functions in the implementation file. · dd13d3fd
  Craig Topper authored Nov 11, 2012
```
llvm-svn: 167696
```
  dd13d3fd
- instcombine: Migrate memset optimizations · d4825780
  Meador Inge authored Nov 11, 2012
```
This patch migrates the memset optimizations from the simplify-libcalls
pass into the instcombine library call simplifier.

llvm-svn: 167689
```
  d4825780
- instcombine: Migrate memmove optimizations · 9cf328b5
  Meador Inge authored Nov 11, 2012
```
This patch migrates the memmove optimizations from the simplify-libcalls
pass into the instcombine library call simplifier.

llvm-svn: 167687
```
  9cf328b5
- instcombine: Migrate memcpy optimizations · dd9234a1
  Meador Inge authored Nov 11, 2012
```
This patch migrates the memcpy optimizations from the simplify-libcalls
pass into the instcombine library call simplifier.

llvm-svn: 167686
```
  dd9234a1
- Use the isTruncFree and isZExtFree API to figure out of these operations are free. Thanks Andy! · 3b99dc62
  Nadav Rotem authored Nov 11, 2012
```
llvm-svn: 167685
```
  3b99dc62