Commits · a0e5e54e8df318f1f2fd5c2a1811ec67a08eb484 · Roger Ferrer / llvm-epi-0.8

Nov 12, 2012

Revert r167620; this can be implemented using an existing CL option. · 2b2b38d3
Chad Rosier authored Nov 12, 2012
```
llvm-svn: 167755
```
2b2b38d3
misched: rename interfaceto avoid gcc warnings · ec369d53
Andrew Trick authored Nov 12, 2012
```
llvm-svn: 167753
```
ec369d53

BBVectorize: Use a more sophisticated check for input cost · 9cf33729

Hal Finkel authored Nov 12, 2012

The old checking code, which assumed that input shuffles and insert-elements
could always be folded (and thus were free) is too simple.
This can only happen in special circumstances.
Using the simple check caused infinite recursion.

llvm-svn: 167750

9cf33729

misched: Target-independent support for MacroFusion. · 26328024

Andrew Trick authored Nov 12, 2012

Uses the infrastructure from r167742 to support clustering instructure
that the target processor can "fuse". e.g. cmp+jmp.

Next step: target hook implementations with test cases, and enable.

llvm-svn: 167744

26328024

BBVectorize: Check the types of compare instructions · f8326b60

Hal Finkel authored Nov 12, 2012

The pass would previously assert when trying to compute the cost of
compare instructions with illegal vector types (like struct pointers).

llvm-svn: 167743

f8326b60

misched: Target-independent support for load/store clustering. · a7714a0f

Andrew Trick authored Nov 12, 2012

This infrastructure is generally useful for any target that wants to
strongly prefer two instructions to be adjacent after scheduling.

A following checkin will add target-specific hooks with unit
tests. Then this feature will be enabled by default with misched.

llvm-svn: 167742

a7714a0f

This change is to fix rdar://12571717 which is about assertion in Reassociate pass. · 1c442f5e

Shuxin Yang authored Nov 12, 2012

The assertion is trigged when the Reassociater tries to transform expression
     ... + 2 * n * 3 + 2 * m + ...
  into:
     ... + 2 * (n*3 + m).

In the process of the transformation, a helper routine folds the constant 2*3 into 6,
confusing optimizer which is trying the to eliminate the common factor 2, and cannot
find 2 any more. 

Review is pending. But I'd like commit first in order to help those who are waiting 
for this fix. 

llvm-svn: 167740

1c442f5e

misched: Infrastructure for weak DAG edges. · f1ff84c6

Andrew Trick authored Nov 12, 2012

This adds support for weak DAG edges to the general scheduling
infrastructure in preparation for MachineScheduler support for
heuristics based on weak edges.

llvm-svn: 167738

f1ff84c6

Make TOC order deterministic by using MapVector instead of DenseMap. · 2c93acdf
Ulrich Weigand authored Nov 12, 2012
```
llvm-svn: 167737
```
2c93acdf
fix a spelling mistake · 0767d177
Nadav Rotem authored Nov 12, 2012
```
llvm-svn: 167734
```
0767d177

BBVectorize: Check the input types of shuffles for legality · ef53df0f

Hal Finkel authored Nov 12, 2012

This fixes a bug where shuffles were being fused such that the
resulting input types were not legal on the target. This would
occur only when both inputs and dependencies were also foldable
operations (such as other shuffles) and there were other connected
pairs in the same block.

llvm-svn: 167731

ef53df0f

Don't use __cxa_demangle under MSVC (which doesn't have it) · 5a578119
Alexander Potapenko authored Nov 12, 2012
```
llvm-svn: 167730
```
5a578119
[ASan] fixup for r167725: Don't fetch name of StructType if it is literal · afc550d9
Alexey Samsonov authored Nov 12, 2012
```
llvm-svn: 167729
```
afc550d9

Fixup for r167558: Store raw pointer (instead of reference) to RelocMap in... · 9cb13d59

Alexey Samsonov authored Nov 12, 2012

Fixup for r167558: Store raw pointer (instead of reference) to RelocMap in DIContext. This is needed to prevent crashes because of dangling reference if the clients don't provide RelocMap to DIContext constructor.

llvm-svn: 167728

9cb13d59

Normalize memcmp constant folding results. · b3e91f6a

Meador Inge authored Nov 12, 2012

The library call simplifier folds memcmp calls with all constant arguments
to a constant.  For example:

  memcmp("foo", "foo", 3) ->  0
  memcmp("hel", "foo", 3) ->  1
  memcmp("foo", "hel", 3) -> -1

The folding is implemented in terms of the system memcmp that LLVM gets
linked with.  It currently just blindly uses the value returned from
the system memcmp as the folded constant.

This patch normalizes the values returned from the system memcmp to
(-1, 0, 1) so that we get consistent results across multiple platforms.
The test cases were adjusted accordingly.

llvm-svn: 167726

b3e91f6a

[ASan]: Add minimalistic support for turning off initialization-order checking... · 582d7de7

Alexey Samsonov authored Nov 12, 2012

[ASan]: Add minimalistic support for turning off initialization-order checking for globals of specified types. Tests for this behavior will go to ASan test suite in compiler-rt.

llvm-svn: 167725

582d7de7

do not play preprocessor tricks with 'private', use public interfaces instead;... · ea5fa100
Gabor Greif authored Nov 12, 2012
```
do not play preprocessor tricks with 'private', use public interfaces instead; this appeases the VC++ buildbots

llvm-svn: 167724
```
ea5fa100

[ASan] Add llvm-symbolizer from to tools/ · 8c07f555

Alexander Potapenko authored Nov 12, 2012

This is the second and last (2/2) part of a change that moves llvm-symbolizer to llvm/tools/, which will allow to build it
with both cmake and configure+make.

llvm-svn: 167723

8c07f555

add unit test for waymarking algorithm (Use::getUser) · fea6a551
Gabor Greif authored Nov 12, 2012
```
llvm-svn: 167720
```
fea6a551
Remove unused field. · 16631130
Eric Christopher authored Nov 12, 2012
```
llvm-svn: 167719
```
16631130

Fix PR14314 · d39c0fb1

Michael Liao authored Nov 12, 2012

- Fix operand order for atomic sub, where the minuend is the value
  loaded from memory and the subtrahend is the parameter specified.

llvm-svn: 167718

d39c0fb1

Add --enable-werror and --enable-cxx11 to projects/sample/ · b41000ed
Craig Topper authored Nov 12, 2012
```
llvm-svn: 167716
```
b41000ed

[NVPTX] Add more precise PTX/SM target attributes · 1812ee9a

Justin Holewinski authored Nov 12, 2012

Each SM and PTX version is modeled as a subtarget feature/CPU. Additionally,
PTX 3.1 is added as the default PTX version to be out-of-the-box compatible
with CUDA 5.0.

Available CPUs for this target:

  sm_10 - Select the sm_10 processor.
  sm_11 - Select the sm_11 processor.
  sm_12 - Select the sm_12 processor.
  sm_13 - Select the sm_13 processor.
  sm_20 - Select the sm_20 processor.
  sm_21 - Select the sm_21 processor.
  sm_30 - Select the sm_30 processor.
  sm_35 - Select the sm_35 processor.

Available features for this target:

  ptx30 - Use PTX version 3.0.
  ptx31 - Use PTX version 3.1.
  sm_10 - Target SM 1.0.
  sm_11 - Target SM 1.1.
  sm_12 - Target SM 1.2.
  sm_13 - Target SM 1.3.
  sm_20 - Target SM 2.0.
  sm_21 - Target SM 2.1.
  sm_30 - Target SM 3.0.
  sm_35 - Target SM 3.5.

llvm-svn: 167699

1812ee9a

Delete a stale comment. No functional change. · f963a8ff
Meador Inge authored Nov 12, 2012
```
llvm-svn: 167698
```
f963a8ff

Nov 11, 2012

Move some helper methods to being static functions in the implementation file. · dd13d3fd
Craig Topper authored Nov 11, 2012
```
llvm-svn: 167696
```
dd13d3fd

Remove hard-coded constant in Transforms/InstCombine/memcmp-1.ll · 9493eb9b

Meador Inge authored Nov 11, 2012

Transforms/InstCombine/memcmp-1.ll has a test case that looks like:

  @foo = constant [4 x i8] c"foo\00"
  @hel = constant [4 x i8] c"hel\00"

  ...

  %mem1 = getelementptr [4 x i8]* @hel, i32 0, i32 0
  %mem2 = getelementptr [4 x i8]* @foo, i32 0, i32 0
  %ret = call i32 @memcmp(i8* %mem1, i8* %mem2, i32 3)
  ret i32 %ret
  ; CHECK: ret i32 2

The folded return value (2 above) is computed using the system memcmp
that the compiler is linked with.  This can return different values on
different systems.  The test was originally written on an OS X 10.7.5
x86-64 box and passed.  However, it failed on one of the x86-64 FreeBSD
buildbots because the system memcpy on that machine returned a different
value (1 instead of 2).

I fixed the test by checking the folding constants with regexes.

llvm-svn: 167691

9493eb9b

instcombine: Migrate memset optimizations · d4825780

Meador Inge authored Nov 11, 2012

This patch migrates the memset optimizations from the simplify-libcalls
pass into the instcombine library call simplifier.

llvm-svn: 167689

d4825780

Update the vectorizer docs. · 91380570
Nadav Rotem authored Nov 11, 2012
```
llvm-svn: 167688
```
91380570

instcombine: Migrate memmove optimizations · 9cf328b5

Meador Inge authored Nov 11, 2012

This patch migrates the memmove optimizations from the simplify-libcalls
pass into the instcombine library call simplifier.

llvm-svn: 167687

9cf328b5

instcombine: Migrate memcpy optimizations · dd9234a1

Meador Inge authored Nov 11, 2012

This patch migrates the memcpy optimizations from the simplify-libcalls
pass into the instcombine library call simplifier.

llvm-svn: 167686

dd9234a1

Use the isTruncFree and isZExtFree API to figure out of these operations are free. Thanks Andy! · 3b99dc62
Nadav Rotem authored Nov 11, 2012
```
llvm-svn: 167685
```
3b99dc62
Fix a comment typo and add comments. · 12930749
Nadav Rotem authored Nov 11, 2012
```
llvm-svn: 167684
```
12930749

instcombine: Migrate memcmp optimizations · 4d2827c1

Meador Inge authored Nov 11, 2012

This patch migrates the memcmp optimizations from the simplify-libcalls
pass into the instcombine library call simplifier.

llvm-svn: 167683

4d2827c1

instcombine: Migrate strstr optimizations · 56edbc93

Meador Inge authored Nov 11, 2012

This patch migrates the strstr optimizations from the simplify-libcalls
pass into the instcombine library call simplifier.

llvm-svn: 167682

56edbc93

Add method for replacing instructions to LibCallSimplifier · 76fc1a47

Meador Inge authored Nov 11, 2012

In some cases the library call simplifier may need to replace instructions
other than the library call being simplified.  In those cases it may be
necessary for clients of the simplifier to override how the replacements
are actually done.  As such, a new overrideable method for replacing
instructions was added to LibCallSimplifier.

A new subclass of LibCallSimplifier is also defined which overrides
the instruction replacement method.  This is because the instruction
combiner defines its own replacement method which updates the worklist
when instructions are replaced.

llvm-svn: 167681

76fc1a47

Nov 10, 2012
- Provide definitions for all functions. · 933f4116
  Benjamin Kramer authored Nov 10, 2012
```
ICC refuses to compile a class in an anonymous namespace if some functions
aren't defined. Fixes PR13477.

llvm-svn: 167676
```
  933f4116
- instcombine: Migrate strcspn optimizations · bcd88ef7
  Meador Inge authored Nov 10, 2012
```
This patch migrates the strcspn optimizations from the simplify-libcalls
pass into the instcombine library call simplifier.

llvm-svn: 167675
```
  bcd88ef7
- Simplify the SmallVector pretty printer for LLDB a bit and make it work with reference types. · 91b014cd
  Benjamin Kramer authored Nov 10, 2012
```
llvm-svn: 167674
```
  91b014cd
- Remove unnecessary subtraction and addition by 1 around a couple for loops. · a43e2fd3
  Craig Topper authored Nov 10, 2012
```
llvm-svn: 167673
```
  a43e2fd3
- Tidy up spacing. No functional change. · 84afbf2b
  Craig Topper authored Nov 10, 2012
```
llvm-svn: 167671
```
  84afbf2b