Commits · 48f52e926d6e8273587f5befc997fc784b04855b · Roger Ferrer / llvm-epi-0.8

Feb 25, 2014

Factor out calls to AA.getDataLayout(). · 6d6e87be
Rafael Espindola authored Feb 25, 2014
```
llvm-svn: 202157
```
6d6e87be
Make a few more DataLayout variables const. · 43b5a51e
Rafael Espindola authored Feb 25, 2014
```
llvm-svn: 202155
```
43b5a51e

[SROA] Use the original load name with the SROA-prefixed IRB rather than · 25adb7b0

Chandler Carruth authored Feb 25, 2014

just "load". This helps avoid pointless de-duping with order-sensitive
numbers as we already have unique names from the original load. It also
makes the resulting IR quite a bit easier to read.

llvm-svn: 202140

25adb7b0

[SROA] Thread the ability to add a pointer-specific name prefix through · cb93cd2d

Chandler Carruth authored Feb 25, 2014

the pointer adjustment code. This is the primary code path that creates
totally new instructions in SROA and being able to lump them based on
the pointer value's name for which they were created causes
*significantly* fewer name collisions and general noise in the debug
output. This is particularly significant because it is making it much
harder to track down instability in the output of SROA, as name
de-duplication is a totally harmless form of instability that gets in
the way of seeing real problems.

The new fancy naming scheme tries to dig out the root "pre-SROA" name
for pointer values and associate that all the way through the pointer
formation instructions. Digging out the root is important to prevent the
multiple iterative rounds of SROA from just layering too much cruft on
top of cruft here. We already track the layers of SROAs iteration in the
alloca name prefix. We don't need to duplicate it here.

Should have no functionality change, and shouldn't have any really
measurable impact on NDEBUG builds, as most of the complex logic is
debug-only.

llvm-svn: 202139

cb93cd2d

[SROA] Rather than copying the logic for building a name prefix into the · 51175533
Chandler Carruth authored Feb 25, 2014
```
PHI-pointer builder, just copy the builder and clobber the obvious
fields.

llvm-svn: 202136
```
51175533

[SROA] Simplify some of the logic to dig out the old pointer value by · 8183a50f

Chandler Carruth authored Feb 25, 2014

using OldPtr more heavily. Lots of this code was written before the
rewriter had an OldPtr member setup ahead of time. There are already
asserts in place that should ensure this doesn't change any
functionality.

llvm-svn: 202135

8183a50f

[SROA] Adjust to new clang-format style. · 7625c54e
Chandler Carruth authored Feb 25, 2014
```
llvm-svn: 202134
```
7625c54e

[SROA] Fix a *glaring* bug in r202091: you have to actually *write* · a8c4cc68

Chandler Carruth authored Feb 25, 2014

the break statement, not just think it to yourself....

No idea how this worked at all, much less survived most bots, my
bootstrap, and some bot bootstraps!

The Polly one didn't survive, and this was filed as PR18959. I don't
have a reduced test case and honestly I'm not seeing the need. What we
probably need here are better asserts / debug-build behavior in
SmallPtrSet so that this madness doesn't make it so far.

llvm-svn: 202129

a8c4cc68

Silence GCC warning · 26af6f7f
Alexey Samsonov authored Feb 25, 2014
```
llvm-svn: 202119
```
26af6f7f
Fix typos · 70b36995
Alp Toker authored Feb 25, 2014
```
llvm-svn: 202107
```
70b36995

[SROA] Add a debugging tool which shuffles the slices sequence prior to · 83cee772

Chandler Carruth authored Feb 25, 2014

sorting it. This helps uncover latent reliance on the original ordering
which aren't guaranteed to be preserved by std::sort (but often are),
and which are based on the use-def chain orderings which also aren't
(technically) guaranteed.

Only available in C++11 debug builds, and behind a flag to prevent noise
at the moment, but this is generally useful so figured I'd put it in the
tree rather than keeping it out-of-tree.

llvm-svn: 202106

83cee772

[SROA] Use a more direct way of determining whether we are processing · bb2a9324

Chandler Carruth authored Feb 25, 2014

the destination operand or source operand of a memmove.

It so happens that it was impossible for SROA to try to rewrite
self-memmove where the operands are *identical*, because either such
a think is volatile (and we don't rewrite) or it is non-volatile, and we
don't even register it as a use of the alloca.

However, making the 'IsDest' test *rely* on this subtle fact is... Very
confusing for the reader. We should use the direct and readily available
test of the Use* which gives us concrete information about which operand
is being rewritten.

No functionality changed, I hope! ;]

llvm-svn: 202103

bb2a9324

[SROA] Fix another instability in SROA with respect to the slice · 3bf18ed5

Chandler Carruth authored Feb 25, 2014

ordering.

The fundamental problem that we're hitting here is that the use-def
chain ordering is *itself* not a stable thing to be relying on in the
rewriting for SROA. Further, we use a non-stable sort over the slices to
arrange them based on the section of the alloca they're operating on.
With a debugging STL implementation (or different implementations in
stage2 and stage3) this can cause stage2 != stage3.

The specific aspect of this problem fixed in this commit deals with the
rewriting and load-speculation around PHIs and Selects. This, like many
other aspects of the use-rewriting in SROA, is really part of the
"strong SSA-formation" that is doen by SROA where it works very hard to
canonicalize loads and stores in *just* the right way to satisfy the
needs of mem2reg[1]. When we have a select (or a PHI) with 2 uses of the
same alloca, we test that loads downstream of the select are
speculatable around it twice. If only one of the operands to the select
needs to be rewritten, then if we get lucky we rewrite that one first
and the select is immediately speculatable. This can cause the order of
operand visitation, and thus the order of slices to be rewritten, to
change an alloca from promotable to non-promotable and vice versa.

The fix is to defer all of the speculation until *after* the rewrite
phase is done. Once we've rewritten everything, we can accurately test
for whether speculation will work (once, instead of twice!) and the
order ceases to matter.

This also happens to simplify the other subtlety of speculation -- we
need to *not* speculate anything unless the result of speculating will
make the alloca fully promotable by mem2reg. I had a previous attempt at
simplifying this, but it was still pretty horrible.

There is actually already a *really* nice test case for this in
basictest.ll, but on multiple STL implementations and inputs, we just
got "lucky". Fortunately, the test case is very small and we can
essentially build it in exactly the opposite way to get reasonable
coverage in both directions even from normal STL implementations.

llvm-svn: 202092

3bf18ed5

Make some DataLayout pointers const. · aeff8a9c
Rafael Espindola authored Feb 24, 2014
```
No functionality change. Just reduces the noise of an upcoming patch.

llvm-svn: 202087
```
aeff8a9c

Feb 24, 2014

SLPVectorizer: Try vectorizing 'splat' stores · 9611d23d
Arnold Schwaighofer authored Feb 24, 2014
```
Vectorize sequential stores of a broadcasted value.
5% on eon.

radar://16124699

llvm-svn: 202067
```
9611d23d

Replace the F_Binary flag with a F_Text one. · 90c7f1cc

Rafael Espindola authored Feb 24, 2014

After this I will set the default back to F_None. The advantage is that
before this patch forgetting to set F_Binary would corrupt a file on windows.
Forgetting to set F_Text produces one that cannot be read in notepad, which
is a better failure mode :-)

llvm-svn: 202052

90c7f1cc

LTO: Add the loop vectorizer to the LTO pipeline. · 6ccda923

Arnold Schwaighofer authored Feb 24, 2014

During the LTO phase LICM will move loop invariant global variables out of loops
(informed by GlobalModRef). This makes more loops countable presenting
opportunity for the loop vectorizer.

Adding the loop vectorizer improves some TSVC benchmarks and twolf/ref dataset
(5%) on x86-64.

radar://15970632

llvm-svn: 202051

6ccda923

Don't make F_None the default. · 7dbcdd08

Rafael Espindola authored Feb 24, 2014

This will make it easier to switch the default to being binary files.

llvm-svn: 202042

7dbcdd08

[asan] simplify the code that compute the shadow offset; get rid of two... · cc92c795

Kostya Serebryany authored Feb 24, 2014

[asan] simplify the code that compute the shadow offset; get rid of two internal flags that allowed to override it. The tests pass, but still this change might break asan on some platform not covered by tests. If you see this, please submit a fix with a test.

llvm-svn: 202033

cc92c795

Feb 22, 2014

Include <cctype> for isdigit(). · 61c6df03
Logan Chien authored Feb 22, 2014
```
llvm-svn: 201930
```
61c6df03

[CodeGenPrepare] Move CodeGenPrepare into lib/CodeGen. · a349084a

Quentin Colombet authored Feb 22, 2014

CodeGenPrepare uses extensively TargetLowering which is part of libLLVMCodeGen.
This is a layer violation which would introduce eventually a dependence on
CodeGen in ScalarOpts.

Move CodeGenPrepare into libLLVMCodeGen to avoid that.

Follow-up of <rdar://problem/15519855>

llvm-svn: 201912

a349084a

Feb 21, 2014

Rename a few more DataLayout variables from TD to DL. · 5f57f462
Rafael Espindola authored Feb 21, 2014
```
llvm-svn: 201870
```
5f57f462
Rename a few more DataLayout variables. · 612886fc
Rafael Espindola authored Feb 21, 2014
```
llvm-svn: 201833
```
612886fc

Rename many DataLayout variables from TD to DL. · 37dc9e19

Rafael Espindola authored Feb 21, 2014

I am really sorry for the noise, but the current state where some parts of the
code use TD (from the old name: TargetData) and other parts use DL makes it
hard to write a patch that changes where those variables come from and how
they are passed along.

llvm-svn: 201827

37dc9e19

Make sure that value handle users see the transformation of an indirect call... · 75080ff2

Nick Lewycky authored Feb 20, 2014

Make sure that value handle users see the transformation of an indirect call to a direct call. This is important for the CallGraph iteration. Patch by Björn Steinbrink!

llvm-svn: 201822

75080ff2

Feb 19, 2014

Add back r201608, r201622, r201624 and r201625 · daeafb4c

Rafael Espindola authored Feb 19, 2014

r201608 made llvm corretly handle private globals with MachO. r201622 fixed
a bug in it and r201624 and r201625 were changes for using private linkage,
assuming that llvm would do the right thing.

They all got reverted because r201608 introduced a crash in LTO. This patch
includes a fix for that. The issue was that TargetLoweringObjectFile now has
to be initialized before we can mangle names of private globals. This is
trivially true during the normal codegen pipeline (the asm printer does it),
but LTO has to do it manually.

llvm-svn: 201700

daeafb4c

This reverts commit r201625 and r201624. · 21736038

Rafael Espindola authored Feb 19, 2014

Since r201608 got reverted, it is not safe to use private linkage in these cases
until it is committed back.

llvm-svn: 201688

21736038

X86 CodeGenPrep: sink shufflevectors before shifts · aeb8e06d

Tim Northover authored Feb 19, 2014

On x86, shifting a vector by a scalar is significantly cheaper than shifting a
vector by another fully general vector. Unfortunately, because SelectionDAG
operates on just one basic block at a time, the shufflevector instruction that
reveals whether the right-hand side of a shift *is* really a scalar is often
not visible to CodeGen when it's needed.

This adds another handler to CodeGenPrepare, to sink any useful shufflevector
instructions down to the basic block where they're used, predicated on a target
hook (since on other architectures, doing so will often just introduce extra
real work).

rdar://problem/16063505

llvm-svn: 201655

aeb8e06d

Now that llvm always does the right thing with private, use it. · 8b27c4ed
Rafael Espindola authored Feb 19, 2014
```
llvm-svn: 201625
```
8b27c4ed

Feb 18, 2014
- Rename some member variables from TD to DL. · 7c68bebb
  Rafael Espindola authored Feb 18, 2014
```
TargetData was renamed DataLayout back in r165242.

llvm-svn: 201581
```
  7c68bebb
- GlobalMerge: move "-global-merge" option to the pass itself. · f804c178
  Tim Northover authored Feb 18, 2014
```
It's rather odd to have the flag enabling and disabling this pass only affect a
single target.

llvm-svn: 201559
```
  f804c178
Feb 17, 2014
- fix for null VectorizedValue assertion in the SLP Vectorizer (in function... · 7a463d06
  Gerolf Hoflehner authored Feb 17, 2014
```
fix for null VectorizedValue assertion in the SLP Vectorizer (in function vectorizeTree()). radar://16064178

llvm-svn: 201501
```
  7a463d06
Feb 16, 2014
- fixed typo in comment as my test commit · 282949bf
  Gerolf Hoflehner authored Feb 16, 2014
```
llvm-svn: 201486
```
  282949bf
Feb 14, 2014
- [CodeGenPrepare][AddressingModeMatcher] Give up on type promotion if the · 867c5509
  Quentin Colombet authored Feb 14, 2014
```
transformation does not bring any immediate benefits and introduce an illegal
operation. 

llvm-svn: 201439
```
  867c5509
- Trivial cleanup: reuse existing variable. · 8eee97dd
  Rafael Espindola authored Feb 14, 2014
```
Extracted while trying to understand http://llvm-reviews.chandlerc.com/D1764.

Patch by Matt Arsenault.

llvm-svn: 201425
```
  8eee97dd
- Do more addrspacecast transforms that happen for bitcast. · aa689f50
  Matt Arsenault authored Feb 14, 2014
```
Makes addrspacecast (gep) do addrspacecast (gep) instead.

llvm-svn: 201376
```
  aa689f50
Feb 13, 2014

InstCombine: Replace custom constant folding code with ConstantExpr. · 92040958
Benjamin Kramer authored Feb 13, 2014
```
llvm-svn: 201352
```
92040958
Reduce code duplication resulting from the ConstantVector/ConstantDataVector split. · 989b9293
Benjamin Kramer authored Feb 13, 2014
```
No intended functionality change.

llvm-svn: 201344
```
989b9293

GlobalOpt: Aliases don't have sections, don't copy them when replacing · 22b19da9

Reid Kleckner authored Feb 13, 2014

As defined in LangRef, aliases do not have sections.  However, LLVM's
GlobalAlias class inherits from GlobalValue, which means we can read and
set its section.  We should probably ban that as a separate change,
since it doesn't make much sense for an alias to have a section that
differs from its aliasee.

Fixes PR18757, where the section was being lost on the global in code
from Clang like:

extern "C" {
__attribute__((used, section("CUSTOM"))) static int in_custom_section;
}

Reviewers: rafael.espindola

Differential Revision: http://llvm-reviews.chandlerc.com/D2758

llvm-svn: 201286

22b19da9

Remove a very old instcombine where we would turn sequences of selects into · 883b5add

Owen Anderson authored Feb 12, 2014

logical operations on the i1's driving them. This is a bad idea for every
target I can think of (confirmed with micro tests on all of: x86-64, ARM,
AArch64, Mips, and PowerPC) because it forces the i1 to be materialized into
a general purpose register, whereas consuming it directly into a select generally
allows it to exist only transiently in a predicate or flags register.

Chandler ran a set of performance tests with this change, and reported no
measurable change on x86-64.

llvm-svn: 201275

883b5add