Commits · 08e1b8742b8d84a01a70160ab6261163f1c270b1 · Roger Ferrer / llvm-epi-0.8

Jun 24, 2013

Add a flag to defer vectorization into a phase after the inliner and its · 08e1b874

Chandler Carruth authored Jun 24, 2013

CGSCC pass manager. This should insulate the inlining decisions from the
vectorization decisions, however it may have both compile time and code
size problems so it is just an experimental option right now.

Adding this based on a discussion with Arnold and it seems at least
worth having this flag for us to both run some experiments to see if
this strategy is workable. It may solve some of the regressions seen
with the loop vectorizer.

llvm-svn: 184698

08e1b874

Filter out dragonegg when checked out into a projects subdirectory. · 99c46b98

Chandler Carruth authored Jun 24, 2013

There is some hope of eventually supporting a unified build with it, but
until then this lets me (and others) check it out in this location
without things breaking.

llvm-svn: 184697

99c46b98

DebugInfo: enumerator values returned as int64 as they are stored · 62251374
David Blaikie authored Jun 24, 2013
```
llvm-svn: 184694
```
62251374
DebugInfo: add some testing from an overly broad end-to-end test in Clang · 3656123d
David Blaikie authored Jun 24, 2013
```
llvm-svn: 184692
```
3656123d

Revert "LoopVectorize: Use the dependence test utility class" · 58ca945f

Arnold Schwaighofer authored Jun 24, 2013

This reverts commit cbfa1ca993363ca5c4dbf6c913abc957c584cbac.

We are seeing a stage2 and stage3 miscompare on some dragonegg bots.

llvm-svn: 184690

58ca945f

[APFloat] Removed out of date comment from isNormal(). · d851ea06

Michael Gottesman authored Jun 24, 2013

I already finished the isIEEENormal => isNormal transition. So isNormal is now
IEEE-754R compliant.

llvm-svn: 184687

d851ea06

[APFloat] Rename llvm::exponent_t => llvm::APFloat::ExponentType. · 9dc98338

Michael Gottesman authored Jun 24, 2013

exponent_t is only used internally in APFloat and no exponent_t values are
exposed via the APFloat API. In light of such conditions it does not make any
sense to gum up the llvm namespace with said type. Plus it makes it clearer that
exponent_t is associated with APFloat.

llvm-svn: 184686

9dc98338

LoopVectorize: Use the dependence test utility class · b914a7e2

Arnold Schwaighofer authored Jun 24, 2013

We now no longer need alias analysis - the cases that alias analysis would
handle are now handled as accesses with a large dependence distance.

We can now vectorize loops with simple constant dependence distances.

  for (i = 8; i < 256; ++i) {
    a[i] = a[i+4] * a[i+8];
  }

  for (i = 8; i < 256; ++i) {
    a[i] = a[i-4] * a[i-8];
  }

We would be able to vectorize about 200 more loops (in many cases the cost model
instructs us no to) in the test suite now. Results on x86-64 are a wash.

I have seen one degradation in ammp. Interestingly, the function in which we
now vectorize a loop is never executed so we probably see some instruction
cache effects. There is a 2% improvement in h264ref. There is one or the other
TSCV loop kernel that speeds up.

radar://13681598

llvm-svn: 184685

b914a7e2

LoopVectorize: Add utility class for checking dependency among accesses · d5179767

Arnold Schwaighofer authored Jun 24, 2013

This class checks dependences by subtracting two Scalar Evolution access
functions allowing us to catch very simple linear dependences.

The checker assumes source order in determining whether vectorization is safe.
We currently don't reorder accesses.
Positive true dependencies need to be a multiple of VF otherwise we impede
store-load forwarding.

llvm-svn: 184684

d5179767

LoopVectorize: Add utility class for building sets of dependent accesses · d5741969

Arnold Schwaighofer authored Jun 24, 2013

Sets of dependent accesses are built by unioning sets based on underlying
objects. This class will be used by the upcoming dependence checker.

llvm-svn: 184683

d5741969

SLP Vectorizer: Add support for vectorizing parts of the tree. · 210e86d7

Nadav Rotem authored Jun 24, 2013

Untill now we detected the vectorizable tree and evaluated the cost of the
entire tree.  With this patch we can decide to trim-out branches of the tree
that are not profitable to vectorizer.

Also, increase the max depth from 6 to 12. In the worse possible case where all
of the code is made of diamond-shaped graph this can bring the cost to 2**10,
but diamonds are not very common.

llvm-svn: 184681

210e86d7

Fix tail merging to assign the (more) correct BasicBlock when splitting. · 97a1d7c4

Andrew Trick authored Jun 24, 2013

This makes it possible to write unit tests that are less susceptible
to minor code motion, particularly copy placement. block-placement.ll
covers this case with -pre-RA-sched=source which will soon be
default. One incorrectly named block is already fixed, but without
this fix, enabling new coalescing and scheduling would cause more
failures.

llvm-svn: 184680

97a1d7c4

Jun 23, 2013
- SLP Vectorizer: Fix a bug in the code that does CSE on the generated gather sequences. · 0323925d
  Nadav Rotem authored Jun 23, 2013
```
Make sure that we don't replace and RAUW two sequences if one does not dominate the other.

llvm-svn: 184674
```
  0323925d
- SLP Vectorizer: Erase instructions outside the vectorizeTree method. · 78428401
  Nadav Rotem authored Jun 23, 2013
```
The RAII builder location guard is saving a reference to instructions, so we can't erase instructions during vectorization.

llvm-svn: 184671
```
  78428401
- DebugInfo: PR14404: Avoid truncating 64 bit values into 32 bits for ULEB128/SLEB128 generation · 5acff7e6
  David Blaikie authored Jun 23, 2013
```
llvm-svn: 184669
```
  5acff7e6
- AArch64: fix overzealous NEXTing for Windows testing. · 295f049d
  Tim Northover authored Jun 23, 2013
```
llvm-svn: 184667
```
  295f049d
- Add MI-Sched support for x86 macro fusion. · 47740deb
  Andrew Trick authored Jun 23, 2013
```
This is an awful implementation of the target hook. But we don't have
abstractions yet for common machine ops, and I don't see any quick way
to make it table-driven.

llvm-svn: 184664
```
  47740deb
- SLP Vectorizer: Implement a simple CSE optimization for the gather sequences. · eb65e67e
  Nadav Rotem authored Jun 23, 2013
```
llvm-svn: 184660
```
  eb65e67e
Jun 22, 2013

SLP Vectorizer: Implement multi-block slp-vectorization. · 80de0a28

Nadav Rotem authored Jun 22, 2013

Rewrote the SLP-vectorization as a whole-function vectorization pass. It is now able to vectorize chains across multiple basic blocks.
It still does not vectorize PHIs, but this should be easy to do now that we scan the entire function.
I removed the support for extracting values from trees.
We are now able to vectorize more programs, but there are some serious regressions in many workloads (such as flops-6 and mandel-2).

llvm-svn: 184647

80de0a28

Replace with a shorter test case produced by Doug Gillmore. · de085b2a
Reed Kotler authored Jun 22, 2013
```
llvm-svn: 184645
```
de085b2a
DebugInfo: Support (using GNU extensions) for template template parameters and parameter packs · 2b380232
David Blaikie authored Jun 22, 2013
```
llvm-svn: 184643
```
2b380232
The getRegForInlineAsmConstraint function should only accept MVT value types. · 295bd43a
Chad Rosier authored Jun 22, 2013
```
llvm-svn: 184642
```
295bd43a
Revert "FunctionAttrs: Merge attributes once instead of doing it for every argument." · 40d7f354
Benjamin Kramer authored Jun 22, 2013
```
It doesn't work as I intended it to.  This reverts commit r184638.

llvm-svn: 184641
```
40d7f354
FunctionAttrs: Merge attributes once instead of doing it for every argument. · 76b7bd0e
Benjamin Kramer authored Jun 22, 2013
```
It has become an expensive operation. No functionality change.

llvm-svn: 184638
```
76b7bd0e
RelocVisitor: Add another PPC64 relocation that occurs in dwarf output. · b5ab3600
Benjamin Kramer authored Jun 22, 2013
```
Should bring the ppc64 buildbot back to life.

llvm-svn: 184633
```
b5ab3600
Create the file with the right permissions instead of setting it afterwards. · b046eedb
Rafael Espindola authored Jun 22, 2013
```
Removes the last use of PathV1.h in llvm-ar.

llvm-svn: 184630
```
b046eedb

[yaml2obj][ELF] Make symbol table top-level key. · 82177573

Sean Silva authored Jun 22, 2013

Although in reality the symbol table in ELF resides in a section, the
standard requires that there be no more than one SHT_SYMTAB. To enforce
this constraint, it is cleaner to group all the symbols under a
top-level `Symbols` key on the object file.

llvm-svn: 184627

82177573

[yaml2obj][ELF] Narrow parameter. · 7a0c3a6f
Sean Silva authored Jun 22, 2013
```
The full ELFYAML::Section isn't needed.

llvm-svn: 184626
```
7a0c3a6f

[yaml2obj][ELF] Don't special case writing these. · 7d617222

Sean Silva authored Jun 22, 2013

Just add them to the vector of section headers like the rest of the
section headers.

llvm-svn: 184624

7d617222

[yaml2obj][ELF] Make this "type switch" actually readable. · 11caebaa
Sean Silva authored Jun 22, 2013
```
llvm-svn: 184623
```
11caebaa

[yaml2obj][ELF] Align section contents in the output. · d93323f5

Sean Silva authored Jun 22, 2013

The improperly aligned section content in the output was causing
buildbot failures. This should fix them.

Incidentally, this results in a simpler and more robust API for
ContiguousBlobAccumulator.

llvm-svn: 184621

d93323f5

Prevent LiveRangeEdit from deleting bundled instructions. · cbd7305d

Andrew Trick authored Jun 22, 2013

We have no targets on trunk that bundle before regalloc. However, we
have been advertising regalloc as bundle safe for use with out-of-tree
targets. We need to at least contain the parts of the code that are
still unsafe.

llvm-svn: 184620

cbd7305d

Reapply documentation changes from r184584. · f51f7186
Benjamin Kramer authored Jun 21, 2013
```
llvm-svn: 184609
```
f51f7186

This was a nifty test, but remove it. · e5c41896

Sean Silva authored Jun 21, 2013

It wouldn't really test anything that doesn't already have a more
targeted test:
`yaml2obj-elf-section-basic.yaml`:
  Already tests that section content is correctly passed though.
`yaml2obj-elf-symbol-basic.yaml` (this file):
  Tests that the st_value and st_size attributes of `main` are set
  correctly.
Between those two tests, disassembling the file doesn't really add
anything, so just remove mention of disassembling the file.

llvm-svn: 184607

e5c41896

Revert "Put r184469 disassembler test back on X86" · 2d47ffd3

Sean Silva authored Jun 21, 2013

This reverts commit r184602. In an upcoming commit, I will just remove
the disassembler part of the test; it was mostly just a "nifty" thing
marking a milestone but it doesn't test anything that isn't tested
elsewhere.

llvm-svn: 184606

2d47ffd3

DebugInfo: Don't lose unreferenced non-trivial by-value parameters · 97c6c5bd

David Blaikie authored Jun 21, 2013

A FastISel optimization was causing us to emit no information for such
parameters & when they go missing we end up emitting a different
function type. By avoiding that shortcut we not only get types correct
(very important) but also location information (handy) - even if it's
only live at the start of a function & may be clobbered later.

Reviewed/discussion by Evan Cheng & Dan Gohman.

llvm-svn: 184604

97c6c5bd

Put r184469 disassembler test back on X86 · fe941943
Renato Golin authored Jun 21, 2013
```
llvm-svn: 184602
```
fe941943
Convert some uses of PathV1.h in ArchiveWriter.cpp. · e88d90ab
Rafael Espindola authored Jun 21, 2013
```
llvm-svn: 184599
```
e88d90ab

Jun 21, 2013

[yaml2obj][ELF] Don't do disassembly in this test. · 8068ca72

Sean Silva authored Jun 21, 2013

This was causing buildbot failures when build without X86 support.

Is there a way to conditionalize the test on the X86 target being
present?

llvm-svn: 184597

8068ca72

[objc-arc-opts] Make IsTrackingImpreciseReleases a const method. · 9799cf7f
Michael Gottesman authored Jun 21, 2013
```
Thanks to Bill Wendling for pointing this out!

llvm-svn: 184593
```
9799cf7f