Commits · f989929cf0c0f0c59fcdaaefc9ba2ebf552a5898 · Roger Ferrer / llvm-epi-0.8

Jun 24, 2013

[APFloat] Removed trailing whitespace from unittests. · f989929c
Michael Gottesman authored Jun 24, 2013
```
llvm-svn: 184715
```
f989929c
[APFloat] Added a large unittest for APFloat.add that checks that special... · e45b1083
Michael Gottesman authored Jun 24, 2013
```
[APFloat] Added a large unittest for APFloat.add that checks that special values are computed correctly.

llvm-svn: 184714
```
e45b1083
[APFloat] Added support for parsing float strings which contain {inf,-inf,NaN,-NaN}. · 40e8a187
Michael Gottesman authored Jun 24, 2013
```
llvm-svn: 184713
```
40e8a187
[APFloat] Added make{Zero,Inf} methods and implemented get{Zero,Inf} on top of them. · c4facdf3
Michael Gottesman authored Jun 24, 2013
```
llvm-svn: 184712
```
c4facdf3

[APFloat] Removed a assert from significandParts() which says that one can... · f0e8cd1a

Michael Gottesman authored Jun 24, 2013

[APFloat] Removed a assert from significandParts() which says that one can only access the significand of FiniteNonZero/NaN floats.

The method significandParts() is a helper method meant to ease access to
APFloat's significand by allowing the user to not need to be aware of whether or
not the APFloat is using memory allocated in the instance itself or in an
external array.

This assert says that one can only access the significand of FiniteNonZero/NaN
floats. This makes it cumbersome and more importantly dangerous when one wishes
to zero out the significand of a zero/infinity value since one will have to deal
with the aforementioned quandary related to how the memory in APFloat is
allocated.

llvm-svn: 184711

f0e8cd1a

[APFloat] Rename macro convolve => PackCategoriesIntoKey so that it is clear... · 9b877e18

Michael Gottesman authored Jun 24, 2013

[APFloat] Rename macro convolve => PackCategoriesIntoKey so that it is clear what APFloat is actually using said macro for.

In the context of APFloat, seeing a macro called convolve suggests that APFloat
is using said value in some sort of convolution somewhere in the source code.
This is misleading.

I also added a documentation comment to the macro.

llvm-svn: 184710

9b877e18

Add -mcpu to some unit tests that only fail on certain hosts. · c08bd450
Andrew Trick authored Jun 24, 2013
```
llvm-svn: 184709
```
c08bd450

ARM: check predicate bits for thumb instructions · 8449c0d5

Amaury de la Vieuville authored Jun 24, 2013

When encoded to thumb, VFP instruction and VMOV/VDUP between scalar and
core registers, must have their predicate bit to 0b1110.

llvm-svn: 184707

8449c0d5

ARM: rGPR is meant to be unpredictable, not undefined · 8175bda3
Amaury de la Vieuville authored Jun 24, 2013
```
llvm-svn: 184706
```
8175bda3

Temporarily enable MI-Sched on X86. · 5a1e0af8

Andrew Trick authored Jun 24, 2013

Sorry for the unit test churn. I'll try to make the change permanently
next time.

llvm-svn: 184705

5a1e0af8

ARM: fix thumb1 nop decoding · f2f00b4e

Amaury de la Vieuville authored Jun 24, 2013

In thumb1, NOP is a pseudo-instruction equivalent to mov r8, r8.
However the disassembler should not use this alias.

llvm-svn: 184703

f2f00b4e

ARM: fix IT decoding · 2f0ac8d9
Amaury de la Vieuville authored Jun 24, 2013
```
mask == 0 -> UNPRED

llvm-svn: 184702
```
2f0ac8d9
ARM: enable decoding of pc-relative PLD/PLI · 4b6c076d
Amaury de la Vieuville authored Jun 24, 2013
```
llvm-svn: 184701
```
4b6c076d

Add a flag to defer vectorization into a phase after the inliner and its · 08e1b874

Chandler Carruth authored Jun 24, 2013

CGSCC pass manager. This should insulate the inlining decisions from the
vectorization decisions, however it may have both compile time and code
size problems so it is just an experimental option right now.

Adding this based on a discussion with Arnold and it seems at least
worth having this flag for us to both run some experiments to see if
this strategy is workable. It may solve some of the regressions seen
with the loop vectorizer.

llvm-svn: 184698

08e1b874

Filter out dragonegg when checked out into a projects subdirectory. · 99c46b98

Chandler Carruth authored Jun 24, 2013

There is some hope of eventually supporting a unified build with it, but
until then this lets me (and others) check it out in this location
without things breaking.

llvm-svn: 184697

99c46b98

DebugInfo: enumerator values returned as int64 as they are stored · 62251374
David Blaikie authored Jun 24, 2013
```
llvm-svn: 184694
```
62251374
DebugInfo: add some testing from an overly broad end-to-end test in Clang · 3656123d
David Blaikie authored Jun 24, 2013
```
llvm-svn: 184692
```
3656123d

Revert "LoopVectorize: Use the dependence test utility class" · 58ca945f

Arnold Schwaighofer authored Jun 24, 2013

This reverts commit cbfa1ca993363ca5c4dbf6c913abc957c584cbac.

We are seeing a stage2 and stage3 miscompare on some dragonegg bots.

llvm-svn: 184690

58ca945f

[APFloat] Removed out of date comment from isNormal(). · d851ea06

Michael Gottesman authored Jun 24, 2013

I already finished the isIEEENormal => isNormal transition. So isNormal is now
IEEE-754R compliant.

llvm-svn: 184687

d851ea06

[APFloat] Rename llvm::exponent_t => llvm::APFloat::ExponentType. · 9dc98338

Michael Gottesman authored Jun 24, 2013

exponent_t is only used internally in APFloat and no exponent_t values are
exposed via the APFloat API. In light of such conditions it does not make any
sense to gum up the llvm namespace with said type. Plus it makes it clearer that
exponent_t is associated with APFloat.

llvm-svn: 184686

9dc98338

LoopVectorize: Use the dependence test utility class · b914a7e2

Arnold Schwaighofer authored Jun 24, 2013

We now no longer need alias analysis - the cases that alias analysis would
handle are now handled as accesses with a large dependence distance.

We can now vectorize loops with simple constant dependence distances.

  for (i = 8; i < 256; ++i) {
    a[i] = a[i+4] * a[i+8];
  }

  for (i = 8; i < 256; ++i) {
    a[i] = a[i-4] * a[i-8];
  }

We would be able to vectorize about 200 more loops (in many cases the cost model
instructs us no to) in the test suite now. Results on x86-64 are a wash.

I have seen one degradation in ammp. Interestingly, the function in which we
now vectorize a loop is never executed so we probably see some instruction
cache effects. There is a 2% improvement in h264ref. There is one or the other
TSCV loop kernel that speeds up.

radar://13681598

llvm-svn: 184685

b914a7e2

LoopVectorize: Add utility class for checking dependency among accesses · d5179767

Arnold Schwaighofer authored Jun 24, 2013

This class checks dependences by subtracting two Scalar Evolution access
functions allowing us to catch very simple linear dependences.

The checker assumes source order in determining whether vectorization is safe.
We currently don't reorder accesses.
Positive true dependencies need to be a multiple of VF otherwise we impede
store-load forwarding.

llvm-svn: 184684

d5179767

LoopVectorize: Add utility class for building sets of dependent accesses · d5741969

Arnold Schwaighofer authored Jun 24, 2013

Sets of dependent accesses are built by unioning sets based on underlying
objects. This class will be used by the upcoming dependence checker.

llvm-svn: 184683

d5741969

SLP Vectorizer: Add support for vectorizing parts of the tree. · 210e86d7

Nadav Rotem authored Jun 24, 2013

Untill now we detected the vectorizable tree and evaluated the cost of the
entire tree.  With this patch we can decide to trim-out branches of the tree
that are not profitable to vectorizer.

Also, increase the max depth from 6 to 12. In the worse possible case where all
of the code is made of diamond-shaped graph this can bring the cost to 2**10,
but diamonds are not very common.

llvm-svn: 184681

210e86d7

Fix tail merging to assign the (more) correct BasicBlock when splitting. · 97a1d7c4

Andrew Trick authored Jun 24, 2013

This makes it possible to write unit tests that are less susceptible
to minor code motion, particularly copy placement. block-placement.ll
covers this case with -pre-RA-sched=source which will soon be
default. One incorrectly named block is already fixed, but without
this fix, enabling new coalescing and scheduling would cause more
failures.

llvm-svn: 184680

97a1d7c4

Jun 23, 2013
- SLP Vectorizer: Fix a bug in the code that does CSE on the generated gather sequences. · 0323925d
  Nadav Rotem authored Jun 23, 2013
```
Make sure that we don't replace and RAUW two sequences if one does not dominate the other.

llvm-svn: 184674
```
  0323925d
- SLP Vectorizer: Erase instructions outside the vectorizeTree method. · 78428401
  Nadav Rotem authored Jun 23, 2013
```
The RAII builder location guard is saving a reference to instructions, so we can't erase instructions during vectorization.

llvm-svn: 184671
```
  78428401
- DebugInfo: PR14404: Avoid truncating 64 bit values into 32 bits for ULEB128/SLEB128 generation · 5acff7e6
  David Blaikie authored Jun 23, 2013
```
llvm-svn: 184669
```
  5acff7e6
- AArch64: fix overzealous NEXTing for Windows testing. · 295f049d
  Tim Northover authored Jun 23, 2013
```
llvm-svn: 184667
```
  295f049d
- Add MI-Sched support for x86 macro fusion. · 47740deb
  Andrew Trick authored Jun 23, 2013
```
This is an awful implementation of the target hook. But we don't have
abstractions yet for common machine ops, and I don't see any quick way
to make it table-driven.

llvm-svn: 184664
```
  47740deb
- SLP Vectorizer: Implement a simple CSE optimization for the gather sequences. · eb65e67e
  Nadav Rotem authored Jun 23, 2013
```
llvm-svn: 184660
```
  eb65e67e
Jun 22, 2013
- SLP Vectorizer: Implement multi-block slp-vectorization. · 80de0a28
  Nadav Rotem authored Jun 22, 2013
```
Rewrote the SLP-vectorization as a whole-function vectorization pass. It is now able to vectorize chains across multiple basic blocks.
It still does not vectorize PHIs, but this should be easy to do now that we scan the entire function.
I removed the support for extracting values from trees.
We are now able to vectorize more programs, but there are some serious regressions in many workloads (such as flops-6 and mandel-2).

llvm-svn: 184647
```
  80de0a28
- Replace with a shorter test case produced by Doug Gillmore. · de085b2a
  Reed Kotler authored Jun 22, 2013
```
llvm-svn: 184645
```
  de085b2a
- DebugInfo: Support (using GNU extensions) for template template parameters and parameter packs · 2b380232
  David Blaikie authored Jun 22, 2013
```
llvm-svn: 184643
```
  2b380232
- The getRegForInlineAsmConstraint function should only accept MVT value types. · 295bd43a
  Chad Rosier authored Jun 22, 2013
```
llvm-svn: 184642
```
  295bd43a
- Revert "FunctionAttrs: Merge attributes once instead of doing it for every argument." · 40d7f354
  Benjamin Kramer authored Jun 22, 2013
```
It doesn't work as I intended it to.  This reverts commit r184638.

llvm-svn: 184641
```
  40d7f354
- FunctionAttrs: Merge attributes once instead of doing it for every argument. · 76b7bd0e
  Benjamin Kramer authored Jun 22, 2013
```
It has become an expensive operation. No functionality change.

llvm-svn: 184638
```
  76b7bd0e
- RelocVisitor: Add another PPC64 relocation that occurs in dwarf output. · b5ab3600
  Benjamin Kramer authored Jun 22, 2013
```
Should bring the ppc64 buildbot back to life.

llvm-svn: 184633
```
  b5ab3600
- Create the file with the right permissions instead of setting it afterwards. · b046eedb
  Rafael Espindola authored Jun 22, 2013
```
Removes the last use of PathV1.h in llvm-ar.

llvm-svn: 184630
```
  b046eedb
- [yaml2obj][ELF] Make symbol table top-level key. · 82177573
  Sean Silva authored Jun 22, 2013
```
Although in reality the symbol table in ELF resides in a section, the
standard requires that there be no more than one SHT_SYMTAB. To enforce
this constraint, it is cleaner to group all the symbols under a
top-level `Symbols` key on the object file.

llvm-svn: 184627
```
  82177573