Commits · dfd8fcbb00336a0577637936a1c9bc296be60ecd · Roger Ferrer / llvm-epi-0.8

Apr 20, 2013
- Fix the header comment. · dfd8fcbb
  Nadav Rotem authored Apr 20, 2013
```
llvm-svn: 179928
```
  dfd8fcbb
Apr 16, 2013

SLPVectorizer: Make it a function pass and add code for hoisting the... · b9116e69

Nadav Rotem authored Apr 15, 2013

SLPVectorizer: Make it a function pass and add code for hoisting the vector-gather sequence out of loops.

llvm-svn: 179562

b9116e69

Apr 14, 2013
- Miscellaneous cleanups for VecUtils.h · 7d62ea86
  Benjamin Kramer authored Apr 14, 2013
```
llvm-svn: 179483
```
  7d62ea86
- SLPVectorizer: Add support for trees that don't start at binary operators, and... · 54b413d1
  Nadav Rotem authored Apr 14, 2013
```
SLPVectorizer: Add support for trees that don't start at binary operators, and add the cost of extracting values from the roots of the tree.

llvm-svn: 179475
```
  54b413d1
- SLPVectorizer: add initial support for reduction variable vectorization. · 0b9cf856
  Nadav Rotem authored Apr 14, 2013
```
llvm-svn: 179470
```
  0b9cf856
Apr 12, 2013

SLPVectorizer: add support for vectorization of diamond shaped trees. We now... · 8543ba3e

Nadav Rotem authored Apr 12, 2013

SLPVectorizer: add support for vectorization of diamond shaped trees. We now perform a preliminary traversal of the graph to collect values with multiple users and check where the users came from. 

llvm-svn: 179414

8543ba3e

Apr 09, 2013

Add support for bottom-up SLP vectorization infrastructure. · 2d9dec32

Nadav Rotem authored Apr 09, 2013

This commit adds the infrastructure for performing bottom-up SLP vectorization (and other optimizations) on parallel computations.
The infrastructure has three potential users:

  1. The loop vectorizer needs to be able to vectorize AOS data structures such as (sum += A[i] + A[i+1]).

  2. The BB-vectorizer needs this infrastructure for bottom-up SLP vectorization, because bottom-up vectorization is faster to compute.

  3. A loop-roller needs to be able to analyze consecutive chains and roll them into a loop, in order to reduce code size. A loop roller does not need to create vector instructions, and this infrastructure separates the chain analysis from the vectorization.

This patch also includes a simple (100 LOC) bottom up SLP vectorizer that uses the infrastructure, and can vectorize this code:

void SAXPY(int *x, int *y, int a, int i) {
  x[i]   = a * x[i]   + y[i];
  x[i+1] = a * x[i+1] + y[i+1];
  x[i+2] = a * x[i+2] + y[i+2];
  x[i+3] = a * x[i+3] + y[i+3];
}

llvm-svn: 179117

2d9dec32