Commits · b9116e69666de1d6da41af6554beb9ceb2835db8 · Roger Ferrer / llvm-epi-0.8

Apr 16, 2013

SLPVectorizer: Make it a function pass and add code for hoisting the... · b9116e69

Nadav Rotem authored Apr 15, 2013

SLPVectorizer: Make it a function pass and add code for hoisting the vector-gather sequence out of loops.

llvm-svn: 179562

b9116e69

Apr 15, 2013
- Fix silly typo that broke big endian hosts. · 6e221097
  Rafael Espindola authored Apr 15, 2013
```
llvm-svn: 179551
```
  6e221097
- Fix endianness on some MSVC versions. · a652799f
  Rafael Espindola authored Apr 15, 2013
```
Looks like it was evaluating undef == undef to true.

llvm-svn: 179549
```
  a652799f
- R600/SI: Emit config values in register value pairs. · cb97e3ac
  Tom Stellard authored Apr 15, 2013
```
Instead of emitting config values in a predefined order, the code
emitter will now emit a 32-bit register index followed by the 32-bit
config value.

llvm-svn: 179546
```
  cb97e3ac
- R600/SI: Emit configuration value in the .AMDGPU.config ELF section · 3a7beafb
  Tom Stellard authored Apr 15, 2013
```
llvm-svn: 179545
```
  3a7beafb
- R600: Emit ELF formatted code rather than raw ISA. · 9991659f
  Tom Stellard authored Apr 15, 2013
```
llvm-svn: 179544
```
  9991659f
- Fix a typo in comment. · 0f38c1e3
  Jim Grosbach authored Apr 15, 2013
```
llvm-svn: 179542
```
  0f38c1e3
- Simplify the MCInst operator iterator declaration. · 2783ca4f
  Jim Grosbach authored Apr 15, 2013
```
llvm-svn: 179541
```
  2783ca4f
- Grammar and punctuation fixes. · 02fc72d4
  John Criswell authored Apr 15, 2013
```
No content changes.

llvm-svn: 179540
```
  02fc72d4
- Try to fix the mingw builds. · 732dc415
  Rafael Espindola authored Apr 15, 2013
```
llvm-svn: 179536
```
  732dc415
- Fix bit size of v64i8 and v32i16 vector types. · 0b06796e
  Arnold Schwaighofer authored Apr 15, 2013
```
Patch by Cameron McInally <cameron.mcinally@nyu.edu>.

llvm-svn: 179535
```
  0b06796e
- Remove getters now that we can specialize structs on the host endianness. · 2fa9f53c
  Rafael Espindola authored Apr 15, 2013
```
llvm-svn: 179534
```
  2fa9f53c
- Avoid outputting temporary test file into source tree. · 943e9293
  Tim Northover authored Apr 15, 2013
```
llvm-svn: 179532
```
  943e9293
- Remove unused function. · 16b0cc39
  Rafael Espindola authored Apr 15, 2013
```
llvm-svn: 179530
```
  16b0cc39
- Make the host endianness check an integer constant expression. · 41cb64f4
  Rafael Espindola authored Apr 15, 2013
```
I will remove the isBigEndianHost function once I update clang.

The ifdef logic is designed to
* not use configure/cmake to avoid breaking -arch i686 -arch ppc.
* default to little endian
* be as small as possible

It looks like sys/endian.h is the preferred header on most modern BSD systems,
but it is better to change this in a followup patch as machine/endian.h is
available on FreeBSD, OpenBSD, NetBSD and OS X.

llvm-svn: 179527
```
  41cb64f4
- Replace uses of the deprecated std::auto_ptr with OwningPtr. · b23ea72e
  Andy Gibbs authored Apr 15, 2013
```
This is a rework of the broken parts in r179373 which were subsequently reverted in r179374 due to incompatibility with C++98 compilers.  This version should be ok under C++98.

llvm-svn: 179520
```
  b23ea72e
- Enable all targets by default on Visual Studio. · 865f4bcb
  Tim Northover authored Apr 15, 2013
```
llvm-svn: 179518
```
  865f4bcb
- Revert "Recommit r179497 after fixing uninitialized variable." until · 13637e90
  Eric Christopher authored Apr 15, 2013
```
I can fix the testcases here:

http://lab.llvm.org:8011/builders/clang-native-arm-cortex-a9/builds/6952

This reverts commit r179512 due to testcases specifying triples
that they didn't actually mean and causing failures on other platforms.

llvm-svn: 179513
```
  13637e90
- Recommit r179497 after fixing uninitialized variable. · fc2beaa1
  Eric Christopher authored Apr 15, 2013
```
llvm-svn: 179512
```
  fc2beaa1
- Document our desire to enable the loop vectorizer on -Os in future releases. · a440a356
  Nadav Rotem authored Apr 15, 2013
```
llvm-svn: 179511
```
  a440a356
- Docs: merge the description of the BB and SLP vectorizers and document the... · 57da1fdd
  Nadav Rotem authored Apr 15, 2013
```
Docs: merge the description of the BB and SLP vectorizers and document the -fslp-vectorize-aggressive flag.

llvm-svn: 179510
```
  57da1fdd
- Add an option -vectorize-slp-aggressive for running the BB vectorizer. Make... · d4dcc003
  Nadav Rotem authored Apr 15, 2013
```
Add an option -vectorize-slp-aggressive for running the BB vectorizer. Make -fslp-vectorize run the slp-vectorizer.

llvm-svn: 179508
```
  d4dcc003
- Rename the slp-vectorizer clang/llvm flags. No functionality change. · a1e5e44e
  Nadav Rotem authored Apr 15, 2013
```
llvm-svn: 179505
```
  a1e5e44e
- SLPVectorizer: Add support for vectorizing trees that start at compare instructions. · 5d393c41
  Nadav Rotem authored Apr 15, 2013
```
llvm-svn: 179504
```
  5d393c41
- fix include path in doc Extending LLVM · f3076492
  Jia Liu authored Apr 15, 2013
```
llvm-svn: 179503
```
  f3076492
- Mark all PPC comparison instructions as not having side effects · 95e6ea69
  Hal Finkel authored Apr 15, 2013
```
Now that the CR spilling issues have been resolved, we can remove the
unmodeled-side-effect attributes from the comparison instructions (and also
mark them as isCompare). By allowing these, by default, to have unmodeled side
effects, we were hiding problems with CR spilling; but everything seems much
happier now.

llvm-svn: 179502
```
  95e6ea69
- Fix PPC64 CR spill location for callee-saved registers · 6736988a
  Hal Finkel authored Apr 15, 2013
```
This fixes an ABI bug for non-Darwin PPC64. For the callee-saved condition
registers, the spill location is specified relative to the stack pointer (SP +
8). However, this is not relative to the SP after the new stack frame is
established, but instead relative to the caller's stack pointer (it is stored
into the linkage area of the parent's stack frame).

So, like with the link register, we don't directly spill the CRs with other
callee-saved registers, but just mark them to be spilled during prologue
generation.

In practice, this reverts r179457 for PPC64 (but leaves it in place for PPC32).

llvm-svn: 179500
```
  6736988a
- Revert "Remove some unused triple and data layout." · 1f140317
  Eric Christopher authored Apr 14, 2013
```
This reverts commit r179497 and the accompanying commit as it broke random platforms that aren't osx.

llvm-svn: 179499
```
  1f140317
- Remove some unused triple and data layout. · 4eebd14a
  Eric Christopher authored Apr 14, 2013
```
llvm-svn: 179498
```
  4eebd14a
- If we've specified a triple on the command line then go ahead · e1876a2b
  Eric Christopher authored Apr 14, 2013
```
and use that as the default triple for the module and target
data layout.

llvm-svn: 179497
```
  e1876a2b
Apr 14, 2013
- Use object file specific section type for initial text section · 334c7bc7
  Nico Rieck authored Apr 14, 2013
```
llvm-svn: 179494
```
  334c7bc7
- Reorders two transforms that collide with each other · 1fae1955
  David Majnemer authored Apr 14, 2013
```
One performs: (X == 13 | X == 14) -> X-13 <u 2
The other: (A == C1 || A == C2) -> (A & ~(C1 ^ C2)) == C1

The problem is that there are certain values of C1 and C2 that
trigger both transforms but the first one blocks out the second,
this generates suboptimal code.

Reordering the transforms should be better in every case and
allows us to do interesting stuff like turn:
  %shr = lshr i32 %X, 4
  %and = and i32 %shr, 15
  %add = add i32 %and, -14
  %tobool = icmp ne i32 %add, 0

into:
  %and = and i32 %X, 240
  %tobool = icmp ne i32 %and, 224

llvm-svn: 179493
```
  1fae1955
- Make the command line triple match the module triple. · 6ebddae1
  Nadav Rotem authored Apr 14, 2013
```
llvm-svn: 179492
```
  6ebddae1
- Miscellaneous cleanups for VecUtils.h · 7d62ea86
  Benjamin Kramer authored Apr 14, 2013
```
llvm-svn: 179483
```
  7d62ea86
- Document the SLP infrastructure. · efa56e18
  Nadav Rotem authored Apr 14, 2013
```
llvm-svn: 179480
```
  efa56e18
- SLP: Document the scalarization cost method. · 3403c115
  Nadav Rotem authored Apr 14, 2013
```
llvm-svn: 179479
```
  3403c115
- Document the decision to assume that the cost of floats is twice as much as integers. · 0db0690a
  Nadav Rotem authored Apr 14, 2013
```
llvm-svn: 179478
```
  0db0690a
- Use i32 for all SPARC shift amounts, even in 64-bit mode. · eed1072f
  Jakob Stoklund Olesen authored Apr 14, 2013
```
Test case by llvm-stress.

llvm-svn: 179477
```
  eed1072f
- Remove unused function attributes. · 029208ce
  Nadav Rotem authored Apr 14, 2013
```
llvm-svn: 179476
```
  029208ce
- SLPVectorizer: Add support for trees that don't start at binary operators, and... · 54b413d1
  Nadav Rotem authored Apr 14, 2013
```
SLPVectorizer: Add support for trees that don't start at binary operators, and add the cost of extracting values from the roots of the tree.

llvm-svn: 179475
```
  54b413d1