Commits · b4ea4b3751c6d48c576816130135118e7ba45769 · Roger Ferrer / llvm-epi-0.8

Dec 14, 2012
- Disable the loop vectorizer. · b4ea4b37
  Nadav Rotem authored Dec 14, 2012
```
llvm-svn: 170162
```
  b4ea4b37
- Enable the Loop Vectorizer by default for O2 and O3. Disable if-conversion by... · e5e28b48
  Nadav Rotem authored Dec 13, 2012
```
Enable the Loop Vectorizer by default for O2 and O3. Disable if-conversion by default. I plan to revert this patch later today.

llvm-svn: 170157
```
  e5e28b48
Dec 13, 2012

Revert r170020, "Simplify negated bit test", for now. · 38d2b244

NAKAMURA Takumi authored Dec 13, 2012

This assumes (1 << n) is always not zero. Consider n is greater than word size.
Although I know it is undefined, this transforms undefined behavior hidden.

This led clang unexpected behavior with some failures. I will investigate to fix undefined shl in clang.

llvm-svn: 170128

38d2b244

Revert "Restore the PHI optimization I accidently removed" temporarily since · a1bbeeca
Eric Christopher authored Dec 13, 2012
```
it seems to be breaking self-host for a few people and is PR14592.

This reverts commit r170024.

llvm-svn: 170106
```
a1bbeeca
Missed these calls from the previous rename somehow. · a2c107e6
Rafael Espindola authored Dec 13, 2012
```
llvm-svn: 170094
```
a2c107e6

Rename isPowerOfTwo to isKnownToBeAPowerOfTwo. · 319f74cd

Rafael Espindola authored Dec 13, 2012

In a previous thread it was pointed out that isPowerOfTwo is not a very precise
name since it can return false for powers of two if it is unable to show that
they are powers of two.

llvm-svn: 170093

319f74cd

Pattern matching code for intrinsics. · 536cc32b

Michael Ilseman authored Dec 13, 2012

Provides m_Argument that allows matching against a CallSite's specified argument. Provides m_Intrinsic pattern that can be templatized over the intrinsic id and bind/match arguments similarly to other pattern matchers. Implementations provided for 0 to 4 arguments, though it's very simple to extend for more. Also provides example template specialization for bswap (m_BSwap) and example of code cleanup for its use.

llvm-svn: 170091

536cc32b

Take into account minimize size attribute in the inliner. · c0dba203

Quentin Colombet authored Dec 13, 2012

Better controls the inlining of functions when the caller function has MinSize attribute.
Basically, when the caller function has this attribute, we do not "force" the inlining
of callee functions carrying the InlineHint attribute (i.e., functions defined with
inline keyword)

llvm-svn: 170065

c0dba203

Teach the cost model about the optimization in r169904: Truncation of... · 36510f71

Nadav Rotem authored Dec 13, 2012

Teach the cost model about the optimization in r169904: Truncation of induction variables costs the same as scalar trunc. 

llvm-svn: 170051

36510f71

Typo. · e28ae30a
Chad Rosier authored Dec 13, 2012
```
llvm-svn: 170050
```
e28ae30a

Dec 12, 2012
- Restore the PHI optimization I accidently removed · 3c814128
  Michael Ilseman authored Dec 12, 2012
```
llvm-svn: 170024
```
  3c814128
- Remove trailing whitespace · 9fc0f258
  Michael Ilseman authored Dec 12, 2012
```
llvm-svn: 170022
```
  9fc0f258
- Simplify negated bit test · 5226aa94
  David Majnemer authored Dec 12, 2012
```
llvm-svn: 170020
```
  5226aa94
- Fix indentation. · 6027bdf8
  Nadav Rotem authored Dec 12, 2012
```
llvm-svn: 170005
```
  6027bdf8
- LoopVectorizer: Use the "optsize" attribute to decide if we are allowed to... · d0bb22bb
  Nadav Rotem authored Dec 12, 2012
```
LoopVectorizer: Use the "optsize" attribute to decide if we are allowed to increase the function size.

llvm-svn: 170004
```
  d0bb22bb
- The TargetData is not used for the isPowerOfTwo determination. It has never · e4023806
  Rafael Espindola authored Dec 12, 2012
```
been used in the first place.  It simply was passed to the function and to the
recursive invocations.  Simply drop the parameter and update the callers for the
new signature.

Patch by Saleem Abdulrasool!

llvm-svn: 169988
```
  e4023806
- Improve debug info generated with enabled AddressSanitizer. · 3d43b63a
  Alexey Samsonov authored Dec 12, 2012
```
When ASan replaces <alloca instruction> with
<offset into a common large alloca>, it should also patch
llvm.dbg.declare calls and replace debug info descriptors to mark
that we've replaced alloca with a value that stores an address
of the user variable, not the user variable itself.

See PR11818 for more context.

llvm-svn: 169984
```
  3d43b63a
- Fix the ascii drawing that was ruined when I split the H and CPP · 6798a04b
  Nadav Rotem authored Dec 12, 2012
```
llvm-svn: 169955
```
  6798a04b
- fix a typo. · 4fa2e3d5
  Nadav Rotem authored Dec 12, 2012
```
llvm-svn: 169953
```
  4fa2e3d5
- LoopVectorizer: When -Os is used, vectorize only loops that dont require a... · aeb17df8
  Nadav Rotem authored Dec 12, 2012
```
LoopVectorizer: When -Os is used, vectorize only loops that dont require a tail loop. There is no testcase because I dont know of a way to initialize the loop vectorizer pass without adding an additional hidden flag. 

llvm-svn: 169950
```
  aeb17df8
- - Fix a problematic way in creating all-the-1 APInt. · 81b36785
  Shuxin Yang authored Dec 12, 2012
```
- Propagate "exact" bit of [l|a]shr instruction.

llvm-svn: 169942
```
  81b36785
- Remove redunant optimizations from InstCombine, instead call the appropriate... · d5787be5
  Michael Ilseman authored Dec 12, 2012
```
Remove redunant optimizations from InstCombine, instead call the appropriate functions from SimplifyInstruction

llvm-svn: 169941
```
  d5787be5
Dec 11, 2012
- PR14574. Fix a bug in the code that calculates the mask the converted PHIs in if-conversion. · f707bf4c
  Nadav Rotem authored Dec 11, 2012
```
llvm-svn: 169916
```
  f707bf4c
- Loop Vectorize: optimize the vectorization of trunc(induction_var). The... · e266efb7
  Nadav Rotem authored Dec 11, 2012
```
Loop Vectorize: optimize the vectorization of trunc(induction_var). The truncation is now done on scalars.

llvm-svn: 169904
```
  e266efb7
- Use an ArrayRef instead of a std::vector&. · a92da5b3
  Rafael Espindola authored Dec 11, 2012
```
llvm-svn: 169881
```
  a92da5b3
- [msan] Use explicitely aligned stores and loads with function argument shadow. · d2bd319a
  Evgeniy Stepanov authored Dec 11, 2012
```
Use explicitely aligned store and load instructions to deal with argument and
retval shadow. This matters when an argument's alignment is higher than
__msan_param_tls alignment (which is the case with __m128i).

llvm-svn: 169859
```
  d2bd319a
- Revert EVT->MVT changes, r169836-169851, due to buildbot failures. · e98b7a03
  Patrik Hagglund authored Dec 11, 2012
```
llvm-svn: 169854
```
  e98b7a03
- Change TargetLowering::getLoadExtAction to take an MVT, instead of EVT. · cbc9d4d0
  Patrik Hagglund authored Dec 11, 2012
```
llvm-svn: 169840
```
  cbc9d4d0
- Fix PR14565. Don't if-convert loops that have switch statements in them. · dbb33281
  Nadav Rotem authored Dec 11, 2012
```
llvm-svn: 169813
```
  dbb33281
Dec 10, 2012

Enable the loop vectorizer only on O2 and above. (Still disabled by default) · 36cdd826
Nadav Rotem authored Dec 10, 2012
```
llvm-svn: 169774
```
36cdd826
Split the LoopVectorizer into H and CPP. · 07df5ac1
Nadav Rotem authored Dec 10, 2012
```
llvm-svn: 169771
```
07df5ac1

Don't use a red zone for code coverage if the user specified `-mno-red-zone'. · 74f334e4

Bill Wendling authored Dec 10, 2012

The `-mno-red-zone' flag wasn't being propagated to the functions that code
coverage generates. This allowed some of them to use the red zone when that
wasn't allowed.
<rdar://problem/12843084>

llvm-svn: 169754

74f334e4

Add support for reverse induction variables. For example: · 7b5b55c1
Nadav Rotem authored Dec 10, 2012
```
while (i--)
 sum+=A[i];

llvm-svn: 169752
```
7b5b55c1

Add a new visitor for walking the uses of a pointer value. · e41e7b79

Chandler Carruth authored Dec 10, 2012

This visitor provides infrastructure for recursively traversing the
use-graph of a pointer-producing instruction like an alloca or a malloc.
It maintains a worklist of uses to visit, so it can handle very deep
recursions. It automatically looks through instructions which simply
translate one pointer to another (bitcasts and GEPs). It tracks the
offset relative to the original pointer as long as that offset remains
constant and exposes it during the visit as an APInt offset. Finally, it
performs conservative escape analysis.

However, currently it has some limitations that should be addressed
going forward:
1) It doesn't handle vectors of pointers.
2) It doesn't provide a cheaper visitor when the constant offset
   tracking isn't needed.
3) It doesn't support non-instruction pointer values.

The current functionality is exactly what is required to implement the
SROA pointer-use visitors in terms of this one, rather than in terms of
their own ad-hoc base visitor, which was always very poorly specified.
SROA has been converted to use this, and the code there deleted which
this utility now provides.

Technically speaking, using this new visitor allows SROA to handle a few
more cases than it previously did. It is now more aggressive in ignoring
chains of instructions which look like they would defeat SROA, but in
fact do not because they never result in a read or write of memory.
While this is "neat", it shouldn't be interesting for real programs as
any such chains should have been removed by others passes long before we
get to SROA. As a consequence, I've not added any tests for these
features -- it shouldn't be part of SROA's contract to perform such
heroics.

The goal is to extend the functionality of this visitor going forward,
and re-use it from passes like ASan that can benefit from doing
a detailed walk of the uses of a pointer.

Thanks to Ben Kramer for the code review rounds and lots of help
reviewing and debugging this patch.

llvm-svn: 169728

e41e7b79

Fix PR14548: SROA was crashing on a mixture of i1 and i8 loads and stores. · e45f4658

Chandler Carruth authored Dec 10, 2012

When SROA was evaluating a mixture of i1 and i8 loads and stores, in
just a particular case, it would tickle a latent bug where we compared
bits to bytes rather than bits to bits. As a consequence of the latent
bug, we would allow integers through which were not byte-size multiples,
a situation the later rewriting code was never intended to handle.

In release builds this could trigger all manner of oddities, but the
reported issue in PR14548 was forming invalid bitcast instructions.

The only downside of this fix is that it makes it more clear that SROA
in its current form is not capable of handling mixed i1 and i8 loads and
stores. Sometimes with the previous code this would work by luck, but
usually it would crash, so I'm not terribly worried. I'll watch the LNT
numbers just to be sure.

llvm-svn: 169719

e45f4658

Dec 09, 2012

LoopVectorize: support vectorizing intrinsic calls · 2adb13c1

Paul Redmond authored Dec 09, 2012

- added function to VectorTargetTransformInfo to query cost of intrinsics
- vectorize trivially vectorizable intrinsic calls such as sin, cos, log, etc.

Reviewed by: Nadav

llvm-svn: 169711

2adb13c1

test commit. · f7cd6b39
Paul Redmond authored Dec 09, 2012
```
llvm-svn: 169709
```
f7cd6b39
Use m_OneUse pattern instead of hasOneUse() method. · 8432185e
Jakub Staszak authored Dec 09, 2012
```
No functionality change.

llvm-svn: 169703
```
8432185e
Remove trailing spaces. · 538e3861
Jakub Staszak authored Dec 09, 2012
```
llvm-svn: 169701
```
538e3861

Switch SROA to pop Uses off the back of its visitors' queues. · 93ff2447

Chandler Carruth authored Dec 09, 2012

This will more closely match the behavior of the new PtrUseVisitor that
I am adding. Hopefully this will not change the actual behavior in any
way, but by making the processing order more similar help in debugging.

llvm-svn: 169697

93ff2447