Commits · 3b2b1e794242b96121cc696dc72d35d911bc82f6 · Roger Ferrer / llvm-epi-0.8

Oct 01, 2010

Dale Johannesen authored Sep 30, 2010

The x86_mmx type is used for MMX intrinsics, parameters and
return values where these use MMX registers, and is also
supported in load, store, and bitcast.

Only the above operations generate MMX instructions, and optimizations
do not operate on or produce MMX intrinsics. 

MMX-sized vectors <2 x i32> etc. are lowered to XMM or split into
smaller pieces.  Optimizations may occur on these forms and the
result casted back to x86_mmx, provided the result feeds into a
previous existing x86_mmx operation.

The point of all this is prevent optimizations from introducing
MMX operations, which is unsafe due to the EMMS problem.

llvm-svn: 115243

dd224d23

Sep 30, 2010
- We do want to allow LoadPRE to perform LICM-like transformations: we already... · 3170a25a
  Owen Anderson authored Sep 30, 2010
```
We do want to allow LoadPRE to perform LICM-like transformations: we already consider PHI nodes to be negligible for
code size (making this transform code size neutral), and it allows us to hoist values out of loops, which is always
a good thing.

llvm-svn: 115205
```
  3170a25a
- Try again to disable critical edge splitting in CodeGenPrepare. · eb12f49f
  Jakob Stoklund Olesen authored Sep 30, 2010
```
The bug that broke i386 linux has been fixed in r115191.

llvm-svn: 115204
```
  eb12f49f
- Tighten up prototype verification of strchr and strrchr to avoid a crash in... · 5d66e5fe
  Benjamin Kramer authored Sep 30, 2010
```
Tighten up prototype verification of strchr and strrchr to avoid a crash in the very unlikely case that someone passes an integer > i64 to strchr.

llvm-svn: 115144
```
  5d66e5fe
- Add constant folding for strspn and strcspn to SimplifyLibCalls. · 2b76c66f
  Benjamin Kramer authored Sep 30, 2010
```
llvm-svn: 115116
```
  2b76c66f
- Add strpbrk folding to SimplifyLibCalls. · 38d22f69
  Benjamin Kramer authored Sep 29, 2010
```
llvm-svn: 115111
```
  38d22f69
- Simplify the loop in StrChrOptimizer. FileCheckize test. · 8e861d7e
  Benjamin Kramer authored Sep 29, 2010
```
llvm-svn: 115095
```
  8e861d7e
Sep 29, 2010

Teach SimplifyLibCalls how to optimize strrchr. · 824645ab
Benjamin Kramer authored Sep 29, 2010
```
llvm-svn: 115091
```
824645ab

Fix PR8247: JumpThreading can cause a block to become unreachable while still... · 99c985c3

Owen Anderson authored Sep 29, 2010

Fix PR8247: JumpThreading can cause a block to become unreachable while still having predecessor, if it is part of a self-loop.
Because of this, we cannot use the Simplify* APIs, as they can assert-fail on unreachable code. Since it's not easy to determine
if a given threading will cause a block to become unreachable, simply defer simplifying simplification to later InstCombine and/or
DCE passes.

llvm-svn: 115082

99c985c3

Revert r114919, which caused some serious regressions on ARM. · d67ca0ed
Owen Anderson authored Sep 29, 2010
```
llvm-svn: 115053
```
d67ca0ed
Removed a bunch of unnecessary target_link_libraries. · b4b12535
Oscar Fuentes authored Sep 28, 2010
```
llvm-svn: 114999
```
b4b12535

Sep 28, 2010

Weight loop unrolling counts by nesting depth. Unrolling deeply nested loops tends to cause high · 9c93fd55

Owen Anderson authored Sep 27, 2010

register pressure and thus excess spills, which we don't currently recover from well.  This should
be re-evaluated in the future if our ability to generate good spills/splits improves.

Partial fix for <rdar://problem/7635585>.

llvm-svn: 114919

9c93fd55

Sep 27, 2010

Revert "Disable codegen prepare critical edge splitting. Machine instruction passes now" · 415a7a6f

Jakob Stoklund Olesen authored Sep 27, 2010

This reverts revision 114633. It was breaking llvm-gcc-i386-linux-selfhost.

It seems there is a downstream bug that is exposed by
-cgp-critical-edge-splitting=0. When that bug is fixed, this patch can go back
in.

Note that the changes to tailcallfp2.ll are not reverted. They were good are
required.

llvm-svn: 114859

415a7a6f

Delete an unused function. · 16ef4968
Dan Gohman authored Sep 27, 2010
```
llvm-svn: 114841
```
16ef4968

Sep 25, 2010

LoadPRE was not properly checking that the load it was PRE'ing post-dominated... · b590a927

Owen Anderson authored Sep 25, 2010

LoadPRE was not properly checking that the load it was PRE'ing post-dominated the block it was being hoisted to.
Splitting critical edges at the merge point only addressed part of the issue; it is also possible for non-post-domination
to occur when the path from the load to the merge has branches in it. Unfortunately, full anticipation analysis is
time-consuming, so for now approximate it. This is strictly more conservative than real anticipation, so we will miss
some cases that real PRE would allow, but we also no longer insert loads into paths where they didn't exist before. :-)

This is a very slight net positive on SPEC for me (0.5% on average). Most of the benchmarks are largely unaffected, but
when it pays off it pays off decently: 181.mcf improves by 4.5% on my machine.

llvm-svn: 114785

b590a927

If we're changing the source of a memcpy we need to use the alignment · ebacd2b0
Eric Christopher authored Sep 25, 2010
```
of the source, not the original alignment since it may no longer
be valid.

Fixes rdar://8400094

llvm-svn: 114781
```
ebacd2b0

Sep 23, 2010
- Disable codegen prepare critical edge splitting. Machine instruction passes now · 794aaa79
  Evan Cheng authored Sep 23, 2010
```
break critical edges on demand.

llvm-svn: 114633
```
  794aaa79
Sep 22, 2010

When moving zext/sext to be folded with a load, ignore the issue of whether · b6832a43

Bob Wilson authored Sep 22, 2010

truncates are free only in the case where the extended type is legal but the
load type is not.  If both types are illegal, such as when they are too big,
the load may not be legalized into an extended load.

llvm-svn: 114568

b6832a43

Sep 21, 2010
- Move a sign-extend or a zero-extend of a load to the same basic block as the · 4ddcb6a6
  Bob Wilson authored Sep 21, 2010
```
load when the type of the load is not legal, even if truncates are not free.
The load is going to be legalized to an extending load anyway.

llvm-svn: 114488
```
  4ddcb6a6
- Clarify a comment. · ff714f99
  Bob Wilson authored Sep 21, 2010
```
llvm-svn: 114487
```
  ff714f99
Sep 18, 2010
- do not rely on the implicit-dereference semantics of dyn_cast_or_null · a06741b3
  Gabor Greif authored Sep 18, 2010
```
llvm-svn: 114278
```
  a06741b3
- do not rely on the implicit-dereference semantics of dyn_cast_or_null · aaa22cf1
  Gabor Greif authored Sep 18, 2010
```
llvm-svn: 114277
```
  aaa22cf1
Sep 16, 2010

Use a depth-first iteratation in CorrelatedValuePropagation to avoid wasting time trying · d1048065
Owen Anderson authored Sep 16, 2010
```
to optimize unreachable blocks.

llvm-svn: 114105
```
d1048065

When substituting sunkaddrs into indirect arguments an asm, we were · f95f59a0

Dale Johannesen authored Sep 16, 2010

walking the asm arguments once and stashing their Values.  This is
wrong because the same memory location can be in the list twice, and
if the first one has a sunkaddr substituted, the stashed value for the
second one will be wrong (use-after-free).  PR 8154.

llvm-svn: 114104

f95f59a0

Sep 14, 2010
- Remove the option to disable LazyValueInfo in JumpThreading, as it is now · d361aac3
  Owen Anderson authored Sep 14, 2010
```
on by default and has received significant testing.

llvm-svn: 113852
```
  d361aac3
- fix PR8102, a case where we'd copyValue from a value that we already · f1144f09
  Chris Lattner authored Sep 14, 2010
```
deleted.  Fix this by doing the copyValue's before we delete stuff!

The testcase only repros the problem on my system with valgrind.

llvm-svn: 113820
```
  f1144f09
- Revert "CMake: Get rid of LLVMLibDeps.cmake and export the libraries normally." · 93c9b2ea
  Michael J. Spencer authored Sep 13, 2010
```
This reverts commit r113632

Conflicts:

	cmake/modules/AddLLVM.cmake

llvm-svn: 113819
```
  93c9b2ea
Sep 13, 2010
- Remove unused variable. · e3a89f9f
  Eric Christopher authored Sep 13, 2010
```
llvm-svn: 113769
```
  e3a89f9f
- Added skeleton for inline asm multiple alternative constraint support. · 1094c802
  John Thompson authored Sep 13, 2010
```
llvm-svn: 113766
```
  1094c802
Sep 11, 2010
- typoes · 2f5f696b
  Gabor Greif authored Sep 10, 2010
```
llvm-svn: 113647
```
  2f5f696b
Sep 10, 2010
- CMake: Get rid of LLVMLibDeps.cmake and export the libraries normally. · dc38d36c
  Michael J. Spencer authored Sep 10, 2010
```
llvm-svn: 113632
```
  dc38d36c
- Lower the unrolling theshold to 150. Empirical tests indicate that this is a... · d85c9ccd
  Owen Anderson authored Sep 10, 2010
```
Lower the unrolling theshold to 150.  Empirical tests indicate that this is a sweet spot in the performance per
code size increase curve.

llvm-svn: 113595
```
  d85c9ccd
Sep 09, 2010

What the loop unroller cares about, rather than just not unrolling loops with calls, is · 04cf3fd7

Owen Anderson authored Sep 09, 2010

not unrolling loops that contain calls that would be better off getting inlined. This mostly
comes up when an interleaved devirtualization pass has devirtualized a call which the inliner
will inline on a future pass. Thus, rather than blocking all loops containing calls, add
a metric for "inline candidate calls" and block loops containing those instead.

llvm-svn: 113535

04cf3fd7

Revert r113439, which relaxed the requirement that loops containing calls... · 62705159

Owen Anderson authored Sep 09, 2010

Revert r113439, which relaxed the requirement that loops containing calls cannot be unrolled. After some discussion,
there seems to be a better way to achieve the same effect.

llvm-svn: 113528

62705159

r113526 introduced an unintended change to the loop unrolling threshold. Revert it. · 11ab204f
Owen Anderson authored Sep 09, 2010
```
llvm-svn: 113527
```
11ab204f
Fix typo in code to cap the loop code size reduction calculation. · b61b1647
Owen Anderson authored Sep 09, 2010
```
llvm-svn: 113526
```
b61b1647
Use code-size reduction metrics to estimate the amount of savings we'll get when we unroll a loop. · 62ea1b71
Owen Anderson authored Sep 09, 2010
```
Next step is to recalculate the threshold values given this new heuristic.

llvm-svn: 113525
```
62ea1b71

Relax the "don't unroll loops containing calls" rule. Instead, when a loop... · 8084dbaf

Owen Anderson authored Sep 08, 2010

Relax the "don't unroll loops containing calls" rule. Instead, when a loop contains a call, lower the
unrolling threshold to the optimize-for-size threshold. Basically, for loops containing calls, unrolling
can still be profitable as long as the loop is REALLY small.

llvm-svn: 113439

8084dbaf

Sep 08, 2010

Add a separate unrolling threshold when the current function is being optimized for size. · a4d9c78a

Owen Anderson authored Sep 07, 2010

The threshold value of 50 is arbitrary, and I chose it simply by analogy to the inlining thresholds, where
the baseline unrolling threshold is slightly smaller than the baseline inlining threshold. This could
undoubtedly use some tuning.

llvm-svn: 113306

a4d9c78a

Sep 06, 2010
- fix PR8067, an over-aggressive assertion in LICM. · be901909
  Chris Lattner authored Sep 06, 2010
```
llvm-svn: 113146
```
  be901909