Commits · 726bc70c438591ffe73e1e653aac6f9bcabe7d68 · Roger Ferrer / llvm-epi-0.8

Sep 11, 2006
- Fix PR905 and InstCombine/2006-09-11-EmptyStructCrash.ll · d2862700
  Chris Lattner authored Sep 11, 2006
```
llvm-svn: 30266
```
  d2862700
- Skip the linear search if the answer is already known. · e94f42a7
  Nick Lewycky authored Sep 11, 2006
```
llvm-svn: 30251
```
  e94f42a7
Sep 10, 2006
- Allow tail duplication in more cases, relaxing the previous restriction a · d1f8e078
  Chris Lattner authored Sep 10, 2006
```
bit.  This fixes Regression/Transforms/TailDup/MergeTest.ll

llvm-svn: 30237
```
  d1f8e078
- Replace EquivalenceClasses with a custom-built data structure. Many common · 9a22d7b6
  Nick Lewycky authored Sep 10, 2006
```
operations (like findProperties) should be faster, at the expense of
unionSets being slower in cases that are rare in practise.

Don't erase a dead Instruction. This fixes a memory corruption issue.

llvm-svn: 30235
```
  9a22d7b6
- Implement Transforms/InstCombine/hoist_instr.ll · 04689875
  Chris Lattner authored Sep 09, 2006
```
llvm-svn: 30234
```
  04689875
Sep 09, 2006
- Make inlining costs more accurate. · 27ff96d8
  Chris Lattner authored Sep 09, 2006
```
llvm-svn: 30231
```
  27ff96d8
- Turn div X, (Cond ? Y : 0) -> div X, Y · d79dc798
  Chris Lattner authored Sep 09, 2006
```
This implements select.ll::test18.

llvm-svn: 30230
```
  d79dc798
Sep 07, 2006

Throttle back tail duplication to avoid creating really ugly sequences of code. · c465046e

Chris Lattner authored Sep 07, 2006

For Transforms/TailDup/if-tail-dup.ll, f.e., it produces:

_foo:
        movl 8(%esp), %eax
        movl 4(%esp), %ecx
        testl $1, %ecx
        je LBB1_2       #cond_next
LBB1_1: #cond_true
        movl $1, (%eax)
LBB1_2: #cond_next
        testl $2, %ecx
        je LBB1_4       #cond_next10
LBB1_3: #cond_true6
        movl $1, 4(%eax)
LBB1_4: #cond_next10
        testl $4, %ecx
        je LBB1_6       #cond_next18
LBB1_5: #cond_true14
        movl $1, 8(%eax)
LBB1_6: #cond_next18
        testl $8, %ecx
        je LBB1_8       #return
LBB1_7: #cond_true22
        movl $1, 12(%eax)
        ret
LBB1_8: #return
        ret

instead of:

_foo:
        movl 4(%esp), %eax
        testl $2, %eax
        sete %cl
        movl 8(%esp), %edx
        testl $1, %eax
        je LBB1_2       #cond_next
LBB1_1: #cond_true
        movl $1, (%edx)
        testb %cl, %cl
        jne LBB1_4      #cond_next10
        jmp LBB1_3      #cond_true6
LBB1_2: #cond_next
        testb %cl, %cl
        jne LBB1_4      #cond_next10
LBB1_3: #cond_true6
        movl $1, 4(%edx)
        testl $4, %eax
        je LBB1_6       #cond_next18
        jmp LBB1_5      #cond_true14
LBB1_4: #cond_next10
        testl $4, %eax
        je LBB1_6       #cond_next18
LBB1_5: #cond_true14
        movl $1, 8(%edx)
        testl $8, %eax
        je LBB1_8       #return
        jmp LBB1_7      #cond_true22
LBB1_6: #cond_next18
        testl $8, %eax
        je LBB1_8       #return
LBB1_7: #cond_true22
        movl $1, 12(%edx)
        ret
LBB1_8: #return
        ret

llvm-svn: 30158

c465046e

Sep 05, 2006
- Fix Duraid's changes to work when TLI is null. This fixes the failing · 845b223d
  Chris Lattner authored Sep 05, 2006
```
lowerinvoke regtests.

llvm-svn: 30115
```
  845b223d
Sep 04, 2006

add setJumpBufSize() and setJumpBufAlignment() to target-lowering. · cf6749e4

Duraid Madina authored Sep 04, 2006

Call these from your backend to enjoy setjmp/longjmp goodness, see
lib/Target/IA64/IA64ISelLowering.cpp for an example

llvm-svn: 30095

cf6749e4

Sep 02, 2006
- Make ArgumentPromotion handle recursive functions that pass pointers in their recursive calls. · 19b80e76
  Owen Anderson authored Sep 02, 2006
```
llvm-svn: 30057
```
  19b80e76
- Improve handling of SelectInst. · 8e559935
  Nick Lewycky authored Sep 02, 2006
```
Reorder operations to remove duplicated work.
Fix to leave floating-point types out of the optimization.
Add tests to predsimplify.ll for SwitchInst and SelectInst handling.

llvm-svn: 30055
```
  8e559935
Sep 01, 2006
- Don't confuse canonicalize and lookup. Fixes predsimplify.reg4.ll. Also · f6f529d0
  Nick Lewycky authored Sep 01, 2006
```
corrects missing optimization opportunity removing cases from a switch.

llvm-svn: 30009
```
  f6f529d0
Aug 31, 2006

Properties where both Values weren't in the union (as being equal to · 08674ab7

Nick Lewycky authored Aug 31, 2006

another Value) weren't being found by findProperties.

This fixes predsimplify.ll test6, a missed optimization opportunity.

llvm-svn: 29991

08674ab7

Aug 30, 2006

Move to using the EquivalenceClass ADT. Removes SynSets. · 5f8f9af6

Nick Lewycky authored Aug 30, 2006

If a branch's condition has become a ConstantBool, simplify it immediately.
Removing the edge saves work and exposes up more optimization opportunities
in the pass.
Add support for SelectInst.

llvm-svn: 29970

5f8f9af6

Do not rely on std::sort and std::erase to get list of unique · f489d0f8

Devang Patel authored Aug 29, 2006

exit blocks. The output is dependent on addresses of basic block.

Add and use Loop::getUniqueExitBlocks.

llvm-svn: 29966

f489d0f8

Aug 29, 2006
- Clean up a bit. · a8a2e5c6
  Owen Anderson authored Aug 29, 2006
```
llvm-svn: 29950
```
  a8a2e5c6
- Add PredicateSimplifier pass. Collapses equal variables into one form · b2e8ae17
  Nick Lewycky authored Aug 28, 2006
```
and simplifies expressions. This implements the optimization described
in PR807.

llvm-svn: 29947
```
  b2e8ae17
Aug 28, 2006
- Make LoopUnroll fold excessive BasicBlocks. This results in a significant speedup of · 62c84fe3
  Owen Anderson authored Aug 28, 2006
```
gccas on 252.eon

llvm-svn: 29936
```
  62c84fe3
- simplify AnalysisGroup registration, eliminating one typeid call. · 97c9f20c
  Chris Lattner authored Aug 28, 2006
```
llvm-svn: 29932
```
  97c9f20c
- eliminate RegisterOpt. It does the same thing as RegisterPass. · c2d3d311
  Chris Lattner authored Aug 27, 2006
```
llvm-svn: 29925
```
  c2d3d311
Aug 27, 2006
- s|llvm/Support/Visibility.h|llvm/Support/Compiler.h| · 3d27be13
  Chris Lattner authored Aug 27, 2006
```
llvm-svn: 29911
```
  3d27be13
Aug 26, 2006
- Fix a crash related to updating Phi nodes in the original header block. This was · 403b95af
  Owen Anderson authored Aug 25, 2006
```
causing a crash in 175.vpr

llvm-svn: 29887
```
  403b95af
- Add an assertion to check that we're really preserving LCSSA. · 8e4b0295
  Owen Anderson authored Aug 25, 2006
```
llvm-svn: 29886
```
  8e4b0295
Aug 25, 2006
- Reapply the indvars patch, since nothing blew up last night. · 8cca95cf
  Owen Anderson authored Aug 25, 2006
```
llvm-svn: 29874
```
  8cca95cf
- Revert my previous patch. Since there are some major changes that went in today, · 94446a42
  Owen Anderson authored Aug 25, 2006
```
I'm going to wait to put this in HEAD until tomorrow, so as not to clutter the nightly
tester.

llvm-svn: 29868
```
  94446a42
- Specify that indvars actually preserve LCSSA. This has been done for a while, but I · 15a64234
  Owen Anderson authored Aug 25, 2006
```
forgot to put in the analysis usage.

llvm-svn: 29867
```
  15a64234
Aug 24, 2006
- Implement unrolling of multiblock loops. This significantly improves the · e001d811
  Owen Anderson authored Aug 24, 2006
```
utility of the LoopUnroll pass.

Also, add a testcase for multiblock-loop unrolling.

llvm-svn: 29859
```
  e001d811
Aug 18, 2006
- Fix a grammaro in a comment. · 5495fe8d
  Reid Spencer authored Aug 18, 2006
```
llvm-svn: 29765
```
  5495fe8d
Aug 14, 2006
- Handle single-entry PHI nodes correctly. This fixes PR877 and · 6441cf93
  Chris Lattner authored Aug 14, 2006
```
Transforms/CondProp/2006-08-14-SingleEntryPhiCrash.ll

llvm-svn: 29673
```
  6441cf93
Aug 12, 2006

Don't attempt to split subloops out of a loop with a huge number of backedges. · f18b396c

Chris Lattner authored Aug 12, 2006

Not only will this take huge amounts of compile time, the resultant loop nests
won't be useful for optimization.  This reduces loopsimplify time on
Transforms/LoopSimplify/2006-08-11-LoopSimplifyLongTime.ll from ~32s to ~0.4s
with a debug build of llvm on a 2.7Ghz G5.

llvm-svn: 29647

f18b396c

Reimplement the loopsimplify code which deletes edges from unreachable · 85d9944f

Chris Lattner authored Aug 12, 2006

blocks that target loop blocks.

Before, the code was run once per loop, and depended on the number of
predecessors each block in the loop had.  Unfortunately, scanning preds can
be really slow when huge numbers of phis exist or when phis with huge numbers
of inputs exist.

Now, the code is run once per function and scans successors instead of preds,
which is far faster.  In addition, the new code is simpler and is goto free,
woo.

This change speeds up a nasty testcase Duraid provided me from taking hours to
taking ~72s with a debug build.  The functionality this implements is already
tested in the testsuite as Transforms/CodeExtractor/2004-03-13-LoopExtractorCrash.ll.

llvm-svn: 29644

85d9944f

Aug 08, 2006

Make this example pass use some things from lib/Support (EscapeString, · 2b6d18a6

Reid Spencer authored Aug 07, 2006

SlowOperatingInfo, Statistics). Besides providing an example of how to
use these facilities, it also serves to debug problems with runtime linking
when dlopening a loadable module. These three support facilities exercise
different combinations of Text/Weak Weak/Text and Text/Text linking
between the executable and the module.

llvm-svn: 29552

2b6d18a6

For PR780: · e6458c3f

Reid Spencer authored Aug 07, 2006

1. Change the usage of LOADABLE_MODULE so that it implies all the things
   necessary to make a loadable module. This reduces the user's burdern to
   get a loadable module correctly built.
2. Document the usage of LOADABLE_MODULE in the MakefileGuide
3. Adjust the makefile for lib/Transforms/Hello to use the new specification
   for building loadable modules
4. Adjust the sample project to not attempt to build a shared library for
   its little library. This was just wasteful and not instructive at all.

llvm-svn: 29551

e6458c3f

Aug 03, 2006

Fix PR867 (and maybe 868) and testcsae: · c9009d91
Chris Lattner authored Aug 03, 2006
```
Transforms/SimplifyCFG/2006-08-03-Crash.ll

llvm-svn: 29515
```
c9009d91

· 3ff62017

Chris Lattner authored Aug 03, 2006

Changes:
  1. Update an obsolete comment.
  2. Make the sorting by base an explicit (though still N^2) step, so
     that the code is more clear on what it is doing.
  3. Partition uses so that uses inside the loop are handled before uses
     outside the loop.

Note that none of these changes currently changes the code inserted by LSR,
but they are a stepping stone to getting there.

This code is the result of some crazy pair programming with Nate. :)

llvm-svn: 29493

3ff62017

Aug 02, 2006

Add special check to avoid isLoop call. Simple, but doesn't seem to speed · 38b6e838
Chris Lattner authored Aug 02, 2006
```
up lcssa much in practice.

llvm-svn: 29465
```
38b6e838

Replace the SSA update code in LCSSA with a bottom-up approach instead of a top · 5a2bc786

Chris Lattner authored Aug 02, 2006

down approach, inspired by discussions with Tanya.

This approach is significantly faster, because it does not need dominator
frontiers and it does not insert extraneous unused PHI nodes. For example, on
252.eon, in a release-asserts build, this speeds up LCSSA (which is the slowest
pass in gccas) from 9.14s to 0.74s on my G5. This code is also slightly smaller
and significantly simpler than the old code.

Amusingly, in a normal Release build (which includes the
"assert(L->isLCSSAForm());" assertion), asserting that the result of LCSSA
is in LCSSA form is actually slower than the LCSSA transformation pass
itself on 252.eon. I will see if Loop::isLCSSAForm can be sped up next.

llvm-svn: 29463

5a2bc786

Jul 27, 2006
- Add some advice · 85ea83e8
  Chris Lattner authored Jul 27, 2006
```
llvm-svn: 29324
```
  85ea83e8
Jul 20, 2006
- Minor comment tweaks · 1b928478
  Chris Lattner authored Jul 20, 2006
```
llvm-svn: 29226
```
  1b928478