Commits · 339ced4e347ba1ee124afe762e1127a64bf4d64c · Roger Ferrer / llvm-epi-0.8

Dec 19, 2011

Add a if-conversion optimization that allows 'true' side of a diamond to be · 4266a793

Evan Cheng authored Dec 19, 2011

unpredicated. That is, turn
 subeq  r0, r1, #1
 addne  r0, r1, #1                                                                                                                                                                                                     
into
 sub    r0, r1, #1
 addne  r0, r1, #1

For targets where conditional instructions are always executed, this may be
beneficial. It may remove pseudo anti-dependency in out-of-order execution
CPUs. e.g.
 op    r1, ...
 str   r1, [r10]        ; end-of-life of r1 as div result
 cmp   r0, #65
 movne r1, #44  ; raw dependency on previous r1
 moveq r1, #12

If movne is unpredicated, then
 op    r1, ...
 str   r1, [r10]
 cmp   r0, #65
 mov   r1, #44  ; r1 written unconditionally
 moveq r1, #12

Both mov and moveq are no longer depdendent on the first instruction. This gives
the out-of-order execution engine more freedom to reorder them.

This has passed entire LLVM test suite. But it has not been enabled for any ARM
variant pending more performance evaluation.

rdar://8951196

llvm-svn: 146914

4266a793

Dec 07, 2011

Add bundle aware API for querying instruction properties and switch the code · 7f8e563a

Evan Cheng authored Dec 07, 2011

generator to it. For non-bundle instructions, these behave exactly the same
as the MC layer API.

For properties like mayLoad / mayStore, look into the bundle and if any of the
bundled instructions has the property it would return true.
For properties like isPredicable, only return true if *all* of the bundled
instructions have the property.
For properties like canFoldAsLoad, isCompare, conservatively return false for
bundles.

llvm-svn: 146026

7f8e563a

Nov 05, 2011
- Added missing &. Fixes <rdar://problem/10393723> · 77c703f1
  Pete Cooper authored Nov 04, 2011
```
llvm-svn: 143753
```
  77c703f1
Aug 04, 2011
- Fix typo in #include which revealed in the case-sensitive filesystem. · 3ef20e35
  Jakub Staszak authored Aug 03, 2011
```
llvm-svn: 136828
```
  3ef20e35
- Use MachineBranchProbabilityInfo in If-Conversion instead of its own heuristics. · 15e5b742
  Jakub Staszak authored Aug 03, 2011
```
llvm-svn: 136826
```
  15e5b742
Jul 22, 2011
- Revert patch which broke some IfConversion tests. · 7987ea74
  Jakub Staszak authored Jul 22, 2011
```
llvm-svn: 135738
```
  7987ea74
- Fix typo in #include which revealed in the case-sensitive filesystem. · 76d71158
  Jakub Staszak authored Jul 22, 2011
```
llvm-svn: 135734
```
  76d71158
- Use MachineBranchProbabilityInfo instead of MachineLoopInfo in IfConversion. · 44860314
  Jakub Staszak authored Jul 21, 2011
```
llvm-svn: 135724
```
  44860314
Jul 10, 2011
- Use BranchProbability instead of floating points in IfConverter. · 9b07c0ab
  Jakub Staszak authored Jul 10, 2011
```
llvm-svn: 134858
```
  9b07c0ab
- Don't analyze block if it's not considered for ifcvt anymore. · a4a18f09
  Jakub Staszak authored Jul 10, 2011
```
llvm-svn: 134856
```
  a4a18f09
Jun 29, 2011
- Sink SubtargetFeature and TargetInstrItineraries (renamed MCInstrItineraries) into MC. · 8264e272
  Evan Cheng authored Jun 29, 2011
```
llvm-svn: 134049
```
  8264e272
Jun 28, 2011

- Rename TargetInstrDesc, TargetOperandInfo to MCInstrDesc and MCOperandInfo and · 6cc775f9

Evan Cheng authored Jun 28, 2011

sink them into MC layer.
- Added MCInstrInfo, which captures the tablegen generated static data. Chang
TargetInstrInfo so it's based off MCInstrInfo.

llvm-svn: 134021

6cc775f9

May 12, 2011

Re-commit 131172 with fix. MachineInstr identity checks should check dead · cfdf3390

Evan Cheng authored May 12, 2011

markers. In some cases a register def is dead on one path, but not on
another.

This is passing Clang self-hosting.

llvm-svn: 131214

cfdf3390

May 11, 2011

Revert 131172 as it is causing clang to miscompile itself. I will try · 2a09d659
Rafael Espindola authored May 11, 2011
```
to provide a reduced testcase.

llvm-svn: 131176
```
2a09d659

Add a late optimization to BranchFolding that hoist common instruction sequences · 05fc35e2

Evan Cheng authored May 11, 2011

at the start of basic blocks to their common predecessor. It's actually quite
common (e.g. about 50 times in JM/lencod) and has shown to be a nice code size
benefit. e.g.

        pushq   %rax
        testl   %edi, %edi
        jne     LBB0_2
## BB#1:
        xorb    %al, %al
        popq    %rdx
        ret
LBB0_2:
        xorb    %al, %al
        callq   _foo
        popq    %rdx
        ret

=>

        pushq   %rax
        xorb    %al, %al
        testl   %edi, %edi
        je      LBB0_2
## BB#1:
        callq   _foo
LBB0_2:
        popq    %rdx
        ret

rdar://9145558

llvm-svn: 131172

05fc35e2

Apr 27, 2011

If converter was being too cute. It look for root BBs (which don't have · 9808d31b

Evan Cheng authored Apr 27, 2011

successors) and use inverse depth first search to traverse the BBs. However
that doesn't work when the CFG has infinite loops. Simply do a linear
traversal of all BBs work just fine.

rdar://9344645

llvm-svn: 130324

9808d31b

Nov 06, 2010
- Prune includes. · 63abc846
  Benjamin Kramer authored Nov 06, 2010
```
llvm-svn: 118342
```
  63abc846
Nov 03, 2010

Two sets of changes. Sorry they are intermingled. · debf9c50

Evan Cheng authored Nov 03, 2010

1. Fix pre-ra scheduler so it doesn't try to push instructions above calls to
   "optimize for latency". Call instructions don't have the right latency and
   this is more likely to use introduce spills.
2. Fix if-converter cost function. For ARM, it should use instruction latencies,
   not # of micro-ops since multi-latency instructions is completely executed
   even when the predicate is false. Also, some instruction will be "slower"
   when they are predicated due to the register def becoming implicit input.
   rdar://8598427

llvm-svn: 118135

debf9c50

Oct 26, 2010

When the "true" and "false" blocks of a diamond if-conversion are the same, · e1961fe2

Bob Wilson authored Oct 26, 2010

do not double-count the duplicate instructions by counting once from the
beginning and again from the end. Keep track of where the duplicates from
the beginning ended and don't go past that point when counting duplicates
at the end. Radar 8589805.

This change causes one of the MC/ARM/simple-fp-encoding tests to produce
different (better!) code without the vmovne instruction being tested.
I changed the test to produce vmovne and vmoveq instructions but moving
between register files in the opposite direction. That's not quite the same
but predicated versions of those instructions weren't being tested before,
so at least the test coverage is not any worse, just different.

llvm-svn: 117333

e1961fe2

Change if-conversion to keep track of the extra cost due to microcoded · efd360c5

Bob Wilson authored Oct 26, 2010

instructions separately from the count of non-predicated instructions.  The
instruction count is used in places to determine how many instructions to
copy, predicate, etc. and things get confused if that count includes the
extra cost for microcoded ops.

llvm-svn: 117332

efd360c5

Oct 19, 2010

Get rid of static constructors for pass registration. Instead, every pass... · 6c18d1aa

Owen Anderson authored Oct 19, 2010

Get rid of static constructors for pass registration. Instead, every pass exposes an initializeMyPassFunction(), which
must be called in the pass's constructor. This function uses static dependency declarations to recursively initialize
the pass's dependencies.

Clients that only create passes through the createFooPass() APIs will require no changes. Clients that want to use the
CommandLine options for passes will need to manually call the appropriate initialization functions in PassInitialization.h
before parsing commandline arguments.

I have tested this with all standard configurations of clang and llvm-gcc on Darwin. It is possible that there are problems
with the static dependencies that will only be visible with non-standard options. If you encounter any crash in pass
registration/creation, please send the testcase to me directly.

llvm-svn: 116820

6c18d1aa

Oct 12, 2010

Begin adding static dependence information to passes, which will allow us to · 8ac477ff

Owen Anderson authored Oct 12, 2010

perform initialization without static constructors AND without explicit initialization
by the client.  For the moment, passes are required to initialize both their
(potential) dependencies and any passes they preserve.  I hope to be able to relax
the latter requirement in the future.

llvm-svn: 116334

8ac477ff

Oct 08, 2010
- Now with fewer extraneous semicolons! · df7a4f25
  Owen Anderson authored Oct 07, 2010
```
llvm-svn: 115996
```
  df7a4f25
Oct 02, 2010

Thread the determination of branch prediction hit rates back through the... · f31f33ea

Owen Anderson authored Oct 01, 2010

Thread the determination of branch prediction hit rates back through the if-conversion heuristic APIs. For now,
stick with a constant estimate of 90% (branch predictors are good!), but we might find that we want to provide
more nuanced estimates in the future.

llvm-svn: 115364

f31f33ea

Sep 30, 2010
- Silence msvc warnings. · 2016f0ea
  Benjamin Kramer authored Sep 29, 2010
```
llvm-svn: 115097
```
  2016f0ea
Sep 28, 2010
- Give the if-converter access to MachineLoopInfo, and use it to generate plausible branch prediction · 1b35f4cc
  Owen Anderson authored Sep 28, 2010
```
estimates.

llvm-svn: 114981
```
  1b35f4cc
- Part one of switching to using a more sane heuristic for determining if-conversion profitability. · 88af7d00
  Owen Anderson authored Sep 28, 2010
```
Rather than having arbitrary cutoffs, actually try to cost model the conversion.

For now, the constants are tuned to more or less match our existing behavior, but these will be
changed to reflect realistic values as this work proceeds.

llvm-svn: 114973
```
  88af7d00
Sep 10, 2010

Teach if-converter to be more careful with predicating instructions that would · bf407075

Evan Cheng authored Sep 10, 2010

take multiple cycles to decode.
For the current if-converter clients (actually only ARM), the instructions that
are predicated on false are not nops. They would still take machine cycles to
decode. Micro-coded instructions such as LDM / STM can potentially take multiple
cycles to decode. If-converter should take treat them as non-micro-coded
simple instructions.

llvm-svn: 113570

bf407075

Aug 06, 2010
- Reapply r110396, with fixes to appease the Linux buildbot gods. · a7aed186
  Owen Anderson authored Aug 06, 2010
```
llvm-svn: 110460
```
  a7aed186
- Revert r110396 to fix buildbots. · bda59bd2
  Owen Anderson authored Aug 06, 2010
```
llvm-svn: 110410
```
  bda59bd2
- Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static · 755aceb5
  Owen Anderson authored Aug 05, 2010
```
ID member as the sole unique type identifier.  Clean up APIs related to this change.

llvm-svn: 110396
```
  755aceb5
Jul 22, 2010
- Fix batch of converting RegisterPass<> to INTIALIZE_PASS(). · a57b97e7
  Owen Anderson authored Jul 21, 2010
```
llvm-svn: 109045
```
  a57b97e7
Jun 29, 2010

Reapply my if-conversion cleanup from svn r106939 with fixes. · 1e5da550

Bob Wilson authored Jun 29, 2010

There are 2 changes relative to the previous version of the patch:

1) For the "simple" if-conversion case, there's no need to worry about
RemoveExtraEdges not handling an unanalyzable branch.  Predicated terminators
are ignored in this context, so RemoveExtraEdges does the right thing.
This might break someday if we ever treat indirect branches (BRIND) as
predicable, but for now, I just removed this part of the patch, because
in the case where we do not add an unconditional branch, we rely on keeping
the fall-through edge to CvtBBI (which is empty after this transformation).

The change relative to the previous patch is:

@@ -1036,10 +1036,6 @@
     IterIfcvt = false;
   }
 
-  // RemoveExtraEdges won't work if the block has an unanalyzable branch,
-  // which is typically the case for IfConvertSimple, so explicitly remove
-  // CvtBBI as a successor.
-  BBI.BB->removeSuccessor(CvtBBI->BB);
   RemoveExtraEdges(BBI);
 
   // Update block info. BB can be iteratively if-converted.


2) My patch exposed a bug in the code for merging the tail of a "diamond",
which had previously never been exercised.  The code was simply checking that
the tail had a single predecessor, but there was a case in
MultiSource/Benchmarks/VersaBench/dbms where that single predecessor was
neither edge of the diamond.  I added the following change to check for
that:

@@ -1276,7 +1276,18 @@
   // tail, add a unconditional branch to it.
   if (TailBB) {
     BBInfo TailBBI = BBAnalysis[TailBB->getNumber()];
-    if (TailBB->pred_size() == 1 && !TailBBI.HasFallThrough) {
+    bool CanMergeTail = !TailBBI.HasFallThrough;
+    // There may still be a fall-through edge from BBI1 or BBI2 to TailBB;
+    // check if there are any other predecessors besides those.
+    unsigned NumPreds = TailBB->pred_size();
+    if (NumPreds > 1)
+      CanMergeTail = false;
+    else if (NumPreds == 1 && CanMergeTail) {
+      MachineBasicBlock::pred_iterator PI = TailBB->pred_begin();
+      if (*PI != BBI1->BB && *PI != BBI2->BB)
+        CanMergeTail = false;
+    }
+    if (CanMergeTail) {
       MergeBlocks(BBI, TailBBI);
       TailBBI.IsDone = true;
     } else {

With these fixes, I was able to run all the SingleSource and MultiSource
tests successfully.

llvm-svn: 107110

1e5da550

Jun 28, 2010
- new, no longer brain-dead, r106907 · ee6e29aa
  Jim Grosbach authored Jun 28, 2010
```
llvm-svn: 107060
```
  ee6e29aa
- Revert r106907, "make sure to handle dbg_value instructions in the middle of the · b8c058cb
  Daniel Dunbar authored Jun 28, 2010
```
block, not...", it caused a bunch of nightly test regressions.

llvm-svn: 107009
```
  b8c058cb
Jun 26, 2010

Revert my if-conversion cleanup since it caused a bunch of nightly test · 418e64a3

Bob Wilson authored Jun 26, 2010

regressions.

--- Reverse-merging r106939 into '.':
U    test/CodeGen/Thumb2/thumb2-ifcvt3.ll
U    lib/CodeGen/IfConversion.cpp

llvm-svn: 106951

418e64a3

Clean up some problems with extra CFG edges being introduced during · c72da6bb

Bob Wilson authored Jun 26, 2010

if-conversion. The RemoveExtraEdges function doesn't work for blocks that
end with unanalyzable branches, so in those cases, the "extra" edges must
be explicitly removed. The CopyAndPredicateBlock and MergeBlocks methods
can also avoid copying successor edges due to branches that have already
been removed. The latter case is especially helpful when MergeBlocks is
called for handling "diamond" if-conversions, where otherwise you can end
up with some weird intermediate states in the CFG. Unfortunately I've
been unable to find cases where this cleanup actually makes a significant
difference in the code. There is one test where we manage to remove an
empty block at the end of a function. Radar 6911268.

llvm-svn: 106939

c72da6bb

make sure to handle dbg_value instructions in the middle of the block, not · c34befc7
Jim Grosbach authored Jun 25, 2010
```
just at the head, when doing diamond if-conversion. rdar://7797940

llvm-svn: 106907
```
c34befc7
Change if-conversion block size limit checks to add some flexibility. · 02b184de
Evan Cheng authored Jun 25, 2010
```
llvm-svn: 106901
```
02b184de
80 column and typo fix · 8a6deefe
Jim Grosbach authored Jun 25, 2010
```
llvm-svn: 106894
```
8a6deefe