Commits · 3e3cdecf98e5a5a2ef1296f96c52da4b435dd950 · Roger Ferrer / llvm-epi-0.8

"llvm/git@repo.hca.bsc.es:rferrer/llvm-epi-0.8.git" did not exist on "497d83196684ad2b0a6dbe21878e789bdfcef2aa"

Jun 02, 2012

Fix typos found by http://github.com/lyda/misspell-check · bde91766
Benjamin Kramer authored Jun 02, 2012
```
llvm-svn: 157885
```
bde91766

Switch all register list clients to the new MC*Iterator interface. · 54038d79

Jakob Stoklund Olesen authored Jun 01, 2012

No functional change intended.

Sorry for the churn. The iterator classes are supposed to help avoid
giant commits like this one in the future. The TableGen-produced
register lists are getting quite large, and it may be necessary to
change the table representation.

This makes it possible to do so without changing all clients (again).

llvm-svn: 157854

54038d79

Jun 01, 2012
- Switch some getAliasSet clients to MCRegAliasIterator. · 92a00839
  Jakob Stoklund Olesen authored Jun 01, 2012
```
MCRegAliasIterator can optionally visit the register itself, allowing
for simpler code.

llvm-svn: 157837
```
  92a00839
May 24, 2012
- Forgot to reverse conditional. · e351e8c5
  Bill Wendling authored May 23, 2012
```
llvm-svn: 157349
```
  e351e8c5
- Reduce indentation by early detection of 'continue'. No functionality change. · 041793c4
  Bill Wendling authored May 23, 2012
```
llvm-svn: 157348
```
  041793c4
Apr 23, 2012

This patch fixes a problem which arose when using the Post-RA scheduler · 9a091475

Preston Gurd authored Apr 23, 2012

on X86 Atom. Some of our tests failed because the tail merging part of
the BranchFolding pass was creating new basic blocks which did not
contain live-in information. When the anti-dependency code in the Post-RA
scheduler ran, it would sometimes rename the register containing
the function return value because the fact that the return value was
live-in to the subsequent block had been lost. To fix this, it is necessary
to run the RegisterScavenging code in the BranchFolding pass.

This patch makes sure that the register scavenging code is invoked
in the X86 subtarget only when post-RA scheduling is being done.
Post RA scheduling in the X86 subtarget is only done for Atom.

This patch adds a new function to the TargetRegisterClass to control
whether or not live-ins should be preserved during branch folding.
This is necessary in order for the anti-dependency optimizations done
during the PostRASchedulerList pass to work properly when doing
Post-RA scheduling for the X86 in general and for the Intel Atom in particular.

The patch adds and invokes the new function trackLivenessAfterRegAlloc()
instead of using the existing requiresRegisterScavenging().
It changes BranchFolding.cpp to call trackLivenessAfterRegAlloc() instead of
requiresRegisterScavenging(). It changes the all the targets that
implemented requiresRegisterScavenging() to also implement
trackLivenessAfterRegAlloc().  

It adds an assertion in the Post RA scheduler to make sure that post RA
liveness information is available when it is needed.

It changes the X86 break-anti-dependencies test to use –mcpu=atom, in order
to avoid running into the added assertion.

Finally, this patch restores the use of anti-dependency checking
(which was turned off temporarily for the 3.1 release) for
Intel Atom in the Post RA scheduler.

Patch by Andy Zhang!

Thanks to Jakob and Anton for their reviews.

llvm-svn: 155395

9a091475

Mar 27, 2012

Branch folding may invalidate liveness. · d1664a15

Jakob Stoklund Olesen authored Mar 27, 2012

Branch folding can use a register scavenger to update liveness
information when required. Don't do that if liveness information is
already invalid.

llvm-svn: 153517

d1664a15

Mar 07, 2012

Where the BranchFolding pass removes a branch then adds another better branch, · 7c5dcb6c

Bill Wendling authored Mar 07, 2012

the DebugLoc information can be maintained throughout by grabbing the DebugLoc
before the RemoveBranch and then passing the result to the InsertBranch.
Patch by Andrew Stanford-Jason!

llvm-svn: 152212

7c5dcb6c

Mar 05, 2012
- Convert more GenRegisterInfo tables from unsigned to uint16_t to reduce static data size. · 4b02a29e
  Craig Topper authored Mar 05, 2012
```
llvm-svn: 152016
```
  4b02a29e
Mar 04, 2012
- Use uint16_t to store register overlaps to reduce static data. · 1d326588
  Craig Topper authored Mar 04, 2012
```
llvm-svn: 152001
```
  1d326588
Feb 22, 2012
- Remove extra semi-colons. · 5dfe6dab
  Chad Rosier authored Feb 22, 2012
```
llvm-svn: 151169
```
  5dfe6dab
Feb 16, 2012

Handle register masks in branch folding. · e9e30d08

Jakob Stoklund Olesen authored Feb 15, 2012

Don't attempt to move instructions with regmask operands. They are most
likely calls anyway.

llvm-svn: 150634

e9e30d08

Feb 08, 2012
- Move pass configuration out of pass constructors: BranchFolderPass · 58648e4e
  Andrew Trick authored Feb 08, 2012
```
llvm-svn: 150095
```
  58648e4e
- whitespace · 9e761997
  Andrew Trick authored Feb 08, 2012
```
llvm-svn: 150094
```
  9e761997
Jan 20, 2012
- More dead code removal (using -Wunreachable-code) · 46a9f016
  David Blaikie authored Jan 20, 2012
```
llvm-svn: 148578
```
  46a9f016
Jan 12, 2012

When hoisting common code, watch out for uses which are marked "kill". If the · 5c03a6b8

Evan Cheng authored Jan 12, 2012

killed registers are needed below the insertion point, then unset the kill
marker.

Sorry I'm not able to find a reduced test case.

rdar://10660944

llvm-svn: 148043

5c03a6b8

Jan 07, 2012

Revert part of r147716. Looks like x87 instructions kill markers are all messed · 6cc8d498

Evan Cheng authored Jan 07, 2012

up so branch folding pass can't use the scavenger. :-(  This doesn't breaks
anything currently. It just means targets which do not carefully update kill
markers cannot run post-ra scheduler (not new, it has always been the case).

We should fix this at some point since it's really hacky.

llvm-svn: 147719

6cc8d498

Added a late machine instruction copy propagation pass. This catches · 00b1a3cd

Evan Cheng authored Jan 07, 2012

opportunities that only present themselves after late optimizations
such as tail duplication .e.g.
## BB#1:
        movl    %eax, %ecx
        movl    %ecx, %eax
        ret

The register allocator also leaves some of them around (due to false
dep between copies from phi-elimination, etc.)

This required some changes in codegen passes. Post-ra scheduler and the
pseudo-instruction expansion passes have been moved after branch folding
and tail merging. They were before branch folding before because it did
not always update block livein's. That's fixed now. The pass change makes
independently since we want to properly schedule instructions after
branch folding / tail duplication.

rdar://10428165
rdar://10640363

llvm-svn: 147716

00b1a3cd

Dec 14, 2011

- Add MachineInstrBundle.h and MachineInstrBundle.cpp. This includes a function · 7fae11b2

Evan Cheng authored Dec 14, 2011

  to finalize MI bundles (i.e. add BUNDLE instruction and computing register def
  and use lists of the BUNDLE instruction) and a pass to unpack bundles.
- Teach more of MachineBasic and MachineInstr methods to be bundle aware.
- Switch Thumb2 IT block to MI bundles and delete the hazard recognizer hack to
  prevent IT blocks from being broken apart.

llvm-svn: 146542

7fae11b2

Dec 07, 2011

Add bundle aware API for querying instruction properties and switch the code · 7f8e563a

Evan Cheng authored Dec 07, 2011

generator to it. For non-bundle instructions, these behave exactly the same
as the MC layer API.

For properties like mayLoad / mayStore, look into the bundle and if any of the
bundled instructions has the property it would return true.
For properties like isPredicable, only return true if *all* of the bundled
instructions have the property.
For properties like canFoldAsLoad, isCompare, conservatively return false for
bundles.

llvm-svn: 146026

7f8e563a

Oct 26, 2011

Reapply r142920 with fix: · 9b993222

Bill Wendling authored Oct 26, 2011

An MBB which branches to an EH landing pad shouldn't be considered for tail merging.

In SjLj EH, the jump to the landing pad is not done explicitly through a branch
statement. The EH landing pad is added as a successor to the throwing
BB. Because of that however, the branch folding pass could mistakenly think that
it could merge the throwing BB with another BB. This isn't safe to do.
<rdar://problem/10334833>

llvm-svn: 143001

9b993222

Oct 25, 2011

Revert commit 142891. Takumi bisected the tablegen miscompiles · 6ca458e4

Duncan Sands authored Oct 25, 2011

down to this commit.  Original commit message:

An MBB which branches to an EH landing pad shouldn't be considered for tail merging.

In SjLj EH, the jump to the landing pad is not done explicitly through a branch
statement. The EH landing pad is added as a successor to the throwing
BB. Because of that however, the branch folding pass could mistakenly think that
it could merge the throwing BB with another BB. This isn't safe to do.
<rdar://problem/10334833>

llvm-svn: 142920

6ca458e4

An MBB which branches to an EH landing pad shouldn't be considered for tail merging. · 38ced995

Bill Wendling authored Oct 25, 2011

In SjLj EH, the jump to the landing pad is not done explicitly through a branch
statement. The EH landing pad is added as a successor to the throwing
BB. Because of that however, the branch folding pass could mistakenly think that
it could merge the throwing BB with another BB. This isn't safe to do.
<rdar://problem/10334833>

llvm-svn: 142891

38ced995

Aug 05, 2011

Fix liveness computations in BranchFolding. · d633abeb

Jakob Stoklund Olesen authored Aug 05, 2011

The old code would look at kills and defs in one pass over the
instruction operands, causing problems with this code:

  %R0<def>, %CPSR<def,dead> = tLSLri %R5<kill>, 2, pred:14, pred:%noreg
  %R0<def>, %CPSR<def,dead> = tADDrr %R4<kill>, %R0<kill>, pred:14, %pred:%noreg

The last instruction kills and redefines %R0, so it is still live after
the instruction.

This caused a register scavenger crash when compiling 483.xalancbmk for
armv6. I am not including a test case because it requires too much bad
luck to expose this old bug.

First you need to convince the register allocator to use %R0 twice on
the tADDrr instruction, then you have to convince BranchFolding to do
something that causes it to run the register scavenger on he bad block.

<rdar://problem/9898200>

llvm-svn: 136973

d633abeb

Jul 07, 2011

When tail-merging multiple blocks, make sure to correctly update the live-in... · bf007364

Eli Friedman authored Jul 06, 2011

When tail-merging multiple blocks, make sure to correctly update the live-in list on the merged block to correctly account for the live-outs of all the predecessors.  They might not be the same in all cases (the testcase I have involves a PHI node where one of the operands is an IMPLICIT_DEF).

Unfortunately, the testcase I have is large and confidential, so I don't have a test to commit at the moment; I'll see if I can come up with something smaller where this issue reproduces.

<rdar://problem/9716278>

llvm-svn: 134565

bf007364

Jun 28, 2011

- Rename TargetInstrDesc, TargetOperandInfo to MCInstrDesc and MCOperandInfo and · 6cc775f9

Evan Cheng authored Jun 28, 2011

sink them into MC layer.
- Added MCInstrInfo, which captures the tablegen generated static data. Chang
TargetInstrInfo so it's based off MCInstrInfo.

llvm-svn: 134021

6cc775f9

Jun 14, 2011
- Add 132986 back, but avoid non-determinism if a bb address gets reused. · 3aeaf9e4
  Rafael Espindola authored Jun 14, 2011
```
llvm-svn: 132995
```
  3aeaf9e4
- revert 132986 to see if the bots go green. · 06ba7a68
  Rafael Espindola authored Jun 14, 2011
```
llvm-svn: 132988
```
  06ba7a68
- Make the threshold used by branch folding softer. Before we would get a · da24f2f8
  Rafael Espindola authored Jun 14, 2011
```
sharp all or nothing transition when one extra predecessor was added. Now
we still test first ones for merging.

llvm-svn: 132974
```
  da24f2f8
May 26, 2011
- Add comment. · cdec1141
  Devang Patel authored May 26, 2011
```
llvm-svn: 132149
```
  cdec1141
- During branch folding avoid inserting redundant DBG_VALUE machine instructions. · 42ddaa10
  Devang Patel authored May 26, 2011
```
llvm-svn: 132148
```
  42ddaa10
May 13, 2011
- Update comment. · 8a88026a
  Evan Cheng authored May 12, 2011
```
llvm-svn: 131258
```
  8a88026a
May 12, 2011
- Re-enable branchfolding common code hoisting optimization. Fixed a liveness... · 43054e61
  Evan Cheng authored May 12, 2011
```
Re-enable branchfolding common code hoisting optimization. Fixed a liveness test bug and also taught it to update liveins.

llvm-svn: 131241
```
  43054e61
- Temporarily disable the transformation. It's breaking 186.crafty in some configuration. · 33469b2c
  Evan Cheng authored May 12, 2011
```
llvm-svn: 131235
```
  33469b2c
- Re-commit 131172 with fix. MachineInstr identity checks should check dead · cfdf3390
  Evan Cheng authored May 12, 2011
```
markers. In some cases a register def is dead on one path, but not on
another.

This is passing Clang self-hosting.

llvm-svn: 131214
```
  cfdf3390
May 11, 2011

Revert 131172 as it is causing clang to miscompile itself. I will try · 2a09d659
Rafael Espindola authored May 11, 2011
```
to provide a reduced testcase.

llvm-svn: 131176
```
2a09d659

Add a late optimization to BranchFolding that hoist common instruction sequences · 05fc35e2

Evan Cheng authored May 11, 2011

at the start of basic blocks to their common predecessor. It's actually quite
common (e.g. about 50 times in JM/lencod) and has shown to be a nice code size
benefit. e.g.

        pushq   %rax
        testl   %edi, %edi
        jne     LBB0_2
## BB#1:
        xorb    %al, %al
        popq    %rdx
        ret
LBB0_2:
        xorb    %al, %al
        callq   _foo
        popq    %rdx
        ret

=>

        pushq   %rax
        xorb    %al, %al
        testl   %edi, %edi
        je      LBB0_2
## BB#1:
        callq   _foo
LBB0_2:
        popq    %rdx
        ret

rdar://9145558

llvm-svn: 131172

05fc35e2

Apr 22, 2011

Branch folding is folding a landing pad into a regular BB. · c14d7322

Bill Wendling authored Apr 22, 2011

An exception is thrown via a call to _cxa_throw, which we don't expect to
return. Therefore, the "true" part of the invoke goes to a BB that has
'unreachable' as its only instruction. This is lowered into an empty MachineBB.
The landing pad for this invoke, however, is directly after the "true" MBB.
When the empty MBB is removed, the landing pad is directly below the BB with the
invoke call. The unconditional branch is removed and then the two blocks are
merged together.

The testcase is too big for a regression test.
<rdar://problem/9305728>

llvm-svn: 129965

c14d7322

Feb 22, 2011
- Add more debugging output. · b8ed462c
  Evan Cheng authored Feb 21, 2011
```
llvm-svn: 126158
```
  b8ed462c
Aug 06, 2010
- Reapply r110396, with fixes to appease the Linux buildbot gods. · a7aed186
  Owen Anderson authored Aug 06, 2010
```
llvm-svn: 110460
```
  a7aed186