Commits · 0fcc019d367d38b2e071cc8edf2096f616e88bb4 · Roger Ferrer / llvm-epi-0.8

Apr 20, 2009

Added a linearscan register allocation optimization. When the register... · d67efaa8

Evan Cheng authored Apr 20, 2009

Added a linearscan register allocation optimization. When the register allocator spill an interval with multiple uses in the same basic block, it creates a different virtual register for each of the reloads. e.g.

	%reg1498<def> = MOV32rm %reg1024, 1, %reg0, 12, %reg0, Mem:LD(4,4) [sunkaddr39 + 0]
        %reg1506<def> = MOV32rm %reg1024, 1, %reg0, 8, %reg0, Mem:LD(4,4) [sunkaddr42 + 0]
        %reg1486<def> = MOV32rr %reg1506
        %reg1486<def> = XOR32rr %reg1486, %reg1498, %EFLAGS<imp-def,dead>
        %reg1510<def> = MOV32rm %reg1024, 1, %reg0, 4, %reg0, Mem:LD(4,4) [sunkaddr45 + 0]

=>

        %reg1498<def> = MOV32rm %reg2036, 1, %reg0, 12, %reg0, Mem:LD(4,4) [sunkaddr39 + 0]
        %reg1506<def> = MOV32rm %reg2037, 1, %reg0, 8, %reg0, Mem:LD(4,4) [sunkaddr42 + 0]
        %reg1486<def> = MOV32rr %reg1506
        %reg1486<def> = XOR32rr %reg1486, %reg1498, %EFLAGS<imp-def,dead>
        %reg1510<def> = MOV32rm %reg2038, 1, %reg0, 4, %reg0, Mem:LD(4,4) [sunkaddr45 + 0]

From linearscan's point of view, each of reg2036, 2037, and 2038 are separate registers, each is "killed" after a single use. The reloaded register is available and it's often clobbered right away. e.g. In thise case reg1498 is allocated EAX while reg2036 is allocated RAX. This means we end up with multiple reloads from the same stack slot in the same basic block.

Now linearscan recognize there are other reloads from same SS in the same BB. So it'll "downgrade" RAX (and its aliases) after reg2036 is allocated until the next reload (reg2037) is done. This greatly increase the likihood reloads from SS are reused.

This speeds up sha1 from OpenSSL by 5.8%. It is also an across the board win for SPEC2000 and 2006.

llvm-svn: 69585

d67efaa8

Apr 19, 2009

Now that BUILD_VECTOR operands are allowed to be · f2e7133d

Duncan Sands authored Apr 19, 2009

bigger than the vector element type, turn checking
of the operand type back on again, appropriately
adjusted.

llvm-svn: 69516

f2e7133d

Apr 18, 2009

Fix PR3898, which manifests as failures on are an Xcore, · 7b01e664
Chris Lattner authored Apr 18, 2009
```
patch by Jakob Stoklund Olesen!

llvm-svn: 69472
```
7b01e664

Don't try to make BUILD_VECTOR operands have the same · e4ff21ba

Duncan Sands authored Apr 18, 2009

type as the vector element type: allow them to be of
a wider integer type than the element type all the way
through the system, and not just as far as LegalizeDAG.
This should be safe because it used to be this way
(the old type legalizer would produce such nodes), so
backends should be able to handle it.  In fact only
targets which have legal vector types with an illegal
promoted element type will ever see this (eg: <4 x i16>
on ppc).  This fixes a regression with the new type
legalizer (vec_splat.ll).  Also, treat SCALAR_TO_VECTOR
the same as BUILD_VECTOR.  After all, it is just a
special case of BUILD_VECTOR.

llvm-svn: 69467

e4ff21ba

Add a new LiveInterval::overlaps(). It checks if the live interval overlaps a... · b685be0c
Evan Cheng authored Apr 18, 2009
```
Add a new LiveInterval::overlaps(). It checks if the live interval overlaps a range specified by [Start, End).

llvm-svn: 69434
```
b685be0c
Inline asm's were still introducing bogus dependencies; · ad968ee2
Dale Johannesen authored Apr 18, 2009
```
my earlier patch to this code only fixed half of it.

llvm-svn: 69408
```
ad968ee2

Apr 17, 2009

Teach spiller to unfold instructions which modref spill slot when a scratch · b96a1082

Evan Cheng authored Apr 17, 2009

register is available and when it's profitable.

e.g.
     xorq  %r12<kill>, %r13
     addq  %rax, -184(%rbp)
     addq  %r13, -184(%rbp)
==>
     xorq  %r12<kill>, %r13
     movq  -184(%rbp), %r12
     addq  %rax, %r12
     addq  %r13, %r12
     movq  %r12, -184(%rbp)

Two more instructions, but fewer memory accesses. It can also open up
opportunities for more optimizations.

llvm-svn: 69341

b96a1082

Apr 16, 2009

In the list-burr's pseudo two-addr dependency heuristics, don't · eefba6bb

Dan Gohman authored Apr 16, 2009

add dependencies on nodes with exactly one successor which is a
COPY_TO_REGCLASS node. In the case that the copy is coalesced
away, the dependence should be on the user of the copy, rather
than the copy itself.

llvm-svn: 69309

eefba6bb

Handle SUBREG_TO_REG instructions with the same heuristics · 3027bb69
Dan Gohman authored Apr 16, 2009
```
as INSERT_SUBREG instructions in the list-burr scheduler.

llvm-svn: 69308
```
3027bb69

Do not treat beginning of inlined scope as beginning of normal function scope... · dab01f3f

Devang Patel authored Apr 16, 2009

Do not treat beginning of inlined scope as beginning of normal function scope if the location info is missing.

Insetad of doing ...
if (inlined_subroutine && known_location)
  DW_TAG_inline_subroutine
else
  DW_TAG_subprogram

do

if (inlined_subroutine) {
 if (known_location)
   DW_TAG_inline_subroutine
} else {
 DW_TAG_subprogram
}

llvm-svn: 69300

dab01f3f

Record line number at the beginning of a func.start. · 9ac4390b
Devang Patel authored Apr 16, 2009
```
This line was accidently lost yesterday.

llvm-svn: 69286
```
9ac4390b
In -fast mode do what FastISel does. · 653dee08
Devang Patel authored Apr 16, 2009
```
This code could use some refactoring help!

llvm-svn: 69254
```
653dee08

· 46b04e4d

Devang Patel authored Apr 16, 2009

If FastISel is run and it has known DebugLoc then use it.

llvm-svn: 69253

46b04e4d

If location where the function was inlined is not know then do not emit debug... · 43fc7e48
Devang Patel authored Apr 16, 2009
```
If location where the function was inlined is not know then do not emit debug info describing inlinied region.

llvm-svn: 69252
```
43fc7e48

Apr 15, 2009
- s/RootDbgScope/FunctionDbgScope/g · 31043aa2
  Devang Patel authored Apr 15, 2009
```
llvm-svn: 69216
```
  31043aa2
- Add DISubprogram is not null check. · 2738d731
  Devang Patel authored Apr 15, 2009
```
This fixes test/CodeGen//2009-01-21-invalid-debug-info.m test case.

llvm-svn: 69210
```
  2738d731
- Generalize one of the SelectionDAG::ReplaceAllUsesWith overloads · 8aa28b9c
  Dan Gohman authored Apr 15, 2009
```
to support replacing a node with another that has a superset of
the result types. Use this instead of calling
ReplaceAllUsesOfValueWith for each value.

llvm-svn: 69209
```
  8aa28b9c
- Check isInlinedSubroutine() before creating DW_TAG_inlined_subroutine. · 70307db0
  Devang Patel authored Apr 15, 2009
```
llvm-svn: 69202
```
  70307db0
- Fix MachineInstr::getNumExplicitOperands to count · 37608532
  Dan Gohman authored Apr 15, 2009
```
variadic operands correctly. Patch by Jakob Stoklund Olesen!

llvm-svn: 69190
```
  37608532
- Move MachineRegisterInfo::setRegClass out of line. · 210448c2
  Dan Gohman authored Apr 15, 2009
```
llvm-svn: 69126
```
  210448c2
- Move MachineJumpTableInfo::ReplaceMBBInJumpTables out of line. · 505065cd
  Dan Gohman authored Apr 15, 2009
```
llvm-svn: 69125
```
  505065cd
- Give RemoveRegOperandFromRegInfo a comment and move the · 89892b05
  Dan Gohman authored Apr 15, 2009
```
code out of line.

llvm-svn: 69124
```
  89892b05
- Construct and emit DW_TAG_inlined_subroutine DIEs for inlined subroutine... · 32d17a1a
  Devang Patel authored Apr 15, 2009
```
Construct and emit DW_TAG_inlined_subroutine DIEs for inlined subroutine scopes (only in FastISel mode). 

llvm-svn: 69116
```
  32d17a1a
- When the result of an EXTRACT_SUBREG, INSERT_SUBREG, or SUBREG_TO_REG · e5cd1fcd
  Dan Gohman authored Apr 14, 2009
```
operator is used by a CopyToReg to export the value to a different
block, don't reuse the CopyToReg's register for the subreg operation
result if the register isn't precisely the right class for the
subreg operation.

Also, rename the h-registers.ll test, now that there are more
than one.

llvm-svn: 69087
```
  e5cd1fcd
Apr 14, 2009

Do not force asm's to be chained if they don't touch · 83593f41
Dale Johannesen authored Apr 14, 2009
```
memory and aren't volatile.  This was interfering with
good scheduling.

llvm-svn: 69008
```
83593f41

Fix PR3934 part 2. findOnlyInterestingUse() was not setting IsCopy and... · 9787183b

Evan Cheng authored Apr 14, 2009

Fix PR3934 part 2. findOnlyInterestingUse() was not setting IsCopy and IsDstPhys which are returned by value and used by callee. This happened to work on the earlier test cases because of a logic error in the caller side.

llvm-svn: 69006

9787183b

Make these errors more noticable in build logs. · 097f630d
Daniel Dunbar authored Apr 13, 2009
```
llvm-svn: 68998
```
097f630d

Change SelectionDAG type legalization to allow BUILD_VECTOR operands to be · 59dbbb2b

Bob Wilson authored Apr 13, 2009

promoted to legal types without changing the type of the vector.  This is
following a suggestion from Duncan
(http://lists.cs.uiuc.edu/pipermail/llvmdev/2009-February/019923.html).
The transformation that used to be done during type legalization is now
postponed to DAG legalization.  This allows the BUILD_VECTORs to be optimized
and potentially handled specially by target-specific code.

It turns out that this is also consistent with an optimization done by the
DAG combiner: a BUILD_VECTOR and INSERT_VECTOR_ELT may be combined by
replacing one of the BUILD_VECTOR operands with the newly inserted element;
but INSERT_VECTOR_ELT allows its scalar operand to be larger than the
element type, with any extra high bits being implicitly truncated.  The
result is a BUILD_VECTOR where one of the operands has a type larger the
the vector element type.

Any code that operates on BUILD_VECTORs may now need to be aware of the
potential type discrepancy between the vector element type and the
BUILD_VECTOR operands.  This patch updates all of the places that I could
find to handle that case.

llvm-svn: 68996

59dbbb2b

Apr 13, 2009

Rename COPY_TO_SUBCLASS to COPY_TO_REGCLASS, and generalize · 6c142630
Dan Gohman authored Apr 13, 2009
```
it accordingly. Thanks to Jakob Stoklund Olesen for pointing
out how this might be useful.

llvm-svn: 68986
```
6c142630
Refactor some code in SelectionDAGLegalize::ExpandBUILD_VECTOR. · f6c21953
Bob Wilson authored Apr 13, 2009
```
llvm-svn: 68981
```
f6c21953
PR3934: Fix a bogus two-address pass assertion. · f0843803
Evan Cheng authored Apr 13, 2009
```
llvm-svn: 68979
```
f0843803

Right now, Debugging information to encode scopes (DW_TAG_lexical_block)... · 0431504f

Devang Patel authored Apr 13, 2009

Right now, Debugging information to encode scopes (DW_TAG_lexical_block) relies on DBG_LABEL. Unfortunately this intefers with the quality of optimized code.
This patch updates dwarf writer to encode scoping information in DWARF only in FastISel mode.

llvm-svn: 68973

0431504f

· 80be3511

Devang Patel authored Apr 13, 2009

Reapply 68847.
Now debug_inlined section is covered by TAI->doesDwarfUsesInlineInfoSection(), which is false by default.

llvm-svn: 68964

80be3511

Add a new TargetInstrInfo MachineInstr opcode, COPY_TO_SUBCLASS. · 60a446ab

Dan Gohman authored Apr 13, 2009

This will be used to replace things like X86's MOV32to32_.

Enhance ScheduleDAGSDNodesEmit to be more flexible and robust
in the presense of subregister superclasses and subclasses. It
can now cope with the definition of a virtual register being in
a subclass of a use.

Re-introduce the code for recording register superreg classes and
subreg classes. This is needed because when subreg extracts and
inserts get coalesced away, the virtual registers are left in
the correct subclass.

llvm-svn: 68961

60a446ab

Don't abort on an aliasing physical register that does not have · 1d504079
Dan Gohman authored Apr 13, 2009
```
a live interval. This is needed for some upcoming subreg changes.

llvm-svn: 68956
```
1d504079

When assigning a physical register to a MachineOperand, set · 4d62ff14

Dan Gohman authored Apr 13, 2009

the subreg field to 0, since the subreg field is only used
for virtual register subregs. This doesn't change
current functionality; it just eliminates bogus noise from
debug output.

llvm-svn: 68955

4d62ff14

Add an assertion to verify that a copy was actually emitted. · 85abd983
Dan Gohman authored Apr 13, 2009
```
llvm-svn: 68953
```
85abd983

Add a new "available_externally" linkage type. This is intended · 184f1be4

Chris Lattner authored Apr 13, 2009

to support C99 inline, GNU extern inline, etc.  Related bugzilla's
include PR3517, PR3100, & PR2933.  Nothing uses this yet, but it
appears to work.

llvm-svn: 68940

184f1be4

Apr 12, 2009
- make UpdateValueMap handle the possiblity that we could be · a101f6f8
  Chris Lattner authored Apr 12, 2009
```
copying into the right register, avoiding a copy.

llvm-svn: 68889
```
  a101f6f8
- optimize FastISel::UpdateValueMap to avoid duplicate map lookups, · ada5d6c3
  Chris Lattner authored Apr 12, 2009
```
and make it return the assigned register.

llvm-svn: 68888
```
  ada5d6c3