Commits · 878665b4bcfb052076273e1d3df23abd3ebb9a98 · Roger Ferrer / llvm-epi-0.8

Jan 09, 2011
- sort this. · 878665b4
  Chris Lattner authored Jan 09, 2011
```
llvm-svn: 123129
```
  878665b4
- Teach TargetRegisterInfo how to cram stack slot indexes in with the virtual and · b83a6b23
  Jakob Stoklund Olesen authored Jan 09, 2011
```
physical register numbers.

This makes the hack used in LiveInterval official, and lets LiveInterval be
oblivious of stack slots.

The isPhysicalRegister() and isVirtualRegister() predicates don't know about
this, so when a variable may contain a stack slot, isStackSlot() should always
be tested first.

llvm-svn: 123128
```
  b83a6b23
- Add a note about a missed FP optimization. · 0c68a668
  Chandler Carruth authored Jan 09, 2011
```
llvm-svn: 123126
```
  0c68a668
- fix a few old bugs (found by inspection) where we would zap instructions · caf5c0d0
  Chris Lattner authored Jan 09, 2011
```
without informing memdep.  This could cause nondeterminstic weirdness 
based on where instructions happen to get allocated, and will hopefully
breath some life into some broken testers.

llvm-svn: 123124
```
  caf5c0d0
- Add a forgotten VireReg2IndexFunctor. · d65524da
  Jakob Stoklund Olesen authored Jan 09, 2011
```
llvm-svn: 123123
```
  d65524da
- Instcombine: Fix pattern where the sext did not dominate the icmp using it · cc21c4aa
  Tobias Grosser authored Jan 09, 2011
```
llvm-svn: 123121
```
  cc21c4aa
- LoopInstSimplify preserves LoopSimplify. · a42e5915
  Cameron Zwarich authored Jan 09, 2011
```
llvm-svn: 123117
```
  a42e5915
- Another missed memset in std::vector initialization. · 82e6f6a3
  Chandler Carruth authored Jan 09, 2011
```
llvm-svn: 123116
```
  82e6f6a3
- Eliminate some extra hash table lookups. · 8a00d817
  Cameron Zwarich authored Jan 09, 2011
```
llvm-svn: 123115
```
  8a00d817
- Add an informative comment. · a5910d1b
  Cameron Zwarich authored Jan 09, 2011
```
llvm-svn: 123114
```
  a5910d1b
- Fix a cut-paste-o so that the sample code is correct for my last note. · 43f6d1b6
  Chandler Carruth authored Jan 09, 2011
```
Also, switch to a more clear 'sink' function with its declaration to
avoid any confusion about 'g'. Thanks for the suggestion Frits.

llvm-svn: 123113
```
  43f6d1b6
- Another missed optimization of trivial vector code. · ad6e1f05
  Chandler Carruth authored Jan 09, 2011
```
llvm-svn: 123112
```
  ad6e1f05
- Add a note about vector's size-constructor producing dead stores. · f3261930
  Chandler Carruth authored Jan 09, 2011
```
llvm-svn: 123111
```
  f3261930
- Simplify LiveDebugVariables by storing MachineOperand copies locations instead · 9adf5e09
  Jakob Stoklund Olesen authored Jan 09, 2011
```
of using a Location class with the same information.

When making a copy of a MachineOperand that was already stored in a
MachineInstr, it is necessary to clear the parent pointer on the copy. Otherwise
the register use-def lists become inconsistent.

Add MachineOperand::clearParent() to do that. An alternative would be a custom
MachineOperand copy constructor that cleared ParentMI. I didn't want to do that
because of the performance impact.

llvm-svn: 123109
```
  9adf5e09
- Shrink a BitVector that didn't mean to store bits for all physical registers. · 3a9e5c29
  Jakob Stoklund Olesen authored Jan 09, 2011
```
llvm-svn: 123108
```
  3a9e5c29
- Replace TargetRegisterInfo::printReg with a PrintReg class that also works without a TRI instance. · 1331a15b
  Jakob Stoklund Olesen authored Jan 09, 2011
```
Print virtual registers numbered from 0 instead of the arbitrary
FirstVirtualRegister. The first virtual register is printed as %vreg0.
TRI::NoRegister is printed as %noreg.

llvm-svn: 123107
```
  1331a15b
- Use IndexedMap for MachineRegisterInfo as well. No functional change. · 7f93d8d6
  Jakob Stoklund Olesen authored Jan 09, 2011
```
llvm-svn: 123106
```
  7f93d8d6
- teach SCEV analysis of PHI nodes that PHI recurences formed · 10223a3f
  Chris Lattner authored Jan 09, 2011
```
with GEP instructions are always NUW, because PHIs cannot wrap
the end of the address space.

llvm-svn: 123105
```
  10223a3f
- reduce indentation. Print <nuw> and <nsw> when dumping SCEV AddRec's · a337f5ec
  Chris Lattner authored Jan 09, 2011
```
that have the bit set.

llvm-svn: 123104
```
  a337f5ec
- Add a note about a missed memset optimization from std::fill. · 5d684c17
  Chandler Carruth authored Jan 09, 2011
```
llvm-svn: 123103
```
  5d684c17
- Fix the last virtual register enumerations. · 4a7b48d5
  Jakob Stoklund Olesen authored Jan 08, 2011
```
llvm-svn: 123102
```
  4a7b48d5
- Fix VirtRegMap to use TRI::index2VirtReg and TRI::virtReg2Index instead of · cf4d5ced
  Jakob Stoklund Olesen authored Jan 08, 2011
```
depending on TRI::FirstVirtualRegister.

Also use TRI::printReg instead of printing virtual registers directly.

llvm-svn: 123101
```
  cf4d5ced
- Fix a MachineVerifier loop that probably didn't mean to skip the last two · 6ff70ad3
  Jakob Stoklund Olesen authored Jan 08, 2011
```
virtual registers.

llvm-svn: 123100
```
  6ff70ad3
- Use an IndexedMap for LiveVariables::VirtRegInfo. · 28d76692
  Jakob Stoklund Olesen authored Jan 08, 2011
```
Provide MRI::getNumVirtRegs() and TRI::index2VirtReg() functions to allow
iteration over virtual registers without depending on the representation of
virtual register numbers.

llvm-svn: 123098
```
  28d76692
- Use an IndexedMap for LiveOutRegInfo to hide its dependence on... · 793d7b76
  Jakob Stoklund Olesen authored Jan 08, 2011
```
Use an IndexedMap for LiveOutRegInfo to hide its dependence on TargetRegisterInfo::FirstVirtualRegister.

llvm-svn: 123096
```
  793d7b76
Jan 08, 2011

Fix coding style. · 0939bc37
Cameron Zwarich authored Jan 08, 2011
```
llvm-svn: 123093
```
0939bc37
fix a latent bug in memcpyoptimizer that my recent patches exposed: it wasn't · 7d6433ae
Chris Lattner authored Jan 08, 2011
```
updating memdep when fusing stores together.  This fixes the crash optimizing
the bullet benchmark.

llvm-svn: 123091
```
7d6433ae
tryMergingIntoMemset can only handle constant length memsets. · ff6ed2ac
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123090
```
ff6ed2ac

Merge memsets followed by neighboring memsets and other stores into · 9a1d63ba

Chris Lattner authored Jan 08, 2011

larger memsets.  Among other things, this fixes rdar://8760394 and
allows us to handle "Example 2" from http://blog.regehr.org/archives/320,
compiling it into a single 4096-byte memset:

_mad_synth_mute:                        ## @mad_synth_mute
## BB#0:                                ## %entry
	pushq	%rax
	movl	$4096, %esi             ## imm = 0x1000
	callq	___bzero
	popq	%rax
	ret

llvm-svn: 123089

9a1d63ba

fix an issue in IsPointerOffset that prevented us from recognizing that · 5120ebf1
Chris Lattner authored Jan 08, 2011
```
P and P+1 are relative to the same base pointer.

llvm-svn: 123087
```
5120ebf1
enhance memcpyopt to merge a store and a subsequent · 4dc1fd93
Chris Lattner authored Jan 08, 2011
```
memset into a single larger memset.

llvm-svn: 123086
```
4dc1fd93

constify TargetData references. · c638147e

Chris Lattner authored Jan 08, 2011

Split memset formation logic out into its own
"tryMergingIntoMemset" helper function.

llvm-svn: 123081

c638147e

When loop rotation happens, it is *very* common for the duplicated condbr · 59c82f85

Chris Lattner authored Jan 08, 2011

to be foldable into an uncond branch.  When this happens, we can make a
much simpler CFG for the loop, which is important for nested loop cases
where we want the outer loop to be aggressively optimized.

Handle this case more aggressively.  For example, previously on
phi-duplicate.ll we would get this:


define void @test(i32 %N, double* %G) nounwind ssp {
entry:
  %cmp1 = icmp slt i64 1, 1000
  br i1 %cmp1, label %bb.nph, label %for.end

bb.nph:                                           ; preds = %entry
  br label %for.body

for.body:                                         ; preds = %bb.nph, %for.cond
  %j.02 = phi i64 [ 1, %bb.nph ], [ %inc, %for.cond ]
  %arrayidx = getelementptr inbounds double* %G, i64 %j.02
  %tmp3 = load double* %arrayidx
  %sub = sub i64 %j.02, 1
  %arrayidx6 = getelementptr inbounds double* %G, i64 %sub
  %tmp7 = load double* %arrayidx6
  %add = fadd double %tmp3, %tmp7
  %arrayidx10 = getelementptr inbounds double* %G, i64 %j.02
  store double %add, double* %arrayidx10
  %inc = add nsw i64 %j.02, 1
  br label %for.cond

for.cond:                                         ; preds = %for.body
  %cmp = icmp slt i64 %inc, 1000
  br i1 %cmp, label %for.body, label %for.cond.for.end_crit_edge

for.cond.for.end_crit_edge:                       ; preds = %for.cond
  br label %for.end

for.end:                                          ; preds = %for.cond.for.end_crit_edge, %entry
  ret void
}

Now we get the much nicer:

define void @test(i32 %N, double* %G) nounwind ssp {
entry:
  br label %for.body

for.body:                                         ; preds = %entry, %for.body
  %j.01 = phi i64 [ 1, %entry ], [ %inc, %for.body ]
  %arrayidx = getelementptr inbounds double* %G, i64 %j.01
  %tmp3 = load double* %arrayidx
  %sub = sub i64 %j.01, 1
  %arrayidx6 = getelementptr inbounds double* %G, i64 %sub
  %tmp7 = load double* %arrayidx6
  %add = fadd double %tmp3, %tmp7
  %arrayidx10 = getelementptr inbounds double* %G, i64 %j.01
  store double %add, double* %arrayidx10
  %inc = add nsw i64 %j.01, 1
  %cmp = icmp slt i64 %inc, 1000
  br i1 %cmp, label %for.body, label %for.end

for.end:                                          ; preds = %for.body
  ret void
}

With all of these recent changes, we are now able to compile:

void foo(char *X) {
 for (int i = 0; i != 100; ++i) 
   for (int j = 0; j != 100; ++j)
     X[j+i*100] = 0;
}

into a single memset of 10000 bytes.  This series of changes
should also be helpful for other nested loop scenarios as well.

llvm-svn: 123079

59c82f85

make domtree verification print something useful on failure. · 5f7734c4
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123078
```
5f7734c4
split ssa updating code out to its own helper function. Don't bother · 30f318e5
Chris Lattner authored Jan 08, 2011
```
moving the OrigHeader block anymore: we just merge it away anyway so
its code layout doesn't matter.

llvm-svn: 123077
```
30f318e5

Implement a TODO: Enhance loopinfo to merge away the unconditional branch · 2615130e

Chris Lattner authored Jan 08, 2011

that it was leaving in loops after rotation (between the original latch
block and the original header.

With this change, it is possible for rotated loops to have just a single
basic block, which is useful.

llvm-svn: 123075

2615130e

various code cleanups, enhance MergeBlockIntoPredecessor to preserve · 930b716e
Chris Lattner authored Jan 08, 2011
```
loop info.

llvm-svn: 123074
```
930b716e
inline preserveCanonicalLoopForm now that it is simple. · fee37c5f
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123073
```
fee37c5f

Three major changes: · 063dca0f

Chris Lattner authored Jan 08, 2011

1. Rip out LoopRotate's domfrontier updating code.  It isn't
   needed now that LICM doesn't use DF and it is super complex
   and gross.
2. Make DomTree updating code a lot simpler and faster.  The 
   old loop over all the blocks was just to find a block??
3. Change the code that inserts the new preheader to just use
   SplitCriticalEdge instead of doing an overcomplex 
   reimplementation of it.

No behavior change, except for the name of the inserted preheader.

llvm-svn: 123072

063dca0f

reduce nesting. · 30d95f9f
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123071
```
30d95f9f