Commits · d65524da0f9e120f9f4f2daea1298685ed90a64f · Roger Ferrer / llvm-epi-0.8

Jan 09, 2011
- Add a forgotten VireReg2IndexFunctor. · d65524da
  Jakob Stoklund Olesen authored Jan 09, 2011
```
llvm-svn: 123123
```
  d65524da
- Apply -fPIC to C sources too. · 45539eda
  Oscar Fuentes authored Jan 09, 2011
```
llvm-svn: 123122
```
  45539eda
- Instcombine: Fix pattern where the sext did not dominate the icmp using it · cc21c4aa
  Tobias Grosser authored Jan 09, 2011
```
llvm-svn: 123121
```
  cc21c4aa
- DominatorTree->print() now prints the status of the DFSNumbers correctly · bc453f6e
  Tobias Grosser authored Jan 09, 2011
```
llvm-svn: 123120
```
  bc453f6e
- Rewrite handling of LLVM_ENABLE_PIC. It was being processed after · edfc1842
  Oscar Fuentes authored Jan 09, 2011
```
config.h was generated, so it had no effect on it.

Thanks to arrowdodger for pointing out this and a tentative patch.

llvm-svn: 123119
```
  edfc1842
- LoopInstSimplify preserves LoopSimplify. · a42e5915
  Cameron Zwarich authored Jan 09, 2011
```
llvm-svn: 123117
```
  a42e5915
- Another missed memset in std::vector initialization. · 82e6f6a3
  Chandler Carruth authored Jan 09, 2011
```
llvm-svn: 123116
```
  82e6f6a3
- Eliminate some extra hash table lookups. · 8a00d817
  Cameron Zwarich authored Jan 09, 2011
```
llvm-svn: 123115
```
  8a00d817
- Add an informative comment. · a5910d1b
  Cameron Zwarich authored Jan 09, 2011
```
llvm-svn: 123114
```
  a5910d1b
- Fix a cut-paste-o so that the sample code is correct for my last note. · 43f6d1b6
  Chandler Carruth authored Jan 09, 2011
```
Also, switch to a more clear 'sink' function with its declaration to
avoid any confusion about 'g'. Thanks for the suggestion Frits.

llvm-svn: 123113
```
  43f6d1b6
- Another missed optimization of trivial vector code. · ad6e1f05
  Chandler Carruth authored Jan 09, 2011
```
llvm-svn: 123112
```
  ad6e1f05
- Add a note about vector's size-constructor producing dead stores. · f3261930
  Chandler Carruth authored Jan 09, 2011
```
llvm-svn: 123111
```
  f3261930
- Simplify LiveDebugVariables by storing MachineOperand copies locations instead · 9adf5e09
  Jakob Stoklund Olesen authored Jan 09, 2011
```
of using a Location class with the same information.

When making a copy of a MachineOperand that was already stored in a
MachineInstr, it is necessary to clear the parent pointer on the copy. Otherwise
the register use-def lists become inconsistent.

Add MachineOperand::clearParent() to do that. An alternative would be a custom
MachineOperand copy constructor that cleared ParentMI. I didn't want to do that
because of the performance impact.

llvm-svn: 123109
```
  9adf5e09
- Shrink a BitVector that didn't mean to store bits for all physical registers. · 3a9e5c29
  Jakob Stoklund Olesen authored Jan 09, 2011
```
llvm-svn: 123108
```
  3a9e5c29
- Replace TargetRegisterInfo::printReg with a PrintReg class that also works without a TRI instance. · 1331a15b
  Jakob Stoklund Olesen authored Jan 09, 2011
```
Print virtual registers numbered from 0 instead of the arbitrary
FirstVirtualRegister. The first virtual register is printed as %vreg0.
TRI::NoRegister is printed as %noreg.

llvm-svn: 123107
```
  1331a15b
- Use IndexedMap for MachineRegisterInfo as well. No functional change. · 7f93d8d6
  Jakob Stoklund Olesen authored Jan 09, 2011
```
llvm-svn: 123106
```
  7f93d8d6
- teach SCEV analysis of PHI nodes that PHI recurences formed · 10223a3f
  Chris Lattner authored Jan 09, 2011
```
with GEP instructions are always NUW, because PHIs cannot wrap
the end of the address space.

llvm-svn: 123105
```
  10223a3f
- reduce indentation. Print <nuw> and <nsw> when dumping SCEV AddRec's · a337f5ec
  Chris Lattner authored Jan 09, 2011
```
that have the bit set.

llvm-svn: 123104
```
  a337f5ec
- Add a note about a missed memset optimization from std::fill. · 5d684c17
  Chandler Carruth authored Jan 09, 2011
```
llvm-svn: 123103
```
  5d684c17
- Fix the last virtual register enumerations. · 4a7b48d5
  Jakob Stoklund Olesen authored Jan 08, 2011
```
llvm-svn: 123102
```
  4a7b48d5
- Fix VirtRegMap to use TRI::index2VirtReg and TRI::virtReg2Index instead of · cf4d5ced
  Jakob Stoklund Olesen authored Jan 08, 2011
```
depending on TRI::FirstVirtualRegister.

Also use TRI::printReg instead of printing virtual registers directly.

llvm-svn: 123101
```
  cf4d5ced
- Fix a MachineVerifier loop that probably didn't mean to skip the last two · 6ff70ad3
  Jakob Stoklund Olesen authored Jan 08, 2011
```
virtual registers.

llvm-svn: 123100
```
  6ff70ad3
- Don't document exactly how virtual registers are represented as integers. Code · d3438eb2
  Jakob Stoklund Olesen authored Jan 08, 2011
```
shouldn't depend directly on that.

Give an example of how to iterate over all virtual registers in a function
without depending on the representation.

llvm-svn: 123099
```
  d3438eb2
- Use an IndexedMap for LiveVariables::VirtRegInfo. · 28d76692
  Jakob Stoklund Olesen authored Jan 08, 2011
```
Provide MRI::getNumVirtRegs() and TRI::index2VirtReg() functions to allow
iteration over virtual registers without depending on the representation of
virtual register numbers.

llvm-svn: 123098
```
  28d76692
- Do not talk about TargetRegisterInfo::FirstVirtualRegister. · a1e03cfb
  Jakob Stoklund Olesen authored Jan 08, 2011
```
llvm-svn: 123097
```
  a1e03cfb
- Use an IndexedMap for LiveOutRegInfo to hide its dependence on... · 793d7b76
  Jakob Stoklund Olesen authored Jan 08, 2011
```
Use an IndexedMap for LiveOutRegInfo to hide its dependence on TargetRegisterInfo::FirstVirtualRegister.

llvm-svn: 123096
```
  793d7b76
Jan 08, 2011

Fix coding style. · 0939bc37
Cameron Zwarich authored Jan 08, 2011
```
llvm-svn: 123093
```
0939bc37
fix a latent bug in memcpyoptimizer that my recent patches exposed: it wasn't · 7d6433ae
Chris Lattner authored Jan 08, 2011
```
updating memdep when fusing stores together.  This fixes the crash optimizing
the bullet benchmark.

llvm-svn: 123091
```
7d6433ae
tryMergingIntoMemset can only handle constant length memsets. · ff6ed2ac
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123090
```
ff6ed2ac

Merge memsets followed by neighboring memsets and other stores into · 9a1d63ba

Chris Lattner authored Jan 08, 2011

larger memsets.  Among other things, this fixes rdar://8760394 and
allows us to handle "Example 2" from http://blog.regehr.org/archives/320,
compiling it into a single 4096-byte memset:

_mad_synth_mute:                        ## @mad_synth_mute
## BB#0:                                ## %entry
	pushq	%rax
	movl	$4096, %esi             ## imm = 0x1000
	callq	___bzero
	popq	%rax
	ret

llvm-svn: 123089

9a1d63ba

fix an issue in IsPointerOffset that prevented us from recognizing that · 5120ebf1
Chris Lattner authored Jan 08, 2011
```
P and P+1 are relative to the same base pointer.

llvm-svn: 123087
```
5120ebf1
enhance memcpyopt to merge a store and a subsequent · 4dc1fd93
Chris Lattner authored Jan 08, 2011
```
memset into a single larger memset.

llvm-svn: 123086
```
4dc1fd93
fit in 80 cols · 2f2c3351
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123085
```
2f2c3351
merge two tests and filecheckify · 9dbbc49f
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123082
```
9dbbc49f

constify TargetData references. · c638147e

Chris Lattner authored Jan 08, 2011

Split memset formation logic out into its own
"tryMergingIntoMemset" helper function.

llvm-svn: 123081

c638147e

When loop rotation happens, it is *very* common for the duplicated condbr · 59c82f85

Chris Lattner authored Jan 08, 2011

to be foldable into an uncond branch.  When this happens, we can make a
much simpler CFG for the loop, which is important for nested loop cases
where we want the outer loop to be aggressively optimized.

Handle this case more aggressively.  For example, previously on
phi-duplicate.ll we would get this:


define void @test(i32 %N, double* %G) nounwind ssp {
entry:
  %cmp1 = icmp slt i64 1, 1000
  br i1 %cmp1, label %bb.nph, label %for.end

bb.nph:                                           ; preds = %entry
  br label %for.body

for.body:                                         ; preds = %bb.nph, %for.cond
  %j.02 = phi i64 [ 1, %bb.nph ], [ %inc, %for.cond ]
  %arrayidx = getelementptr inbounds double* %G, i64 %j.02
  %tmp3 = load double* %arrayidx
  %sub = sub i64 %j.02, 1
  %arrayidx6 = getelementptr inbounds double* %G, i64 %sub
  %tmp7 = load double* %arrayidx6
  %add = fadd double %tmp3, %tmp7
  %arrayidx10 = getelementptr inbounds double* %G, i64 %j.02
  store double %add, double* %arrayidx10
  %inc = add nsw i64 %j.02, 1
  br label %for.cond

for.cond:                                         ; preds = %for.body
  %cmp = icmp slt i64 %inc, 1000
  br i1 %cmp, label %for.body, label %for.cond.for.end_crit_edge

for.cond.for.end_crit_edge:                       ; preds = %for.cond
  br label %for.end

for.end:                                          ; preds = %for.cond.for.end_crit_edge, %entry
  ret void
}

Now we get the much nicer:

define void @test(i32 %N, double* %G) nounwind ssp {
entry:
  br label %for.body

for.body:                                         ; preds = %entry, %for.body
  %j.01 = phi i64 [ 1, %entry ], [ %inc, %for.body ]
  %arrayidx = getelementptr inbounds double* %G, i64 %j.01
  %tmp3 = load double* %arrayidx
  %sub = sub i64 %j.01, 1
  %arrayidx6 = getelementptr inbounds double* %G, i64 %sub
  %tmp7 = load double* %arrayidx6
  %add = fadd double %tmp3, %tmp7
  %arrayidx10 = getelementptr inbounds double* %G, i64 %j.01
  store double %add, double* %arrayidx10
  %inc = add nsw i64 %j.01, 1
  %cmp = icmp slt i64 %inc, 1000
  br i1 %cmp, label %for.body, label %for.end

for.end:                                          ; preds = %for.body
  ret void
}

With all of these recent changes, we are now able to compile:

void foo(char *X) {
 for (int i = 0; i != 100; ++i) 
   for (int j = 0; j != 100; ++j)
     X[j+i*100] = 0;
}

into a single memset of 10000 bytes.  This series of changes
should also be helpful for other nested loop scenarios as well.

llvm-svn: 123079

59c82f85

make domtree verification print something useful on failure. · 5f7734c4
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123078
```
5f7734c4
split ssa updating code out to its own helper function. Don't bother · 30f318e5
Chris Lattner authored Jan 08, 2011
```
moving the OrigHeader block anymore: we just merge it away anyway so
its code layout doesn't matter.

llvm-svn: 123077
```
30f318e5

Implement a TODO: Enhance loopinfo to merge away the unconditional branch · 2615130e

Chris Lattner authored Jan 08, 2011

that it was leaving in loops after rotation (between the original latch
block and the original header.

With this change, it is possible for rotated loops to have just a single
basic block, which is useful.

llvm-svn: 123075

2615130e

various code cleanups, enhance MergeBlockIntoPredecessor to preserve · 930b716e
Chris Lattner authored Jan 08, 2011
```
loop info.

llvm-svn: 123074
```
930b716e