Commits · 7f93d8d62c9e1cb7b797fa42c8147cf67158760c · Roger Ferrer / llvm-epi-0.8

Jan 09, 2011
- Use IndexedMap for MachineRegisterInfo as well. No functional change. · 7f93d8d6
  Jakob Stoklund Olesen authored Jan 09, 2011
```
llvm-svn: 123106
```
  7f93d8d6
- teach SCEV analysis of PHI nodes that PHI recurences formed · 10223a3f
  Chris Lattner authored Jan 09, 2011
```
with GEP instructions are always NUW, because PHIs cannot wrap
the end of the address space.

llvm-svn: 123105
```
  10223a3f
- reduce indentation. Print <nuw> and <nsw> when dumping SCEV AddRec's · a337f5ec
  Chris Lattner authored Jan 09, 2011
```
that have the bit set.

llvm-svn: 123104
```
  a337f5ec
- Add a note about a missed memset optimization from std::fill. · 5d684c17
  Chandler Carruth authored Jan 09, 2011
```
llvm-svn: 123103
```
  5d684c17
- Fix the last virtual register enumerations. · 4a7b48d5
  Jakob Stoklund Olesen authored Jan 08, 2011
```
llvm-svn: 123102
```
  4a7b48d5
- Fix VirtRegMap to use TRI::index2VirtReg and TRI::virtReg2Index instead of · cf4d5ced
  Jakob Stoklund Olesen authored Jan 08, 2011
```
depending on TRI::FirstVirtualRegister.

Also use TRI::printReg instead of printing virtual registers directly.

llvm-svn: 123101
```
  cf4d5ced
- Fix a MachineVerifier loop that probably didn't mean to skip the last two · 6ff70ad3
  Jakob Stoklund Olesen authored Jan 08, 2011
```
virtual registers.

llvm-svn: 123100
```
  6ff70ad3
- Don't document exactly how virtual registers are represented as integers. Code · d3438eb2
  Jakob Stoklund Olesen authored Jan 08, 2011
```
shouldn't depend directly on that.

Give an example of how to iterate over all virtual registers in a function
without depending on the representation.

llvm-svn: 123099
```
  d3438eb2
- Use an IndexedMap for LiveVariables::VirtRegInfo. · 28d76692
  Jakob Stoklund Olesen authored Jan 08, 2011
```
Provide MRI::getNumVirtRegs() and TRI::index2VirtReg() functions to allow
iteration over virtual registers without depending on the representation of
virtual register numbers.

llvm-svn: 123098
```
  28d76692
- Do not talk about TargetRegisterInfo::FirstVirtualRegister. · a1e03cfb
  Jakob Stoklund Olesen authored Jan 08, 2011
```
llvm-svn: 123097
```
  a1e03cfb
- Use an IndexedMap for LiveOutRegInfo to hide its dependence on... · 793d7b76
  Jakob Stoklund Olesen authored Jan 08, 2011
```
Use an IndexedMap for LiveOutRegInfo to hide its dependence on TargetRegisterInfo::FirstVirtualRegister.

llvm-svn: 123096
```
  793d7b76
- Rename CXXCtorInitializer::BaseOrMember to Initializee, since it will also be · a50dd46e
  Alexis Hunt authored Jan 08, 2011
```
used to store the CXXConstructorDecl in a delegating constructor.

llvm-svn: 123095
```
  a50dd46e
Jan 08, 2011

Fixed the "-b" option on disassembly to always pad out the bytes with for · c925f028

Greg Clayton authored Jan 08, 2011

i386 and for x86_64 to allow 15 byte opcodes to be displayed. This outputs
clean looking disassembly when the bytes are shown.

llvm-svn: 123094

c925f028

Fix coding style. · 0939bc37
Cameron Zwarich authored Jan 08, 2011
```
llvm-svn: 123093
```
0939bc37
Make sure we don't assert if we have a child with zero byte size. Also · 97a4371c
Greg Clayton authored Jan 08, 2011
```
we now say that "void *" value objects don't have children. 

llvm-svn: 123092
```
97a4371c
fix a latent bug in memcpyoptimizer that my recent patches exposed: it wasn't · 7d6433ae
Chris Lattner authored Jan 08, 2011
```
updating memdep when fusing stores together.  This fixes the crash optimizing
the bullet benchmark.

llvm-svn: 123091
```
7d6433ae
tryMergingIntoMemset can only handle constant length memsets. · ff6ed2ac
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123090
```
ff6ed2ac

Merge memsets followed by neighboring memsets and other stores into · 9a1d63ba

Chris Lattner authored Jan 08, 2011

larger memsets.  Among other things, this fixes rdar://8760394 and
allows us to handle "Example 2" from http://blog.regehr.org/archives/320,
compiling it into a single 4096-byte memset:

_mad_synth_mute:                        ## @mad_synth_mute
## BB#0:                                ## %entry
	pushq	%rax
	movl	$4096, %esi             ## imm = 0x1000
	callq	___bzero
	popq	%rax
	ret

llvm-svn: 123089

9a1d63ba

Made FuncUnwinders threadsafe. · 877aaa58
Greg Clayton authored Jan 08, 2011
```
Other small cleanups as well.

llvm-svn: 123088
```
877aaa58
fix an issue in IsPointerOffset that prevented us from recognizing that · 5120ebf1
Chris Lattner authored Jan 08, 2011
```
P and P+1 are relative to the same base pointer.

llvm-svn: 123087
```
5120ebf1
enhance memcpyopt to merge a store and a subsequent · 4dc1fd93
Chris Lattner authored Jan 08, 2011
```
memset into a single larger memset.

llvm-svn: 123086
```
4dc1fd93
fit in 80 cols · 2f2c3351
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123085
```
2f2c3351
Renamed CXXBaseOrMemberInitializer to CXXCtorInitializer. This is both shorter, · 1d792650
Alexis Hunt authored Jan 08, 2011
```
more accurate, and makes it make sense for it to hold a delegating constructor
call.

llvm-svn: 123084
```
1d792650
Spelling changes applied from lldb_spelling.diffs from Bruce Mitchener. · 710dd5ae
Greg Clayton authored Jan 08, 2011
```
Thanks Bruce!

llvm-svn: 123083
```
710dd5ae
merge two tests and filecheckify · 9dbbc49f
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123082
```
9dbbc49f

constify TargetData references. · c638147e

Chris Lattner authored Jan 08, 2011

Split memset formation logic out into its own
"tryMergingIntoMemset" helper function.

llvm-svn: 123081

c638147e

Two minor fixes: 1. Put integral_constant conversion to integral in even... · b5b2a1e1

Howard Hinnant authored Jan 08, 2011

Two minor fixes:  1.  Put integral_constant conversion to integral in even without constexpr support.  2.  Add ios_base to <iosfwd>.  The latter is being tracked by LWG 2026.

llvm-svn: 123080

b5b2a1e1

When loop rotation happens, it is *very* common for the duplicated condbr · 59c82f85

Chris Lattner authored Jan 08, 2011

to be foldable into an uncond branch.  When this happens, we can make a
much simpler CFG for the loop, which is important for nested loop cases
where we want the outer loop to be aggressively optimized.

Handle this case more aggressively.  For example, previously on
phi-duplicate.ll we would get this:


define void @test(i32 %N, double* %G) nounwind ssp {
entry:
  %cmp1 = icmp slt i64 1, 1000
  br i1 %cmp1, label %bb.nph, label %for.end

bb.nph:                                           ; preds = %entry
  br label %for.body

for.body:                                         ; preds = %bb.nph, %for.cond
  %j.02 = phi i64 [ 1, %bb.nph ], [ %inc, %for.cond ]
  %arrayidx = getelementptr inbounds double* %G, i64 %j.02
  %tmp3 = load double* %arrayidx
  %sub = sub i64 %j.02, 1
  %arrayidx6 = getelementptr inbounds double* %G, i64 %sub
  %tmp7 = load double* %arrayidx6
  %add = fadd double %tmp3, %tmp7
  %arrayidx10 = getelementptr inbounds double* %G, i64 %j.02
  store double %add, double* %arrayidx10
  %inc = add nsw i64 %j.02, 1
  br label %for.cond

for.cond:                                         ; preds = %for.body
  %cmp = icmp slt i64 %inc, 1000
  br i1 %cmp, label %for.body, label %for.cond.for.end_crit_edge

for.cond.for.end_crit_edge:                       ; preds = %for.cond
  br label %for.end

for.end:                                          ; preds = %for.cond.for.end_crit_edge, %entry
  ret void
}

Now we get the much nicer:

define void @test(i32 %N, double* %G) nounwind ssp {
entry:
  br label %for.body

for.body:                                         ; preds = %entry, %for.body
  %j.01 = phi i64 [ 1, %entry ], [ %inc, %for.body ]
  %arrayidx = getelementptr inbounds double* %G, i64 %j.01
  %tmp3 = load double* %arrayidx
  %sub = sub i64 %j.01, 1
  %arrayidx6 = getelementptr inbounds double* %G, i64 %sub
  %tmp7 = load double* %arrayidx6
  %add = fadd double %tmp3, %tmp7
  %arrayidx10 = getelementptr inbounds double* %G, i64 %j.01
  store double %add, double* %arrayidx10
  %inc = add nsw i64 %j.01, 1
  %cmp = icmp slt i64 %inc, 1000
  br i1 %cmp, label %for.body, label %for.end

for.end:                                          ; preds = %for.body
  ret void
}

With all of these recent changes, we are now able to compile:

void foo(char *X) {
 for (int i = 0; i != 100; ++i) 
   for (int j = 0; j != 100; ++j)
     X[j+i*100] = 0;
}

into a single memset of 10000 bytes.  This series of changes
should also be helpful for other nested loop scenarios as well.

llvm-svn: 123079

59c82f85

make domtree verification print something useful on failure. · 5f7734c4
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123078
```
5f7734c4
split ssa updating code out to its own helper function. Don't bother · 30f318e5
Chris Lattner authored Jan 08, 2011
```
moving the OrigHeader block anymore: we just merge it away anyway so
its code layout doesn't matter.

llvm-svn: 123077
```
30f318e5
Check for delegating constructors and (currently) return an error about them. · 4049b8d4
Alexis Hunt authored Jan 08, 2011
```
llvm-svn: 123076
```
4049b8d4

Implement a TODO: Enhance loopinfo to merge away the unconditional branch · 2615130e

Chris Lattner authored Jan 08, 2011

that it was leaving in loops after rotation (between the original latch
block and the original header.

With this change, it is possible for rotated loops to have just a single
basic block, which is useful.

llvm-svn: 123075

2615130e

various code cleanups, enhance MergeBlockIntoPredecessor to preserve · 930b716e
Chris Lattner authored Jan 08, 2011
```
loop info.

llvm-svn: 123074
```
930b716e
inline preserveCanonicalLoopForm now that it is simple. · fee37c5f
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123073
```
fee37c5f

Three major changes: · 063dca0f

Chris Lattner authored Jan 08, 2011

1. Rip out LoopRotate's domfrontier updating code.  It isn't
   needed now that LICM doesn't use DF and it is super complex
   and gross.
2. Make DomTree updating code a lot simpler and faster.  The 
   old loop over all the blocks was just to find a block??
3. Change the code that inserts the new preheader to just use
   SplitCriticalEdge instead of doing an overcomplex 
   reimplementation of it.

No behavior change, except for the name of the inserted preheader.

llvm-svn: 123072

063dca0f

reduce nesting. · 30d95f9f
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123071
```
30d95f9f

On Windows, replace each occurrence of '\' by '\\' on the replacement string.... · 7c9eab8f

Francois Pichet authored Jan 08, 2011

On Windows, replace each occurrence of '\' by '\\' on the replacement string. This is necessary to prevent re.sub from replacing escape sequences occurring in path.

For example:

llvm\tools\clang\test
was replaced by
llvm <tab> ools\clang <tab> est

llvm-svn: 123070

7c9eab8f

LoopRotate requires canonical loop form, so it always has preheaders · 7fab23bc
Chris Lattner authored Jan 08, 2011
```
and latch blocks.  Reorder entry conditions to make hte pass faster
and more logical.

llvm-svn: 123069
```
7fab23bc
use the LI ivar. · d62691f4
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123068
```
d62691f4
some cleanups: remove dead arguments and eliminate ivars · 385f2ec6
Chris Lattner authored Jan 08, 2011
```
that are just passed to one function.

llvm-svn: 123067
```
385f2ec6