Commits · 654098f4116f6f6018c1e18a3462c2bb20012559 · Roger Ferrer / llvm-epi-0.8

Jan 12, 2011
- revert r123146 which disabled code that wasn't the root cause · 654098f4
  Chris Lattner authored Jan 12, 2011
```
of the bootstrap miscompare issue.

llvm-svn: 123299
```
  654098f4
- 1. Support ELF pcrel relocations for movw/movt: · 9c5b65d2
  Jason W Kim authored Jan 12, 2011
```
  R_ARM_MOVT_PREL and R_ARM_MOVW_PREL_NC.
2. Fix minor bug in ARMAsmPrinter - treat bitfield flag as a bitfield, not an enum.
3. Add support for 3 new elf section types (no-ops)

llvm-svn: 123294
```
  9c5b65d2
- Workaround for bug 8721. · 1f7bc070
  Jason W Kim authored Jan 11, 2011
```
.s Test added.

llvm-svn: 123292
```
  1f7bc070
- The world is not ready for LiveDebugVariables yet. · 43812bfa
  Jakob Stoklund Olesen authored Jan 11, 2011
```
llvm-svn: 123290
```
  43812bfa
Jan 11, 2011
- Enable LiveDebugVariables by default. · 8c98495f
  Jakob Stoklund Olesen authored Jan 11, 2011
```
llvm-svn: 123282
```
  8c98495f
- SPARC backend: correct ICC/FCC uses for ADDX and SELECT_CC · 4d6ade0e
  Venkatraman Govindaraju authored Jan 11, 2011
```
llvm-svn: 123281
```
  4d6ade0e
- Fix PR8946, a missing reg/reg form of movdqu. · abd2dfd3
  Chris Lattner authored Jan 11, 2011
```
llvm-svn: 123242
```
  abd2dfd3
- McARM: Add more hard coded logic to SplitMnemonicAndCC to also split out the · 9d944b3f
  Daniel Dunbar authored Jan 11, 2011
```
carry setting flag from the mnemonic.

Note that this currently involves me disabling a number of working cases in
arm_instructions.s, this is a hopefully short term evil which will be rapidly
fixed (and greatly surpassed), assuming my current approach flies.

llvm-svn: 123238
```
  9d944b3f
- Revert the testcase from the previous reverted commit. · 31bb4c58
  Eric Christopher authored Jan 11, 2011
```
llvm-svn: 123227
```
  31bb4c58
- merge tests into one crash.ll test. · 054d2a85
  Chris Lattner authored Jan 11, 2011
```
llvm-svn: 123220
```
  054d2a85
- remove a bogus assertion: the latch block of a loop is not · 63fe78de
  Chris Lattner authored Jan 11, 2011
```
neccesarily an uncond branch to the header.  This fixes 
PR8955 (the assertion tripping).

llvm-svn: 123219
```
  63fe78de
- Teach constant folding to perform conversions from constant floating · b1e7f557
  Chandler Carruth authored Jan 11, 2011
```
point values to their integer representation through the SSE intrinsic
calls. This is the last part of a README.txt entry for which I have real
world examples.

llvm-svn: 123206
```
  b1e7f557
- FileCheck-ize a test, and move a no-longer calling test case to another · fdf49691
  Chandler Carruth authored Jan 11, 2011
```
file and make it actually test something...

llvm-svn: 123205
```
  fdf49691
- Fix a random missed optimization by making InstCombine more aggressive when... · d490c2d2
  Owen Anderson authored Jan 11, 2011
```
Fix a random missed optimization by making InstCombine more aggressive when determining which bits are demanded by
a comparison against a constant.

llvm-svn: 123203
```
  d490c2d2
- Even if we don't have 7 bytes of stack space we may need to save and · 3904343c
  Eric Christopher authored Jan 11, 2011
```
restore the stack pointer from the frame pointer on thumbv6.

Fixes rdar://8819685

llvm-svn: 123196
```
  3904343c
Jan 10, 2011
- Fix PR 8916 (qv for analysis), at least the immediate problem. · d2b48119
  Dale Johannesen authored Jan 10, 2011
```
There's an inherent tension in DAGCombine between assuming
that things will be put in canonical form, and the Depth
mechanism that disables transformations when recursion gets
too deep.  It would not surprise me if there's a lot of little
bugs like this one waiting to be discovered.  The mechanism
seems fragile and I'd suggest looking at it from a design viewpoint.

llvm-svn: 123191
```
  d2b48119
- McARM: Flush out hard coded known non-predicated mnemonic list. · c0e8756b
  Daniel Dunbar authored Jan 10, 2011
```
llvm-svn: 123189
```
  c0e8756b
- Teach instcombine about the rest of the SSE and SSE2 conversion · cf414cf0
  Chandler Carruth authored Jan 10, 2011
```
intrinsics element dependencies. Reviewed by Nick.

llvm-svn: 123161
```
  cf414cf0
- Fold two related tests into the newly FileCheck-ized test, migrating · 7bb282eb
  Chandler Carruth authored Jan 10, 2011
```
them to FileCheck as well.

llvm-svn: 123154
```
  7bb282eb
- Clean up and FileCheck-ize a test. · ef7aac59
  Chandler Carruth authored Jan 10, 2011
```
llvm-svn: 123153
```
  ef7aac59
- fix typo · ec1387cf
  Chris Lattner authored Jan 10, 2011
```
llvm-svn: 123148
```
  ec1387cf
- another (more) aggressive attempt to bring llvm-gcc-i386-linux-selfhost · 4662bd4b
  Chris Lattner authored Jan 10, 2011
```
back to life.

llvm-svn: 123146
```
  4662bd4b
- temporarily disable memset formation from memsets in an effort to restore buildbot stability. · 1017fa67
  Chris Lattner authored Jan 09, 2011
```
llvm-svn: 123144
```
  1017fa67
- add a testcase I missed in previous commit. · 1032965c
  Chris Lattner authored Jan 09, 2011
```
llvm-svn: 123143
```
  1032965c
Jan 09, 2011
- Instcombine: Fix pattern where the sext did not dominate the icmp using it · cc21c4aa
  Tobias Grosser authored Jan 09, 2011
```
llvm-svn: 123121
```
  cc21c4aa
- teach SCEV analysis of PHI nodes that PHI recurences formed · 10223a3f
  Chris Lattner authored Jan 09, 2011
```
with GEP instructions are always NUW, because PHIs cannot wrap
the end of the address space.

llvm-svn: 123105
```
  10223a3f
- reduce indentation. Print <nuw> and <nsw> when dumping SCEV AddRec's · a337f5ec
  Chris Lattner authored Jan 09, 2011
```
that have the bit set.

llvm-svn: 123104
```
  a337f5ec
Jan 08, 2011

Merge memsets followed by neighboring memsets and other stores into · 9a1d63ba

Chris Lattner authored Jan 08, 2011

larger memsets.  Among other things, this fixes rdar://8760394 and
allows us to handle "Example 2" from http://blog.regehr.org/archives/320,
compiling it into a single 4096-byte memset:

_mad_synth_mute:                        ## @mad_synth_mute
## BB#0:                                ## %entry
	pushq	%rax
	movl	$4096, %esi             ## imm = 0x1000
	callq	___bzero
	popq	%rax
	ret

llvm-svn: 123089

9a1d63ba

fix an issue in IsPointerOffset that prevented us from recognizing that · 5120ebf1
Chris Lattner authored Jan 08, 2011
```
P and P+1 are relative to the same base pointer.

llvm-svn: 123087
```
5120ebf1
enhance memcpyopt to merge a store and a subsequent · 4dc1fd93
Chris Lattner authored Jan 08, 2011
```
memset into a single larger memset.

llvm-svn: 123086
```
4dc1fd93
merge two tests and filecheckify · 9dbbc49f
Chris Lattner authored Jan 08, 2011
```
llvm-svn: 123082
```
9dbbc49f

When loop rotation happens, it is *very* common for the duplicated condbr · 59c82f85

Chris Lattner authored Jan 08, 2011

to be foldable into an uncond branch.  When this happens, we can make a
much simpler CFG for the loop, which is important for nested loop cases
where we want the outer loop to be aggressively optimized.

Handle this case more aggressively.  For example, previously on
phi-duplicate.ll we would get this:


define void @test(i32 %N, double* %G) nounwind ssp {
entry:
  %cmp1 = icmp slt i64 1, 1000
  br i1 %cmp1, label %bb.nph, label %for.end

bb.nph:                                           ; preds = %entry
  br label %for.body

for.body:                                         ; preds = %bb.nph, %for.cond
  %j.02 = phi i64 [ 1, %bb.nph ], [ %inc, %for.cond ]
  %arrayidx = getelementptr inbounds double* %G, i64 %j.02
  %tmp3 = load double* %arrayidx
  %sub = sub i64 %j.02, 1
  %arrayidx6 = getelementptr inbounds double* %G, i64 %sub
  %tmp7 = load double* %arrayidx6
  %add = fadd double %tmp3, %tmp7
  %arrayidx10 = getelementptr inbounds double* %G, i64 %j.02
  store double %add, double* %arrayidx10
  %inc = add nsw i64 %j.02, 1
  br label %for.cond

for.cond:                                         ; preds = %for.body
  %cmp = icmp slt i64 %inc, 1000
  br i1 %cmp, label %for.body, label %for.cond.for.end_crit_edge

for.cond.for.end_crit_edge:                       ; preds = %for.cond
  br label %for.end

for.end:                                          ; preds = %for.cond.for.end_crit_edge, %entry
  ret void
}

Now we get the much nicer:

define void @test(i32 %N, double* %G) nounwind ssp {
entry:
  br label %for.body

for.body:                                         ; preds = %entry, %for.body
  %j.01 = phi i64 [ 1, %entry ], [ %inc, %for.body ]
  %arrayidx = getelementptr inbounds double* %G, i64 %j.01
  %tmp3 = load double* %arrayidx
  %sub = sub i64 %j.01, 1
  %arrayidx6 = getelementptr inbounds double* %G, i64 %sub
  %tmp7 = load double* %arrayidx6
  %add = fadd double %tmp3, %tmp7
  %arrayidx10 = getelementptr inbounds double* %G, i64 %j.01
  store double %add, double* %arrayidx10
  %inc = add nsw i64 %j.01, 1
  %cmp = icmp slt i64 %inc, 1000
  br i1 %cmp, label %for.body, label %for.end

for.end:                                          ; preds = %for.body
  ret void
}

With all of these recent changes, we are now able to compile:

void foo(char *X) {
 for (int i = 0; i != 100; ++i) 
   for (int j = 0; j != 100; ++j)
     X[j+i*100] = 0;
}

into a single memset of 10000 bytes.  This series of changes
should also be helpful for other nested loop scenarios as well.

llvm-svn: 123079

59c82f85

Three major changes: · 063dca0f

Chris Lattner authored Jan 08, 2011

1. Rip out LoopRotate's domfrontier updating code.  It isn't
   needed now that LICM doesn't use DF and it is super complex
   and gross.
2. Make DomTree updating code a lot simpler and faster.  The 
   old loop over all the blocks was just to find a block??
3. Change the code that inserts the new preheader to just use
   SplitCriticalEdge instead of doing an overcomplex 
   reimplementation of it.

No behavior change, except for the name of the inserted preheader.

llvm-svn: 123072

063dca0f

First step in fixing PR8927: · 45e6c195

Rafael Espindola authored Jan 08, 2011

Add a unnamed_addr bit to global variables and functions. This will be used
to indicate that the address is not significant and therefore the constant
or function can be merged with others.

If an optimization pass can show that an address is not used, it can set this.

Examples of things that can have this set by the FE are globals created to
hold string literals and C++ constructors.

Adding unnamed_addr to a non-const global should have no effect unless
an optimization can transform that global into a constant.

Aliases are not allowed to have unnamed_addr since I couldn't figure
out any use for it.

llvm-svn: 123063

45e6c195

Fix a bug in r123034 (trying to sext/zext non-integers) and clean up a little. · 6a1fb8f2
Frits van Bommel authored Jan 08, 2011
```
llvm-svn: 123061
```
6a1fb8f2

Have loop-rotate simplify instructions (yay instsimplify!) as it clones · 8c5defd0

Chris Lattner authored Jan 08, 2011

them into the loop preheader, eliminating silly instructions like
"icmp i32 0, 100" in fixed tripcount loops. This also better exposes the
bigger problem with loop rotate that I'd like to fix: once this has been
folded, the duplicated conditional branch *often* turns into an uncond branch.

Not aggressively handling this is pessimizing later loop optimizations
somethin' fierce by making "dominates all exit blocks" checks fail.

llvm-svn: 123060

8c5defd0

Recognize inline asm 'rev /bin/bash, ' as a bswap intrinsic call. · 078b0b09
Evan Cheng authored Jan 08, 2011
```
llvm-svn: 123048
```
078b0b09

Do not model all INLINEASM instructions as having unmodelled side effects. · 6eb516db

Evan Cheng authored Jan 07, 2011

Instead encode llvm IR level property "HasSideEffects" in an operand (shared
with IsAlignStack). Added MachineInstrs::hasUnmodeledSideEffects() to check
the operand when the instruction is an INLINEASM.

This allows memory instructions to be moved around INLINEASM instructions.

llvm-svn: 123044

6eb516db

Jan 07, 2011
- Speculatively revert r123032. · acbee0b0
  Devang Patel authored Jan 07, 2011
```
llvm-svn: 123039
```
  acbee0b0
- Lower some BUILD_VECTORS using VEXT+shuffle. · 6f2b8966
  Bob Wilson authored Jan 07, 2011
```
Patch by Tim Northover.

llvm-svn: 123035
```
  6f2b8966