Commits · 1b93be501d785c822083f2fe651248c15959d7dc · Roger Ferrer / llvm-epi-0.8

Jan 15, 2011

Now that instruction optzns can update the iterator as they go, we can · 1b93be50

Chris Lattner authored Jan 15, 2011

have objectsize folding recursively simplify away their result when it
folds.  It is important to catch this here, because otherwise we won't
eliminate the cross-block values at isel and other times.

llvm-svn: 123524

1b93be50

make the current instruction iterator an ivar, allowing xforms that · 7a277144

Chris Lattner authored Jan 15, 2011

potentially invalidate it (like inline asm lowering) to be sunk into
their proper place, cleaning up a ton of code.

llvm-svn: 123523

7a277144

implement an instcombine xform that canonicalizes casts outside of and-with-constant operations. · 9c10d587

Chris Lattner authored Jan 15, 2011

This fixes rdar://8808586 which observed that we used to compile:


union xy {
        struct x { _Bool b[15]; } x;
        __attribute__((packed))
        struct y {
                __attribute__((packed)) unsigned long b0to7;
                __attribute__((packed)) unsigned int b8to11;
                __attribute__((packed)) unsigned short b12to13;
                __attribute__((packed)) unsigned char b14;
        } y;
};

struct x
foo(union xy *xy)
{
        return xy->x;
}

into:

_foo:                                   ## @foo
	movq	(%rdi), %rax
	movabsq	$1095216660480, %rcx    ## imm = 0xFF00000000
	andq	%rax, %rcx
	movabsq	$-72057594037927936, %rdx ## imm = 0xFF00000000000000
	andq	%rax, %rdx
	movzbl	%al, %esi
	orq	%rdx, %rsi
	movq	%rax, %rdx
	andq	$65280, %rdx            ## imm = 0xFF00
	orq	%rsi, %rdx
	movq	%rax, %rsi
	andq	$16711680, %rsi         ## imm = 0xFF0000
	orq	%rdx, %rsi
	movl	%eax, %edx
	andl	$-16777216, %edx        ## imm = 0xFFFFFFFFFF000000
	orq	%rsi, %rdx
	orq	%rcx, %rdx
	movabsq	$280375465082880, %rcx  ## imm = 0xFF0000000000
	movq	%rax, %rsi
	andq	%rcx, %rsi
	orq	%rdx, %rsi
	movabsq	$71776119061217280, %r8 ## imm = 0xFF000000000000
	andq	%r8, %rax
	orq	%rsi, %rax
	movzwl	12(%rdi), %edx
	movzbl	14(%rdi), %esi
	shlq	$16, %rsi
	orl	%edx, %esi
	movq	%rsi, %r9
	shlq	$32, %r9
	movl	8(%rdi), %edx
	orq	%r9, %rdx
	andq	%rdx, %rcx
	movzbl	%sil, %esi
	shlq	$32, %rsi
	orq	%rcx, %rsi
	movl	%edx, %ecx
	andl	$-16777216, %ecx        ## imm = 0xFFFFFFFFFF000000
	orq	%rsi, %rcx
	movq	%rdx, %rsi
	andq	$16711680, %rsi         ## imm = 0xFF0000
	orq	%rcx, %rsi
	movq	%rdx, %rcx
	andq	$65280, %rcx            ## imm = 0xFF00
	orq	%rsi, %rcx
	movzbl	%dl, %esi
	orq	%rcx, %rsi
	andq	%r8, %rdx
	orq	%rsi, %rdx
	ret

We now compile this into:

_foo:                                   ## @foo
## BB#0:                                ## %entry
	movzwl	12(%rdi), %eax
	movzbl	14(%rdi), %ecx
	shlq	$16, %rcx
	orl	%eax, %ecx
	shlq	$32, %rcx
	movl	8(%rdi), %edx
	orq	%rcx, %rdx
	movq	(%rdi), %rax
	ret

A small improvement :-)

llvm-svn: 123520

9c10d587

one more instcombine variant that is needed to work with future changes, · e20dd530
Chris Lattner authored Jan 15, 2011
```
no functionality change currently.

llvm-svn: 123517
```
e20dd530
fix typo · 497459d5
Chris Lattner authored Jan 15, 2011
```
llvm-svn: 123516
```
497459d5
Catch ~x < cst just like ~x < ~y, we currently handle this through · f3c4eeff
Chris Lattner authored Jan 15, 2011
```
means that are about to disappear.

llvm-svn: 123515
```
f3c4eeff
reduce indentation · 311aa63c
Chris Lattner authored Jan 15, 2011
```
llvm-svn: 123514
```
311aa63c
Generalize LoadAndStorePromoter a bit and switch LICM · b68ec5c3
Chris Lattner authored Jan 15, 2011
```
to use it.

llvm-svn: 123501
```
b68ec5c3

Jan 14, 2011

Fix a false-positive warning. · 3e2f6cf7
Owen Anderson authored Jan 14, 2011
```
llvm-svn: 123480
```
3e2f6cf7
Enhance GlobalOpt to be able evaluate initializers that involve stores through · 9eb7cb48
Owen Anderson authored Jan 14, 2011
```
bitcasts, at least in simple cases.  This fixes clang's CodeGenCXX/virtual-base-dtor.cpp

llvm-svn: 123477
```
9eb7cb48
switch SRoA to use LoadAndStorePromoter instead of its own copy of the code. · b498f9af
Chris Lattner authored Jan 14, 2011
```
llvm-svn: 123457
```
b498f9af
Add a new LoadAndStorePromoter class, which implements the general · 95294b87
Chris Lattner authored Jan 14, 2011
```
"promote a bunch of load and stores" logic, allowing the code to
be shared and reused.

llvm-svn: 123456
```
95294b87
split SROA into two passes: one that uses DomFrontiers (-scalarrepl) · 9987a6f4
Chris Lattner authored Jan 14, 2011
```
and one that uses SSAUpdater (-scalarrepl-ssa)

llvm-svn: 123436
```
9987a6f4

Implement full support for promoting allocas to registers using SSAUpdater · 543384ef

Chris Lattner authored Jan 14, 2011

instead of DomTree/DomFrontier.  This may be interesting for reducing compile 
time.  This is currently disabled, but seems to work just fine.

When this is enabled, we eliminate two runs of dominator frontier, one in the
"early per-function" optimizations and one in the "interlaced with inliner"
function passes.

llvm-svn: 123434

543384ef

indentation · 90f3a9a1
Chris Lattner authored Jan 14, 2011
```
llvm-svn: 123426
```
90f3a9a1

Move some shift transforms out of instcombine and into InstructionSimplify. · 7f60dc1e

Duncan Sands authored Jan 14, 2011

While there, I noticed that the transform "undef >>a X -> undef" was wrong.
For example if X is 2 then the top two bits must be equal, so the result can
not be anything. I fixed this in the constant folder as well. Also, I made
the transform for "X << undef" stronger: it now folds to undef always, even
though X might be zero. This is in accordance with the LangRef, but I must
admit that it is fairly aggressive. Also, I added "i32 X << 32 -> undef"
following the LangRef and the constant folder, likewise fairly aggressive.

llvm-svn: 123417

7f60dc1e

Jan 13, 2011

Fix whitespace. · 328e91bb
Bob Wilson authored Jan 13, 2011
```
llvm-svn: 123396
```
328e91bb
Check for empty structs, and for consistency, zero-element arrays. · c8056a95
Bob Wilson authored Jan 13, 2011
```
llvm-svn: 123383
```
c8056a95

Extend SROA to handle arrays accessed as homogeneous structs and vice versa. · 08713d3c

Bob Wilson authored Jan 13, 2011

This is a minor extension of SROA to handle a special case that is
important for some ARM NEON operations. Some of the NEON intrinsics
return multiple values, which are handled as struct types containing
multiple elements of the same vector type. The corresponding return
types declared in the arm_neon.h header have equivalent arrays. We
need SROA to recognize that it can split up those arrays and structs
into separate vectors, even though they are not always accessed with
the same type. SROA already handles loads and stores of an entire
alloca by using insertvalue/extractvalue to access the individual
pieces, and that code works the same regardless of whether the type
is a struct or an array. So, all that needs to be done is to check
for compatible arrays and homogeneous structs.

llvm-svn: 123381

08713d3c

Make SROA more aggressive with allocas containing padding. · 12eec40c

Bob Wilson authored Jan 13, 2011

SROA only split up structs and arrays one level at a time, so padding can
only cause trouble if it is located in between the struct or array elements.

llvm-svn: 123380

12eec40c

Jan 12, 2011
- Use SmallVector instead of SmallPtrSet and avoid non-deterministic behavior. · 30f3ebbc
  Devang Patel authored Jan 12, 2011
```
llvm-svn: 123318
```
  30f3ebbc
- revert 123144, reenabling the rest of memset formation. · dd5f60b7
  Chris Lattner authored Jan 12, 2011
```
llvm-svn: 123302
```
  dd5f60b7
- revert r123146 which disabled code that wasn't the root cause · 654098f4
  Chris Lattner authored Jan 12, 2011
```
of the bootstrap miscompare issue.

llvm-svn: 123299
```
  654098f4
- revert r123149, reenabling an improvement to memcpyopt that wasn't · fa7c29d2
  Chris Lattner authored Jan 12, 2011
```
the source of the bootstrap problem.

llvm-svn: 123298
```
  fa7c29d2
Jan 11, 2011
- Remove the PR8954 workaround. · 12cc296b
  Jakob Stoklund Olesen authored Jan 11, 2011
```
llvm-svn: 123288
```
  12cc296b
- Fix a non-deterministic loop in llvm::MergeBlockIntoPredecessor. · f2407aa9
  Jakob Stoklund Olesen authored Jan 11, 2011
```
DT->changeImmediateDominator() trivially ignores identity updates, so there is
really no need for the uniqueing provided by SmallPtrSet.

I expect this to fix PR8954.

llvm-svn: 123286
```
  f2407aa9
- Dial back the speculative fix for PR8954 a bit, so that we only recompute dominators · cb9c4f85
  Cameron Zwarich authored Jan 11, 2011
```
once at the beginning of GVN instead of once per iteration.

llvm-svn: 123278
```
  cb9c4f85
- Attempt to fix the bootstrap buildbot. Rafael says this works for him on x86-64 Linux. · 51eb4039
  Cameron Zwarich authored Jan 11, 2011
```
llvm-svn: 123270
```
  51eb4039
- Remove dead variable, const-ref-ize an APInt. · 0022a4b4
  Owen Anderson authored Jan 11, 2011
```
llvm-svn: 123248
```
  0022a4b4
- this pass claims to preserve scev, make sure to tell it about deletions. · d41db8f9
  Chris Lattner authored Jan 11, 2011
```
llvm-svn: 123247
```
  d41db8f9
- Factor the actual simplification out of SimplifyIndirectBrOnSelect and into a... · 8e158495
  Frits van Bommel authored Jan 11, 2011
```
Factor the actual simplification out of SimplifyIndirectBrOnSelect and into a new helper function so it can be reused in e.g. an upcoming SimplifySwitchOnSelect.
No functional change.

llvm-svn: 123234
```
  8e158495
- update memdep when an instruction is deleted. This code isn't · 193ce7c4
  Chris Lattner authored Jan 11, 2011
```
actually reached in the testcase in PR8954, but it's safe and good
practice.

llvm-svn: 123224
```
  193ce7c4
- when MergeBlockIntoPredecessor merges two blocks, update MemDep if it · e2523b28
  Chris Lattner authored Jan 11, 2011
```
is floating around in the ether.

llvm-svn: 123223
```
  e2523b28
- Fix FoldSingleEntryPHINodes to update memdep and AA when it deletes · f6ae904e
  Chris Lattner authored Jan 11, 2011
```
phi nodes.  It is called from MergeBlockIntoPredecessor which is 
called from GVN, which claims to preserve these.

I'm skeptical that this is the actual problem behind PR8954, but
this is a stab in the right direction.

llvm-svn: 123222
```
  f6ae904e
- random cleanups · dfcfcb49
  Chris Lattner authored Jan 11, 2011
```
llvm-svn: 123221
```
  dfcfcb49
- remove a bogus assertion: the latch block of a loop is not · 63fe78de
  Chris Lattner authored Jan 11, 2011
```
neccesarily an uncond branch to the header.  This fixes 
PR8955 (the assertion tripping).

llvm-svn: 123219
```
  63fe78de
- Fix a random missed optimization by making InstCombine more aggressive when... · d490c2d2
  Owen Anderson authored Jan 11, 2011
```
Fix a random missed optimization by making InstCombine more aggressive when determining which bits are demanded by
a comparison against a constant.

llvm-svn: 123203
```
  d490c2d2
Jan 10, 2011
- Teach instcombine about the rest of the SSE and SSE2 conversion · cf414cf0
  Chandler Carruth authored Jan 10, 2011
```
intrinsics element dependencies. Reviewed by Nick.

llvm-svn: 123161
```
  cf414cf0
- another random stab in the dark trying to fix llvm-gcc-i386-linux-selfhost · 88bc848a
  Chris Lattner authored Jan 10, 2011
```
llvm-svn: 123149
```
  88bc848a
- another (more) aggressive attempt to bring llvm-gcc-i386-linux-selfhost · 4662bd4b
  Chris Lattner authored Jan 10, 2011
```
back to life.

llvm-svn: 123146
```
  4662bd4b