Commits · 030f02021b6359ec5641622cf1aa63d873ecf55a · Roger Ferrer / llvm-epi-0.8

Sep 01, 2010

licm is wasting time hoisting constant foldable operations, · 030f0202

Chris Lattner authored Aug 31, 2010

instead of hoisting them, just fold them away.  This occurs in the
testcase for PR8041, for example.

llvm-svn: 112669

030f0202

We have a chance for an optimization. Consider this code: · 6789f8b6

Bill Wendling authored Aug 31, 2010

int x(int t) {
  if (t & 256)
    return -26;
  return 0;
}

We generate this:

     tst.w   r0, #256
     mvn     r0, #25
     it      eq
     moveq   r0, #0

while gcc generates this:

     ands    r0, r0, #256
     it      ne
     mvnne   r0, #25
     bx      lr

Scandalous really!

During ISel time, we can look for this particular pattern. One where we have a
"MOVCC" that uses the flag off of a CMPZ that itself is comparing an AND
instruction to 0. Something like this (greatly simplified):

  %r0 = ISD::AND ...
  ARMISD::CMPZ %r0, 0         @ sets [CPSR]
  %r0 = ARMISD::MOVCC 0, -26  @ reads [CPSR]

All we have to do is convert the "ISD::AND" into an "ARM::ANDS" that sets [CPSR]
when it's zero. The zero value will all ready be in the %r0 register and we only
need to change it if the AND wasn't zero. Easy!

llvm-svn: 112664

6789f8b6

Reapply r112623. Included additional check for unused byval argument. · 86ec8b3a
Devang Patel authored Aug 31, 2010
```
llvm-svn: 112659
```
86ec8b3a

Aug 31, 2010
- Merge 2010-08-31-InfiniteRecursion.ll into crash.ll. · a5e6b3ec
  Owen Anderson authored Aug 31, 2010
```
llvm-svn: 112635
```
  a5e6b3ec
- Revert r112623. It is causing self host build failures. · 529f248e
  Devang Patel authored Aug 31, 2010
```
llvm-svn: 112631
```
  529f248e
- Remember byval argument's frame index during argument lowering and use this... · 8559932d
  Devang Patel authored Aug 31, 2010
```
Remember byval argument's frame index during argument lowering and use this info to emit debug info.
Fixes Radar 8367011.

llvm-svn: 112623
```
  8559932d
- Add a test for the duplicated-conditional situation illutrated by PR5652. · 799a08ae
  Owen Anderson authored Aug 31, 2010
```
llvm-svn: 112621
```
  799a08ae
- merge two tests. · e2295f1c
  Chris Lattner authored Aug 31, 2010
```
llvm-svn: 112617
```
  e2295f1c
- Manually reduce this testcase. · 3931c859
  Owen Anderson authored Aug 31, 2010
```
llvm-svn: 112615
```
  3931c859
- merge two tests and convert to filecheck. · fbcd165b
  Chris Lattner authored Aug 31, 2010
```
llvm-svn: 112613
```
  fbcd165b
- Add a micro-test for the transforms I added to JumpThreading. · ada06237
  Owen Anderson authored Aug 31, 2010
```
I have not been able to find a way to test each in isolation, for a few reasons:
1) The ability to look-through non-i1 BinaryOperator's requires the ability to look through non-constant
   ICmps in order for it to ever trigger.
2) The ability to do LVI-powered PHI value determination only matters in cases that ProcessBranchOnPHI
   can't handle.  Since it already handles all the cases without other instructions in the def-use chain
   between the PHI and the branch, it requires the ability to look through ICmps and/or BinaryOperators
   as well.

llvm-svn: 112611
```
  ada06237
- Update test for 112609 · ad9b6de3
  Jim Grosbach authored Aug 31, 2010
```
llvm-svn: 112610
```
  ad9b6de3
- Rename test directory to reflect new pass name. · 064b139c
  Owen Anderson authored Aug 31, 2010
```
llvm-svn: 112592
```
  064b139c
- Rename ValuePropagation to a more descriptive CorrelatedValuePropagation. · 48d58ad6
  Owen Anderson authored Aug 31, 2010
```
llvm-svn: 112591
```
  48d58ad6
- More Chris-inspired JumpThreading fixes: use ConstantExpr to correctly... · 3997a07f
  Owen Anderson authored Aug 31, 2010
```
More Chris-inspired JumpThreading fixes: use ConstantExpr to correctly constant-fold undef, and be more careful with its return value.
This actually exposed an infinite recursion bug in ComputeValueKnownInPredecessors which theoretically already existed (in JumpThreading's
handling of and/or of i1's), but never manifested before.  This patch adds a tracking set to prevent this case.

llvm-svn: 112589
```
  3997a07f
- Remove r111665, which implemented store-narrowing in InstCombine. Chris... · 376597c1
  Owen Anderson authored Aug 31, 2010
```
Remove r111665, which implemented store-narrowing in InstCombine.  Chris discovered a miscompilation in it, and it's not easily
fixable at the optimizer level. I'll investigate reimplementing it in DAGCombine.

llvm-svn: 112575
```
  376597c1
- Fix borken test · 3a1d87a7
  Anton Korobeynikov authored Aug 30, 2010
```
llvm-svn: 112555
```
  3a1d87a7
- Combine these two tests, and make sure there's a newline at the end of the file. · 70b17c50
  Owen Anderson authored Aug 30, 2010
```
llvm-svn: 112554
```
  70b17c50
Aug 30, 2010
- Remove NEON vmovn intrinsic, replacing it with vector truncate operations. · 4cd8a126
  Bob Wilson authored Aug 30, 2010
```
Auto-upgrade the old intrinsic and update tests.

llvm-svn: 112507
```
  4cd8a126
- two changes: · 34bfab0a
  Chris Lattner authored Aug 30, 2010
```
1) nuke ConstDataCoalSection, which is dead.
2) revise my previous patch for rdar://8018335,
  which was completely wrong.  Specifically, it doesn't 
  make sense to mark __TEXT,__const_coal as PURE_INSTRUCTIONS,
  because it is for readonly data.  templates (it turns out)
  go to const_coal_nt.  The real fix for rdar://8018335 was
  to give ConstTextCoalSection a section kind of ReadOnly 
  instead of Text.

llvm-svn: 112496
```
  34bfab0a
- Partially revert r112480. Caused test failures. · 2f997cde
  Michael J. Spencer authored Aug 30, 2010
```
llvm-svn: 112486
```
  2f997cde
- coff-dump.py: Fix PR7996. Now it is compatible to Python-2.4. · e53cf6f8
  NAKAMURA Takumi authored Aug 30, 2010
```
llvm-svn: 112485
```
  e53cf6f8
- Fix constant-over-index.ll test on windows. · 79833404
  Michael J. Spencer authored Aug 30, 2010
```
llvm-svn: 112483
```
  79833404
- Test: Fix LLVMC tests on CMake. · 41c18853
  Michael J. Spencer authored Aug 30, 2010
```
The CMake build didn't define TEST_COMPILE_CXX_CMD. The tests assumed gcc.

llvm-svn: 112480
```
  41c18853
- Correct bogus module triple specifications. · 68c30907
  Duncan Sands authored Aug 30, 2010
```
llvm-svn: 112469
```
  68c30907
Aug 29, 2010
- LICM does get dead instructions input to it. Instead of sinking them · 263f8046
  Chris Lattner authored Aug 29, 2010
```
out of loops, just delete them.

llvm-svn: 112451
```
  263f8046
- Make IVUsers iterative instead of recursive. · 3a08ed79
  Dan Gohman authored Aug 29, 2010
```
This has the side effect of reversing the order of most of
IVUser's results.

llvm-svn: 112442
```
  3a08ed79
- Make this test less dependent on register allocation choices. · 6665550b
  Dan Gohman authored Aug 29, 2010
```
llvm-svn: 112426
```
  6665550b
- Use exec. · 883fa863
  Dan Gohman authored Aug 29, 2010
```
llvm-svn: 112425
```
  883fa863
- Fix lowering of INSERT_VECTOR_ELT in SPU. · 1e616572
  Kalle Raiskila authored Aug 29, 2010
```
The IDX was treated as byte index, not element index.

llvm-svn: 112422
```
  1e616572
- Remove NEON vaddl, vaddw, vsubl, and vsubw intrinsics. Instead, use llvm · d0c05488
  Bob Wilson authored Aug 29, 2010
```
IR add/sub operations with one or both operands sign- or zero-extended.
Auto-upgrade the old intrinsics.

llvm-svn: 112416
```
  d0c05488
- merge a bunch of shuffle tests into sse2.ll · c2887bc2
  Chris Lattner authored Aug 29, 2010
```
llvm-svn: 112398
```
  c2887bc2
- add some nounwind's · b1ff9784
  Chris Lattner authored Aug 29, 2010
```
llvm-svn: 112396
```
  b1ff9784
Aug 28, 2010

fixme accomplished · 112b6ee3
Chris Lattner authored Aug 28, 2010
```
llvm-svn: 112386
```
112b6ee3

fix the buildvector->insertp[sd] logic to not always create a redundant · 94656b1c

Chris Lattner authored Aug 28, 2010

insertp[sd] $0, which is a noop.  Before:

_f32:                                   ## @f32
	pshufd	$1, %xmm1, %xmm2
	pshufd	$1, %xmm0, %xmm3
	addss	%xmm2, %xmm3
	addss	%xmm1, %xmm0
                                        ## kill: XMM0<def> XMM0<kill> XMM0<def>
	insertps	$0, %xmm0, %xmm0
	insertps	$16, %xmm3, %xmm0
	ret

after:

_f32:                                   ## @f32
	movdqa	%xmm0, %xmm2
	addss	%xmm1, %xmm2
	pshufd	$1, %xmm1, %xmm1
	pshufd	$1, %xmm0, %xmm3
	addss	%xmm1, %xmm3
	movdqa	%xmm2, %xmm0
	insertps	$16, %xmm3, %xmm0
	ret

The extra movs are due to a random (poor) scheduling decision.

llvm-svn: 112379

94656b1c

fix the BuildVector -> unpcklps logic to not do pointless shuffles · bcb6090a

Chris Lattner authored Aug 28, 2010

when the top elements of a vector are undefined.  This happens all
the time for X86-64 ABI stuff because only the low 2 elements of
a 4 element vector are defined.  For example, on:

_Complex float f32(_Complex float A, _Complex float B) {
  return A+B;
}

We used to produce (with SSE2, SSE4.1+ uses insertps):

_f32:                                   ## @f32
	movdqa	%xmm0, %xmm2
	addss	%xmm1, %xmm2
	pshufd	$16, %xmm2, %xmm2
	pshufd	$1, %xmm1, %xmm1
	pshufd	$1, %xmm0, %xmm0
	addss	%xmm1, %xmm0
	pshufd	$16, %xmm0, %xmm1
	movdqa	%xmm2, %xmm0
	unpcklps	%xmm1, %xmm0
	ret

We now produce:

_f32:                                   ## @f32
	movdqa	%xmm0, %xmm2
	addss	%xmm1, %xmm2
	pshufd	$1, %xmm1, %xmm1
	pshufd	$1, %xmm0, %xmm3
	addss	%xmm1, %xmm3
	movaps	%xmm2, %xmm0
	unpcklps	%xmm3, %xmm0
	ret

This implements rdar://8368414

llvm-svn: 112378

bcb6090a

Update ocaml test. · 2e5c1471
Benjamin Kramer authored Aug 28, 2010
```
llvm-svn: 112364
```
2e5c1471
remove unions from LLVM IR. They are severely buggy and not · 13ee795c
Chris Lattner authored Aug 28, 2010
```
being actively maintained, improved, or extended.

llvm-svn: 112356
```
13ee795c

remove the ABCD and SSI passes. They don't have any clients that · 504e5100

Chris Lattner authored Aug 28, 2010

I'm aware of, aren't maintained, and LVI will be replacing their value.
nlewycky approved this on irc.

llvm-svn: 112355

504e5100

handle the constant case of vector insertion. For something · d0214f3e

Chris Lattner authored Aug 28, 2010

like this:

struct S { float A, B, C, D; };

struct S g;
struct S bar() { 
  struct S A = g;
  ++A.B;
  A.A = 42;
  return A;
}

we now generate:

_bar:                                   ## @bar
## BB#0:                                ## %entry
	movq	_g@GOTPCREL(%rip), %rax
	movss	12(%rax), %xmm0
	pshufd	$16, %xmm0, %xmm0
	movss	4(%rax), %xmm2
	movss	8(%rax), %xmm1
	pshufd	$16, %xmm1, %xmm1
	unpcklps	%xmm0, %xmm1
	addss	LCPI1_0(%rip), %xmm2
	pshufd	$16, %xmm2, %xmm2
	movss	LCPI1_1(%rip), %xmm0
	pshufd	$16, %xmm0, %xmm0
	unpcklps	%xmm2, %xmm0
	ret

instead of:

_bar:                                   ## @bar
## BB#0:                                ## %entry
	movq	_g@GOTPCREL(%rip), %rax
	movss	12(%rax), %xmm0
	pshufd	$16, %xmm0, %xmm0
	movss	4(%rax), %xmm2
	movss	8(%rax), %xmm1
	pshufd	$16, %xmm1, %xmm1
	unpcklps	%xmm0, %xmm1
	addss	LCPI1_0(%rip), %xmm2
	movd	%xmm2, %eax
	shlq	$32, %rax
	addq	$1109917696, %rax       ## imm = 0x42280000
	movd	%rax, %xmm0
	ret

llvm-svn: 112345

d0214f3e