Commits · d606e547577eb005ac11ea7e1b509d9ccb64d4d1 · Roger Ferrer / llvm-epi-0.8

Jan 01, 2011
- Add support for the 'H' modifier. · d606e547
  Rafael Espindola authored Jan 01, 2011
```
llvm-svn: 122667
```
  d606e547
Dec 29, 2010
- test/CodeGen/X86/negative-sin.ll: FileCheck-ize. · dbacfb73
  NAKAMURA Takumi authored Dec 29, 2010
```
llvm-svn: 122619
```
  dbacfb73
- test/CodeGen/X86/fp-in-intregs.ll: FileCheck-ize. · 74835a22
  NAKAMURA Takumi authored Dec 29, 2010
```
llvm-svn: 122618
```
  74835a22
Dec 23, 2010

DAGCombine add (sext i1), X into sub X, (zext i1) if sext from i1 is illegal. ... · 1f4dfbbc

Benjamin Kramer authored Dec 22, 2010

DAGCombine add (sext i1), X into sub X, (zext i1) if sext from i1 is illegal.  The latter usually compiles into smaller code.

example code:
unsigned foo(unsigned x, unsigned y) {
  if (x != 0) y--;
  return y;
}

before:
  _foo:                           ## @foo
    cmpl  $1, 4(%esp)             ## encoding: [0x83,0x7c,0x24,0x04,0x01]
    sbbl  %eax, %eax              ## encoding: [0x19,0xc0]
    notl  %eax                    ## encoding: [0xf7,0xd0]
    addl  8(%esp), %eax           ## encoding: [0x03,0x44,0x24,0x08]
    ret                           ## encoding: [0xc3]

after:
  _foo:                           ## @foo
    cmpl  $1, 4(%esp)             ## encoding: [0x83,0x7c,0x24,0x04,0x01]
    movl  8(%esp), %eax           ## encoding: [0x8b,0x44,0x24,0x08]
    adcl  $-1, %eax               ## encoding: [0x83,0xd0,0xff]
    ret                           ## encoding: [0xc3]

llvm-svn: 122455

1f4dfbbc

X86: Lower a select directly to a setcc_carry if possible. · 6020ed9d

Benjamin Kramer authored Dec 22, 2010

  int test(unsigned long a, unsigned long b) { return -(a < b); }
compiles to
  _test:                              ## @test
    cmpq  %rsi, %rdi                  ## encoding: [0x48,0x39,0xf7]
    sbbl  %eax, %eax                  ## encoding: [0x19,0xc0]
    ret                               ## encoding: [0xc3]
instead of
  _test:                              ## @test
    xorl  %ecx, %ecx                  ## encoding: [0x31,0xc9]
    cmpq  %rsi, %rdi                  ## encoding: [0x48,0x39,0xf7]
    movl  $-1, %eax                   ## encoding: [0xb8,0xff,0xff,0xff,0xff]
    cmovael %ecx, %eax                ## encoding: [0x0f,0x43,0xc1]
    ret                               ## encoding: [0xc3]

llvm-svn: 122451

6020ed9d

Dec 22, 2010

Fix a bug in ReduceLoadWidth that wasn't handling extending · cafc1e60

Chris Lattner authored Dec 22, 2010

loads properly.  We miscompiled the testcase into:

_test:                                  ## @test
	movl	$128, (%rdi)
	movzbl	1(%rdi), %eax
	ret

Now we get a proper:

_test:                                  ## @test
	movl	$128, (%rdi)
	movsbl	(%rdi), %eax
	movzbl	%ah, %eax
	ret

This fixes PR8757.

llvm-svn: 122392

cafc1e60

Dec 21, 2010

Reapply 122353-122355 with fixes. 122354 was wrong; · a94e36bb

Dale Johannesen authored Dec 21, 2010

the shift type was needed one place, the shift count
type another.  The transform in 123555 had the same
problem.

llvm-svn: 122366

a94e36bb

Add some x86 specific dagcombines for conditional increments. · f6ddc4a1

Benjamin Kramer authored Dec 21, 2010

(add Y, (sete  X, 0)) -> cmp X, 1; adc  0, Y
(add Y, (setne X, 0)) -> cmp X, 1; sbb -1, Y
(sub (sete  X, 0), Y) -> cmp X, 1; sbb  0, Y
(sub (setne X, 0), Y) -> cmp X, 1; adc -1, Y

for
  unsigned foo(unsigned a, unsigned b) {
    if (a == 0) b++;
    return b;
  }
we now get:
  foo:
    cmpl  $1, %edi
    movl  %esi, %eax
    adcl  $0, %eax
    ret
instead of:
  foo:
    testl %edi, %edi
    sete  %al
    movzbl  %al, %eax
    addl  %esi, %eax
    ret

llvm-svn: 122364

f6ddc4a1

Revert 122353-122355 for the moment, they broke stuff. · 87c47499
Dale Johannesen authored Dec 21, 2010
```
llvm-svn: 122360
```
87c47499
Add a new transform to DAGCombiner. · caf42aa6
Dale Johannesen authored Dec 21, 2010
```
llvm-svn: 122355
```
caf42aa6

Get the type of a shift from the shift, not from its shift · fa5dc82f

Dale Johannesen authored Dec 21, 2010

count operand.  These should be the same but apparently are
not always, and this is cleaner anyway.  This improves the
code in an existing test.

llvm-svn: 122354

fa5dc82f

Dec 20, 2010

Cosmetic changes. · 0a291a36
Dale Johannesen authored Dec 20, 2010
```
llvm-svn: 122259
```
0a291a36

now that addc/adde are gone, "ADDC" in the X86 backend uses EFLAGS results, · 5c00d416

Chris Lattner authored Dec 20, 2010

the same as setcc.  Optimize ADDC(0,0,FLAGS) -> SET_CARRY(FLAGS).  This is
a step towards finishing off PR5443.  In the testcase in that bug we now  get:

	movq	%rdi, %rax
	addq	%rsi, %rax
	sbbq	%rcx, %rcx
	testb	$1, %cl
	setne	%dl
	ret

instead of:

	movq	%rdi, %rax
	addq	%rsi, %rax
	movl	$0, %ecx
	adcq	$0, %rcx
	testq	%rcx, %rcx
	setne	%dl
	ret

llvm-svn: 122219

5c00d416

We lower setb to sbb with the hope that the and will go away, when it · 46b9efca

Chris Lattner authored Dec 20, 2010

doesn't, match it back to setb.

On a 64-bit version of the testcase before we'd get:

	movq	%rdi, %rax
	addq	%rsi, %rax
	sbbb	%dl, %dl
	andb	$1, %dl
	ret

now we get:

	movq	%rdi, %rax
	addq	%rsi, %rax
	setb	%dl
	ret

llvm-svn: 122217

46b9efca

Add comment for testcase for 122206 · 075a16b0
Mon P Wang authored Dec 20, 2010
```
llvm-svn: 122210
```
075a16b0
Prevents PerformShuffleCombine from creating a node with an illegal type after legalize types · 1064992c
Mon P Wang authored Dec 19, 2010
```
has run, e.g., prevent creating an i64 node from a v2i64 when i64 is not a legal type.

llvm-svn: 122206
```
1064992c

Dec 19, 2010
- improve the setcc -> setcc_carry optimization to happen more · 9edf3f50
  Chris Lattner authored Dec 19, 2010
```
consistently by moving it out of lowering into dag combine.

Add some missing patterns for matching away extended versions of setcc_c.

llvm-svn: 122201
```
  9edf3f50
- now that generic vector types aren't selected onto MMX registers, these · ff392ab3
  Chris Lattner authored Dec 19, 2010
```
tests don't need -disable-mmx.

llvm-svn: 122188
```
  ff392ab3
- fix PR8642: if a critical edge has a PHI value that can trap, · 77a8a714
  Chris Lattner authored Dec 19, 2010
```
isel is *required* to split the edge.  PHI values get evaluated
on the edge, not in their predecessor block.

llvm-svn: 122170
```
  77a8a714
Dec 18, 2010
- Just rename the functions, relying on matching a instruction that has the same... · 4b4f3557
  Benjamin Kramer authored Dec 18, 2010
```
Just rename the functions, relying on matching a instruction that has the same name as a symbol is way too fragile.

llvm-svn: 122154
```
  4b4f3557
- Test more than just label names and make test work on non-x86 hosts. · 5f6219d9
  Benjamin Kramer authored Dec 18, 2010
```
llvm-svn: 122153
```
  5f6219d9
Dec 17, 2010
- Add support for matching psign & plendvb to the x86 target · 97b72c99
  Nate Begeman authored Dec 17, 2010
```
Remove unnecessary pandn patterns, 'vnot' patfrag looks through bitcasts

llvm-svn: 122098
```
  97b72c99
- Add a transform to DAG Combiner. This improves the · cd538afa
  Dale Johannesen authored Dec 17, 2010
```
code for the case where 32-bit divide by constant is
turned into 64-bit multiply by constant.  8771012.

llvm-svn: 122090
```
  cd538afa
Dec 15, 2010
- Teach machine cse to commute instructions. · b7ff5a0f
  Evan Cheng authored Dec 15, 2010
```
llvm-svn: 121903
```
  b7ff5a0f
- take care of some todos, transforming [us]mul_lohi into · 15090e1e
  Chris Lattner authored Dec 15, 2010
```
a wider mul if the wider mul is legal.

llvm-svn: 121848
```
  15090e1e
- merge two tests · c3301e97
  Chris Lattner authored Dec 15, 2010
```
llvm-svn: 121847
```
  c3301e97
Dec 14, 2010

Fix a minor bug in two-address pass. It was missing a commute opportunity. · 19dc77ce

Evan Cheng authored Dec 14, 2010

regB = move RCX
regA = op regB, regC
RAX  = move regA
where both regB and regC are killed. If regB is constrainted to non-compatible
physical registers but regC is not constrainted at all, then it's better to
commute the instruction.
       movl    %edi, %eax
       shlq    $32, %rcx
       leaq    (%rcx,%rax), %rax
=>
       movl    %edi, %eax
       shlq    $32, %rcx
       orq     %rcx, %rax
rdar://8762995

llvm-svn: 121793

19dc77ce

Dec 13, 2010

rename test · 8e21a02c
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121697
```
8e21a02c

Add a couple dag combines to transform mulhi/mullo into a wider multiply · 10bd29f1

Chris Lattner authored Dec 13, 2010

when the wider type is legal.  This allows us to compile:

define zeroext i16 @test1(i16 zeroext %x) nounwind {
entry:
	%div = udiv i16 %x, 33
	ret i16 %div
}

into:

test1:                                  # @test1
	movzwl	4(%esp), %eax
	imull	$63551, %eax, %eax      # imm = 0xF83F
	shrl	$21, %eax
	ret

instead of:

test1:                                  # @test1
        movw    $-1985, %ax             # imm = 0xFFFFFFFFFFFFF83F
        mulw    4(%esp)
        andl    $65504, %edx            # imm = 0xFFE0
        movl    %edx, %eax
        shrl    $5, %eax
        ret

Implementing rdar://8760399 and example #4 from:
http://blog.regehr.org/archives/320

We should implement the same thing for [su]mul_hilo, but I don't
have immediate plans to do this.

llvm-svn: 121696

10bd29f1

Dec 10, 2010

Formalize the notion that AVX and SSE are non-overlapping extensions from the... · 8b08f523

Nate Begeman authored Dec 10, 2010

Formalize the notion that AVX and SSE are non-overlapping extensions from the compiler's point of view.  Per email discussion, we either want to always use VEX-prefixed instructions or never use them, and are taking "HasAVX" to mean "Always use VEX".  Passing -mattr=-avx,+sse42 should serve to restore legacy SSE support when desirable.

llvm-svn: 121439

8b08f523

Dec 09, 2010
- Rewrite the darwin tlv support to use a chain and return to copying · a8aaaee3
  Eric Christopher authored Dec 09, 2010
```
the output to the correct register. Fixes a hidden problem uncovered
by the last patch where we'd try to DAG combine our MVT::Other node
oddly.

llvm-svn: 121358
```
  a8aaaee3
- Remove extraneous copy from DAG conversion for darwin tls. This was · d84970ae
  Eric Christopher authored Dec 09, 2010
```
popping up at O0 when it wasn't folded and the fast allocator would
complain.

llvm-svn: 121330
```
  d84970ae
- Move this test to tlv* to make it easier to notice versus linux tls · 6a21b40b
  Eric Christopher authored Dec 08, 2010
```
support.

llvm-svn: 121316
```
  6a21b40b
Dec 06, 2010
- If dbg_declare() or dbg_value() is not lowered by isel then emit DEBUG message... · c24048a7
  Devang Patel authored Dec 06, 2010
```
If dbg_declare() or dbg_value() is not lowered by isel then emit DEBUG message instead of creating DBG_VALUE for undefined value in reg0.

llvm-svn: 121059
```
  c24048a7
- Revert previous two patches while I try to find out how to make both · dee30623
  Rafael Espindola authored Dec 06, 2010
```
linux and darwin assemblers happy :-(

llvm-svn: 121004
```
  dee30623
- Update test for the extra =. · 884d58a7
  Rafael Espindola authored Dec 06, 2010
```
llvm-svn: 121001
```
  884d58a7
Dec 05, 2010

Teach X86ISelLowering that the second result of X86ISD::UMUL is a flags · 68861717

Chris Lattner authored Dec 05, 2010

result.  This allows us to compile:

void *test12(long count) {
      return new int[count];
}

into:

test12:
	movl	$4, %ecx
	movq	%rdi, %rax
	mulq	%rcx
	movq	$-1, %rdi
	cmovnoq	%rax, %rdi
	jmp	__Znam                  ## TAILCALL

instead of:

test12:
	movl	$4, %ecx
	movq	%rdi, %rax
	mulq	%rcx
	seto	%cl
	testb	%cl, %cl
	movq	$-1, %rdi
	cmoveq	%rax, %rdi
	jmp	__Znam

Of course it would be even better if the regalloc inverted the cmov to 'cmovoq',
which would eliminate the need for the 'movq %rdi, %rax'.

llvm-svn: 120936

68861717

it turns out that when ".with.overflow" intrinsics were added to the X86 · 364bb0a0

Chris Lattner authored Dec 05, 2010

backend that they were all implemented except umul.  This one fell back
to the default implementation that did a hi/lo multiply and compared the
top.  Fix this to check the overflow flag that the 'mul' instruction
sets, so we can avoid an explicit test.  Now we compile:

void *func(long count) {
      return new int[count];
}

into:

__Z4funcl:                              ## @_Z4funcl
	movl	$4, %ecx                ## encoding: [0xb9,0x04,0x00,0x00,0x00]
	movq	%rdi, %rax              ## encoding: [0x48,0x89,0xf8]
	mulq	%rcx                    ## encoding: [0x48,0xf7,0xe1]
	seto	%cl                     ## encoding: [0x0f,0x90,0xc1]
	testb	%cl, %cl                ## encoding: [0x84,0xc9]
	movq	$-1, %rdi               ## encoding: [0x48,0xc7,0xc7,0xff,0xff,0xff,0xff]
	cmoveq	%rax, %rdi              ## encoding: [0x48,0x0f,0x44,0xf8]
	jmp	__Znam                  ## TAILCALL

instead of:

__Z4funcl:                              ## @_Z4funcl
	movl	$4, %ecx                ## encoding: [0xb9,0x04,0x00,0x00,0x00]
	movq	%rdi, %rax              ## encoding: [0x48,0x89,0xf8]
	mulq	%rcx                    ## encoding: [0x48,0xf7,0xe1]
	testq	%rdx, %rdx              ## encoding: [0x48,0x85,0xd2]
	movq	$-1, %rdi               ## encoding: [0x48,0xc7,0xc7,0xff,0xff,0xff,0xff]
	cmoveq	%rax, %rdi              ## encoding: [0x48,0x0f,0x44,0xf8]
	jmp	__Znam                  ## TAILCALL

Other than the silly seto+test, this is using the o bit directly, so it's going in the right
direction.

llvm-svn: 120935

364bb0a0

fix the rest of the linux miscompares :) · 183ddd8e
Chris Lattner authored Dec 05, 2010
```
llvm-svn: 120933
```
183ddd8e

generalize the previous check to handle -1 on either side of the · 116580a1

Chris Lattner authored Dec 05, 2010

select, inserting a not to compensate.  Add a missing isZero check
that I lost somehow.

This improves codegen of:

void *func(long count) {
      return new int[count];
}

from:

__Z4funcl:                              ## @_Z4funcl
	movl	$4, %ecx                ## encoding: [0xb9,0x04,0x00,0x00,0x00]
	movq	%rdi, %rax              ## encoding: [0x48,0x89,0xf8]
	mulq	%rcx                    ## encoding: [0x48,0xf7,0xe1]
	testq	%rdx, %rdx              ## encoding: [0x48,0x85,0xd2]
	movq	$-1, %rdi               ## encoding: [0x48,0xc7,0xc7,0xff,0xff,0xff,0xff]
	cmoveq	%rax, %rdi              ## encoding: [0x48,0x0f,0x44,0xf8]
	jmp	__Znam                  ## TAILCALL
                                        ## encoding: [0xeb,A]

to:

__Z4funcl:                              ## @_Z4funcl
	movl	$4, %ecx                ## encoding: [0xb9,0x04,0x00,0x00,0x00]
	movq	%rdi, %rax              ## encoding: [0x48,0x89,0xf8]
	mulq	%rcx                    ## encoding: [0x48,0xf7,0xe1]
	cmpq	$1, %rdx                ## encoding: [0x48,0x83,0xfa,0x01]
	sbbq	%rdi, %rdi              ## encoding: [0x48,0x19,0xff]
	notq	%rdi                    ## encoding: [0x48,0xf7,0xd7]
	orq	%rax, %rdi              ## encoding: [0x48,0x09,0xc7]
	jmp	__Znam                  ## TAILCALL
                                        ## encoding: [0xeb,A]

llvm-svn: 120932

116580a1