Commits · 9a1e91b107323a82b07737d9ef2f562cd966dedb · Roger Ferrer / llvm-epi-0.8

Sep 26, 2006
- print the preds of each MBB · 9a1e91b1
  Chris Lattner authored Sep 26, 2006
```
llvm-svn: 30606
```
  9a1e91b1
- Add support for targets that want to do something with the llvm.used list, · 66af3906
  Chris Lattner authored Sep 26, 2006
```
because they have an aggressive linker that does dead code stripping.

llvm-svn: 30604
```
  66af3906
Sep 25, 2006
- Accidental enable of bad code · 7aa0638a
  Jim Laskey authored Sep 25, 2006
```
llvm-svn: 30601
```
  7aa0638a
- Fix chain dropping in load and drop unused stores in ret blocks. · b5534e5c
  Jim Laskey authored Sep 25, 2006
```
llvm-svn: 30600
```
  b5534e5c
- Core antialiasing for load and store. · d07be232
  Jim Laskey authored Sep 25, 2006
```
llvm-svn: 30597
```
  d07be232
Sep 24, 2006
- Add support for other relocation bases to jump tables, as well as custom asm directives · 783a4a9d
  Andrew Lenharth authored Sep 24, 2006
```
llvm-svn: 30593
```
  783a4a9d
- PIC jump table entries are always 32-bit. This fixes PIC jump table support on X86-64. · 77c0757f
  Evan Cheng authored Sep 24, 2006
```
llvm-svn: 30590
```
  77c0757f
Sep 21, 2006

Make it work for DAG combine of multi-value nodes. · 449a0c7e
Evan Cheng authored Sep 21, 2006
```
llvm-svn: 30573
```
449a0c7e
core corrections · 35f7eebb
Jim Laskey authored Sep 21, 2006
```
llvm-svn: 30570
```
35f7eebb
Basic "in frame" alias analysis. · 5d19d590
Jim Laskey authored Sep 21, 2006
```
llvm-svn: 30568
```
5d19d590
fold (aext (and (trunc x), cst)) -> (and x, cst). · 082db3f9
Chris Lattner authored Sep 21, 2006
```
llvm-svn: 30561
```
082db3f9
Check the right value type. This fixes 186.crafty on x86 · fa9f92cf
Chris Lattner authored Sep 21, 2006
```
llvm-svn: 30560
```
fa9f92cf

Compile: · 8d8a3bf9

Chris Lattner authored Sep 21, 2006

int %test(ulong *%tmp) {
        %tmp = load ulong* %tmp         ; <ulong> [#uses=1]
        %tmp.mask = shr ulong %tmp, ubyte 50            ; <ulong> [#uses=1]
        %tmp.mask = cast ulong %tmp.mask to ubyte
        %tmp2 = and ubyte %tmp.mask, 3          ; <ubyte> [#uses=1]
        %tmp2 = cast ubyte %tmp2 to int         ; <int> [#uses=1]
        ret int %tmp2
}

to:

_test:
        movl 4(%esp), %eax
        movl 4(%eax), %eax
        shrl $18, %eax
        andl $3, %eax
        ret

instead of:

_test:
        movl 4(%esp), %eax
        movl 4(%eax), %eax
        shrl $18, %eax
        # TRUNCATE movb %al, %al
        andb $3, %al
        movzbl %al, %eax
        ret

llvm-svn: 30558

8d8a3bf9

Generalize (zext (truncate x)) and (sext (truncate x)) folding to work when · a31f0a62

Chris Lattner authored Sep 21, 2006

the src/dst are not the same size.  This catches things like "truncate
32-bit X to 8 bits, then zext to 16", which happens a bit on X86.

llvm-svn: 30557

a31f0a62

Sep 20, 2006

Compile: · c8cd62d3

Chris Lattner authored Sep 20, 2006

int test3(int a, int b) { return (a < 0) ? a : 0; }

to:

_test3:
        srawi r2, r3, 31
        and r3, r2, r3
        blr

instead of:

_test3:
        cmpwi cr0, r3, 1
        li r2, 0
        blt cr0, LBB2_2 ;entry
LBB2_1: ;entry
        mr r3, r2
LBB2_2: ;entry
        blr


This implements: PowerPC/select_lt0.ll:seli32_a_a

llvm-svn: 30517

c8cd62d3

Fold the full generality of (any_extend (truncate x)) · 8746e2cd
Chris Lattner authored Sep 20, 2006
```
llvm-svn: 30514
```
8746e2cd

Two things: · 8b68decb

Chris Lattner authored Sep 20, 2006

1. teach SimplifySetCC that '(srl (ctlz x), 5) == 0' is really x != 0.
2. Teach visitSELECT_CC to use SimplifySetCC instead of calling it and
   ignoring the result.  This allows us to compile:

bool %test(ulong %x) {
  %tmp = setlt ulong %x, 4294967296
  ret bool %tmp
}

to:

_test:
        cntlzw r2, r3
        cmplwi cr0, r3, 1
        srwi r2, r2, 5
        li r3, 0
        beq cr0, LBB1_2 ;
LBB1_1: ;
        mr r3, r2
LBB1_2: ;
        blr

instead of:

_test:
        addi r2, r3, -1
        cntlzw r2, r2
        cntlzw r3, r3
        srwi r2, r2, 5
        cmplwi cr0, r2, 0
        srwi r2, r3, 5
        li r3, 0
        bne cr0, LBB1_2 ;
LBB1_1: ;
        mr r3, r2
LBB1_2: ;
        blr

This isn't wonderful, but it's an improvement.

llvm-svn: 30513

8b68decb

Expand 64-bit shifts more optimally if we know that the high bit of the · 875ea0cd

Chris Lattner authored Sep 20, 2006

shift amount is one or zero.  For example, for:

long long foo1(long long X, int C) {
  return X << (C|32);
}

long long foo2(long long X, int C) {
  return X << (C&~32);
}

we get:

_foo1:
        movb $31, %cl
        movl 4(%esp), %edx
        andb 12(%esp), %cl
        shll %cl, %edx
        xorl %eax, %eax
        ret
_foo2:
        movb $223, %cl
        movl 4(%esp), %eax
        movl 8(%esp), %edx
        andb 12(%esp), %cl
        shldl %cl, %eax, %edx
        shll %cl, %eax
        ret

instead of:

_foo1:
        subl $4, %esp
        movl %ebx, (%esp)
        movb $32, %bl
        movl 8(%esp), %eax
        movl 12(%esp), %edx
        movb %bl, %cl
        orb 16(%esp), %cl
        shldl %cl, %eax, %edx
        shll %cl, %eax
        xorl %ecx, %ecx
        testb %bl, %bl
        cmovne %eax, %edx
        cmovne %ecx, %eax
        movl (%esp), %ebx
        addl $4, %esp
        ret
_foo2:
        subl $4, %esp
        movl %ebx, (%esp)
        movb $223, %cl
        movl 8(%esp), %eax
        movl 12(%esp), %edx
        andb 16(%esp), %cl
        shldl %cl, %eax, %edx
        shll %cl, %eax
        xorl %ecx, %ecx
        xorb %bl, %bl
        testb %bl, %bl
        cmovne %eax, %edx
        cmovne %ecx, %eax
        movl (%esp), %ebx
        addl $4, %esp
        ret

llvm-svn: 30506

875ea0cd

Sep 19, 2006
- Fix UnitTests/2005-05-12-Int64ToFP.c with llc-beta. In particular, do not · 698000b0
  Chris Lattner authored Sep 19, 2006
```
allow it to go into an infinite loop, filling up the disk!

llvm-svn: 30494
```
  698000b0
- Fold extract_element(cst) to cst · 5a42ebcf
  Chris Lattner authored Sep 19, 2006
```
llvm-svn: 30478
```
  5a42ebcf
- Minor speedup for legalize by avoiding some malloc traffic · 4c059f49
  Chris Lattner authored Sep 19, 2006
```
llvm-svn: 30477
```
  4c059f49
- Fix a typo. · 1fc7c363
  Evan Cheng authored Sep 18, 2006
```
llvm-svn: 30474
```
  1fc7c363
Sep 18, 2006
- Allow i32 UDIV, SDIV, UREM, SREM to be expanded into libcalls. · 4bfaf0bd
  Evan Cheng authored Sep 18, 2006
```
llvm-svn: 30470
```
  4bfaf0bd
- oops · 51ad73c9
  Andrew Lenharth authored Sep 18, 2006
```
llvm-svn: 30462
```
  51ad73c9
- absolute addresses must match pointer size · c50458fb
  Andrew Lenharth authored Sep 18, 2006
```
llvm-svn: 30461
```
  c50458fb
- Sort out mangled names for globals · d30bba33
  Jim Laskey authored Sep 18, 2006
```
llvm-svn: 30460
```
  d30bba33
Sep 16, 2006
- Oh yeah, this is needed too · e50f5d1f
  Chris Lattner authored Sep 16, 2006
```
llvm-svn: 30407
```
  e50f5d1f
- simplify control flow, no functionality change · 1b63391f
  Chris Lattner authored Sep 16, 2006
```
llvm-svn: 30403
```
  1b63391f
- Allow custom expand of mul · fbadbda6
  Chris Lattner authored Sep 16, 2006
```
llvm-svn: 30402
```
  fbadbda6
Sep 15, 2006
- Keep track of the start of MBB's in a separate map from instructions. This · 8fb3d445
  Chris Lattner authored Sep 15, 2006
```
is faster and is needed for future improvements.

llvm-svn: 30383
```
  8fb3d445
Sep 14, 2006
- Fold (X & C1) | (Y & C2) -> (X|Y) & C3 when possible. · 46d710e6
  Chris Lattner authored Sep 14, 2006
```
This implements CodeGen/X86/and-or-fold.ll

llvm-svn: 30379
```
  46d710e6
- Split rotate matching code out to its own function. Make it stronger, by · 97614c86
  Chris Lattner authored Sep 14, 2006
```
matching things like ((x >> c1) & c2) | ((x << c3) & c4) to (rot x, c5) & c6

llvm-svn: 30376
```
  97614c86
- Use getOffset() instead. · 616aa548
  Evan Cheng authored Sep 14, 2006
```
llvm-svn: 30327
```
  616aa548
- Use MachineConstantPoolEntry getOffset() and getType() accessors. · 2ad050fb
  Evan Cheng authored Sep 14, 2006
```
llvm-svn: 30326
```
  2ad050fb
- A MachineConstantPool may have mixed Constant* and MachineConstantPoolValue* values. · 4f929955
  Evan Cheng authored Sep 14, 2006
```
llvm-svn: 30316
```
  4f929955
Sep 13, 2006

If LSR went through a lot of trouble to put constants (e.g. the addr of a global · 84cc1f7c

Chris Lattner authored Sep 13, 2006

in a specific BB, don't undo this!).  This allows us to compile
CodeGen/X86/loop-hoist.ll into:

_foo:
        xorl %eax, %eax
***     movl L_Arr$non_lazy_ptr, %ecx
        movl 4(%esp), %edx
LBB1_1: #cond_true
        movl %eax, (%ecx,%eax,4)
        incl %eax
        cmpl %edx, %eax
        jne LBB1_1      #cond_true
LBB1_2: #return
        ret

instead of:

_foo:
        xorl %eax, %eax
        movl 4(%esp), %ecx
LBB1_1: #cond_true
***     movl L_Arr$non_lazy_ptr, %edx
        movl %eax, (%edx,%eax,4)
        incl %eax
        cmpl %ecx, %eax
        jne LBB1_1      #cond_true
LBB1_2: #return
        ret

This was noticed in 464.h264ref.  This doesn't usually affect PPC,
but strikes X86 all the time.

llvm-svn: 30290

84cc1f7c

Compile X << 1 (where X is a long-long) to: · 72b503bc

Chris Lattner authored Sep 13, 2006

        addl %ecx, %ecx
        adcl %eax, %eax

instead of:

        movl %ecx, %edx
        addl %edx, %edx
        shrl $31, %ecx
        addl %eax, %eax
        orl %ecx, %eax

and to:

        addc r5, r5, r5
        adde r4, r4, r4

instead of:

        slwi r2,r9,1
        srwi r0,r11,31
        slwi r3,r11,1
        or r2,r0,r2

on PPC.

llvm-svn: 30284

72b503bc

Sep 12, 2006
- Added support for machine specific constantpool values. These are useful for · 45fe3bc7
  Evan Cheng authored Sep 12, 2006
```
representing expressions that can only be resolved at link time, etc.

llvm-svn: 30278
```
  45fe3bc7
Sep 11, 2006
- Behold, more work on relocations. Things are looking pretty good now. · a0d95a8d
  Nate Begeman authored Sep 10, 2006
```
llvm-svn: 30240
```
  a0d95a8d
Sep 10, 2006

This code was trying too hard. By eliminating redundant edges in the CFG · 2e0dfb0b

Chris Lattner authored Sep 10, 2006

due to switch cases going to the same place, it make #pred != #phi entries,
breaking live interval analysis.

This fixes 458.sjeng on x86 with llc.

llvm-svn: 30236

2e0dfb0b