Commits · acbb456dde4a1561764504b980f9a40cc682e962 · Lorenzo Albano / LLVM bpEVL

Dec 06, 2006
- Remove the 'printname' argument to WriteAsOperand. It is always true, and · edcc8c2f
  Chris Lattner authored Dec 06, 2006
```
passing false would make the asmprinter fail anyway.

llvm-svn: 32264
```
  edcc8c2f
- counter should be unsigned. · ec589036
  Chris Lattner authored Dec 06, 2006
```
llvm-svn: 32252
```
  ec589036
Dec 05, 2006
- add an instcombine xform. This speeds up 462.libquantum from 9.78s to · c209b584
  Chris Lattner authored Dec 05, 2006
```
7.48s.  This regression is due to unforseen consequences of the cast patch.

llvm-svn: 32209
```
  c209b584
- SCCP does not handle Packed Type properly. Disable Packed Type handling · 21efc731
  Devang Patel authored Dec 04, 2006
```
for now.

llvm-svn: 32208
```
  21efc731
Dec 04, 2006
- Update call to CastInst::getCastOpcode for its new signature. · 14fbdd55
  Reid Spencer authored Dec 04, 2006
```
llvm-svn: 32166
```
  14fbdd55
Dec 02, 2006
- Unbreak VC++ build. · cc08c831
  Jeff Cohen authored Dec 02, 2006
```
llvm-svn: 32113
```
  cc08c831
- disable transformations that are invalid for fp vectors. This fixes · 7a002fec
  Chris Lattner authored Dec 02, 2006
```
Transforms/InstCombine/2006-12-01-BadFPVectorXform.ll

llvm-svn: 32112
```
  7a002fec
Dec 01, 2006
- Remove 4 FIXMEs to hack around cast-to-bool problems which no longer exist. · ad05ee9f
  Reid Spencer authored Nov 30, 2006
```
llvm-svn: 32051
```
  ad05ee9f
Nov 30, 2006
- make it clear that this is always a zext · c8978c52
  Chris Lattner authored Nov 30, 2006
```
llvm-svn: 32044
```
  c8978c52
- One more bugfix, 3 cases of making casts explicit. · 3ede00b3
  Chris Lattner authored Nov 30, 2006
```
llvm-svn: 32043
```
  3ede00b3
- Fix a bug in globalopt due to the recent cast patch. · 0390b9e6
  Chris Lattner authored Nov 30, 2006
```
llvm-svn: 32042
```
  0390b9e6
Nov 29, 2006
- implement cast.ll:test35. With this, we recognize: · 960acb00
  Chris Lattner authored Nov 29, 2006
```
unsigned short swp(unsigned short a) {
       return ((a & 0xff00) >> 8 | (a & 0x00ff) << 8);
}

as an idiom for bswap.

llvm-svn: 32011
```
  960acb00
- Teach instcombine to turn trunc(srl x, c) -> srl (trunc(x), c) when safe. · d747f015
  Chris Lattner authored Nov 29, 2006
```
This implements InstCombine/cast.ll:test34.  It fires hundreds of times on
176.gcc.

llvm-svn: 32009
```
  d747f015
- Implement Regression/Transforms/InstCombine/bswap-fold.ll, · a7942b7b
  Chris Lattner authored Nov 29, 2006
```
folding   seteq (bswap(x)), c -> seteq(x,bswap(c))

llvm-svn: 32006
```
  a7942b7b
- Join a split line. · a736fdf2
  Reid Spencer authored Nov 29, 2006
```
llvm-svn: 31996
```
  a736fdf2
Nov 28, 2006
- Undo the last patch until 253.perlbmk passes with these changes. · 116ad83a
  Reid Spencer authored Nov 28, 2006
```
llvm-svn: 31977
```
  116ad83a
- Remove 4 FIXME's from the CAST patch now that the back end is correctly · 59fe2d89
  Reid Spencer authored Nov 28, 2006
```
producing code for "trunc to bool". This passes all tests on Linux.

llvm-svn: 31963
```
  59fe2d89
Nov 27, 2006

Fix PR1014 and InstCombine/2006-11-27-XorBug.ll. · 8e9a7b73
Chris Lattner authored Nov 27, 2006
```
llvm-svn: 31941
```
8e9a7b73

For PR950: · 6c38f0bb

Reid Spencer authored Nov 27, 2006

The long awaited CAST patch. This introduces 12 new instructions into LLVM
to replace the cast instruction. Corresponding changes throughout LLVM are
provided. This passes llvm-test, llvm/test, and SPEC CPUINT2000 with the
exception of 175.vpr which fails only on a slight floating point output
difference.

llvm-svn: 31931

6c38f0bb

Nov 26, 2006
- Remove #include <iostream> and use llvm_* streams instead. · 4ae40107
  Bill Wendling authored Nov 26, 2006
```
llvm-svn: 31925
```
  4ae40107
- Replace #include <iostream> with llvm_* streams. · 8f13b5c4
  Bill Wendling authored Nov 26, 2006
```
llvm-svn: 31924
```
  8f13b5c4
- Removed #include <iostream> and replaced with llvm_* streams. · 5dbf43c9
  Bill Wendling authored Nov 26, 2006
```
llvm-svn: 31923
```
  5dbf43c9
- Removed #include <iostream> and used the llvm_cerr/DOUT streams instead. · a7459ca8
  Bill Wendling authored Nov 26, 2006
```
llvm-svn: 31922
```
  a7459ca8
Nov 23, 2006

Update to new predicate simplifier VRP design. Fixes PR966 and PR967. · 09b7e4d3

Nick Lewycky authored Nov 22, 2006

Remove predicate simplifier from default gcc3 pipeline. New design is too
slow to enable by default.
Add new testcases for problems encountered in development.

llvm-svn: 31895

09b7e4d3

Nov 21, 2006
- This xform is handled by FoldOpIntoPhi in visitCastInst in a more elegant way. · ec45a4c8
  Chris Lattner authored Nov 21, 2006
```
llvm-svn: 31889
```
  ec45a4c8
Nov 18, 2006

Do not convert massive blocks on phi nodes into select statements. Instead · 95adf8f1

Chris Lattner authored Nov 18, 2006

only do these transformations if there are a small number of phi's.
This speeds up Ptrdist/ks from 2.35s to 2.19s on my mac pro.

llvm-svn: 31853

95adf8f1

Nov 17, 2006

If an indvar with a variable stride is used by the exit condition, go ahead · 21eba2da

Chris Lattner authored Nov 17, 2006

and handle it like constant stride vars.  This fixes some bad codegen in
variable stride cases.  For example, it compiles this:

void foo(int k, int i) {
  for (k=i+i; k <= 8192; k+=i)
    flags2[k] = 0;
}

to:

LBB1_1: #bb.preheader
        movl %eax, %ecx
        addl %ecx, %ecx
        movl L_flags2$non_lazy_ptr, %edx
LBB1_2: #bb
        movb $0, (%edx,%ecx)
        addl %eax, %ecx
        cmpl $8192, %ecx
        jle LBB1_2      #bb
LBB1_5: #return
        ret

or (if the array is local and we are in dynamic-nonpic or static mode):

LBB3_2: #bb
        movb $0, _flags2(%ecx)
        addl %eax, %ecx
        cmpl $8192, %ecx
        jle LBB3_2      #bb

and:

        lis r2, ha16(L_flags2$non_lazy_ptr)
        lwz r2, lo16(L_flags2$non_lazy_ptr)(r2)
        slwi r3, r4, 1
LBB1_2: ;bb
        li r5, 0
        add r6, r4, r3
        stbx r5, r2, r3
        cmpwi cr0, r6, 8192
        bgt cr0, LBB1_5 ;return

instead of:

        leal (%eax,%eax,2), %ecx
        movl %eax, %edx
        addl %edx, %edx
        addl L_flags2$non_lazy_ptr, %edx
        xorl %esi, %esi
LBB1_2: #bb
        movb $0, (%edx,%esi)
        movl %eax, %edi
        addl %esi, %edi
        addl %ecx, %esi
        cmpl $8192, %esi
        jg LBB1_5       #return

and:

        lis r2, ha16(L_flags2$non_lazy_ptr)
        lwz r2, lo16(L_flags2$non_lazy_ptr)(r2)
        mulli r3, r4, 3
        slwi r5, r4, 1
        li r6, 0
        add r2, r2, r5
LBB1_2: ;bb
        li r5, 0
        add r7, r3, r6
        stbx r5, r2, r6
        add r6, r4, r6
        cmpwi cr0, r7, 8192
        ble cr0, LBB1_2 ;bb

This speeds up Benchmarks/Shootout/sieve from 8.533s to 6.464s and
implements LoopStrengthReduce/var_stride_used_by_compare.ll

llvm-svn: 31809

21eba2da

Nov 15, 2006
- Fix a gcc 4.2 warning. · e3a63d13
  Chris Lattner authored Nov 15, 2006
```
llvm-svn: 31751
```
  e3a63d13
Nov 14, 2006

implement InstCombine/shift-simplify.ll by transforming: · f05d69ae
Chris Lattner authored Nov 14, 2006
```
(X >> Z) op (Y >> Z)  -> (X op Y) >> Z

for all shifts and all ops={and/or/xor}.

llvm-svn: 31729
```
f05d69ae

implement InstCombine/and-compare.ll:test1. This compiles: · d12a4bf7

Chris Lattner authored Nov 14, 2006

typedef struct { unsigned prefix : 4; unsigned code : 4; unsigned unsigned_p : 4; } tree_common;
int foo(tree_common *a, tree_common *b) { return a->code == b->code; }

into:

_foo:
        movl 4(%esp), %eax
        movl 8(%esp), %ecx
        movl (%eax), %eax
        xorl (%ecx), %eax
        # TRUNCATE movb %al, %al
        shrb $4, %al
        testb %al, %al
        sete %al
        movzbl %al, %eax
        ret

instead of:

_foo:
        movl 8(%esp), %eax
        movb (%eax), %al
        shrb $4, %al
        movl 4(%esp), %ecx
        movb (%ecx), %cl
        shrb $4, %cl
        cmpb %al, %cl
        sete %al
        movzbl %al, %eax
        ret

saving one cycle by eliminating a shift.

llvm-svn: 31727

d12a4bf7

Nov 11, 2006
- Fix InstCombine/2006-11-10-ashr-miscompile.ll a miscompilation introduced · d4dee405
  Chris Lattner authored Nov 10, 2006
```
by the shr -> [al]shr patch.  This was reduced from 176.gcc.

llvm-svn: 31653
```
  d4dee405
Nov 10, 2006
- second patch to fix PR992/993. · 82928ca2
  Chris Lattner authored Nov 09, 2006
```
llvm-svn: 31610
```
  82928ca2
- Minimal patch to fix PR992/PR993 · 924f4fee
  Chris Lattner authored Nov 09, 2006
```
llvm-svn: 31608
```
  924f4fee
Nov 09, 2006
- Teach ShrinkDemandedConstant how to handle X+C. This implements: · 6e2c15c1
  Chris Lattner authored Nov 09, 2006
```
add.ll:test33, add.ll:test34, shift-sra.ll:test2

llvm-svn: 31586
```
  6e2c15c1
Nov 08, 2006
- reenable factoring of GEP expressions, being more precise about the · 4f218d56
  Chris Lattner authored Nov 08, 2006
```
case that it bad to do.

llvm-svn: 31563
```
  4f218d56
- make this code more efficient by not creating a phi node we are just going to · cd62f112
  Chris Lattner authored Nov 08, 2006
```
delete in the first place.  This also makes it simpler.

llvm-svn: 31562
```
  cd62f112
- Remove redundant <cmath>. · 61feeb90
  Jim Laskey authored Nov 08, 2006
```
llvm-svn: 31561
```
  61feeb90
- disable this factoring optzn for GEPs for now, this severely pessimizes some · a3acfca9
  Chris Lattner authored Nov 08, 2006
```
loops.

llvm-svn: 31560
```
  a3acfca9
- For PR950: · fdff938a
  Reid Spencer authored Nov 08, 2006
```
This patch converts the old SHR instruction into two instructions,
AShr (Arithmetic) and LShr (Logical). The Shr instructions now are not
dependent on the sign of their operands.

llvm-svn: 31542
```
  fdff938a
Nov 07, 2006

scalarrepl should not split the two elements of the vsiidx array: · 4967f6dd

Chris Lattner authored Nov 07, 2006

int func(vFloat v0, vFloat v1) {
        int ii;
        vSInt32 vsiidx[2];
        vsiidx[0] = _mm_cvttps_epi32(v0);
        vsiidx[1] = _mm_cvttps_epi32(v1);
        ii = ((int *) vsiidx)[4];
        return ii;
}

This fixes Transforms/ScalarRepl/2006-11-07-InvalidArrayPromote.ll

llvm-svn: 31524

4967f6dd