Commits · 6e5fe376ec898e651526107485d2a0f6e4000789 · Roger Ferrer / llvm-epi-0.8

Dec 12, 2006
- Patch for PR1045 and Transforms/ScalarRepl/2006-12-11-SROA-Crash.ll · 6e5fe376
  Chris Lattner authored Dec 12, 2006
```
llvm-svn: 32468
```
  6e5fe376
Dec 11, 2006

trunc to integer, not to FP. · e810140c
Chris Lattner authored Dec 11, 2006
```
llvm-svn: 32426
```
e810140c
implement promotion of unions containing two packed types of the same width. · 23f4b68f
Chris Lattner authored Dec 11, 2006
```
This implements Transforms/ScalarRepl/union-packed.ll

llvm-svn: 32422
```
23f4b68f

* Eliminate calls to CastInst::createInferredCast. · 216c3028

Chris Lattner authored Dec 10, 2006

 * Add support for promoting unions with fp values in them.  This produces
   our new int<->fp bitcast instructions, implementing
   Transforms/ScalarRepl/union-fp-int.ll

As an example, this allows us to compile this:

union intfloat { int i; float f; };
float invsqrt(const float arg_x) {
    union intfloat x = { .f = arg_x };
    const float xhalf = arg_x * 0.5f;
    x.i = 0x5f3759df - (x.i >> 1);
    return x.f * (1.5f - xhalf * x.f * x.f);
}

into:

_invsqrt:
        movss 4(%esp), %xmm0
        movd %xmm0, %eax
        sarl %eax
        movl $1597463007, %ecx
        subl %eax, %ecx
        movd %ecx, %xmm1
        mulss LCPI1_0, %xmm0
        mulss %xmm1, %xmm0
        movss LCPI1_1, %xmm2
        mulss %xmm1, %xmm0
        subss %xmm0, %xmm2
        movl 8(%esp), %eax
        mulss %xmm2, %xmm1
        movss %xmm1, (%eax)
        ret

instead of:

_invsqrt:
        subl $4, %esp
        movss 8(%esp), %xmm0
        movss %xmm0, (%esp)
        movl (%esp), %eax
        movl $1597463007, %ecx
        sarl %eax
        subl %eax, %ecx
        movl %ecx, (%esp)
        mulss LCPI1_0, %xmm0
        movss (%esp), %xmm1
        mulss %xmm1, %xmm0
        mulss %xmm1, %xmm0
        movss LCPI1_1, %xmm2
        subss %xmm0, %xmm2
        mulss %xmm2, %xmm1
        movl 12(%esp), %eax
        movss %xmm1, (%eax)
        addl $4, %esp
        ret

llvm-svn: 32418

216c3028

Dec 08, 2006
- Incorporate any changes in the successor blocks into the result of · efe5c862
  Reid Spencer authored Dec 08, 2006
```
MarkAliveBlocks.

llvm-svn: 32375
```
  efe5c862
Dec 07, 2006
- What should be the last unnecessary <iostream>s in the library. · 9bfb1e1f
  Bill Wendling authored Dec 07, 2006
```
llvm-svn: 32333
```
  9bfb1e1f
- Removing even more <iostream> includes. · 22e978a7
  Bill Wendling authored Dec 07, 2006
```
llvm-svn: 32320
```
  22e978a7
- Changed llvm_ostream et all to OStream. llvm_cerr, llvm_cout, llvm_null, are · f3baad3e
  Bill Wendling authored Dec 07, 2006
```
now cerr, cout, and NullStream resp.

llvm-svn: 32298
```
  f3baad3e
Dec 06, 2006
- Update ConstantIntegral Max/Min tests for new interface. · 4ae56f30
  Reid Spencer authored Dec 06, 2006
```
llvm-svn: 32288
```
  4ae56f30
- add missing #include · f06bb658
  Chris Lattner authored Dec 06, 2006
```
llvm-svn: 32280
```
  f06bb658
- Detemplatize the Statistic class. The only type it is instantiated with · 700b8731
  Chris Lattner authored Dec 06, 2006
```
is 'unsigned'.

llvm-svn: 32279
```
  700b8731
- Remove the 'printname' argument to WriteAsOperand. It is always true, and · edcc8c2f
  Chris Lattner authored Dec 06, 2006
```
passing false would make the asmprinter fail anyway.

llvm-svn: 32264
```
  edcc8c2f
- counter should be unsigned. · ec589036
  Chris Lattner authored Dec 06, 2006
```
llvm-svn: 32252
```
  ec589036
Dec 05, 2006
- add an instcombine xform. This speeds up 462.libquantum from 9.78s to · c209b584
  Chris Lattner authored Dec 05, 2006
```
7.48s.  This regression is due to unforseen consequences of the cast patch.

llvm-svn: 32209
```
  c209b584
- SCCP does not handle Packed Type properly. Disable Packed Type handling · 21efc731
  Devang Patel authored Dec 04, 2006
```
for now.

llvm-svn: 32208
```
  21efc731
Dec 04, 2006
- Update call to CastInst::getCastOpcode for its new signature. · 14fbdd55
  Reid Spencer authored Dec 04, 2006
```
llvm-svn: 32166
```
  14fbdd55
Dec 02, 2006
- Unbreak VC++ build. · cc08c831
  Jeff Cohen authored Dec 02, 2006
```
llvm-svn: 32113
```
  cc08c831
- disable transformations that are invalid for fp vectors. This fixes · 7a002fec
  Chris Lattner authored Dec 02, 2006
```
Transforms/InstCombine/2006-12-01-BadFPVectorXform.ll

llvm-svn: 32112
```
  7a002fec
Dec 01, 2006
- Remove 4 FIXMEs to hack around cast-to-bool problems which no longer exist. · ad05ee9f
  Reid Spencer authored Nov 30, 2006
```
llvm-svn: 32051
```
  ad05ee9f
Nov 30, 2006
- make it clear that this is always a zext · c8978c52
  Chris Lattner authored Nov 30, 2006
```
llvm-svn: 32044
```
  c8978c52
- One more bugfix, 3 cases of making casts explicit. · 3ede00b3
  Chris Lattner authored Nov 30, 2006
```
llvm-svn: 32043
```
  3ede00b3
- Fix a bug in globalopt due to the recent cast patch. · 0390b9e6
  Chris Lattner authored Nov 30, 2006
```
llvm-svn: 32042
```
  0390b9e6
Nov 29, 2006
- implement cast.ll:test35. With this, we recognize: · 960acb00
  Chris Lattner authored Nov 29, 2006
```
unsigned short swp(unsigned short a) {
       return ((a & 0xff00) >> 8 | (a & 0x00ff) << 8);
}

as an idiom for bswap.

llvm-svn: 32011
```
  960acb00
- Teach instcombine to turn trunc(srl x, c) -> srl (trunc(x), c) when safe. · d747f015
  Chris Lattner authored Nov 29, 2006
```
This implements InstCombine/cast.ll:test34.  It fires hundreds of times on
176.gcc.

llvm-svn: 32009
```
  d747f015
- Implement Regression/Transforms/InstCombine/bswap-fold.ll, · a7942b7b
  Chris Lattner authored Nov 29, 2006
```
folding   seteq (bswap(x)), c -> seteq(x,bswap(c))

llvm-svn: 32006
```
  a7942b7b
- Join a split line. · a736fdf2
  Reid Spencer authored Nov 29, 2006
```
llvm-svn: 31996
```
  a736fdf2
Nov 28, 2006
- Undo the last patch until 253.perlbmk passes with these changes. · 116ad83a
  Reid Spencer authored Nov 28, 2006
```
llvm-svn: 31977
```
  116ad83a
- Remove 4 FIXME's from the CAST patch now that the back end is correctly · 59fe2d89
  Reid Spencer authored Nov 28, 2006
```
producing code for "trunc to bool". This passes all tests on Linux.

llvm-svn: 31963
```
  59fe2d89
Nov 27, 2006

Fix PR1014 and InstCombine/2006-11-27-XorBug.ll. · 8e9a7b73
Chris Lattner authored Nov 27, 2006
```
llvm-svn: 31941
```
8e9a7b73

For PR950: · 6c38f0bb

Reid Spencer authored Nov 27, 2006

The long awaited CAST patch. This introduces 12 new instructions into LLVM
to replace the cast instruction. Corresponding changes throughout LLVM are
provided. This passes llvm-test, llvm/test, and SPEC CPUINT2000 with the
exception of 175.vpr which fails only on a slight floating point output
difference.

llvm-svn: 31931

6c38f0bb

Nov 26, 2006
- Remove #include <iostream> and use llvm_* streams instead. · 4ae40107
  Bill Wendling authored Nov 26, 2006
```
llvm-svn: 31925
```
  4ae40107
- Replace #include <iostream> with llvm_* streams. · 8f13b5c4
  Bill Wendling authored Nov 26, 2006
```
llvm-svn: 31924
```
  8f13b5c4
- Removed #include <iostream> and replaced with llvm_* streams. · 5dbf43c9
  Bill Wendling authored Nov 26, 2006
```
llvm-svn: 31923
```
  5dbf43c9
- Removed #include <iostream> and used the llvm_cerr/DOUT streams instead. · a7459ca8
  Bill Wendling authored Nov 26, 2006
```
llvm-svn: 31922
```
  a7459ca8
Nov 23, 2006

Update to new predicate simplifier VRP design. Fixes PR966 and PR967. · 09b7e4d3

Nick Lewycky authored Nov 22, 2006

Remove predicate simplifier from default gcc3 pipeline. New design is too
slow to enable by default.
Add new testcases for problems encountered in development.

llvm-svn: 31895

09b7e4d3

Nov 21, 2006
- This xform is handled by FoldOpIntoPhi in visitCastInst in a more elegant way. · ec45a4c8
  Chris Lattner authored Nov 21, 2006
```
llvm-svn: 31889
```
  ec45a4c8
Nov 18, 2006

Do not convert massive blocks on phi nodes into select statements. Instead · 95adf8f1

Chris Lattner authored Nov 18, 2006

only do these transformations if there are a small number of phi's.
This speeds up Ptrdist/ks from 2.35s to 2.19s on my mac pro.

llvm-svn: 31853

95adf8f1

Nov 17, 2006

If an indvar with a variable stride is used by the exit condition, go ahead · 21eba2da

Chris Lattner authored Nov 17, 2006

and handle it like constant stride vars.  This fixes some bad codegen in
variable stride cases.  For example, it compiles this:

void foo(int k, int i) {
  for (k=i+i; k <= 8192; k+=i)
    flags2[k] = 0;
}

to:

LBB1_1: #bb.preheader
        movl %eax, %ecx
        addl %ecx, %ecx
        movl L_flags2$non_lazy_ptr, %edx
LBB1_2: #bb
        movb $0, (%edx,%ecx)
        addl %eax, %ecx
        cmpl $8192, %ecx
        jle LBB1_2      #bb
LBB1_5: #return
        ret

or (if the array is local and we are in dynamic-nonpic or static mode):

LBB3_2: #bb
        movb $0, _flags2(%ecx)
        addl %eax, %ecx
        cmpl $8192, %ecx
        jle LBB3_2      #bb

and:

        lis r2, ha16(L_flags2$non_lazy_ptr)
        lwz r2, lo16(L_flags2$non_lazy_ptr)(r2)
        slwi r3, r4, 1
LBB1_2: ;bb
        li r5, 0
        add r6, r4, r3
        stbx r5, r2, r3
        cmpwi cr0, r6, 8192
        bgt cr0, LBB1_5 ;return

instead of:

        leal (%eax,%eax,2), %ecx
        movl %eax, %edx
        addl %edx, %edx
        addl L_flags2$non_lazy_ptr, %edx
        xorl %esi, %esi
LBB1_2: #bb
        movb $0, (%edx,%esi)
        movl %eax, %edi
        addl %esi, %edi
        addl %ecx, %esi
        cmpl $8192, %esi
        jg LBB1_5       #return

and:

        lis r2, ha16(L_flags2$non_lazy_ptr)
        lwz r2, lo16(L_flags2$non_lazy_ptr)(r2)
        mulli r3, r4, 3
        slwi r5, r4, 1
        li r6, 0
        add r2, r2, r5
LBB1_2: ;bb
        li r5, 0
        add r7, r3, r6
        stbx r5, r2, r6
        add r6, r4, r6
        cmpwi cr0, r7, 8192
        ble cr0, LBB1_2 ;bb

This speeds up Benchmarks/Shootout/sieve from 8.533s to 6.464s and
implements LoopStrengthReduce/var_stride_used_by_compare.ll

llvm-svn: 31809

21eba2da

Nov 15, 2006
- Fix a gcc 4.2 warning. · e3a63d13
  Chris Lattner authored Nov 15, 2006
```
llvm-svn: 31751
```
  e3a63d13
Nov 14, 2006
- implement InstCombine/shift-simplify.ll by transforming: · f05d69ae
  Chris Lattner authored Nov 14, 2006
```
(X >> Z) op (Y >> Z)  -> (X op Y) >> Z

for all shifts and all ops={and/or/xor}.

llvm-svn: 31729
```
  f05d69ae