Commits · de46e48420b56dd4e3a3a224eae42256e3c54c04 · Roger Ferrer / llvm-epi-0.8

Nov 02, 2006

For PR786: · de46e484

Reid Spencer authored Nov 02, 2006

Turn on -Wunused and -Wno-unused-parameter. Clean up most of the resulting
fall out by removing unused variables. Remaining warnings have to do with
unused functions (I didn't want to delete code without review) and unused
variables in generated code. Maintainers should clean up the remaining
issues when they see them. All changes pass DejaGnu tests and Olden.

llvm-svn: 31380

de46e484

For PR950: · 7eb55b39

Reid Spencer authored Nov 02, 2006

Replace the REM instruction with UREM, SREM and FREM.

llvm-svn: 31369

7eb55b39

There can be more than one PHINode at the start of the block. · 2cb4f83b
Devang Patel authored Nov 01, 2006
```
llvm-svn: 31362
```
2cb4f83b

Nov 01, 2006
- Handle PHINode with only one incoming value. · 44519a8f
  Devang Patel authored Nov 01, 2006
```
This fixes http://llvm.org/bugs/show_bug.cgi?id=979

llvm-svn: 31358
```
  44519a8f
- Factor gep instructions through phi nodes. · eebea43b
  Chris Lattner authored Nov 01, 2006
```
llvm-svn: 31346
```
  eebea43b
- Turn a phi of many loads into a phi of the address and a single load of the · 14f82c7d
  Chris Lattner authored Nov 01, 2006
```
result.  This can significantly shrink code and exposes identities more
aggressively.

llvm-svn: 31344
```
  14f82c7d
- Fix a bug in the previous patch · dc826fc0
  Chris Lattner authored Nov 01, 2006
```
llvm-svn: 31342
```
  dc826fc0
- Fold things like "phi [add (a,b), add(c,d)]" into two phi's and one add. · cadac0c5
  Chris Lattner authored Nov 01, 2006
```
This triggers thousands of times on multisource.

llvm-svn: 31341
```
  cadac0c5
Oct 28, 2006
- break edges more intelligently · a6eb7e08
  Chris Lattner authored Oct 28, 2006
```
llvm-svn: 31257
```
  a6eb7e08
- SplitCriticalEdge checks to see if an edge is critical, don't check twice · 400ac04e
  Chris Lattner authored Oct 28, 2006
```
llvm-svn: 31255
```
  400ac04e
- prepare for a change I'm about to make · 5191c654
  Chris Lattner authored Oct 28, 2006
```
llvm-svn: 31248
```
  5191c654
Oct 26, 2006

Simplify code a bit by changing instances of: · 00c482b7

Reid Spencer authored Oct 26, 2006

   InsertNewInstBefore(new CastInst(Val, ValTy, Val->GetName()), I)
into:
   InsertCastBefore(Val, ValTy, I)

llvm-svn: 31204

00c482b7

For PR950: · 7e80b0b3

Reid Spencer authored Oct 26, 2006

Make necessary changes to support DIV -> [SUF]Div. This changes llvm to
have three division instructions: signed, unsigned, floating point. The
bytecode and assembler are bacwards compatible, however.

llvm-svn: 31195

7e80b0b3

Fix 2006-10-25-AddSetCC. A relational operator (like setlt) can never · 5b979ae5
Nick Lewycky authored Oct 26, 2006
```
produce an EQ property.

llvm-svn: 31193
```
5b979ae5

Resurrect r1.25. · 9d17c82a

Nick Lewycky authored Oct 25, 2006

Fix and comment the "or", "and" and "xor" transformations.

llvm-svn: 31189

9d17c82a

Oct 25, 2006
- hide symbols properly · 53f53db9
  Chris Lattner authored Oct 25, 2006
```
llvm-svn: 31184
```
  53f53db9
Oct 24, 2006
- Fix Transforms/ScalarRepl/2006-10-23-PointerUnionCrash.ll · ebb1ad43
  Chris Lattner authored Oct 24, 2006
```
llvm-svn: 31151
```
  ebb1ad43
- Revert back to r1.21, which was the last revision of predsimplify that · dc7b9beb
  Chris Lattner authored Oct 24, 2006
```
passes llvm-gcc bootstrap.

llvm-svn: 31146
```
  dc7b9beb
Oct 23, 2006
- Handle fallout from the recent branch-on-undef changes. This fixes · fe7b6ef3
  Chris Lattner authored Oct 23, 2006
```
Prolangs-C/agrep and SCCP/2006-10-23-IPSCCP-Crash.ll

llvm-svn: 31132
```
  fe7b6ef3
- Remove the Backwards operation. Resolving now works at the time when a · 53b41584
  Nick Lewycky authored Oct 23, 2006
```
property is added by running through the list of uses of the value and
adding resolved properties to the property set.

llvm-svn: 31126
```
  53b41584
- Fix similar missing optimization opportunity in XOR. · 6f5c30fc
  Nick Lewycky authored Oct 22, 2006
```
llvm-svn: 31123
```
  6f5c30fc
Oct 22, 2006

Whoops! Add missing NULL check. · af2b0571
Nick Lewycky authored Oct 22, 2006
```
llvm-svn: 31121
```
af2b0571
Handle "if ((x|y) != 0)" for ints like we do for bools. Fixes missed · 2c734f3f
Nick Lewycky authored Oct 22, 2006
```
optimization opportunity pointed out by Chris Lattner.

llvm-svn: 31118
```
2c734f3f
AllocaInst can't return a null pointer. Fixes missed optimization · f3450083
Nick Lewycky authored Oct 22, 2006
```
opportunity pointed out by Andrew Lewycky.

llvm-svn: 31115
```
f3450083
Add a workaround for PR962, disabling the more aggressive form of this · 250eff20
Chris Lattner authored Oct 22, 2006
```
transformation.  This speeds up a C++ app 2.25x.

llvm-svn: 31113
```
250eff20

3 Changes: · af17096d

Chris Lattner authored Oct 22, 2006

1. Better document what is going on here.
2. Only hack on one branch per iteration, making the results less conservative.
3. Handle the problematic case by marking edges executable instead of by
   playing with value lattice states.  This is far less pessimistic, and fixes
   SCCP/ipsccp-gvar.ll.

llvm-svn: 31106

af17096d

Oct 20, 2006
- Fix an ugly problem in SCCP. This fixes Benchmarks/Misc-C++/mandel-text.cpp · 319c86fd
  Chris Lattner authored Oct 20, 2006
```
llvm-svn: 31073
```
  319c86fd
- Fix miscompilation of MallocBench/espresso which code review pointed out · 5dee3b25
  Chris Lattner authored Oct 20, 2006
```
but apparently didn't make it into the final patch.

llvm-svn: 31070
```
  5dee3b25
- For PR950: · e0fc4dfc
  Reid Spencer authored Oct 20, 2006
```
This patch implements the first increment for the Signless Types feature.
All changes pertain to removing the ConstantSInt and ConstantUInt classes
in favor of just using ConstantInt.

llvm-svn: 31063
```
  e0fc4dfc
- While creating mask, use 1ULL instead of 1. · 5d417e35
  Devang Patel authored Oct 20, 2006
```
llvm-svn: 31062
```
  5d417e35
Oct 19, 2006
- It is OK to remove extra cast if operation is EQ/NE even though source · 5d6df959
  Devang Patel authored Oct 19, 2006
```
and destination sign may not match but other conditions are met.

llvm-svn: 31056
```
  5d6df959
- Typo Typo. · 88afd00d
  Devang Patel authored Oct 19, 2006
```
llvm-svn: 31055
```
  88afd00d
- Typo. · 472530d9
  Devang Patel authored Oct 19, 2006
```
llvm-svn: 31054
```
  472530d9
- Fix bug in PR454 resolution. Added new test case. · b42aef49
  Devang Patel authored Oct 19, 2006
```
This fixes llvmAsmParser.cpp miscompile by llvm on PowerPC Darwin.

llvm-svn: 31053
```
  b42aef49
Oct 17, 2006
- Undo Chris' last patch, it caused a regression. · 3c514959
  Reid Spencer authored Oct 16, 2006
```
llvm-svn: 30991
```
  3c514959
Oct 16, 2006
- fix a buggy check that accidentally disabled this xform · 9a1c7dd2
  Chris Lattner authored Oct 15, 2006
```
llvm-svn: 30967
```
  9a1c7dd2
Oct 12, 2006
- Replace custom dispatch code with two uses of InstVisitor. Improves · 77e030bc
  Nick Lewycky authored Oct 12, 2006
```
compile-time performance.

llvm-svn: 30896
```
  77e030bc
Oct 09, 2006
- Implement SROA of unions with mixed pointers/integers in them. This implements · 41b44224
  Chris Lattner authored Oct 08, 2006
```
PR892 and Transforms/ScalarRepl/union-pointer.ll:test2

llvm-svn: 30825
```
  41b44224
- Implement Transforms/ScalarRepl/union-pointer.ll:test · 05f8272a
  Chris Lattner authored Oct 08, 2006
```
llvm-svn: 30823
```
  05f8272a
Oct 05, 2006

add a new SimplifyDemandedVectorElts method, which works similarly to · 2deeaeac

Chris Lattner authored Oct 05, 2006

SimplifyDemandedBits.  The idea is that some operations can be simplified if
not all of the computed elements are needed.  Some targets (like x86) have a
large number of intrinsics that operate on a single element, but pass other
elts through unmodified.  If those other elements are not needed, the
intrinsics can be simplified to scalar operations, and insertelement ops can
be removed.

This turns (f.e.):

ushort %Convert_sse(float %f) {
        %tmp = insertelement <4 x float> undef, float %f, uint 0                ; <<4 x float>> [#uses=1]
        %tmp10 = insertelement <4 x float> %tmp, float 0.000000e+00, uint 1             ; <<4 x float>> [#uses=1]
        %tmp11 = insertelement <4 x float> %tmp10, float 0.000000e+00, uint 2           ; <<4 x float>> [#uses=1]
        %tmp12 = insertelement <4 x float> %tmp11, float 0.000000e+00, uint 3           ; <<4 x float>> [#uses=1]
        %tmp28 = tail call <4 x float> %llvm.x86.sse.sub.ss( <4 x float> %tmp12, <4 x float> < float 1.000000e+00, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > )               ; <<4 x float>> [#uses=1]
        %tmp37 = tail call <4 x float> %llvm.x86.sse.mul.ss( <4 x float> %tmp28, <4 x float> < float 5.000000e-01, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > )               ; <<4 x float>> [#uses=1]
        %tmp48 = tail call <4 x float> %llvm.x86.sse.min.ss( <4 x float> %tmp37, <4 x float> < float 6.553500e+04, float 0.000000e+00, float 0.000000e+00, float 0.000000e+00 > )               ; <<4 x float>> [#uses=1]
        %tmp59 = tail call <4 x float> %llvm.x86.sse.max.ss( <4 x float> %tmp48, <4 x float> zeroinitializer )          ; <<4 x float>> [#uses=1]
        %tmp = tail call int %llvm.x86.sse.cvttss2si( <4 x float> %tmp59 )              ; <int> [#uses=1]
        %tmp69 = cast int %tmp to ushort                ; <ushort> [#uses=1]
        ret ushort %tmp69
}

into:

ushort %Convert_sse(float %f) {
entry:
        %tmp28 = sub float %f, 1.000000e+00             ; <float> [#uses=1]
        %tmp37 = mul float %tmp28, 5.000000e-01         ; <float> [#uses=1]
        %tmp375 = insertelement <4 x float> undef, float %tmp37, uint 0         ; <<4 x float>> [#uses=1]
        %tmp48 = tail call <4 x float> %llvm.x86.sse.min.ss( <4 x float> %tmp375, <4 x float> < float 6.553500e+04, float undef, float undef, float undef > )           ; <<4 x float>> [#uses=1]
        %tmp59 = tail call <4 x float> %llvm.x86.sse.max.ss( <4 x float> %tmp48, <4 x float> < float 0.000000e+00, float undef, float undef, float undef > )            ; <<4 x float>> [#uses=1]
        %tmp = tail call int %llvm.x86.sse.cvttss2si( <4 x float> %tmp59 )              ; <int> [#uses=1]
        %tmp69 = cast int %tmp to ushort                ; <ushort> [#uses=1]
        ret ushort %tmp69
}

which improves codegen from:

_Convert_sse:
        movss LCPI1_0, %xmm0
        movss 4(%esp), %xmm1
        subss %xmm0, %xmm1
        movss LCPI1_1, %xmm0
        mulss %xmm0, %xmm1
        movss LCPI1_2, %xmm0
        minss %xmm0, %xmm1
        xorps %xmm0, %xmm0
        maxss %xmm0, %xmm1
        cvttss2si %xmm1, %eax
        andl $65535, %eax
        ret

to:

_Convert_sse:
        movss 4(%esp), %xmm0
        subss LCPI1_0, %xmm0
        mulss LCPI1_1, %xmm0
        movss LCPI1_2, %xmm1
        minss %xmm1, %xmm0
        xorps %xmm1, %xmm1
        maxss %xmm1, %xmm0
        cvttss2si %xmm0, %eax
        andl $65535, %eax
        ret


This is just a first step, it can be extended in many ways.  Testcase here:
Transforms/InstCombine/vec_demanded_elts.ll

llvm-svn: 30752

2deeaeac