Commits · 6a792feb023f91a7c6394137fc2a43738b0366b9 · Roger Ferrer / llvm-epi-0.8

Oct 18, 2004

Getting ADCE to interact well with unreachable instructions seems like a nontrivial · 6a792feb

Chris Lattner authored Oct 17, 2004

exercise that I'm not interested in tackling right now.  Just punt and treat them
like unwind's.

This 'fixes' test/Regression/Transforms/ADCE/unreachable-function.ll

llvm-svn: 17106

6a792feb

Oct 17, 2004
- Remove printout, realize that instructions in the entry block dominate all · 107c15c3
  Chris Lattner authored Oct 17, 2004
```
other blocks.

llvm-svn: 17099
```
  107c15c3
- hasConstantValue will soon return instructions that don't dominate the PHI node, · e29d634a
  Chris Lattner authored Oct 17, 2004
```
so prepare for this.

llvm-svn: 17095
```
  e29d634a
- Fix a type violation · 67f0545d
  Chris Lattner authored Oct 16, 2004
```
llvm-svn: 17069
```
  67f0545d
Oct 16, 2004
- Kill the bogon that slipped into my buffer before I committed. · 684c5c65
  Chris Lattner authored Oct 16, 2004
```
llvm-svn: 17067
```
  684c5c65
- Implement InstCombine/getelementptr.ll:test9, which is the source of many · 6580e09f
  Chris Lattner authored Oct 16, 2004
```
ugly and giant constnat exprs in some programs.

llvm-svn: 17066
```
  6580e09f
- Optimize instructions involving undef values. For example X+undef == undef. · 81a7a234
  Chris Lattner authored Oct 16, 2004
```
llvm-svn: 17047
```
  81a7a234
- Handle undef values as undefined on the constant lattice · 646354ba
  Chris Lattner authored Oct 16, 2004
```
ignore unreachable instructions

llvm-svn: 17044
```
  646354ba
- Add note · 6ac3ef95
  Chris Lattner authored Oct 16, 2004
```
llvm-svn: 17043
```
  6ac3ef95
Oct 13, 2004
- Update to reflect changes in Makefile rules. · ace94df7
  Reid Spencer authored Oct 13, 2004
```
llvm-svn: 16950
```
  ace94df7
Oct 12, 2004

Transform memmove -> memcpy when the source is obviously constant memory. · 00648e1f
Chris Lattner authored Oct 12, 2004
```
llvm-svn: 16932
```
00648e1f
Fix a REALLY obscure bug in my previous checkin, which was splicing the END · 7cabf6f8
Chris Lattner authored Oct 12, 2004
```
marker from one ilist into the middle of another basic block!

llvm-svn: 16925
```
7cabf6f8

Handle a common case more carefully. In particular, instead of transforming · 9776f725

Chris Lattner authored Oct 11, 2004

pointer recurrences into expressions from this:

  %P_addr.0.i.0 = phi sbyte* [ getelementptr ([8 x sbyte]* %.str_1, int 0, int 0), %entry ], [ %inc.0.i, %no_exit.i ]
  %inc.0.i = getelementptr sbyte* %P_addr.0.i.0, int 1            ; <sbyte*> [#uses=2]

into this:

  %inc.0.i = getelementptr sbyte* getelementptr ([8 x sbyte]* %.str_1, int 0, int 0), int %inc.0.i.rec

Actually create something nice, like this:

  %inc.0.i = getelementptr [8 x sbyte]* %.str_1, int 0, int %inc.0.i.rec

llvm-svn: 16924

9776f725

Oct 11, 2004
- Reenable the transform, turning X/-10 < 1 into X > -10 · a92af96c
  Chris Lattner authored Oct 11, 2004
```
llvm-svn: 16918
```
  a92af96c
- Initial version of automake Makefile.am file. · 97327f05
  Reid Spencer authored Oct 10, 2004
```
llvm-svn: 16893
```
  97327f05
Oct 09, 2004
- Use DEBUG instead of DebugFlag directly, as DebugFlag does not respect · 5c91c8f1
  Chris Lattner authored Oct 09, 2004
```
-debug-only!

llvm-svn: 16868
```
  5c91c8f1
- Implement sub.ll:test17, -X/C -> X/-C · 4ad08352
  Chris Lattner authored Oct 09, 2004
```
llvm-svn: 16863
```
  4ad08352
Oct 08, 2004
- Temporarily disable a buggy transformation until it can be fixed. This fixes · 0b41e861
  Chris Lattner authored Oct 08, 2004
```
254.gap.

llvm-svn: 16853
```
  0b41e861
- Instcombine (X & FF00) + xx00 -> (X+xx00) & FF00, implementing and.ll:test27 · bff91d9a
  Chris Lattner authored Oct 08, 2004
```
This comes up when doing adds to bitfield elements.

llvm-svn: 16836
```
  bff91d9a
- Little patch to turn (shl (add X, 123), 4) -> (add (shl X, 4), 123 << 4) · 44bd392c
  Chris Lattner authored Oct 08, 2004
```
This triggers in cases of bitfield additions, opening opportunities for
future improvements.

llvm-svn: 16834
```
  44bd392c
Oct 06, 2004
- Instcombine: -(X sdiv C) -> (X sdiv -C), tested by sub.ll:test16 · 0aee4b79
  Chris Lattner authored Oct 06, 2004
```
llvm-svn: 16769
```
  0aee4b79
- Reduce code growth implied by the tail duplication pass by not duplicating · 2ce32df8
  Chris Lattner authored Oct 06, 2004
```
an instruction if it can be hoisted to a common dominator of the block.
This implements: test/Regression/Transforms/TailDup/MergeTest.ll

llvm-svn: 16758
```
  2ce32df8
Sep 29, 2004

Hrm, debugging printouts do not need to be in here · abae776b
Chris Lattner authored Sep 29, 2004
```
llvm-svn: 16598
```
abae776b

* Pull range optimization code out into new InsertRangeTest function. · 6862fbd2

Chris Lattner authored Sep 29, 2004

* SubOne/AddOne functions always return ConstantInt, declare them as such
* Pull code for handling setcc X, cst, where cst is at the end of the range,
  or cc is LE or GE up earlier in visitSetCondInst.  This reduces #iterations
  in some cases.
* Fold: (div X, C1) op C2 -> range check, implementing div.ll:test6 - test9.

llvm-svn: 16588

6862fbd2

Fold binary expressions and casts into PHI nodes that have all constant inputs. · 6a4adcda

Chris Lattner authored Sep 29, 2004

This takes something like this:

%A = phi int [ 3, %cond_false.0 ], [ 2, %endif.0.i ], [ 2, %endif.1.i ]
%B = div int %tmp.243, 4

and turns it into:

%A = phi int [ 3/4, %cond_false.0 ], [ 2/4, %endif.0.i ], [ 2/4, %endif.1.i ]

which is later simplified (in this case) into %A = 0.

This triggers thousands of times in spec, for example, 269 times in 176.gcc.

This is tested by InstCombine/add.ll:test23 and set.ll:test18.

llvm-svn: 16582

6a4adcda

Hrm, really, all tests passed without this, but it is scary to think how... · c949128b
Chris Lattner authored Sep 29, 2004
```
llvm-svn: 16568
```
c949128b

Remove debugging printout · be7a69eb

Chris Lattner authored Sep 29, 2004

Instcombine (setcc (truncate X), C1).

This occurs THOUSANDS of times in many benchmarks.  Particularlly common
seem to be things like (seteq (cast bool X to int), int 0)

This turns it into (seteq bool %X, false), which then becomes (not %X).

llvm-svn: 16567

be7a69eb

Fold (X setcc C1) | (X setcc C2) · dcf756ec
Chris Lattner authored Sep 28, 2004
```
This implements or.ll:test1[89]

llvm-svn: 16561
```
dcf756ec

Sep 28, 2004

Fold (and (setcc X, C1), (setcc X, C2)) · 623826c8

Chris Lattner authored Sep 28, 2004

This is important for several reasons:

1. Benchmarks have lots of code that looks like this (perlbmk in particular):

  %tmp.2.i = setne int %tmp.0.i, 128              ; <bool> [#uses=1]
  %tmp.6343 = seteq int %tmp.0.i, 1               ; <bool> [#uses=1]
  %tmp.63 = and bool %tmp.2.i, %tmp.6343          ; <bool> [#uses=1]

   we now fold away the setne, a clear improvement.

2. In the more important cases, such as (X >= 10) & (X < 20), we now produce
   smaller code: (X-10) < 10.

3. Perhaps the nicest effect of this patch is that it really helps out the
   code generators.  In particular, for a 'range test' like the above,
   instead of generating this on X86 (the difference on PPC is even more
   pronounced):

        cmp %EAX, 50
        setge %CL
        cmp %EAX, 100
        setl %AL
        and %CL, %AL
        cmp %CL, 0

   we now generate this:

        add %EAX, -50
        cmp %EAX, 50

   Furthermore, this causes setcc's to be folded into branches more often.

These combinations trigger dozens of times in the spec benchmarks, particularly
in 176.gcc, 186.crafty, 253.perlbmk, 254.gap, & 099.go.

llvm-svn: 16559

623826c8

Implement X / C1 / C2 folding · 272d5ca9

Chris Lattner authored Sep 28, 2004

Implement (setcc (shl X, C1), C2) folding.

The second one occurs several dozen times in spec.  The first was added
just in case.  :)

These are tested by shift.ll:test2[12], and div.ll:test5

llvm-svn: 16549

272d5ca9

shl is always zero extending, so always use a zero extending shift right. · 6afc02f8

Chris Lattner authored Sep 28, 2004

This latent bug was exposed by recent changes, and is tested as:
llvm/test/Regression/Transforms/InstCombine/2004-09-28-BadShiftAndSetCC.llx

llvm-svn: 16546

6afc02f8

Pull assignment out of for loop conditional in order for this to · 3ce42ec7
Alkis Evlogimenos authored Sep 28, 2004
```
compile under windows. Patch contributed by Paolo Invernizzi!

llvm-svn: 16534
```
3ce42ec7

Sep 27, 2004

Fix two bugs: one where a condition was mistakenly swapped, and another · bfff18a8

Chris Lattner authored Sep 27, 2004

where we folded (X & 254) -> X < 1 instead of X < 2.  These problems were
latent problems exposed by the latest patch.

llvm-svn: 16528

bfff18a8

Fold: (setcc (shr X, ShAmt), CI), where 'cc' is eq or ne. This xform · 1023b872

Chris Lattner authored Sep 27, 2004

triggers often, for example:

6x in povray, 1x in gzip, 279x in gcc, 1x in crafty, 8x in eon, 11x in perlbmk,
362x in gap, 4x in vortex, 14 in m88ksim, 211x in 126.gcc, 1x in compress,
11x in ijpeg, and 4x in 147.vortex.

llvm-svn: 16521

1023b872

Sep 24, 2004
- Implement shift-and combinations, implementing InstCombine/and.ll:test19-21 · 7e794273
  Chris Lattner authored Sep 24, 2004
```
These combinations trigger 4 times in povray, 7x in gcc, 4x in gap, and 2x in bzip2.

llvm-svn: 16508
```
  7e794273
Sep 23, 2004
- Move LHSI->hasOneUse() into the arms of the conditional, reindenting code. · e1b4d2a4
  Chris Lattner authored Sep 23, 2004
```
No functionality changes here.

llvm-svn: 16505
```
  e1b4d2a4
- Implement Transforms/InstCombine/and.ll:test18, a case that occurs 20 times · 8fc5af4d
  Chris Lattner authored Sep 23, 2004
```
in perlbmk

llvm-svn: 16504
```
  8fc5af4d
- Implement select.ll:test16: fold load (select C, X, null) -> load X · bdcf41a8
  Chris Lattner authored Sep 23, 2004
```
llvm-svn: 16499
```
  bdcf41a8
Sep 21, 2004

Do not fold (X + C1 != C2) if there are other users of the add. Doing · b121ae1c

Chris Lattner authored Sep 21, 2004

this transformation used to take a loop like this:

int Array[1000];
void test(int X) {
  int i;
  for (i = 0; i < 1000; ++i)
    Array[i] += X;
}

Compiled to LLVM is:

no_exit:                ; preds = %entry, %no_exit
        %indvar = phi uint [ 0, %entry ], [ %indvar.next, %no_exit ]            ; <uint> [#uses=2]
        %tmp.4 = getelementptr [1000 x int]* %Array, int 0, uint %indvar                ; <int*> [#uses=2]
        %tmp.7 = load int* %tmp.4               ; <int> [#uses=1]
        %tmp.9 = add int %tmp.7, %X             ; <int> [#uses=1]
        store int %tmp.9, int* %tmp.4
***     %indvar.next = add uint %indvar, 1              ; <uint> [#uses=2]
***     %exitcond = seteq uint %indvar.next, 1000               ; <bool> [#uses=1]
        br bool %exitcond, label %return, label %no_exit

and turn it into a loop like this:

no_exit:                ; preds = %entry, %no_exit
        %indvar = phi uint [ 0, %entry ], [ %indvar.next, %no_exit ]            ; <uint> [#uses=3]
        %tmp.4 = getelementptr [1000 x int]* %Array, int 0, uint %indvar                ; <int*> [#uses=2]
        %tmp.7 = load int* %tmp.4               ; <int> [#uses=1]
        %tmp.9 = add int %tmp.7, %X             ; <int> [#uses=1]
        store int %tmp.9, int* %tmp.4
***     %indvar.next = add uint %indvar, 1              ; <uint> [#uses=1]
***     %exitcond = seteq uint %indvar, 999             ; <bool> [#uses=1]
        br bool %exitcond, label %return, label %no_exit

Note that indvar.next and indvar can no longer be coallesced.  In machine
code terms, this patch changes this code:

.LBBtest_1:     # no_exit
        mov %EDX, OFFSET Array
        mov %ESI, %EAX
        add %ESI, DWORD PTR [%EDX + 4*%ECX]
        mov %EDX, OFFSET Array
        mov DWORD PTR [%EDX + 4*%ECX], %ESI
        mov %EDX, %ECX
        inc %EDX
        cmp %ECX, 999
        mov %ECX, %EDX
        jne .LBBtest_1  # no_exit

into this:

.LBBtest_1:     # no_exit
        mov %EDX, OFFSET Array
        mov %ESI, %EAX
        add %ESI, DWORD PTR [%EDX + 4*%ECX]
        mov %EDX, OFFSET Array
        mov DWORD PTR [%EDX + 4*%ECX], %ESI
        inc %ECX
        cmp %ECX, 1000
        jne .LBBtest_1  # no_exit

We need better instruction selection to get this:

.LBBtest_1:     # no_exit
        add DWORD PTR [Array + 4*%ECX], EAX
        inc %ECX
        cmp %ECX, 1000
        jne .LBBtest_1  # no_exit

... but at least there is less register juggling

llvm-svn: 16473

b121ae1c

Sep 20, 2004
- Fix potential miscompilations: InstCombine/2004-09-20-BadLoadCombine*.llx · 42618551
  Chris Lattner authored Sep 20, 2004
```
llvm-svn: 16447
```
  42618551