Commits · 7acdc17fe48c0b9cdbfa50251aae689b7e57998f · Roger Ferrer / llvm-epi-0.8

Nov 18, 2006

make the previous change more aggressive, moving any block with no fallthrough. · 7acdc17f
Chris Lattner authored Nov 18, 2006
```
This speeds up yacr2 by 7% on a core2.

llvm-svn: 31856
```
7acdc17f

Minor code layout tweak: if we have something like this: · 47ce2615

Chris Lattner authored Nov 18, 2006

  if (cond) goto BB2
BB1:
    ...
    return;
BB2:
   ...

Move BB1 to the end of the function so that the code falls through in the
non-return case.  This has the effect of moving assert (and other no-return
call) bodies and return blocks out of loops.

llvm-svn: 31855

47ce2615

Do not convert massive blocks on phi nodes into select statements. Instead · 95adf8f1

Chris Lattner authored Nov 18, 2006

only do these transformations if there are a small number of phi's.
This speeds up Ptrdist/ks from 2.35s to 2.19s on my mac pro.

llvm-svn: 31853

95adf8f1

Have ConstantExprs upgrade opcodes the same way as instructions. · ceeed00b
Reid Spencer authored Nov 18, 2006
```
llvm-svn: 31841
```
ceeed00b
on ppc64, float arguments take 8-byte stack slots not 4-byte stack slots. · 2cca385f
Chris Lattner authored Nov 18, 2006
```
Also, valist should create a pointer RC reg class value, not a GPRC value.

llvm-svn: 31840
```
2cca385f
make sure to safe LR8 in the right stack slot for PPC64 · 572e238c
Chris Lattner authored Nov 18, 2006
```
llvm-svn: 31839
```
572e238c
Pretty print 'rldicr r2, r2, 2, 61' as 'sldi r2, r2, 2'. · 9ca15c89
Chris Lattner authored Nov 18, 2006
```
llvm-svn: 31838
```
9ca15c89

Rewrite the branch selector to be correct in the face of large functions. · 542dfd55

Chris Lattner authored Nov 18, 2006

The algorithm it used before wasn't 100% correct, we now use an iterative
expansion model.  This fixes assembler errors when compiling 403.gcc with
tail merging enabled.

Change the way the branch selector works overall: Now, the isel generates
PPC::BCC instructions (as it used to) directly, and these BCC instructions
are emitted to the output or jitted directly if branches don't need
expansion.  Only if branches need expansion are instructions rewritten
and created.  This should make branch select faster, and eliminates the
Bxx instructions from the .td file.

llvm-svn: 31837

542dfd55

add encoding for BCC, after finally wrestling strange ppc/tblgen endianness · 33fc1d45
Chris Lattner authored Nov 17, 2006
```
issues to the ground.

llvm-svn: 31836
```
33fc1d45

Nov 17, 2006

convert PPC::BCC to use the 'pred' operand instead of separate predicate · be9377a1

Chris Lattner authored Nov 17, 2006

value and CR reg #.  This requires swapping the order of these everywhere
that touches BCC and requires us to write custom matching logic for
PPCcondbranch :(

llvm-svn: 31835

be9377a1

rename PPC::COND_BRANCH to PPC::BCC · e0263794
Chris Lattner authored Nov 17, 2006
```
llvm-svn: 31834
```
e0263794
start using PPC predicates more consistently. · 8c6a41ea
Chris Lattner authored Nov 17, 2006
```
llvm-svn: 31833
```
8c6a41ea

For unsigned 8-bit division. Use movzbw to set the lower 8 bits of AX while · 9e8093ae

Evan Cheng authored Nov 17, 2006

clearing the upper 8-bits instead of issuing two instructions. This also
eliminates the need to target the AH register which can be problematic on
x86-64.

llvm-svn: 31832

9e8093ae

Hopefully a good crack at making debugging work on intel -disable-fp-elim. · de5fa025
Jim Laskey authored Nov 17, 2006
```
llvm-svn: 31830
```
de5fa025
Assert unhandled case. · 73106b5e
Jim Laskey authored Nov 17, 2006
```
llvm-svn: 31828
```
73106b5e
1. Ignore the -disable-fp-elim when the routine is a leaf. · 1823346b
Jim Laskey authored Nov 17, 2006
```
2. Offsets on 64-bit stores are still in bytes.

llvm-svn: 31824
```
1823346b
Typo. Fix the nightly tests. · 91542a4f
Jim Laskey authored Nov 17, 2006
```
llvm-svn: 31823
```
91542a4f
Fixing the ENABLE_OPTIMIZED=1 DISABLE_ASSERTIONS=1 build. · da0add3f
Jim Laskey authored Nov 17, 2006
```
llvm-svn: 31822
```
da0add3f
Moved definition of llvm_ostream wrappers to the Streams.cpp file. · 9594f3a1
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31819
```
9594f3a1
Added wrappers for the std::cerr/std::cout objects. The wrappers will · 33158694
Bill Wendling authored Nov 17, 2006
```
soon replace all uses of those objects.

llvm-svn: 31817
```
33158694
Needed <iostream> for now. · 9b04e3b8
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31816
```
9b04e3b8
Needs the iostream include. · 4479c92d
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31815
```
4479c92d
Removed iostream #includes. Replaced std::cerr with DOUT. · 6a462f1e
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31814
```
6a462f1e
Removed even more std::cerr and #include <iostream> things. · c8e81b8d
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31813
```
c8e81b8d
Replaced DEBUG(std::cerr with DOUT. · 4b1a04ac
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31812
```
4b1a04ac
Replace DEBUG(std::cerr with DOUT. Removed some iostream #includes. · fc9063e9
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31811
```
fc9063e9
Removed unneeded <iostream> #include. · 32165514
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31810
```
32165514

If an indvar with a variable stride is used by the exit condition, go ahead · 21eba2da

Chris Lattner authored Nov 17, 2006

and handle it like constant stride vars.  This fixes some bad codegen in
variable stride cases.  For example, it compiles this:

void foo(int k, int i) {
  for (k=i+i; k <= 8192; k+=i)
    flags2[k] = 0;
}

to:

LBB1_1: #bb.preheader
        movl %eax, %ecx
        addl %ecx, %ecx
        movl L_flags2$non_lazy_ptr, %edx
LBB1_2: #bb
        movb $0, (%edx,%ecx)
        addl %eax, %ecx
        cmpl $8192, %ecx
        jle LBB1_2      #bb
LBB1_5: #return
        ret

or (if the array is local and we are in dynamic-nonpic or static mode):

LBB3_2: #bb
        movb $0, _flags2(%ecx)
        addl %eax, %ecx
        cmpl $8192, %ecx
        jle LBB3_2      #bb

and:

        lis r2, ha16(L_flags2$non_lazy_ptr)
        lwz r2, lo16(L_flags2$non_lazy_ptr)(r2)
        slwi r3, r4, 1
LBB1_2: ;bb
        li r5, 0
        add r6, r4, r3
        stbx r5, r2, r3
        cmpwi cr0, r6, 8192
        bgt cr0, LBB1_5 ;return

instead of:

        leal (%eax,%eax,2), %ecx
        movl %eax, %edx
        addl %edx, %edx
        addl L_flags2$non_lazy_ptr, %edx
        xorl %esi, %esi
LBB1_2: #bb
        movb $0, (%edx,%esi)
        movl %eax, %edi
        addl %esi, %edi
        addl %ecx, %esi
        cmpl $8192, %esi
        jg LBB1_5       #return

and:

        lis r2, ha16(L_flags2$non_lazy_ptr)
        lwz r2, lo16(L_flags2$non_lazy_ptr)(r2)
        mulli r3, r4, 3
        slwi r5, r4, 1
        li r6, 0
        add r2, r2, r5
LBB1_2: ;bb
        li r5, 0
        add r7, r3, r6
        stbx r5, r2, r6
        add r6, r4, r6
        cmpwi cr0, r7, 8192
        ble cr0, LBB1_2 ;bb

This speeds up Benchmarks/Shootout/sieve from 8.533s to 6.464s and
implements LoopStrengthReduce/var_stride_used_by_compare.ll

llvm-svn: 31809

21eba2da

More removal of std::cerr and DEBUG, replacing with DOUT instead. · 9d46fcd5
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31806
```
9d46fcd5
implement a todo: change a map into a vector · 3b7261b1
Chris Lattner authored Nov 17, 2006
```
llvm-svn: 31805
```
3b7261b1
Replace std::cerr uses of the "DEBUG" macro with "DOUT" instead. Removes · d108b9a6
Bill Wendling authored Nov 17, 2006
```
a #include of iostream.

llvm-svn: 31800
```
d108b9a6
fix typo · be1a4d80
Chris Lattner authored Nov 17, 2006
```
llvm-svn: 31799
```
be1a4d80

Added "DOUT" macro. This is used as a replacement for the std::cerr · d7fda044

Bill Wendling authored Nov 17, 2006

stream. It centralizes the use of std::cerr so that static c'tor/d'tors
aren't scattered around all over the place. The way to use it is like this:

       DOUT << "This is a status line: " << Var << "\n";

If "-debug" is specified, it will print. Otherwise, it'll not print. If
NDEBUG is defined, the DOUT does nothing.

llvm-svn: 31798

d7fda044

implicit_def_vrrc doesn't generate code. · a715288b
Chris Lattner authored Nov 16, 2006
```
llvm-svn: 31797
```
a715288b

Correct instructions for moving data between GR64 and SSE registers; also... · 572dc9cb

Evan Cheng authored Nov 16, 2006

Correct instructions for moving data between GR64 and SSE registers; also correct load i64 / store i64 from v2i64.

llvm-svn: 31795

572dc9cb

Fix a potential bug: MOVPDI2DI, etc. are not copy instructions. · 7ae482c5
Evan Cheng authored Nov 16, 2006
```
llvm-svn: 31794
```
7ae482c5

Nov 16, 2006

· 48850c10

Jim Laskey authored Nov 16, 2006

This is a general clean up of the PowerPC ABI.  Address several problems and
bugs including making sure that the TOS links back to the previous frame,
that the maximum call frame size is not included twice when using frame
pointers, no longer growing the frame on calls, double storing of SP and
a cleaner/faster dynamic alloca.

llvm-svn: 31792

48850c10

fix a regression that I introduced. stdu should scale the offset by 4 · 30055b92
Chris Lattner authored Nov 16, 2006
```
before printing it.

llvm-svn: 31791
```
30055b92
Align stubs on 4 byte boundary. This fixes 447.dealII. · c5e76971
Evan Cheng authored Nov 16, 2006
```
llvm-svn: 31790
```
c5e76971
*** empty log message *** · 8969ebca
Bill Wendling authored Nov 16, 2006
```
llvm-svn: 31789
```
8969ebca