Commits · d7f53cdaecdc931c623104bd01ec31a2b8a13dc4 · Roger Ferrer / llvm-epi-0.8

Nov 20, 2006

Global label not handled correctly. · d7f53cda
Jim Laskey authored Nov 20, 2006
```
llvm-svn: 31883
```
d7f53cda
r13 is the thread pointer on darwin/ppc64, don't allocate it. · ac363603
Chris Lattner authored Nov 20, 2006
```
llvm-svn: 31882
```
ac363603
Fix codegen for x86-64 on systems (like ppc or i386) that don't have 64-bit · 3e96211b
Chris Lattner authored Nov 20, 2006
```
features autodetected.  This fixes PR1010 and Regression/CodeGen/X86/xmm-r64.ll
on non-x86-64 hosts.

llvm-svn: 31879
```
3e96211b
Fix PR1011 and CodeGen/Generic/2006-11-20-DAGCombineCrash.ll · 3676a994
Chris Lattner authored Nov 20, 2006
```
llvm-svn: 31878
```
3676a994

setOperand should not zap the operand list or add implicit operands to an · 539df43e

Chris Lattner authored Nov 20, 2006

instruction.  Doing so breaks the FP stackifier, the alpha branch selector
the sparc fpmover.

This fixes PR1012 and CodeGen/X86/fp-stack-compare.ll

llvm-svn: 31876

539df43e

We should perform extra load for dllimported calls · 37d080b1
Anton Korobeynikov authored Nov 20, 2006
```
llvm-svn: 31874
```
37d080b1

For PR950: · d9436b68

Reid Spencer authored Nov 20, 2006

First in a series of patches to convert SetCondInst into ICmpInst and
FCmpInst using only two opcodes and having the instructions contain their
predicate value. Nothing uses these classes yet. More patches to follow.

llvm-svn: 31867

d9436b68

For PR950: · 9f4448a2

Reid Spencer authored Nov 19, 2006

Retain the signedness of the old integer types in a new TypeInfo structure
so that it can be used in the grammar to implement auto-upgrade of things
that depended on signedness of types. This doesn't implement any new
functionality in the AsmParser, its just plumbing for future changes.

llvm-svn: 31866

9f4448a2

Nov 18, 2006
- Don't transform in another bad case: if the block is empty, it should be · ea017f66
  Chris Lattner authored Nov 18, 2006
```
simplified before we do this xform so that our cost model is accurate.

llvm-svn: 31864
```
  ea017f66
- Fix another case we *don't* want to do this xform. · 56ec81ff
  Chris Lattner authored Nov 18, 2006
```
llvm-svn: 31861
```
  56ec81ff
- Implement operator<< for machine basic blocks to make it easier to dump them. · ace542fa
  Chris Lattner authored Nov 18, 2006
```
llvm-svn: 31857
```
  ace542fa
- make the previous change more aggressive, moving any block with no fallthrough. · 7acdc17f
  Chris Lattner authored Nov 18, 2006
```
This speeds up yacr2 by 7% on a core2.

llvm-svn: 31856
```
  7acdc17f
- Minor code layout tweak: if we have something like this: · 47ce2615
  Chris Lattner authored Nov 18, 2006
```
  if (cond) goto BB2
BB1:
    ...
    return;
BB2:
   ...

Move BB1 to the end of the function so that the code falls through in the
non-return case.  This has the effect of moving assert (and other no-return
call) bodies and return blocks out of loops.

llvm-svn: 31855
```
  47ce2615
- Do not convert massive blocks on phi nodes into select statements. Instead · 95adf8f1
  Chris Lattner authored Nov 18, 2006
```
only do these transformations if there are a small number of phi's.
This speeds up Ptrdist/ks from 2.35s to 2.19s on my mac pro.

llvm-svn: 31853
```
  95adf8f1
- Have ConstantExprs upgrade opcodes the same way as instructions. · ceeed00b
  Reid Spencer authored Nov 18, 2006
```
llvm-svn: 31841
```
  ceeed00b
- on ppc64, float arguments take 8-byte stack slots not 4-byte stack slots. · 2cca385f
  Chris Lattner authored Nov 18, 2006
```
Also, valist should create a pointer RC reg class value, not a GPRC value.

llvm-svn: 31840
```
  2cca385f
- make sure to safe LR8 in the right stack slot for PPC64 · 572e238c
  Chris Lattner authored Nov 18, 2006
```
llvm-svn: 31839
```
  572e238c
- Pretty print 'rldicr r2, r2, 2, 61' as 'sldi r2, r2, 2'. · 9ca15c89
  Chris Lattner authored Nov 18, 2006
```
llvm-svn: 31838
```
  9ca15c89
- Rewrite the branch selector to be correct in the face of large functions. · 542dfd55
  Chris Lattner authored Nov 18, 2006
```
The algorithm it used before wasn't 100% correct, we now use an iterative
expansion model.  This fixes assembler errors when compiling 403.gcc with
tail merging enabled.

Change the way the branch selector works overall: Now, the isel generates
PPC::BCC instructions (as it used to) directly, and these BCC instructions
are emitted to the output or jitted directly if branches don't need
expansion.  Only if branches need expansion are instructions rewritten
and created.  This should make branch select faster, and eliminates the
Bxx instructions from the .td file.

llvm-svn: 31837
```
  542dfd55
- add encoding for BCC, after finally wrestling strange ppc/tblgen endianness · 33fc1d45
  Chris Lattner authored Nov 17, 2006
```
issues to the ground.

llvm-svn: 31836
```
  33fc1d45
Nov 17, 2006

convert PPC::BCC to use the 'pred' operand instead of separate predicate · be9377a1

Chris Lattner authored Nov 17, 2006

value and CR reg #.  This requires swapping the order of these everywhere
that touches BCC and requires us to write custom matching logic for
PPCcondbranch :(

llvm-svn: 31835

be9377a1

rename PPC::COND_BRANCH to PPC::BCC · e0263794
Chris Lattner authored Nov 17, 2006
```
llvm-svn: 31834
```
e0263794
start using PPC predicates more consistently. · 8c6a41ea
Chris Lattner authored Nov 17, 2006
```
llvm-svn: 31833
```
8c6a41ea

For unsigned 8-bit division. Use movzbw to set the lower 8 bits of AX while · 9e8093ae

Evan Cheng authored Nov 17, 2006

clearing the upper 8-bits instead of issuing two instructions. This also
eliminates the need to target the AH register which can be problematic on
x86-64.

llvm-svn: 31832

9e8093ae

Hopefully a good crack at making debugging work on intel -disable-fp-elim. · de5fa025
Jim Laskey authored Nov 17, 2006
```
llvm-svn: 31830
```
de5fa025
Assert unhandled case. · 73106b5e
Jim Laskey authored Nov 17, 2006
```
llvm-svn: 31828
```
73106b5e
1. Ignore the -disable-fp-elim when the routine is a leaf. · 1823346b
Jim Laskey authored Nov 17, 2006
```
2. Offsets on 64-bit stores are still in bytes.

llvm-svn: 31824
```
1823346b
Typo. Fix the nightly tests. · 91542a4f
Jim Laskey authored Nov 17, 2006
```
llvm-svn: 31823
```
91542a4f
Fixing the ENABLE_OPTIMIZED=1 DISABLE_ASSERTIONS=1 build. · da0add3f
Jim Laskey authored Nov 17, 2006
```
llvm-svn: 31822
```
da0add3f
Moved definition of llvm_ostream wrappers to the Streams.cpp file. · 9594f3a1
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31819
```
9594f3a1
Added wrappers for the std::cerr/std::cout objects. The wrappers will · 33158694
Bill Wendling authored Nov 17, 2006
```
soon replace all uses of those objects.

llvm-svn: 31817
```
33158694
Needed <iostream> for now. · 9b04e3b8
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31816
```
9b04e3b8
Needs the iostream include. · 4479c92d
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31815
```
4479c92d
Removed iostream #includes. Replaced std::cerr with DOUT. · 6a462f1e
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31814
```
6a462f1e
Removed even more std::cerr and #include <iostream> things. · c8e81b8d
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31813
```
c8e81b8d
Replaced DEBUG(std::cerr with DOUT. · 4b1a04ac
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31812
```
4b1a04ac
Replace DEBUG(std::cerr with DOUT. Removed some iostream #includes. · fc9063e9
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31811
```
fc9063e9
Removed unneeded <iostream> #include. · 32165514
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31810
```
32165514

If an indvar with a variable stride is used by the exit condition, go ahead · 21eba2da

Chris Lattner authored Nov 17, 2006

and handle it like constant stride vars.  This fixes some bad codegen in
variable stride cases.  For example, it compiles this:

void foo(int k, int i) {
  for (k=i+i; k <= 8192; k+=i)
    flags2[k] = 0;
}

to:

LBB1_1: #bb.preheader
        movl %eax, %ecx
        addl %ecx, %ecx
        movl L_flags2$non_lazy_ptr, %edx
LBB1_2: #bb
        movb $0, (%edx,%ecx)
        addl %eax, %ecx
        cmpl $8192, %ecx
        jle LBB1_2      #bb
LBB1_5: #return
        ret

or (if the array is local and we are in dynamic-nonpic or static mode):

LBB3_2: #bb
        movb $0, _flags2(%ecx)
        addl %eax, %ecx
        cmpl $8192, %ecx
        jle LBB3_2      #bb

and:

        lis r2, ha16(L_flags2$non_lazy_ptr)
        lwz r2, lo16(L_flags2$non_lazy_ptr)(r2)
        slwi r3, r4, 1
LBB1_2: ;bb
        li r5, 0
        add r6, r4, r3
        stbx r5, r2, r3
        cmpwi cr0, r6, 8192
        bgt cr0, LBB1_5 ;return

instead of:

        leal (%eax,%eax,2), %ecx
        movl %eax, %edx
        addl %edx, %edx
        addl L_flags2$non_lazy_ptr, %edx
        xorl %esi, %esi
LBB1_2: #bb
        movb $0, (%edx,%esi)
        movl %eax, %edi
        addl %esi, %edi
        addl %ecx, %esi
        cmpl $8192, %esi
        jg LBB1_5       #return

and:

        lis r2, ha16(L_flags2$non_lazy_ptr)
        lwz r2, lo16(L_flags2$non_lazy_ptr)(r2)
        mulli r3, r4, 3
        slwi r5, r4, 1
        li r6, 0
        add r2, r2, r5
LBB1_2: ;bb
        li r5, 0
        add r7, r3, r6
        stbx r5, r2, r6
        add r6, r4, r6
        cmpwi cr0, r7, 8192
        ble cr0, LBB1_2 ;bb

This speeds up Benchmarks/Shootout/sieve from 8.533s to 6.464s and
implements LoopStrengthReduce/var_stride_used_by_compare.ll

llvm-svn: 31809

21eba2da

More removal of std::cerr and DEBUG, replacing with DOUT instead. · 9d46fcd5
Bill Wendling authored Nov 17, 2006
```
llvm-svn: 31806
```
9d46fcd5