Commits · e5e5b37b7979cfbd229f770f2d1ba52698e3d809 · Roger Ferrer / llvm-epi-0.8

Oct 18, 2004
- Pass -single_module option to gcc when linking dynamic libraries for use with... · e5e5b37b
  Nate Begeman authored Oct 17, 2004
```
Pass -single_module option to gcc when linking dynamic libraries for use with bugpoint, so that we can bugpoint multiple .cp files

llvm-svn: 17102
```
  e5e5b37b
- Generate correct stubs for weak-linked symbols · 844186b8
  Nate Begeman authored Oct 17, 2004
```
llvm-svn: 17101
```
  844186b8
Oct 17, 2004

fold gep undef, ... -> undef · f6013750
Chris Lattner authored Oct 17, 2004
```
This comes up many times in perlbmk and probably others.

llvm-svn: 17100
```
f6013750
Remove printout, realize that instructions in the entry block dominate all · 107c15c3
Chris Lattner authored Oct 17, 2004
```
other blocks.

llvm-svn: 17099
```
107c15c3

When inserting PHI nodes, don't insert any phi nodes that are obviously · 215c7eba

Chris Lattner authored Oct 17, 2004

unneccesary.  This allows us to delete several hundred phi nodes of the
form PHI(x,x,x,undef) from 253.perlbmk and probably other programs as well.

This implements Mem2Reg/UndefValuesMerge.ll

llvm-svn: 17098

215c7eba

New testcase, no PHI should be inserted. · d0476731
Chris Lattner authored Oct 17, 2004
```
llvm-svn: 17097
```
d0476731
Enhance hasConstantValue to ignore undef values in phi nodes. This allows it · 96db59e4
Chris Lattner authored Oct 17, 2004
```
to think that PHI[4, undef] == 4.

llvm-svn: 17096
```
96db59e4
hasConstantValue will soon return instructions that don't dominate the PHI node, · e29d634a
Chris Lattner authored Oct 17, 2004
```
so prepare for this.

llvm-svn: 17095
```
e29d634a

The first hunk corrects a bug when printing undef null values. We would print · 621c413a

Chris Lattner authored Oct 17, 2004

0->field, which is illegal.  Now we print ((foo*)0)->field.

The second hunk is an optimization to not print undefined phi values.

llvm-svn: 17094

621c413a

Don't print stuff out from the code generator. This broke the JIT horribly · 06855531
Chris Lattner authored Oct 17, 2004
```
last night. :)  bork!

llvm-svn: 17093
```
06855531
Add HAVE_BZLIB_H and HAVE_ZLIB_H tests. · cb13f6e5
Reid Spencer authored Oct 17, 2004
```
llvm-svn: 17092
```
cb13f6e5
Update to reflect building zlib for LLVM · c1f295a6
Reid Spencer authored Oct 17, 2004
```
llvm-svn: 17091
```
c1f295a6
Add missing targets for install/clean · 6889cc2d
Reid Spencer authored Oct 17, 2004
```
llvm-svn: 17090
```
6889cc2d
Make the library name SparcV9 specific · f476d84b
Reid Spencer authored Oct 17, 2004
```
llvm-svn: 17089
```
f476d84b
Consolidate the definitions · b7c9d544
Reid Spencer authored Oct 17, 2004
```
llvm-svn: 17088
```
b7c9d544
PPC32GenCodeEmitter instead of PowerPCGenCodeEmitter · de028a4a
Reid Spencer authored Oct 17, 2004
```
llvm-svn: 17087
```
de028a4a
Add runtime directories · 85f1cd78
Reid Spencer authored Oct 17, 2004
```
llvm-svn: 17086
```
85f1cd78
Support bytecode generation, GenCodeEmitter, etc. · 60be6484
Reid Spencer authored Oct 17, 2004
```
llvm-svn: 17085
```
60be6484
Add runtime directory, include Makefile_rules · c833b76f
Reid Spencer authored Oct 17, 2004
```
llvm-svn: 17084
```
c833b76f

Rewrite support for cast uint -> FP. In particular, we used to compile this: · 839abf57

Chris Lattner authored Oct 17, 2004

double %test(uint %X) {
        %tmp.1 = cast uint %X to double         ; <double> [#uses=1]
        ret double %tmp.1
}

into:

test:
        sub %ESP, 8
        mov %EAX, DWORD PTR [%ESP + 12]
        mov %ECX, 0
        mov DWORD PTR [%ESP], %EAX
        mov DWORD PTR [%ESP + 4], %ECX
        fild QWORD PTR [%ESP]
        add %ESP, 8
        ret

... which basically zero extends to 8 bytes, then does an fild for an
8-byte signed int.

Now we generate this:


test:
        sub %ESP, 4
        mov %EAX, DWORD PTR [%ESP + 8]
        mov DWORD PTR [%ESP], %EAX
        fild DWORD PTR [%ESP]
        shr %EAX, 31
        fadd DWORD PTR [.CPItest_0 + 4*%EAX]
        add %ESP, 4
        ret

        .section .rodata
        .align  4
.CPItest_0:
        .quad   5728578726015270912

This does a 32-bit signed integer load, then adds in an offset if the sign
bit of the integer was set.

It turns out that this is substantially faster than the preceeding sequence.
Consider this testcase:

unsigned a[2]={1,2};
volatile double G;

void main() {
    int i;
    for (i=0; i<100000000; ++i )
        G += a[i&1];
}

On zion (a P4 Xeon, 3Ghz), this patch speeds up the testcase from 2.140s
to 0.94s.

On apoc, an athlon MP 2100+, this patch speeds up the testcase from 1.72s
to 1.34s.

Note that the program takes 2.5s/1.97s on zion/apoc with GCC 3.3 -O3
-fomit-frame-pointer.

llvm-svn: 17083

839abf57

Unify handling of constant pool indexes with the other code paths, allowing · 112fd88a
Chris Lattner authored Oct 17, 2004
```
us to use index registers for CPI's

llvm-svn: 17082
```
112fd88a
Give the asmprinter the ability to print memrefs with a constant pool index, · af19d396
Chris Lattner authored Oct 17, 2004
```
index reg and scale

llvm-svn: 17081
```
af19d396

fold: · 653d8663

Chris Lattner authored Oct 17, 2004

  %X = and Y, constantint
  %Z = setcc %X, 0

instead of emitting:

        and %EAX, 3
        test %EAX, %EAX
        je .LBBfoo2_2   # UnifiedReturnBlock

We now emit:

        test %EAX, 3
        je .LBBfoo2_2   # UnifiedReturnBlock

This triggers 581 times on 176.gcc for example.

llvm-svn: 17080

653d8663

All of these labels are off by one now that the unreachable instruction exists · e234a214
Chris Lattner authored Oct 17, 2004
```
llvm-svn: 17079
```
e234a214

Implement bitfield insert by recognizing the following pattern: · 2c873ca3

Nate Begeman authored Oct 17, 2004

1. optional shift left
2. and x, immX
3. and y, immY
4. or z, x, y
==> rlwimi z, x, y, shift, mask begin, mask end

where immX == ~immY and immX is a run of set bits. This transformation
fires 32 times on voronoi, once on espresso, and probably several
dozen times on external benchmarks such as gcc.

To put this in terms of actual code generated for
struct B { unsigned a : 3; unsigned b : 2; };
void storeA (struct B *b, int v) { b->a = v;}
void storeB (struct B *b, int v) { b->b = v;}

Old:
_storeA:
        rlwinm r2, r4, 0, 29, 31
        lwz r4, 0(r3)
        rlwinm r4, r4, 0, 0, 28
        or r2, r4, r2
        stw r2, 0(r3)
        blr

_storeB:
        rlwinm r2, r4, 3, 0, 28
        rlwinm r2, r2, 0, 27, 28
        lwz r4, 0(r3)
        rlwinm r4, r4, 0, 29, 26
        or r2, r2, r4
        stw r2, 0(r3)
        blr

New:
_storeA:
        lwz r2, 0(r3)
        rlwimi r2, r4, 0, 29, 31
        stw r2, 0(r3)
        blr

_storeB:
        lwz r2, 0(r3)
        rlwimi r2, r4, 3, 27, 28
        stw r2, 0(r3)
        blr

llvm-svn: 17078

2c873ca3

Fix constant folding relational operators with undef operands. · 192eaccc
Chris Lattner authored Oct 17, 2004
```
llvm-svn: 17077
```
192eaccc
Reid added --version to the CommandLine lib. Don't conflict with it. · 0e4818c9
Chris Lattner authored Oct 17, 2004
```
llvm-svn: 17076
```
0e4818c9
I forgot that sparc no longer uses the shared asmwriter. Give it support · f6666db0
Chris Lattner authored Oct 17, 2004
```
for undef.

llvm-svn: 17075
```
f6666db0
Add support for unreachable and undef · 37b138a0
Chris Lattner authored Oct 17, 2004
```
llvm-svn: 17074
```
37b138a0
Initial Makefile.am for building with automake · 94ffa656
Reid Spencer authored Oct 17, 2004
```
llvm-svn: 17073
```
94ffa656
Initial Makefile.am for building with automake. · 3d99608c
Reid Spencer authored Oct 17, 2004
```
llvm-svn: 17072
```
3d99608c

Make sure that for systems where the string functions are actually macros · c0ec7a65

Reid Spencer authored Oct 17, 2004

that we undefine the macro before using its name in the definition. This
can happen on Linux if _GNU_SOURCE is defined.

llvm-svn: 17071

c0ec7a65

Implement constant folding of undef values. · fd7bf724
Chris Lattner authored Oct 16, 2004
```
llvm-svn: 17070
```
fd7bf724
Fix a type violation · 67f0545d
Chris Lattner authored Oct 16, 2004
```
llvm-svn: 17069
```
67f0545d

Oct 16, 2004
- Finally fix one of the oldest FIXMEs in the PowerPC backend: correctly · 29dc5f2a
  Nate Begeman authored Oct 16, 2004
```
flag rotate left word immediate then mask insert (rlwimi) as a two-address
instruction, and update the ISel usage of the instruction accordingly.

This will allow us to properly schedule rlwimi, and use it to efficiently
codegen bitfield operations.

llvm-svn: 17068
```
  29dc5f2a
- Kill the bogon that slipped into my buffer before I committed. · 684c5c65
  Chris Lattner authored Oct 16, 2004
```
llvm-svn: 17067
```
  684c5c65
- Implement InstCombine/getelementptr.ll:test9, which is the source of many · 6580e09f
  Chris Lattner authored Oct 16, 2004
```
ugly and giant constnat exprs in some programs.

llvm-svn: 17066
```
  6580e09f
- New testcase, rework testcases to fail if there are any gep's other than those · d2c8ed11
  Chris Lattner authored Oct 16, 2004
```
involving %B instead of allowing any geps except %A's.

llvm-svn: 17065
```
  d2c8ed11
- * Add a space between words · bbd150ea
  Misha Brukman authored Oct 16, 2004
```
* Wrap at 80 cols

llvm-svn: 17064
```
  bbd150ea
- Do not erroneously accept revision 6 bytecode files when the format hasn't · eba3f46b
  Chris Lattner authored Oct 16, 2004
```
been defined yet!

llvm-svn: 17063
```
  eba3f46b