Commits · 50a425a56d2aa6fd5fb633e26a6f232919701736 · Roger Ferrer / llvm-epi-0.8

Oct 07, 2004
- Make these scripts work on SunOS too. · 50a425a5
  Reid Spencer authored Oct 07, 2004
```
llvm-svn: 16805
```
  50a425a5
- Fix a bug in the safety analysis routine · 02b6c918
  Chris Lattner authored Oct 07, 2004
```
llvm-svn: 16804
```
  02b6c918
- Comment cleanups · f6479968
  Chris Lattner authored Oct 07, 2004
```
llvm-svn: 16803
```
  f6479968
- * Rename pass to globalopt, since we do more than just constify · 25db5803
  Chris Lattner authored Oct 07, 2004
```
* Instead of handling dead functions specially, just nuke them.
* Be more aggressive about cleaning up after constification, in
  particular, handle getelementptr instructions and constantexprs.
* Be a little bit more structured about how we process globals.

*** Delete globals that are only stored to, and never read.  These are
    clearly not useful, so they should go.  This implements deadglobal.llx

This last one triggers quite a few times.  In particular, 2208 in the
external tests, 1865 of which are in 252.eon.  This shrinks eon from
1995094 to 1732341 bytes of bytecode.

llvm-svn: 16802
```
  25db5803
- Rename pass · fa3cfd39
  Chris Lattner authored Oct 07, 2004
```
llvm-svn: 16801
```
  fa3cfd39
- This pass is not needed, as there is only ever one global: the stack · b0c8aab0
  Chris Lattner authored Oct 07, 2004
```
llvm-svn: 16800
```
  b0c8aab0
- Add new testcase, rename pass · 381fbf16
  Chris Lattner authored Oct 07, 2004
```
llvm-svn: 16799
```
  381fbf16
- Don't add libz or libbz2 to the USEDLIBS lists, those are for LLVM libraries. · fc303099
  Chris Lattner authored Oct 07, 2004
```
llvm-svn: 16798
```
  fc303099
- Don't call memset if malloc returns a null pointer · 76319a83
  Chris Lattner authored Oct 06, 2004
```
llvm-svn: 16797
```
  76319a83
Oct 06, 2004

Implement GlobalConstifier/trivialstore.llx, and also do some · 1f849a08

Chris Lattner authored Oct 06, 2004

simplifications of the resultant program to avoid making later passes
do it all.

This allows us to constify globals that just have the same constant that
they are initialized stored into them.

Suprisingly this comes up ALL of the freaking time, dozens of times in
SPEC, 30 times in vortex alone.

For example, on 256.bzip2, it allows us to constify these two globals:

%smallMode = internal global ubyte 0             ; <ubyte*> [#uses=8]
%verbosity = internal global int 0               ; <int*> [#uses=49]

Which (with later optimizations) results in the bytecode file shrinking
from 82286 to 69686 bytes!  Lets hear it for IPO :)

For the record, it's nuking lots of "if (verbosity > 2) { do lots of stuff }"
code.

llvm-svn: 16793

1f849a08

New testcase · 645bcf6c
Chris Lattner authored Oct 06, 2004
```
llvm-svn: 16791
```
645bcf6c
Dont' let null nodes sneak past cast instructions · af88fcd4
Chris Lattner authored Oct 06, 2004
```
llvm-svn: 16779
```
af88fcd4
Undoxyfy internal method. · fe643e31
Misha Brukman authored Oct 06, 2004
```
llvm-svn: 16774
```
fe643e31
Doxygen-ify comments · 74a1195b
Misha Brukman authored Oct 06, 2004
```
llvm-svn: 16773
```
74a1195b

Change Type::isAbstract to have better comments, a more correct name · 43e03c9c

Chris Lattner authored Oct 06, 2004

(PromoteAbstractToConcrete), and to use a set to avoid recomputation.
In particular, this set eliminates the potentially exponential cases
from this little recursive algorithm.

On a particularly nasty testcase, llvm-dis on the .bc file went from 34
minutes (which is when I killed it, it still hadn't finished) to 0.57s.
Remember kids, exponential algorithms are bad.

llvm-svn: 16772

43e03c9c

Rename method, change comment, add argument · f2956078
Chris Lattner authored Oct 06, 2004
```
llvm-svn: 16771
```
f2956078
Correct some typeos · f94f985b
Chris Lattner authored Oct 06, 2004
```
llvm-svn: 16770
```
f94f985b
Instcombine: -(X sdiv C) -> (X sdiv -C), tested by sub.ll:test16 · 0aee4b79
Chris Lattner authored Oct 06, 2004
```
llvm-svn: 16769
```
0aee4b79
New testcase · 52783ab1
Chris Lattner authored Oct 06, 2004
```
llvm-svn: 16768
```
52783ab1
Remove debugging code, fix encoding problem. This fixes the problems · 93867e51
Chris Lattner authored Oct 06, 2004
```
the JIT had last night.

llvm-svn: 16766
```
93867e51
Turning on fsel code gen now that we can do so would be good. · 9a1fbaf1
Nate Begeman authored Oct 06, 2004
```
llvm-svn: 16765
```
9a1fbaf1

Implement floating point select for lt, gt, le, ge using the powerpc fsel · fac8529d

Nate Begeman authored Oct 06, 2004

instruction.

Now, rather than emitting the following loop out of bisect:
.LBB_main_19:	; no_exit.0.i
	rlwinm r3, r2, 3, 0, 28
	lfdx f1, r3, r27
	addis r3, r30, ha16(.CPI_main_1-"L00000$pb")
	lfd f2, lo16(.CPI_main_1-"L00000$pb")(r3)
	fsub f2, f2, f1
	addis r3, r30, ha16(.CPI_main_1-"L00000$pb")
	lfd f4, lo16(.CPI_main_1-"L00000$pb")(r3)
	fcmpu cr0, f1, f4
	bge .LBB_main_64	; no_exit.0.i
.LBB_main_63:	; no_exit.0.i
	b .LBB_main_65	; no_exit.0.i
.LBB_main_64:	; no_exit.0.i
	fmr f2, f1
.LBB_main_65:	; no_exit.0.i
	addi r3, r2, 1
	rlwinm r3, r3, 3, 0, 28
	lfdx f1, r3, r27
	addis r3, r30, ha16(.CPI_main_1-"L00000$pb")
	lfd f4, lo16(.CPI_main_1-"L00000$pb")(r3)
	fsub f4, f4, f1
	addis r3, r30, ha16(.CPI_main_1-"L00000$pb")
	lfd f5, lo16(.CPI_main_1-"L00000$pb")(r3)
	fcmpu cr0, f1, f5
	bge .LBB_main_67	; no_exit.0.i
.LBB_main_66:	; no_exit.0.i
	b .LBB_main_68	; no_exit.0.i
.LBB_main_67:	; no_exit.0.i
	fmr f4, f1
.LBB_main_68:	; no_exit.0.i
	fadd f1, f2, f4
	addis r3, r30, ha16(.CPI_main_2-"L00000$pb")
	lfd f2, lo16(.CPI_main_2-"L00000$pb")(r3)
	fmul f1, f1, f2
	rlwinm r3, r2, 3, 0, 28
	lfdx f2, r3, r28
	fadd f4, f2, f1
	fcmpu cr0, f4, f0
	bgt .LBB_main_70	; no_exit.0.i
.LBB_main_69:	; no_exit.0.i
	b .LBB_main_71	; no_exit.0.i
.LBB_main_70:	; no_exit.0.i
	fmr f0, f4
.LBB_main_71:	; no_exit.0.i
	fsub f1, f2, f1
	addi r2, r2, -1
	fcmpu cr0, f1, f3
	blt .LBB_main_73	; no_exit.0.i
.LBB_main_72:	; no_exit.0.i
	b .LBB_main_74	; no_exit.0.i
.LBB_main_73:	; no_exit.0.i
	fmr f3, f1
.LBB_main_74:	; no_exit.0.i
	cmpwi cr0, r2, -1
	fmr f16, f0
	fmr f17, f3
	bgt .LBB_main_19	; no_exit.0.i

We emit this instead:
.LBB_main_19:	; no_exit.0.i
	rlwinm r3, r2, 3, 0, 28
	lfdx f1, r3, r27
	addis r3, r30, ha16(.CPI_main_1-"L00000$pb")
	lfd f2, lo16(.CPI_main_1-"L00000$pb")(r3)
	fsub f2, f2, f1
	fsel f1, f1, f1, f2
	addi r3, r2, 1
	rlwinm r3, r3, 3, 0, 28
	lfdx f2, r3, r27
	addis r3, r30, ha16(.CPI_main_1-"L00000$pb")
	lfd f4, lo16(.CPI_main_1-"L00000$pb")(r3)
	fsub f4, f4, f2
	fsel f2, f2, f2, f4
	fadd f1, f1, f2
	addis r3, r30, ha16(.CPI_main_2-"L00000$pb")
	lfd f2, lo16(.CPI_main_2-"L00000$pb")(r3)
	fmul f1, f1, f2
	rlwinm r3, r2, 3, 0, 28
	lfdx f2, r3, r28
	fadd f4, f2, f1
	fsub f5, f0, f4
	fsel f0, f5, f0, f4
	fsub f1, f2, f1
	addi r2, r2, -1
	fsub f2, f1, f3
	fsel f3, f2, f3, f1
	cmpwi cr0, r2, -1
	fmr f16, f0
	fmr f17, f3
	bgt .LBB_main_19	; no_exit.0.i

llvm-svn: 16764

fac8529d

Codegen signed mod by 2 or -2 more efficiently. Instead of generating: · 6835dedb

Chris Lattner authored Oct 06, 2004

t:
        mov %EDX, DWORD PTR [%ESP + 4]
        mov %ECX, 2
        mov %EAX, %EDX
        sar %EDX, 31
        idiv %ECX
        mov %EAX, %EDX
        ret

Generate:
t:
        mov %ECX, DWORD PTR [%ESP + 4]
***     mov %EAX, %ECX
        cdq
        and %ECX, 1
        xor %ECX, %EDX
        sub %ECX, %EDX
***     mov %EAX, %ECX
        ret

Note that the two marked moves are redundant, and should be eliminated by the
register allocator, but aren't.

Compare this to GCC, which generates:

t:
        mov     %eax, DWORD PTR [%esp+4]
        mov     %edx, %eax
        shr     %edx, 31
        lea     %ecx, [%edx+%eax]
        and     %ecx, -2
        sub     %eax, %ecx
        ret

or ICC 8.0, which generates:

t:
        movl      4(%esp), %ecx                                 #3.5
        movl      $-2147483647, %eax                            #3.25
        imull     %ecx                                          #3.25
        movl      %ecx, %eax                                    #3.25
        sarl      $31, %eax                                     #3.25
        addl      %ecx, %edx                                    #3.25
        subl      %edx, %eax                                    #3.25
        addl      %eax, %eax                                    #3.25
        negl      %eax                                          #3.25
        subl      %eax, %ecx                                    #3.25
        movl      %ecx, %eax                                    #3.25
        ret                                                     #3.25

We would be in great shape if not for the moves.

llvm-svn: 16763

6835dedb

Really fix FreeBSD, which apparently doesn't tolerate the extern. · e4c60eb7
Chris Lattner authored Oct 06, 2004
```
Thanks to Jeff Cohen for pointing out my goof.

llvm-svn: 16762
```
e4c60eb7

Fix a scary bug with signed division by a power of two. We used to generate: · 7bd8f133

Chris Lattner authored Oct 06, 2004

s:   ;; X / 4
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, %EAX
        sar %ECX, 1
        shr %ECX, 30
        mov %EDX, %EAX
        add %EDX, %ECX
        sar %EAX, 2
        ret

When we really meant:

s:
        mov %EAX, DWORD PTR [%ESP + 4]
        mov %ECX, %EAX
        sar %ECX, 1
        shr %ECX, 30
        add %EAX, %ECX
        sar %EAX, 2
        ret

Hey, this also reduces register pressure too :)

llvm-svn: 16761

7bd8f133

Codegen signed divides by 2 and -2 more efficiently. In particular · 147edd2f

Chris Lattner authored Oct 06, 2004

instead of:

s:   ;; X / 2
        movl 4(%esp), %eax
        movl %eax, %ecx
        shrl $31, %ecx
        movl %eax, %edx
        addl %ecx, %edx
        sarl $1, %eax
        ret

t:   ;; X / -2
        movl 4(%esp), %eax
        movl %eax, %ecx
        shrl $31, %ecx
        movl %eax, %edx
        addl %ecx, %edx
        sarl $1, %eax
        negl %eax
        ret

Emit:

s:
        movl 4(%esp), %eax
        cmpl $-2147483648, %eax
        sbbl $-1, %eax
        sarl $1, %eax
        ret

t:
        movl 4(%esp), %eax
        cmpl $-2147483648, %eax
        sbbl $-1, %eax
        sarl $1, %eax
        negl %eax
        ret

llvm-svn: 16760

147edd2f

Add some new instructions. Fix the asm string for sbb32rr · e9bfa5a2
Chris Lattner authored Oct 06, 2004
```
llvm-svn: 16759
```
e9bfa5a2

Reduce code growth implied by the tail duplication pass by not duplicating · 2ce32df8

Chris Lattner authored Oct 06, 2004

an instruction if it can be hoisted to a common dominator of the block.
This implements: test/Regression/Transforms/TailDup/MergeTest.ll

llvm-svn: 16758

2ce32df8

When tail duplicating these functions, the add instruction should not be · 7d83efbc
Chris Lattner authored Oct 06, 2004
```
duplicated, even though the block it is in is duplicated.

llvm-svn: 16757
```
7d83efbc
FreeBSD uses GCC. Patch contributed by Jeff Cohen! · 32ed828f
Chris Lattner authored Oct 06, 2004
```
llvm-svn: 16756
```
32ed828f
Fix the path to the fixinc'd headers. Patch contributed by Jeff Cohen! · 18b88f71
Chris Lattner authored Oct 06, 2004
```
llvm-svn: 16755
```
18b88f71

Oct 05, 2004
- Must include sys/stat.h before declaring a 'struct stat' · c5a630bd
  Brian Gaeke authored Oct 05, 2004
```
llvm-svn: 16728
```
  c5a630bd
- Build BFtoLLVM example front-end by default · a3d1b776
  Brian Gaeke authored Oct 05, 2004
```
llvm-svn: 16719
```
  a3d1b776
- Add BFtoLLVM example front end · ca70a78b
  Brian Gaeke authored Oct 05, 2004
```
llvm-svn: 16714
```
  ca70a78b
- Make sure the const bit gets inherited correctly when linking declarations · 9b38ead8
  Chris Lattner authored Oct 05, 2004
```
of disagreeing constness.  This fixes
test/Regression/Linker/ConstantGlobals[123].ll

llvm-svn: 16692
```
  9b38ead8
- Another testcase for constness linkage · 07d1d7ed
  Chris Lattner authored Oct 05, 2004
```
llvm-svn: 16691
```
  07d1d7ed
- Testcase to ensure that the 'constant' flag follows the definition when there · e0d464bd
  Chris Lattner authored Oct 05, 2004
```
is a question.

llvm-svn: 16690
```
  e0d464bd
- Adjust sys/stat.h inclusion so its only for SunOS. · abb04cfc
  Reid Spencer authored Oct 05, 2004
```
llvm-svn: 16686
```
  abb04cfc
- Added a couple of includes to get this to compile on Sparc. · c3ef3cc7
  Tanya Lattner authored Oct 05, 2004
```
llvm-svn: 16685
```
  c3ef3cc7
- Solaris doesn't have MAP_FILE. · 98959376
  Chris Lattner authored Oct 05, 2004
```
llvm-svn: 16682
```
  98959376