Commits · 9c0710781dd0a3b1b092e8ab8d606abb5f5a4855 · Roger Ferrer / llvm-epi-0.8

Oct 09, 2004
- Implement getModuleMatchQuality and getJITMatchQuality so that v8 will be the · 9c071078
  Brian Gaeke authored Oct 09, 2004
```
default 32/BE target on sparc hosts, and ppc will continue to be the default
on other hosts.

llvm-svn: 16865
```
  9c071078
- Fix infinite loop due to iteration · f369b38d
  Chris Lattner authored Oct 09, 2004
```
llvm-svn: 16864
```
  f369b38d
- Implement sub.ll:test17, -X/C -> X/-C · 4ad08352
  Chris Lattner authored Oct 09, 2004
```
llvm-svn: 16863
```
  4ad08352
- Add a check to avoid an assertion on malformed input · 1f4739cd
  Chris Lattner authored Oct 09, 2004
```
llvm-svn: 16861
```
  1f4739cd
- The person who was planning to add SSE support isn't anymore, so disable · 23c8d0b6
  Chris Lattner authored Oct 08, 2004
```
the -sse* options (to avoid misleading people).

Also, the stack alignment of the target doesn't depend on whether SSE is
eventually implemented, so remove a comment.

llvm-svn: 16860
```
  23c8d0b6
- Fix a major regression from the bugfix for 2004-10-08-SelectSetCCFold.llx, · 97ea4206
  Chris Lattner authored Oct 08, 2004
```
which prevented setcc's from being folded into branches.  It appears that
conditional branchinst's CC operand is actually operand(2), not operand(0)
as we might expect. :(

llvm-svn: 16859
```
  97ea4206
- If we found a dead global, we should at least delete it... · 1b8d2957
  Chris Lattner authored Oct 08, 2004
```
llvm-svn: 16858
```
  1b8d2957
Oct 08, 2004

* Pull out the meat of runOnModule into another function for clarity. · 1c4bddc5

Chris Lattner authored Oct 08, 2004

* Do not lead dangling dead constants prevent optimization
* Iterate global optimization while we're making progress.

These changes allow us to be more aggressive, handling cases like
GlobalOpt/iterate.llx without a problem (turning it into 'ret int 0').

llvm-svn: 16857

1c4bddc5

We might as well delete the known-dead global sooner rather than later since · 73ad73e2
Chris Lattner authored Oct 08, 2004
```
we know it is dead.

llvm-svn: 16855
```
73ad73e2
Hyphenate target-(in)dependent for more tasty grammar goodness (tm) · 84e5ff76
Misha Brukman authored Oct 08, 2004
```
llvm-svn: 16854
```
84e5ff76
Temporarily disable a buggy transformation until it can be fixed. This fixes · 0b41e861
Chris Lattner authored Oct 08, 2004
```
254.gap.

llvm-svn: 16853
```
0b41e861
Adjust paths due to moving InstrSched to lib/Target/SparcV9 · e4e1360e
Misha Brukman authored Oct 08, 2004
```
llvm-svn: 16852
```
e4e1360e
InstrSched has been moved to lib/Target/SparcV9 · cb54d5df
Misha Brukman authored Oct 08, 2004
```
llvm-svn: 16850
```
cb54d5df
InstrSched is SparcV9-specific and so has been moved to lib/Target/SparcV9/ · 24eb38af
Misha Brukman authored Oct 08, 2004
```
llvm-svn: 16849
```
24eb38af
Single-space instead of double-spacing in the Makefile · 5a9976ac
Misha Brukman authored Oct 08, 2004
```
llvm-svn: 16848
```
5a9976ac
Build InstrSched as well, and all three subdirs can be built independently · e75c2668
Misha Brukman authored Oct 08, 2004
```
llvm-svn: 16847
```
e75c2668
Single-space instead of double-spacing in the Makefile · 73dce3a6
Misha Brukman authored Oct 08, 2004
```
llvm-svn: 16845
```
73dce3a6

Implement SRA for global variables. This allows the other global variable · abab0719

Chris Lattner authored Oct 08, 2004

optimizations to trigger much more often.  This allows the elimination of
several dozen more global variables in Programs/External.  Note that we only
do this for non-constant globals: constant globals will already be optimized
out if the accesses to them permit it.

This implements Transforms/GlobalOpt/globalsra.llx

llvm-svn: 16842

abab0719

Fix bug: 2004-10-08-SelectSetCCFold.llx. Normally this is hidden by the · 0be2f504
Chris Lattner authored Oct 08, 2004
```
instcombine xform, which is why we didn't notice it before.

llvm-svn: 16840
```
0be2f504
Instcombine (X & FF00) + xx00 -> (X+xx00) & FF00, implementing and.ll:test27 · bff91d9a
Chris Lattner authored Oct 08, 2004
```
This comes up when doing adds to bitfield elements.

llvm-svn: 16836
```
bff91d9a
Little patch to turn (shl (add X, 123), 4) -> (add (shl X, 4), 123 << 4) · 44bd392c
Chris Lattner authored Oct 08, 2004
```
This triggers in cases of bitfield additions, opening opportunities for
future improvements.

llvm-svn: 16834
```
44bd392c

Implement logical and with an immediate that consists of a contiguous block · b58dd679

Nate Begeman authored Oct 08, 2004

of one or more 1 bits (may wrap from least significant bit to most
significant bit) as the rlwinm rather than andi., andis., or some longer
instructons sequence.

int andn4(int z) { return z & -4; }
int clearhi(int z) { return z & 0x0000FFFF; }
int clearlo(int z) { return z & 0xFFFF0000; }
int clearmid(int z) { return z & 0x00FFFF00; }
int clearwrap(int z) { return z & 0xFF0000FF; }

_andn4:
        rlwinm r3, r3, 0, 0, 29
        blr

_clearhi:
        rlwinm r3, r3, 0, 16, 31
        blr

_clearlo:
        rlwinm r3, r3, 0, 0, 15
        blr

_clearmid:
        rlwinm r3, r3, 0, 8, 23
        blr

_clearwrap:
        rlwinm r3, r3, 0, 24, 7
        blr

llvm-svn: 16832

b58dd679

Several fixes and enhancements to the PPC32 backend. · 6e6514c4

Nate Begeman authored Oct 07, 2004

1. Fix an illegal argument to getClassB when deciding whether or not to
   sign extend a byte load.

2. Initial addition of isLoad and isStore flags to the instruction .td file
   for eventual use in a scheduler.

3. Rewrite of how constants are handled in emitSimpleBinaryOperation so
   that we can emit the PowerPC shifted immediate instructions far more
   often.  This allows us to emit the following code:

int foo(int x) { return x | 0x00F0000; }

_foo:
.LBB_foo_0:     ; entry
        ; IMPLICIT_DEF
        oris r3, r3, 15
        blr

llvm-svn: 16826

6e6514c4

Add ori reg, reg, 0 as a move instruction. This can be generated from · c6b63cd2

Nate Begeman authored Oct 07, 2004

loading a 32bit constant into a register whose low halfword is all zeroes.

We now omit the ori after the lis for the following C code:

int bar(int y) { return y * 0x00F0000; }

_bar:
.LBB_bar_0:     ; entry
        ; IMPLICIT_DEF
        lis r2, 15
        mullw r3, r3, r2
        blr

llvm-svn: 16825

c6b63cd2

Remove unnecessary header include · 70a9d9c0
Nate Begeman authored Oct 07, 2004
```
llvm-svn: 16824
```
70a9d9c0

Oct 07, 2004

Improve comments, no functionality changes · 617f1a34
Chris Lattner authored Oct 07, 2004
```
llvm-svn: 16814
```
617f1a34
Fix a nasty dangling pointer problem, due to a free'd pointer being left in · 3ae7bb6b
Chris Lattner authored Oct 07, 2004
```
a map.  This caused problems if a later object happened to be allocated at
the free'd object's address.

llvm-svn: 16813
```
3ae7bb6b

Unfortunately the fix for the previous bug introduced the previous · 251093ca

Chris Lattner authored Oct 07, 2004

exponential behavior (bork!).  This patch processes stuff with an
explicit SCC finder, allowing the algorithm to be more clear,
efficient, and also (as a bonus) correct!  This gets us back to taking
0.6s to disassemble my horrible .bc file that previously took something
> 30 mins.

llvm-svn: 16811

251093ca

Fix a bug in my previous change. Unfortunately this reverts most of the · cef3c060
Chris Lattner authored Oct 07, 2004
```
speedup, but has the advantage of not breaking a bunch of programs!

llvm-svn: 16806
```
cef3c060
Fix a bug in the safety analysis routine · 02b6c918
Chris Lattner authored Oct 07, 2004
```
llvm-svn: 16804
```
02b6c918
Comment cleanups · f6479968
Chris Lattner authored Oct 07, 2004
```
llvm-svn: 16803
```
f6479968

* Rename pass to globalopt, since we do more than just constify · 25db5803

Chris Lattner authored Oct 07, 2004

* Instead of handling dead functions specially, just nuke them.
* Be more aggressive about cleaning up after constification, in
  particular, handle getelementptr instructions and constantexprs.
* Be a little bit more structured about how we process globals.

*** Delete globals that are only stored to, and never read.  These are
    clearly not useful, so they should go.  This implements deadglobal.llx

This last one triggers quite a few times.  In particular, 2208 in the
external tests, 1865 of which are in 252.eon.  This shrinks eon from
1995094 to 1732341 bytes of bytecode.

llvm-svn: 16802

25db5803

Oct 06, 2004

Implement GlobalConstifier/trivialstore.llx, and also do some · 1f849a08

Chris Lattner authored Oct 06, 2004

simplifications of the resultant program to avoid making later passes
do it all.

This allows us to constify globals that just have the same constant that
they are initialized stored into them.

Suprisingly this comes up ALL of the freaking time, dozens of times in
SPEC, 30 times in vortex alone.

For example, on 256.bzip2, it allows us to constify these two globals:

%smallMode = internal global ubyte 0             ; <ubyte*> [#uses=8]
%verbosity = internal global int 0               ; <int*> [#uses=49]

Which (with later optimizations) results in the bytecode file shrinking
from 82286 to 69686 bytes!  Lets hear it for IPO :)

For the record, it's nuking lots of "if (verbosity > 2) { do lots of stuff }"
code.

llvm-svn: 16793

1f849a08

Dont' let null nodes sneak past cast instructions · af88fcd4
Chris Lattner authored Oct 06, 2004
```
llvm-svn: 16779
```
af88fcd4

Change Type::isAbstract to have better comments, a more correct name · 43e03c9c

Chris Lattner authored Oct 06, 2004

(PromoteAbstractToConcrete), and to use a set to avoid recomputation.
In particular, this set eliminates the potentially exponential cases
from this little recursive algorithm.

On a particularly nasty testcase, llvm-dis on the .bc file went from 34
minutes (which is when I killed it, it still hadn't finished) to 0.57s.
Remember kids, exponential algorithms are bad.

llvm-svn: 16772

43e03c9c

Correct some typeos · f94f985b
Chris Lattner authored Oct 06, 2004
```
llvm-svn: 16770
```
f94f985b
Instcombine: -(X sdiv C) -> (X sdiv -C), tested by sub.ll:test16 · 0aee4b79
Chris Lattner authored Oct 06, 2004
```
llvm-svn: 16769
```
0aee4b79
Remove debugging code, fix encoding problem. This fixes the problems · 93867e51
Chris Lattner authored Oct 06, 2004
```
the JIT had last night.

llvm-svn: 16766
```
93867e51
Turning on fsel code gen now that we can do so would be good. · 9a1fbaf1
Nate Begeman authored Oct 06, 2004
```
llvm-svn: 16765
```
9a1fbaf1

Implement floating point select for lt, gt, le, ge using the powerpc fsel · fac8529d

Nate Begeman authored Oct 06, 2004

instruction.

Now, rather than emitting the following loop out of bisect:
.LBB_main_19:	; no_exit.0.i
	rlwinm r3, r2, 3, 0, 28
	lfdx f1, r3, r27
	addis r3, r30, ha16(.CPI_main_1-"L00000$pb")
	lfd f2, lo16(.CPI_main_1-"L00000$pb")(r3)
	fsub f2, f2, f1
	addis r3, r30, ha16(.CPI_main_1-"L00000$pb")
	lfd f4, lo16(.CPI_main_1-"L00000$pb")(r3)
	fcmpu cr0, f1, f4
	bge .LBB_main_64	; no_exit.0.i
.LBB_main_63:	; no_exit.0.i
	b .LBB_main_65	; no_exit.0.i
.LBB_main_64:	; no_exit.0.i
	fmr f2, f1
.LBB_main_65:	; no_exit.0.i
	addi r3, r2, 1
	rlwinm r3, r3, 3, 0, 28
	lfdx f1, r3, r27
	addis r3, r30, ha16(.CPI_main_1-"L00000$pb")
	lfd f4, lo16(.CPI_main_1-"L00000$pb")(r3)
	fsub f4, f4, f1
	addis r3, r30, ha16(.CPI_main_1-"L00000$pb")
	lfd f5, lo16(.CPI_main_1-"L00000$pb")(r3)
	fcmpu cr0, f1, f5
	bge .LBB_main_67	; no_exit.0.i
.LBB_main_66:	; no_exit.0.i
	b .LBB_main_68	; no_exit.0.i
.LBB_main_67:	; no_exit.0.i
	fmr f4, f1
.LBB_main_68:	; no_exit.0.i
	fadd f1, f2, f4
	addis r3, r30, ha16(.CPI_main_2-"L00000$pb")
	lfd f2, lo16(.CPI_main_2-"L00000$pb")(r3)
	fmul f1, f1, f2
	rlwinm r3, r2, 3, 0, 28
	lfdx f2, r3, r28
	fadd f4, f2, f1
	fcmpu cr0, f4, f0
	bgt .LBB_main_70	; no_exit.0.i
.LBB_main_69:	; no_exit.0.i
	b .LBB_main_71	; no_exit.0.i
.LBB_main_70:	; no_exit.0.i
	fmr f0, f4
.LBB_main_71:	; no_exit.0.i
	fsub f1, f2, f1
	addi r2, r2, -1
	fcmpu cr0, f1, f3
	blt .LBB_main_73	; no_exit.0.i
.LBB_main_72:	; no_exit.0.i
	b .LBB_main_74	; no_exit.0.i
.LBB_main_73:	; no_exit.0.i
	fmr f3, f1
.LBB_main_74:	; no_exit.0.i
	cmpwi cr0, r2, -1
	fmr f16, f0
	fmr f17, f3
	bgt .LBB_main_19	; no_exit.0.i

We emit this instead:
.LBB_main_19:	; no_exit.0.i
	rlwinm r3, r2, 3, 0, 28
	lfdx f1, r3, r27
	addis r3, r30, ha16(.CPI_main_1-"L00000$pb")
	lfd f2, lo16(.CPI_main_1-"L00000$pb")(r3)
	fsub f2, f2, f1
	fsel f1, f1, f1, f2
	addi r3, r2, 1
	rlwinm r3, r3, 3, 0, 28
	lfdx f2, r3, r27
	addis r3, r30, ha16(.CPI_main_1-"L00000$pb")
	lfd f4, lo16(.CPI_main_1-"L00000$pb")(r3)
	fsub f4, f4, f2
	fsel f2, f2, f2, f4
	fadd f1, f1, f2
	addis r3, r30, ha16(.CPI_main_2-"L00000$pb")
	lfd f2, lo16(.CPI_main_2-"L00000$pb")(r3)
	fmul f1, f1, f2
	rlwinm r3, r2, 3, 0, 28
	lfdx f2, r3, r28
	fadd f4, f2, f1
	fsub f5, f0, f4
	fsel f0, f5, f0, f4
	fsub f1, f2, f1
	addi r2, r2, -1
	fsub f2, f1, f3
	fsel f3, f2, f3, f1
	cmpwi cr0, r2, -1
	fmr f16, f0
	fmr f17, f3
	bgt .LBB_main_19	; no_exit.0.i

llvm-svn: 16764

fac8529d