Commits · 049d33a7175882fcf2fdc56f4465af5629a7e353 · Roger Ferrer / llvm-epi-0.8

Nov 13, 2004

· 049d33a7

Chris Lattner authored Nov 13, 2004

shld is a very high latency operation. Instead of emitting it for shifts of
two or three, open code the equivalent operation which is faster on athlon
and P4 (by a substantial margin).

For example, instead of compiling this:

long long X2(long long Y) { return Y << 2; }

to:

X3_2:
        movl 4(%esp), %eax
        movl 8(%esp), %edx
        shldl $2, %eax, %edx
        shll $2, %eax
        ret

Compile it to:

X2:
        movl 4(%esp), %eax
        movl 8(%esp), %ecx
        movl %eax, %edx
        shrl $30, %edx
        leal (%edx,%ecx,4), %edx
        shll $2, %eax
        ret

Likewise, for << 3, compile to:

X3:
        movl 4(%esp), %eax
        movl 8(%esp), %ecx
        movl %eax, %edx
        shrl $29, %edx
        leal (%edx,%ecx,8), %edx
        shll $3, %eax
        ret

This matches icc, except that icc open codes the shifts as adds on the P4.

llvm-svn: 17707

049d33a7

Add missing check · ef6bd92a
Chris Lattner authored Nov 13, 2004
```
llvm-svn: 17706
```
ef6bd92a

Compile: · 8d521bb1

Chris Lattner authored Nov 13, 2004

long long X3_2(long long Y) { return Y+Y; }
int X(int Y) { return Y+Y; }

into:

X3_2:
        movl 4(%esp), %eax
        movl 8(%esp), %edx
        addl %eax, %eax
        adcl %edx, %edx
        ret
X:
        movl 4(%esp), %eax
        addl %eax, %eax
        ret

instead of:

X3_2:
        movl 4(%esp), %eax
        movl 8(%esp), %edx
        shldl $1, %eax, %edx
        shll $1, %eax
        ret

X:
        movl 4(%esp), %eax
        shll $1, %eax
        ret

llvm-svn: 17705

8d521bb1

Simplify handling of shifts to be the same as we do for adds. Add support · 8c3e7b92

Chris Lattner authored Nov 13, 2004

for (X * C1) + (X * C2) (where * can be mul or shl), allowing us to fold:

   Y+Y+Y+Y+Y+Y+Y+Y

into
         %tmp.8 = shl long %Y, ubyte 3           ; <long> [#uses=1]

instead of

        %tmp.4 = shl long %Y, ubyte 2           ; <long> [#uses=1]
        %tmp.12 = shl long %Y, ubyte 2          ; <long> [#uses=1]
        %tmp.8 = add long %tmp.4, %tmp.12               ; <long> [#uses=1]

This implements add.ll:test25

Also add support for (X*C1)-(X*C2) -> X*(C1-C2), implementing sub.ll:test18

llvm-svn: 17704

8c3e7b92

New testcase · f6392b46
Chris Lattner authored Nov 13, 2004
```
llvm-svn: 17703
```
f6392b46
Add support for shifts · 6912370a
Chris Lattner authored Nov 13, 2004
```
llvm-svn: 17702
```
6912370a

Fold: · 4efe20a1

Chris Lattner authored Nov 13, 2004

   (X + (X << C2)) --> X * ((1 << C2) + 1)
   ((X << C2) + X) --> X * ((1 << C2) + 1)

This means that we now canonicalize "Y+Y+Y" into:

        %tmp.2 = mul long %Y, 3         ; <long> [#uses=1]

instead of:

        %tmp.10 = shl long %Y, ubyte 1          ; <long> [#uses=1]
        %tmp.6 = add long %Y, %tmp.10               ; <long> [#uses=1]

llvm-svn: 17701

4efe20a1

Lazily create the abort message, so only translation units that use unwind · 2858e175
Chris Lattner authored Nov 13, 2004
```
will actually get it.

llvm-svn: 17700
```
2858e175
Fix: CodeExtractor/2004-11-12-InvokeExtract.ll · 9b0291b1
Chris Lattner authored Nov 13, 2004
```
llvm-svn: 17699
```
9b0291b1
New testcase · 8cc98850
Chris Lattner authored Nov 13, 2004
```
llvm-svn: 17698
```
8cc98850
Fix a bug where the code extractor would get a bit confused handling invoke · 5bcca605
Chris Lattner authored Nov 12, 2004
```
instructions, setting DefBlock to a block it did not have dom info for.

llvm-svn: 17697
```
5bcca605

Nov 12, 2004
- Simplify handling of constant initializers · 5c1d84c7
  Chris Lattner authored Nov 12, 2004
```
llvm-svn: 17696
```
  5c1d84c7
- Makefile for lib/Linker · a81f8197
  Reid Spencer authored Nov 12, 2004
```
llvm-svn: 17695
```
  a81f8197
- This file originated in lib/VMCore/Linker.cpp but now lives in · 361e513d
  Reid Spencer authored Nov 12, 2004
```
lib/Linker/LinkModules.cpp

llvm-svn: 17694
```
  361e513d
- This file originated in tools/gccld/Linker.cpp but now lives in · 1cfa8d60
  Reid Spencer authored Nov 12, 2004
```
lib/Linker/LinkArchives.cpp

llvm-svn: 17693
```
  1cfa8d60
- * Clean up all the shared library output on uninstall · aee67e65
  Reid Spencer authored Nov 12, 2004
```
* Provide the correct set of input directories to the TAGS target
* Provide a CTAGS target for building Vi style ctags files.

llvm-svn: 17688
```
  aee67e65
- Document the new llvm-ranlib command. · 2ea5ae61
  Reid Spencer authored Nov 12, 2004
```
llvm-svn: 17687
```
  2ea5ae61
- Correctly terminate a list. · ffb9f061
  Reid Spencer authored Nov 12, 2004
```
llvm-svn: 17686
```
  ffb9f061
- Document the modifiers and the file format. · 2e34034d
  Reid Spencer authored Nov 12, 2004
```
llvm-svn: 17685
```
  2e34034d
Nov 11, 2004
- Make this build in release mode · 738c89ec
  Chris Lattner authored Nov 11, 2004
```
llvm-svn: 17684
```
  738c89ec
- Add llvm-ar to the index. · e448500e
  Reid Spencer authored Nov 11, 2004
```
llvm-svn: 17682
```
  e448500e
- First attempt at llvm-ar documentation. Modifiers need a little more · 8beeb495
  Reid Spencer authored Nov 11, 2004
```
explanation.

llvm-svn: 17681
```
  8beeb495
- Actually, leave the check in. This prevents us from counting dead arguments · 9621dfab
  Chris Lattner authored Nov 11, 2004
```
as IPCP opportunities.

llvm-svn: 17680
```
  9621dfab
- Fix bug: IPConstantProp/deadarg.ll · 5fa696f8
  Chris Lattner authored Nov 11, 2004
```
llvm-svn: 17679
```
  5fa696f8
- new testcase · ba582f0d
  Chris Lattner authored Nov 11, 2004
```
llvm-svn: 17678
```
  ba582f0d
- Fix documentation for Makefile target name change. install-bytecode is now · 45dc1394
  Reid Spencer authored Nov 11, 2004
```
just "install" in the runtime directory.

llvm-svn: 17677
```
  45dc1394
Nov 10, 2004
- Make IP Constant prop more aggressive about handling self recursive calls. · c1d24cd8
  Chris Lattner authored Nov 10, 2004
```
This implements IPConstantProp/recursion.ll

llvm-svn: 17666
```
  c1d24cd8
- New testcase · 59e54625
  Chris Lattner authored Nov 10, 2004
```
llvm-svn: 17665
```
  59e54625
- Correct the name of stosd for the AT&T syntax: · 04570265
  John Criswell authored Nov 10, 2004
```
It's stosl (l for long == 32 bit).

llvm-svn: 17658
```
  04570265
Nov 09, 2004
- Do not let dead constant expressions hanging off of functions prevent IPCP. · 0d3773d8
  Chris Lattner authored Nov 09, 2004
```
This allows to elimination of a bunch of global pool descriptor args from
programs being pool allocated (and is also generally useful!)

llvm-svn: 17657
```
  0d3773d8
- Provide conversion from posix time. · e5142be4
  Reid Spencer authored Nov 09, 2004
```
llvm-svn: 17656
```
  e5142be4
- Fix isBytecodeFile to correctly recognized compressed bytecode too. · 202eaeb2
  Reid Spencer authored Nov 09, 2004
```
llvm-svn: 17655
```
  202eaeb2
- * Implement getStatusInfo for getting stat(2) like information · fb1f7357
  Reid Spencer authored Nov 09, 2004
```
* Implement createTemporaryFile for mkstemp(3) functionality
* Fix isBytecodeFile to accept llvc magic # (compressed) as bytecode.

llvm-svn: 17654
```
  fb1f7357
- Make sure llee can deal with compressed bytecode too. · abbefecf
  Reid Spencer authored Nov 09, 2004
```
llvm-svn: 17652
```
  abbefecf
- Recognize compressed LLVM bytecode files. · 623dc9c5
  John Criswell authored Nov 09, 2004
```
This should fix the problem of not being able to link compressed LLVM
bytecode files from LLVM libraries.

llvm-svn: 17648
```
  623dc9c5
- Tune compression: · 6a1a10aa
  Reid Spencer authored Nov 09, 2004
```
bzip2: block size 9 -> 5, reduces memory by 400Kbytes, doesn't affect speed
       or compression ratio on all but the largest bytecode files (>1MB)
zip:   level 9 -> 6, this speeds up compression time by ~30% but only
       degrades the compressed size by a few bytes per megabyte. Those few
       bytes aren't worth the effort.

llvm-svn: 17647
```
  6a1a10aa
- Change this back so that I get stable numbers to reflect the change from the · 436285e7
  Chris Lattner authored Nov 09, 2004
```
nightly testers

llvm-svn: 17646
```
  436285e7
- Document quick-test target. · 981afd7c
  Reid Spencer authored Nov 09, 2004
```
llvm-svn: 17644
```
  981afd7c
- Add a quick-test target that uses QUICKTEST variable to quickly run a · 30060816
  Reid Spencer authored Nov 09, 2004
```
portion of the test suite. e.g.:

make quck-test QUICKTEST=Regression/Bytecode

llvm-svn: 17643
```
  30060816
- Fix bug: 2004-11-08-FreeUseCrash.ll · 1f0a97c6
  Chris Lattner authored Nov 09, 2004
```
llvm-svn: 17642
```
  1f0a97c6