Commits · 026e5d7667c473c67452731e2d5c19335b468534 · Roger Ferrer / llvm-epi-0.8

Apr 30, 2009
- Instead of passing in an unsigned value for the optimization level, use an enum, · 026e5d76
  Bill Wendling authored Apr 29, 2009
```
which better identifies what the optimization is doing. And is more flexible for
future uses.

llvm-svn: 70440
```
  026e5d76
Apr 29, 2009

Bill Wendling authored Apr 29, 2009

Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to
use the old behavior, the flag is -O0. This change allows for finer-grained
control over which optimizations are run at different -O levels.

Most of this work was pretty mechanical. The majority of the fixes came from
verifying that a "fast" variable wasn't used anymore. The JIT still uses a
"Fast" flag. I'll change the JIT with a follow-up patch.

llvm-svn: 70343

084669a1

Apr 28, 2009

r70270 isn't ready yet. Back this out. Sorry for the noise. · 56f2987a
Bill Wendling authored Apr 28, 2009
```
llvm-svn: 70275
```
56f2987a

Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to · d0ae1594

Bill Wendling authored Apr 28, 2009

use the old behavior, the flag is -O0. This change allows for finer-grained
control over which optimizations are run at different -O levels.

Most of this work was pretty mechanical. The majority of the fixes came from
verifying that a "fast" variable wasn't used anymore. The JIT still uses a
"Fast" flag. I'm not 100% sure if it's necessary to change it there...

llvm-svn: 70270

d0ae1594

Apr 16, 2009
- fix PR3995. A scale must be 1, 2, 4 or 8. · 5e42177a
  Rafael Espindola authored Apr 16, 2009
```
llvm-svn: 69284
```
  5e42177a
Apr 15, 2009
- For the h-register addressing-mode trick, use the correct value for · 62f44986
  Dan Gohman authored Apr 14, 2009
```
any non-address uses of the address value. This fixes 186.crafty.

llvm-svn: 69094
```
  62f44986
Apr 13, 2009

Implement x86 h-register extract support. · 57d6bd36

Dan Gohman authored Apr 13, 2009

 - Add patterns for h-register extract, which avoids a shift and mask,
   and in some cases a temporary register.
 - Add address-mode matching for turning (X>>(8-n))&(255<<n), where
   n is a valid address-mode scale value, into an h-register extract
   and a scaled-offset address.
 - Replace X86's MOV32to32_ and related instructions with the new
   target-independent COPY_TO_SUBREG instruction.

On x86-64 there are complicated constraints on h registers, and
CodeGen doesn't currently provide a high-level way to express all of them,
so they are handled with a bunch of special code. This code currently only
supports extracts where the result is used by a zero-extend or a store,
though these are fairly common.

These transformations are not always beneficial; since there are only
4 h registers, they sometimes require extra move instructions, and
this sometimes increases register pressure because it can force out
values that would otherwise be in one of those registers. However,
this appears to be relatively uncommon.

llvm-svn: 68962

57d6bd36

Remove x86's special-case handling for ISD::TRUNCATE and · f20462c2

Dan Gohman authored Apr 13, 2009

ISD::SIGN_EXTEND_INREG. Tablegen-generated code can handle
these cases, and the scheduling issues observed earlier
appear to be resolved now.

llvm-svn: 68959

f20462c2

Use X86::SUBREG_8BIT instead of hard-coding the equivalent constant. · 092b8b6f
Dan Gohman authored Apr 13, 2009
```
llvm-svn: 68951
```
092b8b6f
X86-64 TLS support for local exec and initial exec. · 6d6c6043
Rafael Espindola authored Apr 13, 2009
```
llvm-svn: 68947
```
6d6c6043
In X86DAGToDAGISel::MatchWrapper, if base or index are set, avoid matching · 7186f20a
Rafael Espindola authored Apr 12, 2009
```
only if symbolic addresses are RIP relatives.

llvm-svn: 68924
```
7186f20a

Apr 12, 2009
- refactor some code into X86DAGToDAGISel::MatchWrapper · 6688b0a5
  Rafael Espindola authored Apr 12, 2009
```
llvm-svn: 68915
```
  6688b0a5
Apr 10, 2009

Don't fold a load if the other operand is a TLS address. · bb834f09

Rafael Espindola authored Apr 10, 2009

With this we generate

movl    %gs:0, %eax
leal    i@NTPOFF(%eax), %eax

instead of

movl    $i@NTPOFF, %eax
addl    %gs:0, %eax

llvm-svn: 68778

bb834f09

Apr 08, 2009

Re-apply 68552. · 3b2df10c

Rafael Espindola authored Apr 08, 2009

Tested by bootstrapping llvm-gcc and using that to build llvm.

llvm-svn: 68645

3b2df10c

Temporarily revert r68552. This was causing a failure in the self-hosting LLVM · 4aa25b79

Bill Wendling authored Apr 07, 2009

builds.

--- Reverse-merging (from foreign repository) r68552 into '.':
U    test/CodeGen/X86/tls8.ll
U    test/CodeGen/X86/tls10.ll
U    test/CodeGen/X86/tls2.ll
U    test/CodeGen/X86/tls6.ll
U    lib/Target/X86/X86Instr64bit.td
U    lib/Target/X86/X86InstrSSE.td
U    lib/Target/X86/X86InstrInfo.td
U    lib/Target/X86/X86RegisterInfo.cpp
U    lib/Target/X86/X86ISelLowering.cpp
U    lib/Target/X86/X86CodeEmitter.cpp
U    lib/Target/X86/X86FastISel.cpp
U    lib/Target/X86/X86InstrInfo.h
U    lib/Target/X86/X86ISelDAGToDAG.cpp
U    lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.cpp
U    lib/Target/X86/AsmPrinter/X86IntelAsmPrinter.cpp
U    lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.h
U    lib/Target/X86/AsmPrinter/X86IntelAsmPrinter.h
U    lib/Target/X86/X86ISelLowering.h
U    lib/Target/X86/X86InstrInfo.cpp
U    lib/Target/X86/X86InstrBuilder.h
U    lib/Target/X86/X86RegisterInfo.td

llvm-svn: 68560

4aa25b79

Apr 07, 2009

Reduce code duplication on the TLS implementation. · 1edda067

Rafael Espindola authored Apr 07, 2009

This introduces a small regression on the generated code
quality in the case we are just computing addresses, not
loading values.

Will work on it and on X86-64 support.

llvm-svn: 68552

1edda067

Mar 31, 2009

remove unused arguments. · 9277379f
Rafael Espindola authored Mar 31, 2009
```
llvm-svn: 68109
```
9277379f

X86 address mode isel tweak. If the base of the address is also used by a... · 885bc6de

Evan Cheng authored Mar 31, 2009

X86 address mode isel tweak. If the base of the address is also used by a CopyToReg (i.e. it's likely live-out), do not fold the sub-expressions into the addressing mode to avoid computing the address twice. The CopyToReg use will be isel'ed to a LEA, re-use it for address instead.

This is not yet enabled.

llvm-svn: 68082

885bc6de

Mar 30, 2009

When optimzing a mul by immediate into two, the resulting mul's should get a... · a84a3188

Evan Cheng authored Mar 30, 2009

When optimzing a mul by immediate into two, the resulting mul's should get a x86 specific node to avoid dag combiner from hacking on them further.

llvm-svn: 68066

a84a3188

Mar 28, 2009
- Use array_lengthof · 1f11c3c3
  Rafael Espindola authored Mar 28, 2009
```
llvm-svn: 67950
```
  1f11c3c3
Mar 27, 2009
- Use less hard coded constants to make the code less brittle. · 22781543
  Rafael Espindola authored Mar 27, 2009
```
llvm-svn: 67846
```
  22781543
Mar 14, 2009

Don't forego folding of loads into 64-bit adds when the other · 2293eb60

Dan Gohman authored Mar 14, 2009

operand is a signed 32-bit immediate. Unlike with the 8-bit
signed immediate case, it isn't actually smaller to fold a
32-bit signed immediate instead of a load. In fact, it's
larger in the case of 32-bit unsigned immediates, because
they can be materialized with movl instead of movq.

llvm-svn: 67001

2293eb60

Mar 13, 2009

Enhance address-mode folding of ISD::ADD to handle cases where the · a1d92423

Dan Gohman authored Mar 13, 2009

operands can't both be fully folded at the same time. For example,
in the included testcase, a global variable is being added with
an add of two values. The global variable wants RIP-relative
addressing, so it can't share the address with another base
register, but it's still possible to fold the initial add.

llvm-svn: 66865

a1d92423

Feb 13, 2009
- Remove non-DebugLoc versions of BuildMI from X86. · 9bba902c
  Dale Johannesen authored Feb 13, 2009
```
There were some that might even matter in X86FastISel.

llvm-svn: 64437
```
  9bba902c
Feb 12, 2009
- fix the X86 backend to just drop llvm.declare nodes for VLAs instead of · aed3a421
  Chris Lattner authored Feb 12, 2009
```
leaving them in the DAG and then getting selection errors.  This is a 
fix for PR3538.

llvm-svn: 64382
```
  aed3a421
Feb 07, 2009
- Use getDebugLoc forwarder instead of getNode()->getDebugLoc. · 9c310711
  Dale Johannesen authored Feb 07, 2009
```
No functional change.

llvm-svn: 64026
```
  9c310711
- Refactor some repeated logic into a separate function. · 4e3e3dee
  Dan Gohman authored Feb 07, 2009
```
llvm-svn: 63989
```
  4e3e3dee
Feb 06, 2009
- Get rid of one more non-DebugLoc getNode and · 9f3f72f1
  Dale Johannesen authored Feb 06, 2009
```
its corresponding getTargetNode.  Lots of
caller changes.

llvm-svn: 63904
```
  9f3f72f1
Feb 04, 2009
- Patch up omissions in DebugLoc propagation. · bbf13f54
  Dale Johannesen authored Feb 04, 2009
```
llvm-svn: 63693
```
  bbf13f54
Feb 03, 2009
- DebugLoc propgation · 14f2d9dc
  Dale Johannesen authored Feb 03, 2009
```
llvm-svn: 63664
```
  14f2d9dc
Jan 27, 2009
- Simplify findNonImmUse; return the result using the return value · f77f0ce2
  Dan Gohman authored Jan 27, 2009
```
instead of via a by-reference argument. No functionality change.

llvm-svn: 63118
```
  f77f0ce2
- Eliminate unnecessary operands-list traversals. · 7740523a
  Dan Gohman authored Jan 27, 2009
```
llvm-svn: 63088
```
  7740523a
Jan 26, 2009

Enhance logic in X86DAGToDAGISel::PreprocessForRMW which move load inside... · 6c7e8514

Evan Cheng authored Jan 26, 2009

Enhance logic in X86DAGToDAGISel::PreprocessForRMW which move load inside callseq_start to allow it to be folded into a call. It was not considering the cases where a token factor is between the load and the callseq_start.

llvm-svn: 63022

6c7e8514

Jan 21, 2009

Fix a recent regression. ClrOpcode is not set for i8; for i8, if · b43c8996

Dan Gohman authored Jan 21, 2009

we want to clear %ah to zero before a division, just use a
zero-extending mov to %al. This fixes PR3366.

llvm-svn: 62691

b43c8996

Jan 19, 2009

DIVREM isel deficiency: If sign bit is known zero, zero out DX/EDX/RDX instead... · 44cc5543

Evan Cheng authored Jan 19, 2009

DIVREM isel deficiency: If sign bit is known zero, zero out DX/EDX/RDX instead of sign extending the low part (in AX/EAX/RAX) into it.

llvm-svn: 62519

44cc5543

Jan 17, 2009
- Fix MatchAddress bug that's preventing negative displacement from being folded in 64-bit mode. · bf38a5e5
  Evan Cheng authored Jan 17, 2009
```
llvm-svn: 62413
```
  bf38a5e5
Jan 15, 2009

Move a few containers out of ScheduleDAGInstrs::BuildSchedGraph · 619ef48a

Dan Gohman authored Jan 15, 2009

and into the ScheduleDAGInstrs class, so that they don't get
destructed and re-constructed for each block. This fixes a
compile-time hot spot in the post-pass scheduler.

To help facilitate this, tidy and do some minor reorganization
in the scheduler constructor functions.

llvm-svn: 62275

619ef48a

Jan 10, 2009
- 80 col violation. · 5a272e79
  Evan Cheng authored Jan 10, 2009
```
llvm-svn: 62024
```
  5a272e79
Dec 10, 2008
- Some code clean up. · 01fa50ca
  Evan Cheng authored Dec 10, 2008
```
llvm-svn: 60850
```
  01fa50ca
Nov 27, 2008

On x86 favors folding short immediate into some arithmetic operations (e.g.... · 83bdb389

Evan Cheng authored Nov 27, 2008

On x86 favors folding short immediate into some arithmetic operations (e.g. add, and, xor, etc.) because materializing an immediate in a register is expensive in turns of code size.
e.g.
movl 4(%esp), %eax
addl $4, %eax

is 2 bytes shorter than

movl $4, %eax
addl 4(%esp), %eax

llvm-svn: 60139

83bdb389