Commits · afa12db8a631e9b7a58c1baa8ce5fd2c711971ad · Roger Ferrer / llvm-epi-0.8

Aug 25, 2009
- Use X86II::MO_NO_FLAG. · 0f6bf2db
  Dan Gohman authored Aug 25, 2009
```
llvm-svn: 80012
```
  0f6bf2db
Aug 23, 2009
- Remove Streams.h from the targets. · 940fbb0e
  Benjamin Kramer authored Aug 23, 2009
```
llvm-svn: 79853
```
  940fbb0e
Aug 22, 2009
- Record variable debug info at ISel time directly. · 09395957
  Devang Patel authored Aug 22, 2009
```
llvm-svn: 79742
```
  09395957
Aug 21, 2009
- Fix a typo · 7950510b
  Anton Korobeynikov authored Aug 21, 2009
```
llvm-svn: 79634
```
  7950510b
Aug 20, 2009

Fix an x86 code size regression: prefer RIP-relative addressing · 05046085

Dan Gohman authored Aug 20, 2009

over absolute addressing even in non-PIC mode (unless the address
has an index or something else incompatible), because it has a
smaller encoding.

llvm-svn: 79553

05046085

Aug 19, 2009
- Remove temporary testing code. · de255fc8
  Dan Gohman authored Aug 19, 2009
```
llvm-svn: 79443
```
  de255fc8
- Add an x86 peep that narrows TEST instructions to forms that use · ac33a906
  Dan Gohman authored Aug 19, 2009
```
a smaller encoding. These kinds of patterns are very frequent in
sqlite3, for example.

llvm-svn: 79439
```
  ac33a906
Aug 11, 2009
- Split EVT into MVT and EVT, the former representing _just_ a primitive type, while · 9f94459d
  Owen Anderson authored Aug 11, 2009
```
the latter is capable of representing either a primitive or an extended type.

llvm-svn: 78713
```
  9f94459d
- Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. · 53aa7a96
  Owen Anderson authored Aug 10, 2009
```
llvm-svn: 78610
```
  53aa7a96
Aug 07, 2009
- Reformatting of lines. Put multiple DEBUG statements under one DEBUG statement. · fe3bdb4b
  Bill Wendling authored Aug 07, 2009
```
llvm-svn: 78411
```
  fe3bdb4b
Aug 06, 2009
- Fix a bug in x86's PreprocessForRMW logic that was exposed · 130e2c7a
  Dan Gohman authored Aug 06, 2009
```
by aggressive chain operand optimization. UpdateNodeOperands
does not modify the node in place if it would result in
a node identical to an existing node.

llvm-svn: 78297
```
  130e2c7a
- Better handle kernel code model. Also, generalize the things and fix one · 741ea0d7
  Anton Korobeynikov authored Aug 05, 2009
```
subtle bug with small code model.

llvm-svn: 78255
```
  741ea0d7
Aug 03, 2009
- - s/DOUT/DEBUG(errs()/g · 6eecd56e
  Bill Wendling authored Aug 03, 2009
```
- Tidy up some headers.

llvm-svn: 77929
```
  6eecd56e
Aug 02, 2009
- Fix indentation. · 757eee8a
  Dan Gohman authored Aug 02, 2009
```
llvm-svn: 77895
```
  757eee8a
Aug 01, 2009
- Minor code simplifications. · edfad17d
  Dan Gohman authored Aug 01, 2009
```
llvm-svn: 77768
```
  edfad17d
Jul 30, 2009

Optimize some common usage patterns of atomic built-ins __sync_add_and_fetch()... · e62288fd

Evan Cheng authored Jul 30, 2009

Optimize some common usage patterns of atomic built-ins __sync_add_and_fetch() and __sync_sub_and_fetch.

When the return value is not used (i.e. only care about the value in the memory), x86 does not have to use add to implement these. Instead, it can use add, sub, inc, dec instructions with the "lock" prefix.

This is currently implemented using a bit of instruction selection trick. The issue is the target independent pattern produces one output and a chain and we want to map it into one that just output a chain. The current trick is to select it into a merge_values with the first definition being an implicit_def. The proper solution is to add new ISD opcodes for the no-output variant. DAG combiner can then transform the node before it gets to target node selection.

Problem #2 is we are adding a whole bunch of x86 atomic instructions when in fact these instructions are identical to the non-lock versions. We need a way to add target specific information to target nodes and have this information carried over to machine instructions. Asm printer (or JIT) can use this information to add the "lock" prefix.

llvm-svn: 77582

e62288fd

Jul 23, 2009
- x86 isel tweak: use lea (%reg,%reg) instead of lea (,%reg,2). · 824ab403
  Dan Gohman authored Jul 22, 2009
```
llvm-svn: 76817
```
  824ab403
Jul 14, 2009

reapply r75408, which eliminates MOV64r0 in favor of using · 79c136d4

Chris Lattner authored Jul 14, 2009

MOV32r0 + subregs to do the same thing.  This should work now
that PR4544 is fixed.  Thanks Evan!

llvm-svn: 75671

79c136d4

llvm_unreachable->llvm_unreachable(0), LLVM_UNREACHABLE->llvm_unreachable. · fbcc663c

Torok Edwin authored Jul 14, 2009

This adds location info for all llvm_unreachable calls (which is a macro now) in
!NDEBUG builds.
In NDEBUG builds location info and the message is off (it only prints
"UREACHABLE executed").

llvm-svn: 75640

fbcc663c

Jul 12, 2009

Temporarily revert r75408. It appears to break the Apple-style builds: · 5b76fc03

Bill Wendling authored Jul 12, 2009

x86_64-apple-darwin10-gcc -c   -g -O2  -DIN_GCC   -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -pedantic -Wno-long-long -Wno-variadic-macros -Wno-overlength-strings -Wold-style-definition -Wmissing-format-attribute   -mdynamic-no-pic -DHAVE_CONFIG_H -I. -I. -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc/. -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc/../include -I./../intl -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc/../libcpp/include  -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmCore.roots/llvmCore~dst/Developer/usr/local/include -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmCore.roots/llvmCore~obj/src/include -DENABLE_LLVM -I/Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmCore.roots/llvmCore~dst/Developer/usr/local/include  -D_DEBUG  -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -D__STDC_CONSTANT_MACROS -DLLVM_VERSION_INFO='"9999"' -DBUILD_LLVM_APPLE_STYLE   /Volumes/Sandbox/Buildbot/llvm/build.llvm-gcc-x86_64-darwin10-selfhost/build/llvmgcc42.roots/llvmgcc42~obj/src/gcc/tree-ssa-alias.c -o tree-ssa-alias.o
/var/tmp//ccJQ2JBT.s:4134:Incorrect register `%rcx' used with `l' suffix
make[2]: *** [tree-ssa-live.o] Error 1
make[2]: *** Waiting for unfinished jobs....

llvm-svn: 75412

5b76fc03

eliminate MOV64r0 in favor of a Pat<> pattern. This is only nontrivial because · 02c4339b
Chris Lattner authored Jul 12, 2009
```
the div lowering code explicitly references it.

llvm-svn: 75408
```
02c4339b
fix a bug in my cleanup patch · 48cee9b4
Chris Lattner authored Jul 11, 2009
```
llvm-svn: 75402
```
48cee9b4
comment cleanup, reduce nesting. · 4d10f1a6
Chris Lattner authored Jul 11, 2009
```
llvm-svn: 75398
```
4d10f1a6

Jul 11, 2009

assert(0) -> LLVM_UNREACHABLE. · 56d06597

Torok Edwin authored Jul 11, 2009

Make llvm_unreachable take an optional string, thus moving the cerr<< out of
line.
LLVM_UNREACHABLE is now a simple wrapper that makes the message go away for
NDEBUG builds.

llvm-svn: 75379

56d06597

Jul 08, 2009
- Implement changes from Chris's feedback. · fb8d6d5b
  Torok Edwin authored Jul 08, 2009
```
Finish converting lib/Target.

llvm-svn: 75043
```
  fb8d6d5b
Jun 27, 2009

Reimplement rip-relative addressing in the X86-64 backend. The new · fea81da4

Chris Lattner authored Jun 27, 2009

implementation primarily differs from the former in that the asmprinter
doesn't make a zillion decisions about whether or not something will be
RIP relative or not.  Instead, those decisions are made by isel lowering
and propagated through to the asm printer.  To achieve this, we:

1. Represent RIP relative addresses by setting the base of the X86 addr
   mode to X86::RIP.
2. When ISel Lowering decides that it is safe to use RIP, it lowers to
   X86ISD::WrapperRIP.  When it is unsafe to use RIP, it lowers to
   X86ISD::Wrapper as before.
3. This removes isRIPRel from X86ISelAddressMode, representing it with
   a basereg of RIP instead.
4. The addressing mode matching logic in isel is greatly simplified.
5. The asmprinter is greatly simplified, notably the "NotRIPRel" predicate
   passed through various printoperand routines is gone now.
6. The various symbol printing routines in asmprinter now no longer infer
   when to emit (%rip), they just print the symbol.

I think this is a big improvement over the previous situation.  It does have
two small caveats though: 1. I implemented a horrible "no-rip" modifier for
the inline asm "P" constraint modifier.  This is a short term hack, there is
a much better, but more involved, solution.  2. I had to xfail an 
-aggressive-remat testcase because it isn't handling the use of RIP in the
constant-pool reading instruction.  This specific test is easy to fix without
-aggressive-remat, which I intend to do next.

llvm-svn: 74372

fea81da4

Jun 26, 2009
- make sure to propagate operand flags in SelectTLSADDRAddr properly. · 899abc46
  Chris Lattner authored Jun 26, 2009
```
llvm-svn: 74326
```
  899abc46
- fix a pasto. · 1d3b65a6
  Chris Lattner authored Jun 26, 2009
```
llvm-svn: 74275
```
  1d3b65a6
- propagate target operand flags through addressing mode selection. · bd7e26db
  Chris Lattner authored Jun 26, 2009
```
llvm-svn: 74272
```
  bd7e26db
Jun 20, 2009

change TLS_ADDR lowering to lower to a real mem operand, instead of matching as · 7d2b0494

Chris Lattner authored Jun 20, 2009

a global with that gets printed with the :mem modifier.  All operands to lea's 
should be handled with the lea32mem operand kind, and this allows the TLS stuff
to do this.  There are several better ways to do this, but I went for the minimal
change since I can't really test this (beyond make check).

This also makes the use of EBX explicit in the operand list in the 32-bit, 
instead of implicit in the instruction.

llvm-svn: 73834

7d2b0494

Jun 03, 2009

Remove the redundant TM member from X86DAGToDAGISel; replace it · 4751bb9e

Dan Gohman authored Jun 03, 2009

with an accessor method which simply casts the parent class
SelectionDAGISel's TM to the target-specific type.

llvm-svn: 72801

4751bb9e

May 11, 2009
- Convert a subtract into a negate and an add when it helps x86 · faf75c8c
  Dan Gohman authored May 11, 2009
```
address folding.

llvm-svn: 71446
```
  faf75c8c
May 08, 2009
- Factor out cycle-finder code and make it generic. · 65a58168
  Anton Korobeynikov authored May 08, 2009
```
llvm-svn: 71241
```
  65a58168
Apr 30, 2009
- Instead of passing in an unsigned value for the optimization level, use an enum, · 026e5d76
  Bill Wendling authored Apr 29, 2009
```
which better identifies what the optimization is doing. And is more flexible for
future uses.

llvm-svn: 70440
```
  026e5d76
Apr 29, 2009

Second attempt: · 084669a1

Bill Wendling authored Apr 29, 2009

Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to
use the old behavior, the flag is -O0. This change allows for finer-grained
control over which optimizations are run at different -O levels.

Most of this work was pretty mechanical. The majority of the fixes came from
verifying that a "fast" variable wasn't used anymore. The JIT still uses a
"Fast" flag. I'll change the JIT with a follow-up patch.

llvm-svn: 70343

084669a1

Apr 28, 2009

r70270 isn't ready yet. Back this out. Sorry for the noise. · 56f2987a
Bill Wendling authored Apr 28, 2009
```
llvm-svn: 70275
```
56f2987a

Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to · d0ae1594

Bill Wendling authored Apr 28, 2009

use the old behavior, the flag is -O0. This change allows for finer-grained
control over which optimizations are run at different -O levels.

Most of this work was pretty mechanical. The majority of the fixes came from
verifying that a "fast" variable wasn't used anymore. The JIT still uses a
"Fast" flag. I'm not 100% sure if it's necessary to change it there...

llvm-svn: 70270

d0ae1594

Apr 16, 2009
- fix PR3995. A scale must be 1, 2, 4 or 8. · 5e42177a
  Rafael Espindola authored Apr 16, 2009
```
llvm-svn: 69284
```
  5e42177a
Apr 15, 2009
- For the h-register addressing-mode trick, use the correct value for · 62f44986
  Dan Gohman authored Apr 14, 2009
```
any non-address uses of the address value. This fixes 186.crafty.

llvm-svn: 69094
```
  62f44986
Apr 13, 2009

Implement x86 h-register extract support. · 57d6bd36

Dan Gohman authored Apr 13, 2009

 - Add patterns for h-register extract, which avoids a shift and mask,
   and in some cases a temporary register.
 - Add address-mode matching for turning (X>>(8-n))&(255<<n), where
   n is a valid address-mode scale value, into an h-register extract
   and a scaled-offset address.
 - Replace X86's MOV32to32_ and related instructions with the new
   target-independent COPY_TO_SUBREG instruction.

On x86-64 there are complicated constraints on h registers, and
CodeGen doesn't currently provide a high-level way to express all of them,
so they are handled with a bunch of special code. This code currently only
supports extracts where the result is used by a zero-extend or a store,
though these are fairly common.

These transformations are not always beneficial; since there are only
4 h registers, they sometimes require extra move instructions, and
this sometimes increases register pressure because it can force out
values that would otherwise be in one of those registers. However,
this appears to be relatively uncommon.

llvm-svn: 68962

57d6bd36