Commits · 43f33dd550a2d165c41cbf01cae97d513dd5002c · Roger Ferrer / llvm-epi-0.8

Jul 02, 2009
- Fix a bunch of other places that used operator[] to test whether · 43f33dd5
  Dan Gohman authored Jul 02, 2009
```
a key is present in a std::map or DenseMap to use find instead.

llvm-svn: 74676
```
  43f33dd5
Jul 01, 2009
- Fix some fast-isel problems selecting global variable addressing in · f95fa1b7
  Chris Lattner authored Jul 01, 2009
```
pic mode.

llvm-svn: 74582
```
  f95fa1b7
Jun 27, 2009

simplify some code and eliminate the symbolicAddressesAreRIPRel() predicate. · cce1589e
Chris Lattner authored Jun 27, 2009
```
llvm-svn: 74377
```
cce1589e
fix clang/test/CodeGenObjC/try.m, a basereg doesn't mean no global anymore. · d17366ae
Chris Lattner authored Jun 27, 2009
```
llvm-svn: 74375
```
d17366ae

Reimplement rip-relative addressing in the X86-64 backend. The new · fea81da4

Chris Lattner authored Jun 27, 2009

implementation primarily differs from the former in that the asmprinter
doesn't make a zillion decisions about whether or not something will be
RIP relative or not.  Instead, those decisions are made by isel lowering
and propagated through to the asm printer.  To achieve this, we:

1. Represent RIP relative addresses by setting the base of the X86 addr
   mode to X86::RIP.
2. When ISel Lowering decides that it is safe to use RIP, it lowers to
   X86ISD::WrapperRIP.  When it is unsafe to use RIP, it lowers to
   X86ISD::Wrapper as before.
3. This removes isRIPRel from X86ISelAddressMode, representing it with
   a basereg of RIP instead.
4. The addressing mode matching logic in isel is greatly simplified.
5. The asmprinter is greatly simplified, notably the "NotRIPRel" predicate
   passed through various printoperand routines is gone now.
6. The various symbol printing routines in asmprinter now no longer infer
   when to emit (%rip), they just print the symbol.

I think this is a big improvement over the previous situation.  It does have
two small caveats though: 1. I implemented a horrible "no-rip" modifier for
the inline asm "P" constraint modifier.  This is a short term hack, there is
a much better, but more involved, solution.  2. I had to xfail an 
-aggressive-remat testcase because it isn't handling the use of RIP in the
constant-pool reading instruction.  This specific test is easy to fix without
-aggressive-remat, which I intend to do next.

llvm-svn: 74372

fea81da4

Fix PR4466 by making fastisel set operand flags correctly. · a3260c0b
Chris Lattner authored Jun 27, 2009
```
llvm-svn: 74366
```
a3260c0b

Jun 12, 2009

Fix Bug 4278: X86-64 with -tailcallopt calling convention · e3a018d7

Arnold Schwaighofer authored Jun 12, 2009

out of sync with regular cc.

The only difference between the tail call cc and the normal
cc was that one parameter register - R9 - was reserved for
calling functions through a function pointer. After time the
tail call cc has gotten out of sync with the regular cc. 

We can use R11 which is also caller saved but not used as
parameter register for potential function pointers and
remove the special tail call cc on x86-64.

llvm-svn: 73233

e3a018d7

Jun 03, 2009
- Avoid a warning "'U' might be used uninitialized in · c66ad73e
  Duncan Sands authored Jun 03, 2009
```
this function" when using a not-too-smart compiler.

llvm-svn: 72768
```
  c66ad73e
May 09, 2009

Rename PaddedSize to AllocSize, in the hope that this · af9eaa83

Duncan Sands authored May 09, 2009

will make it more obvious what it represents, and stop
it being confused with the StoreSize.

llvm-svn: 71349

af9eaa83

May 04, 2009
- X86FastISel doesn't support the -tailcallopt ABI. · bb525f7e
  Dan Gohman authored May 04, 2009
```
llvm-svn: 70902
```
  bb525f7e
Apr 27, 2009
- Rename GR8_, GR16_, GR32_, and GR64_ to GR8_ABCD, GR16_ABCD, · ec542ca6
  Dan Gohman authored Apr 27, 2009
```
GR32_ABCD, and GR64_ABCD, respectively, to help describe them.

llvm-svn: 70210
```
  ec542ca6
Apr 13, 2009

Implement x86 h-register extract support. · 57d6bd36

Dan Gohman authored Apr 13, 2009

 - Add patterns for h-register extract, which avoids a shift and mask,
   and in some cases a temporary register.
 - Add address-mode matching for turning (X>>(8-n))&(255<<n), where
   n is a valid address-mode scale value, into an h-register extract
   and a scaled-offset address.
 - Replace X86's MOV32to32_ and related instructions with the new
   target-independent COPY_TO_SUBREG instruction.

On x86-64 there are complicated constraints on h registers, and
CodeGen doesn't currently provide a high-level way to express all of them,
so they are handled with a bunch of special code. This code currently only
supports extracts where the result is used by a zero-extend or a store,
though these are fairly common.

These transformations are not always beneficial; since there are only
4 h registers, they sometimes require extra move instructions, and
this sometimes increases register pressure because it can force out
values that would otherwise be in one of those registers. However,
this appears to be relatively uncommon.

llvm-svn: 68962

57d6bd36

Apr 12, 2009
- fix a cross-block fastisel crash handling overflow intrinsics. · ce6bcf08
  Chris Lattner authored Apr 12, 2009
```
See comment for details.  This fixes rdar://6772169

llvm-svn: 68890
```
  ce6bcf08
- simplify code by using IntrinsicInst. · 99a8cb62
  Chris Lattner authored Apr 12, 2009
```
llvm-svn: 68887
```
  99a8cb62
- Add new TargetInstrDesc::hasImplicitUseOfPhysReg and · 24ac95ab
  Chris Lattner authored Apr 12, 2009
```
hasImplicitDefOfPhysReg methods.  Use them to remove a 
look in X86 fast isel.

llvm-svn: 68886
```
  24ac95ab
Apr 08, 2009

Re-apply 68552. · 3b2df10c

Rafael Espindola authored Apr 08, 2009

Tested by bootstrapping llvm-gcc and using that to build llvm.

llvm-svn: 68645

3b2df10c

Temporarily revert r68552. This was causing a failure in the self-hosting LLVM · 4aa25b79

Bill Wendling authored Apr 07, 2009

builds.

--- Reverse-merging (from foreign repository) r68552 into '.':
U    test/CodeGen/X86/tls8.ll
U    test/CodeGen/X86/tls10.ll
U    test/CodeGen/X86/tls2.ll
U    test/CodeGen/X86/tls6.ll
U    lib/Target/X86/X86Instr64bit.td
U    lib/Target/X86/X86InstrSSE.td
U    lib/Target/X86/X86InstrInfo.td
U    lib/Target/X86/X86RegisterInfo.cpp
U    lib/Target/X86/X86ISelLowering.cpp
U    lib/Target/X86/X86CodeEmitter.cpp
U    lib/Target/X86/X86FastISel.cpp
U    lib/Target/X86/X86InstrInfo.h
U    lib/Target/X86/X86ISelDAGToDAG.cpp
U    lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.cpp
U    lib/Target/X86/AsmPrinter/X86IntelAsmPrinter.cpp
U    lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.h
U    lib/Target/X86/AsmPrinter/X86IntelAsmPrinter.h
U    lib/Target/X86/X86ISelLowering.h
U    lib/Target/X86/X86InstrInfo.cpp
U    lib/Target/X86/X86InstrBuilder.h
U    lib/Target/X86/X86RegisterInfo.td

llvm-svn: 68560

4aa25b79

Apr 07, 2009

Reduce code duplication on the TLS implementation. · 1edda067

Rafael Espindola authored Apr 07, 2009

This introduces a small regression on the generated code
quality in the case we are just computing addresses, not
loading values.

Will work on it and on X86-64 support.

llvm-svn: 68552

1edda067

Mar 14, 2009

Improve FastISel's handling of truncates to i1, and implement · a62e4ab6

Dan Gohman authored Mar 13, 2009

ptrtoint and inttoptr in X86FastISel. These casts aren't always
handled in the generic FastISel code because X86 sometimes needs
custom code to do truncation and zero-extension.

llvm-svn: 66988

a62e4ab6

Mar 13, 2009

Fix FastISel's assumption that i1 values are always zero-extended · c0bb9595

Dan Gohman authored Mar 13, 2009

by inserting explicit zero extensions where necessary. Included
is a testcase where SelectionDAG produces a virtual register
holding an i1 value which FastISel previously mistakenly assumed
to be zero-extended.

llvm-svn: 66941

c0bb9595

generalize this code so that fast isel handles integer truncates to i1, which · 3fb71c8f

Chris Lattner authored Mar 13, 2009

codegen to the same thing as integer truncates to i8 (the top bits are 
just undefined).  This implements rdar://6667338

llvm-svn: 66902

3fb71c8f

Fix some significant problems with constant pools that resulted in unnecessary... · 1fb8aedd

Evan Cheng authored Mar 13, 2009

Fix some significant problems with constant pools that resulted in unnecessary paddings between constant pool entries, larger than necessary alignments (e.g. 8 byte alignment for .literal4 sections), and potentially other issues.

1. ConstantPoolSDNode alignment field is log2 value of the alignment requirement. This is not consistent with other SDNode variants.
2. MachineConstantPool alignment field is also a log2 value.
3. However, some places are creating ConstantPoolSDNode with alignment value rather than log2 values. This creates entries with artificially large alignments, e.g. 256 for SSE vector values.
4. Constant pool entry offsets are computed when they are created. However, asm printer group them by sections. That means the offsets are no longer valid. However, asm printer uses them to determine size of padding between entries.
5. Asm printer uses expensive data structure multimap to track constant pool entries by sections.
6. Asm printer iterate over SmallPtrSet when it's emitting constant pool entries. This is non-deterministic.

Solutions:
1. ConstantPoolSDNode alignment field is changed to keep non-log2 value.
2. MachineConstantPool alignment field is also changed to keep non-log2 value.
3. Functions that create ConstantPool nodes are passing in non-log2 alignments.
4. MachineConstantPoolEntry no longer keeps an offset field. It's replaced with an alignment field. Offsets are not computed when constant pool entries are created. They are computed on the fly in asm printer and JIT.
5. Asm printer uses cheaper data structure to group constant pool entries.
6. Asm printer compute entry offsets after grouping is done.
7. Change JIT code to compute entry offsets on the fly.

llvm-svn: 66875

1fb8aedd

Mar 08, 2009
- do not export all the X86FastISel symbols, ever. · d5ac9d87
  Chris Lattner authored Mar 08, 2009
```
llvm-svn: 66382
```
  d5ac9d87
Feb 23, 2009
- Fast-isel can't do TLS yet, so it should fall back to SDISel · 318d7376
  Dan Gohman authored Feb 23, 2009
```
if it sees TLS addresses.

llvm-svn: 65341
```
  318d7376
Feb 13, 2009
- Remove non-DebugLoc versions of BuildMI from X86. · 9bba902c
  Dale Johannesen authored Feb 13, 2009
```
There were some that might even matter in X86FastISel.

llvm-svn: 64437
```
  9bba902c
Jan 22, 2009

Eliminate a couple of fields from TargetRegisterClass: SubRegClasses and... · 4a0bf66e

Evan Cheng authored Jan 22, 2009

Eliminate a couple of fields from TargetRegisterClass: SubRegClasses and SuperRegClasses. These are not necessary. Also eliminate getSubRegisterRegClass and getSuperRegisterRegClass. These are slow and their results can change if register file names change. Just use TargetLowering::getRegClassFor() to get the right TargetRegisterClass instead.

llvm-svn: 62762

4a0bf66e

Jan 20, 2009
- Change TargetInstrInfo::isMoveInstr to return source and destination sub-register indices as well. · c544cb0e
  Evan Cheng authored Jan 20, 2009
```
llvm-svn: 62600
```
  c544cb0e
Jan 13, 2009

· 5c6e1e3b

Devang Patel authored Jan 13, 2009

Use DebugInfo interface to lower dbg_* intrinsics.

llvm-svn: 62127

5c6e1e3b

Jan 12, 2009
- Rename getABITypeSize to getTypePaddedSize, as · dc020f9c
  Duncan Sands authored Jan 12, 2009
```
suggested by Chris.

llvm-svn: 62099
```
  dc020f9c
Jan 07, 2009
- X86_COND_C and X86_COND_NC are alternate mnemonics for · 33e6fcd5
  Dan Gohman authored Jan 07, 2009
```
X86_COND_B and X86_COND_AE, respectively.

llvm-svn: 61835
```
  33e6fcd5
Dec 23, 2008
- Silence unused variable warnings. · 3d188347
  Devang Patel authored Dec 23, 2008
```
llvm-svn: 61392
```
  3d188347
Dec 20, 2008

Fix fast-isel to not emit invalid assembly when presented with a · ab316350

Dan Gohman authored Dec 20, 2008

constant shift count that doesn't fit in the shift instruction's
immediate field. This fixes PR3242.

llvm-svn: 61281

ab316350

Dec 19, 2008
- Fix some release-assert warnings · 9c148c8f
  Chris Lattner authored Dec 19, 2008
```
llvm-svn: 61244
```
  9c148c8f
Dec 10, 2008

Only perform SETO/SETC to JO/JC conversion if extractvalue is coming from an... · 517d05fd

Bill Wendling authored Dec 10, 2008

Only perform SETO/SETC to JO/JC conversion if extractvalue is coming from an arithmetic with overflow instruction.

llvm-svn: 60844

517d05fd

Implement fast-isel conversion of a branch instruction that's branching on an · 8008cb9a

Bill Wendling authored Dec 09, 2008

overflow/carry from the "arithmetic with overflow" intrinsics. It searches the
machine basic block from bottom to top to find the SETO/SETC instruction that is
its conditional. If an instruction modifies EFLAGS before it reaches the
SETO/SETC instruction, then it defaults to the normal instruction emission.

llvm-svn: 60807

8008cb9a

Dec 09, 2008

Correct my English. · e25d3417
Bill Wendling authored Dec 09, 2008
```
llvm-svn: 60753
```
e25d3417

Add initial support for fast-isel of the [SU]ADDO intrinsics. It isn't · 80b34b3f

Bill Wendling authored Dec 09, 2008

complete. For instance, it lowers the common case into this less-than-optimal
code:

        addl    %ecx, %eax
        seto    %cl
        testb   %cl, %cl
        jne     LBB1_2  ## overflow

instead of:

        addl    %ecx, %eax
        jo      LBB1_2  ## overflow

That will come in a future commit.

llvm-svn: 60737

80b34b3f

Fix a couple of mistaken switch case fall-throughs. Thanks to Bill · bc55c2a1
Dan Gohman authored Dec 08, 2008
```
for spotting these!

llvm-svn: 60728
```
bc55c2a1

Dec 08, 2008

Factor out the code for sign-extending/truncating gep indices · 4c31524b

Dan Gohman authored Dec 08, 2008

and use it in x86 address mode folding. Also, make
getRegForValue return 0 for illegal types even if it has a
ValueMap for them, because Argument values are put in the
ValueMap. This fixes PR3181.

llvm-svn: 60696

4c31524b

Oct 21, 2008
- Implement the optimized FCMP_OEQ/FCMP_UNE code for x86 fast-isel. · 4ddf7a4c
  Dan Gohman authored Oct 21, 2008
```
llvm-svn: 57915
```
  4ddf7a4c