Commits · 9d763cc3f8bc22f6dc37c00198ee5ec3f9500666 · Roger Ferrer / llvm-epi-0.8

Oct 18, 2009

-Revert parts of 84326 and 84411. Distinquishing between fixed and non-fixed · 0e9d9ca8

Evan Cheng authored Oct 18, 2009

stack slots and giving them different PseudoSourceValue's did not fix the
problem of post-alloc scheduling miscompiling llvm itself.
- Apply Dan's conservative workaround by assuming any non fixed stack slots can
alias other memory locations. This means a load from spill slot #1 cannot 
move above a store of spill slot #2. 
- Enable post-alloc scheduling for x86 at optimization leverl Default and above.

llvm-svn: 84424

0e9d9ca8

Oct 17, 2009
- Distinquish stack slots from other stack objects. They (and fixed objects) get... · 4729191b
  Evan Cheng authored Oct 17, 2009
```
Distinquish stack slots from other stack objects. They (and fixed objects) get FixedStack PseudoSourceValues.

llvm-svn: 84326
```
  4729191b
- Revert 84315 for now. Re-thinking the patch. · 8759585a
  Evan Cheng authored Oct 17, 2009
```
llvm-svn: 84321
```
  8759585a
- Rename getFixedStack to getStackObject. The stack objects represented are not · 0818d87e
  Evan Cheng authored Oct 17, 2009
```
necessarily fixed. Only those will negative frame indices are "fixed."

llvm-svn: 84315
```
  0818d87e
Oct 07, 2009
- Add PseudoSourceValues for constpool stuff on ELF (Darwin should use something similar) · 75b59fb0
  Anton Korobeynikov authored Oct 07, 2009
```
and register spills.

llvm-svn: 83435
```
  75b59fb0
Sep 28, 2009

Introduce the TargetInstrInfo::KILL machine instruction and get rid of the · dc9efe80

Jakob Stoklund Olesen authored Sep 28, 2009

unused DECLARE instruction.

KILL is not yet used anywhere, it will replace TargetInstrInfo::IMPLICIT_DEF
in the places where IMPLICIT_DEF is just used to alter liveness of physical
registers.

llvm-svn: 83006

dc9efe80

Make ARM and Thumb2 32-bit immediate materialization into a single 32-bit pseudo · 83e0d481

Evan Cheng authored Sep 28, 2009

instruction. This makes it re-materializable.

Thumb2 will split it back out into two instructions so IT pass will generate the
right mask. Also, this expose opportunies to optimize the movw to a 16-bit move.

llvm-svn: 82982

83e0d481

Sep 13, 2009
- Add QPR_VFP2 regclass and add copy_to_regclass nodes, where needed to · 8d0fbebb
  Anton Korobeynikov authored Sep 12, 2009
```
constraint the register usage.

llvm-svn: 81635
```
  8d0fbebb
Sep 08, 2009
- Add NEON 'laned' operations. This fixes another bunch of gcc testsuite fails and · 59e2b8e8
  Anton Korobeynikov authored Sep 08, 2009
```
makes the code faster.

llvm-svn: 81220
```
  59e2b8e8
Aug 27, 2009
- Fix PR4789. Teach eliminateFrameIndex how to handle VLDRQ and VSTRQ which... · 7a37b1a2
  Evan Cheng authored Aug 27, 2009
```
Fix PR4789. Teach eliminateFrameIndex how to handle VLDRQ and VSTRQ which cannot fold any immediate offset.

llvm-svn: 80191
```
  7a37b1a2
Aug 22, 2009
- rename TAI -> MAI, being careful not to make MAILJMP instructions :) · e9a75a66
  Chris Lattner authored Aug 22, 2009
```
llvm-svn: 79777
```
  e9a75a66
- Rename TargetAsmInfo (and its subclasses) to MCAsmInfo. · 7b26fce2
  Chris Lattner authored Aug 22, 2009
```
llvm-svn: 79763
```
  7b26fce2
- Record variable debug info at ISel time directly. · 09395957
  Devang Patel authored Aug 22, 2009
```
llvm-svn: 79742
```
  09395957
Aug 11, 2009
- Add Thumb2 eh_sjlj_setjmp implementation · 841850ed
  Jim Grosbach authored Aug 11, 2009
```
llvm-svn: 78701
```
  841850ed
- fix GetInstSizeInBytes for eh_sjlj_setjmp · 1d5350c0
  Jim Grosbach authored Aug 11, 2009
```
llvm-svn: 78683
```
  1d5350c0
- Whitespace cleanup. Remove trailing whitespace. · f24f9d9c
  Jim Grosbach authored Aug 11, 2009
```
llvm-svn: 78666
```
  f24f9d9c
Aug 10, 2009
- Add support for folding loads / stores into 16-bit moves used by Thumb2. · 092b701a
  Evan Cheng authored Aug 10, 2009
```
llvm-svn: 78558
```
  092b701a
- 80 col violation. · 55c014a9
  Evan Cheng authored Aug 10, 2009
```
llvm-svn: 78557
```
  55c014a9
Aug 08, 2009
- Use VLDM / VSTM to spill/reload 128-bit Neon registers · 887d05ce
  Anton Korobeynikov authored Aug 08, 2009
```
llvm-svn: 78468
```
  887d05ce
- Code refactoring. No functionality change. · 2aa91cc2
  Evan Cheng authored Aug 08, 2009
```
llvm-svn: 78455
```
  2aa91cc2
Aug 07, 2009

Fix support to use NEON for single precision fp math. · 4c3b1ca5
Evan Cheng authored Aug 07, 2009
```
llvm-svn: 78397
```
4c3b1ca5

It turns out most of the thumb2 instructions are not allowed to touch SP. The... · b972e563

Evan Cheng authored Aug 07, 2009

It turns out most of the thumb2 instructions are not allowed to touch SP. The semantics of such instructions are unpredictable. We have just been lucky that tests have been passing.

This patch takes pain to ensure all the PEI lowering code does the right thing when lowering frame indices, insert code to manipulate stack pointers, etc. It's also custom lowering dynamic stack alloc into pseudo instructions so we can insert the right instructions at scheduling time.

This fixes PR4659 and PR4682.

llvm-svn: 78361

b972e563

Aug 05, 2009

When using NEON for single-precision FP, the NEON result must be placed in... · e5b5d8fb

David Goodwin authored Aug 05, 2009

When using NEON for single-precision FP, the NEON result must be placed in D0-D15 as these are the only D registers with S subregs. Introduce a new regclass to represent D0-D15 and use it in the NEON single-precision FP patterns.

llvm-svn: 78244

e5b5d8fb

Aug 02, 2009

Move the getInlineAsmLength virtual method from TAI to TII, where · e98a3c3c

Chris Lattner authored Aug 02, 2009

the only real caller (GetFunctionSizeInBytes) uses it.

The custom ARM implementation of this is basically reimplementing
an assembler poorly for negligible gain.  It should be removed 
IMNSHO, but I'll leave that to ARMish folks to decide.

llvm-svn: 77877

e98a3c3c

Aug 01, 2009
- Workaround a couple of Darwin assembler bugs. · e64f48ba
  Evan Cheng authored Aug 01, 2009
```
llvm-svn: 77781
```
  e64f48ba
- t2BR_JT is mov pc, it's 2 byte long, not 4. · 95d63258
  Evan Cheng authored Jul 31, 2009
```
llvm-svn: 77744
```
  95d63258
Jul 31, 2009
- - Teach TBB / TBH offset limits are 510 and 131070 respectively since the offset · f6d0fa3d
  Evan Cheng authored Jul 31, 2009
```
  is scaled by two.
- Teach GetInstSizeInBytes about TBB and TBH.

llvm-svn: 77701
```
  f6d0fa3d
Jul 28, 2009

- More refactoring. This gets rid of all of the getOpcode calls. · 780748d5

Evan Cheng authored Jul 28, 2009

- This change also makes it possible to switch between ARM / Thumb on a
  per-function basis.
- Fixed thumb2 routine which expand reg + arbitrary immediate. It was using
  using ARM so_imm logic.
- Use movw and movt to do reg + imm when profitable.
- Other code clean ups and minor optimizations.

llvm-svn: 77300

780748d5

Jul 27, 2009
- convertToThreeAddress can't handle Thumb2 instructions (which don't have same... · 0e075e24
  Evan Cheng authored Jul 27, 2009
```
convertToThreeAddress can't handle Thumb2 instructions (which don't have same address mode as ARM instructions).

llvm-svn: 77230
```
  0e075e24
- Clean up. · 8f2ed1bc
  Evan Cheng authored Jul 27, 2009
```
llvm-svn: 77221
```
  8f2ed1bc
- Get rid of some more getOpcode calls. · 056c669e
  Evan Cheng authored Jul 27, 2009
```
This also fixes potential problems in ARMBaseInstrInfo routines not recognizing thumb1 instructions when 32-bit and 16-bit instructions mix.

llvm-svn: 77218
```
  056c669e
- If CPSR is modified but the def is dead, then it's ok to fold the load / store. · 371ec9e8
  Evan Cheng authored Jul 27, 2009
```
llvm-svn: 77182
```
  371ec9e8
- Use t2LDRi12 and t2STRi12 to load / store to / from stack frames. Eliminate more getOpcode calls. · c47e1093
  Evan Cheng authored Jul 27, 2009
```
llvm-svn: 77181
```
  c47e1093
- Use the right instructions to copy between GPR and the more strictive tGPR... · 186332f8
  Evan Cheng authored Jul 27, 2009
```
Use the right instructions to copy between GPR and the more strictive tGPR classes. t2MOV does not match the RC requirements.

llvm-svn: 77175
```
  186332f8
- Merge isLoadFromStackSlot into one since it behaves the same regardless of sub-target. · 0e5b1499
  Evan Cheng authored Jul 27, 2009
```
llvm-svn: 77174
```
  0e5b1499
- Just use a single isMoveInstr to catch all the cases. · 26b51b15
  Evan Cheng authored Jul 27, 2009
```
llvm-svn: 77173
```
  26b51b15
Jul 25, 2009

Change Thumb2 jumptable codegen to one that uses two level jumps: · f3a1fce8

Evan Cheng authored Jul 25, 2009

Before:
      adr r12, #LJTI3_0_0
      ldr pc, [r12, +r0, lsl #2]
LJTI3_0_0:
      .long    LBB3_24
      .long    LBB3_30
      .long    LBB3_31
      .long    LBB3_32

After:
      adr r12, #LJTI3_0_0
      add pc, r12, +r0, lsl #2
LJTI3_0_0:
      b.w    LBB3_24
      b.w    LBB3_30
      b.w    LBB3_31
      b.w    LBB3_32

This has several advantages.
1. This will make it easier to optimize this to a TBB / TBH instruction +
   (smaller) table.
2. This eliminate the need for ugly asm printer hack to force the address
   into thumb addresses (bit 0 is one).
3. Same codegen for pic and non-pic.
4. This eliminate the need to align the table so constantpool island pass
   won't have to over-estimate the size.

Based on my calculation, the later is probably slightly faster as well since
ldr pc with shifter address is very slow. That is, it should be a win as long
as the HW implementation can do a reasonable job of branch predict the second
branch.

llvm-svn: 77024

f3a1fce8

Jul 24, 2009
- Make sure thumb2 jumptable entries are aligned. · 666c912c
  Evan Cheng authored Jul 24, 2009
```
llvm-svn: 76986
```
  666c912c
- Remove unused member functions. · 95fc6ee5
  Eli Friedman authored Jul 24, 2009
```
llvm-svn: 76960
```
  95fc6ee5
- FLDD, FLDS, FCPYD, FCPYS, FSTD, FSTS, VMOVD, VMOVQ maps to the same... · 6cfbe613
  Evan Cheng authored Jul 24, 2009
```
FLDD, FLDS, FCPYD, FCPYS, FSTD, FSTS, VMOVD, VMOVQ maps to the same instructions on all sub-targets.

llvm-svn: 76925
```
  6cfbe613