Commits · 835ef201acd6b4eeef28b49605456296667181bd · Roger Ferrer / llvm-epi-0.8

Sep 21, 2012
- Mips16FrameLowering.cpp: Remove unused TII introduced in r164349. [-Wunused-variable] · f51004bc
  NAKAMURA Takumi authored Sep 21, 2012
```
llvm-svn: 164354
```
  f51004bc
- Properly save and restore RA and Mips16 callee save registers S0,S1 · cd04e2b8
  Akira Hatanaka authored Sep 21, 2012
```
Patch by Reed Kotler.

llvm-svn: 164349
```
  cd04e2b8
- [fast-isel] Fallback to SelectionDAG isel if we require strict alignment for · 2364f583
  Chad Rosier authored Sep 21, 2012
```
non-halfword-aligned i16 loads/stores.
rdar://12304911

llvm-svn: 164345
```
  2364f583
- Tidy up. Whitespace. · e2baa97d
  Jim Grosbach authored Sep 21, 2012
```
llvm-svn: 164344
```
  e2baa97d
- Tidy up. Formatting. · 9659ed98
  Jim Grosbach authored Sep 21, 2012
```
llvm-svn: 164343
```
  9659ed98
- ARM: Use a dedicated intrinsic for vector bitwise select. · 74b61c39
  Jim Grosbach authored Sep 21, 2012
```
The expression based expansion too often results in IR level optimizations
splitting the intermediate values into separate basic blocks, preventing
the formation of the VBSL instruction as the code author intended. In
particular, LICM would often hoist part of the computation out of a loop.

rdar://11011471

llvm-svn: 164340
```
  74b61c39
Sep 20, 2012

Revert r164308 to fix buildbots. · c727bacb
Bill Wendling authored Sep 20, 2012
```
llvm-svn: 164309
```
c727bacb
Make the 'get*AlignmentFromAttr' functions into member functions within the Attributes class. · abac6615
Bill Wendling authored Sep 20, 2012
```
llvm-svn: 164308
```
abac6615

Change enum type in a static table to uint8_t instead. Saves about 700 hundred... · 980739af

Craig Topper authored Sep 20, 2012

Change enum type in a static table to uint8_t instead. Saves about 700 hundred bytes of static data. Change unsigned char in same table to uint8_t for explicitness.

llvm-svn: 164285

980739af

Re-work X86 code generation of atomic ops with spin-loop · 3237662b

Michael Liao authored Sep 20, 2012

- Rewrite/merge pseudo-atomic instruction emitters to address the
  following issue:
  * Reduce one unnecessary load in spin-loop

    previously the spin-loop looks like

        thisMBB:
        newMBB:
          ld  t1 = [bitinstr.addr]
          op  t2 = t1, [bitinstr.val]
          not t3 = t2  (if Invert)
          mov EAX = t1
          lcs dest = [bitinstr.addr], t3  [EAX is implicit]
          bz  newMBB
          fallthrough -->nextMBB

    the 'ld' at the beginning of newMBB should be lift out of the loop
    as lcs (or CMPXCHG on x86) will load the current memory value into
    EAX. This loop is refined as:

        thisMBB:
          EAX = LOAD [MI.addr]
        mainMBB:
          t1 = OP [MI.val], EAX
          LCMPXCHG [MI.addr], t1, [EAX is implicitly used & defined]
          JNE mainMBB
        sinkMBB:

  * Remove immopc as, so far, all pseudo-atomic instructions has
    all-register form only, there is no immedidate operand.

  * Remove unnecessary attributes/modifiers in pseudo-atomic instruction
    td

  * Fix issues in PR13458

- Add comprehensive tests on atomic ops on various data types.
  NOTE: Some of them are turned off due to missing functionality.

- Revise tests due to the new spin-loop generated.

llvm-svn: 164281

3237662b

Sep 19, 2012

Unify the logic in SelectAtomicLoadAdd and SelectAtomicLoadArith · 83725395

Michael Liao authored Sep 19, 2012

- Merge the processing of LOAD_ADD with other atomic load-arith
  operations
- Separate the logic getting target constant for atomic-load-op and add
  an optimization for atomic-load-add on i16 with negative value
- Optimize a minor case for atomic-fetch-add i16 with negative operand. Test
  case is revised.

llvm-svn: 164243

83725395

Small structs for PPC64 SVR4 must be passed right-justified in registers. · 019cc6fe

Bill Schmidt authored Sep 19, 2012

lib/Target/PowerPC/PPCISelLowering.{h,cpp}
 Rename LowerFormalArguments_Darwin to LowerFormalArguments_Darwin_Or_64SVR4.
 Rename LowerFormalArguments_SVR4 to LowerFormalArguments_32SVR4.
 Receive small structs right-justified in LowerFormalArguments_Darwin_Or_64SVR4.
 Rename LowerCall_Darwin to LowerCall_Darwin_Or_64SVR4.
 Rename LowerCall_SVR4 to LowerCall_32SVR4.
 Pass small structs right-justified in LowerCall_Darwin_Or_64SVR4.

test/CodeGen/PowerPC/structsinregs.ll
 New test.

llvm-svn: 164228

019cc6fe

Remove code for setting the VEX L-bit as a function of operand size from the... · 3f23c1a8

Craig Topper authored Sep 19, 2012

Remove code for setting the VEX L-bit as a function of operand size from the code emitters and the disassembler table builder. Fix a couple instructions that were still missing VEX_L.

llvm-svn: 164204

3f23c1a8

Add explicit VEX_L tags to all 256-bit instructions. This will allow us to... · a73be890

Craig Topper authored Sep 19, 2012

Add explicit VEX_L tags to all 256-bit instructions. This will allow us to remove code from the code emitters that examined operands to set the L-bit.

llvm-svn: 164202

a73be890

Sep 18, 2012
- MOVi16 (movw) is only legal on cpus with V6T2 support. rdar://12300648 · 1de7ec8c
  Evan Cheng authored Sep 18, 2012
```
llvm-svn: 164169
```
  1de7ec8c
- Fix the isLocalCall() by checking for linker weakness as well. · 09adf3de
  Roman Divacky authored Sep 18, 2012
```
llvm-svn: 164155
```
  09adf3de
- Revert r164051. · 40cf08dd
  Akira Hatanaka authored Sep 18, 2012
```
llvm-svn: 164150
```
  40cf08dd
- Avoid symbol name clash when filling TOC. · 0be33598
  Roman Divacky authored Sep 18, 2012
```
Patch by Adhemerval Zanella.

llvm-svn: 164141
```
  0be33598
- On PPC64 emit the environment pointer. Patch by Adhemerval Zanella. · d4f6f421
  Roman Divacky authored Sep 18, 2012
```
llvm-svn: 164139
```
  d4f6f421
- Optimize local func calls to not emit nop for TOC restoration. · 76293063
  Roman Divacky authored Sep 18, 2012
```
Patch by Adhemerval Zanella.

llvm-svn: 164138
```
  76293063
- When creating MCAsmBackend pass the CPU string as well. In X86AsmBackend · 5dd4ccb4
  Roman Divacky authored Sep 18, 2012
```
store this and use it to not emit long nops when the CPU is geode which
doesnt support them.

Fixes PR11212.

llvm-svn: 164132
```
  5dd4ccb4
- More domain conversion; convert VFP VMOVS to NEON instructions in more cases -... · ea05256b
  James Molloy authored Sep 18, 2012
```
More domain conversion; convert VFP VMOVS to NEON instructions in more cases - when we may clobber the other S-lane by converting an S to a D instruction, make an effort to work out if the S lane is clobberable or not.

llvm-svn: 164114
```
  ea05256b
- TableGen subtarget emitter. Initialize MCSubtargetInfo with the new machine model. · ab722bdd
  Andrew Trick authored Sep 18, 2012
```
llvm-svn: 164092
```
  ab722bdd
- Use vld1 / vst2 for unaligned v2f64 load / store. e.g. Use vld1.16 for 2-byte · 90ae8f84
  Evan Cheng authored Sep 18, 2012
```
aligned address. Based on patch by David Peixotto.

Also use vld1.64 / vst1.64 with 128-bit alignment to take advantage of alignment
hints. rdar://12090772, rdar://12238782

llvm-svn: 164089
```
  90ae8f84
- Revert r164061-r164067. Most of the new subtarget emitter. · 8e7f202e
  Andrew Trick authored Sep 17, 2012
```
I have to work out the Target/CodeGen header dependencies
before putting this back.

llvm-svn: 164072
```
  8e7f202e
- TableGen subtarget emitter. Initialize MCSubtargetInfo with the new machine model. · 0923f818
  Andrew Trick authored Sep 17, 2012
```
llvm-svn: 164061
```
  0923f818
- Add some cases to x86 OptimizeCompare to handle DEC and INC, too. · 4ce1d7b4
  Jan Wen Voung authored Sep 17, 2012
```
While we are setting the earlier def to true, also make it live.

llvm-svn: 164056
```
  4ce1d7b4
Sep 17, 2012
- Make sure there is enough room for RA. getStackSize needs to be cleaned up but · 9068706b
  Akira Hatanaka authored Sep 17, 2012
```
we will do that when we implement the full save/restore.

Patch by Reed Kotler.

llvm-svn: 164051
```
  9068706b
- LLVM_ATTRIBUTE_USED forces emission of a function. To silence unused function... · 0d874f77
  Benjamin Kramer authored Sep 17, 2012
```
LLVM_ATTRIBUTE_USED forces emission of a function. To silence unused function warnings use LLVM_ATTRIBUTE_UNUSED.

llvm-svn: 164036
```
  0d874f77
- Removed the VMLxForwarding feature for the Cortex-A15 target. · 7bd29146
  Silviu Baranga authored Sep 17, 2012
```
llvm-svn: 164030
```
  7bd29146
Sep 16, 2012
- Change unsigned to uint32_t to match base class declaration and other targets. · 462c31b3
  Craig Topper authored Sep 16, 2012
```
llvm-svn: 164001
```
  462c31b3
- The PMOVZXWD family of functions had patterns extends narrow vector types to wide vector types. · 37521aa8
  Nadav Rotem authored Sep 16, 2012
```
It had patterns for zext-loading and extending. This commit adds patterns for loading a wide type, performing a bitcast,
and extending. This is an odd pattern, but it is commonly used when writing code with intrinsics.

rdar://11897677

llvm-svn: 163995
```
  37521aa8
Sep 15, 2012
- Use LLVM_DELETED_FUNCTION in place of 'DO NOT IMPLEMENT' comments. · a60c0f11
  Craig Topper authored Sep 15, 2012
```
llvm-svn: 163974
```
  a60c0f11
- Remove unused private fields to silence -Wunused-private-field. · 2ed23ce7
  Craig Topper authored Sep 15, 2012
```
llvm-svn: 163973
```
  2ed23ce7
- X86: Emitting x87 fsin/fcos for sinf/cosf is not safe without unsafe fp math. · ece43425
  Benjamin Kramer authored Sep 15, 2012
```
This was only an issue if sse is disabled.

llvm-svn: 163967
```
  ece43425
- Remove aligned/unaligned load/store fragments defined in MipsInstrInfo.td and · 3e7ba761
  Akira Hatanaka authored Sep 15, 2012
```
use load/store fragments defined in TargetSelectionDAG.td in place of them.
Unaligned loads/stores are either expanded or lowered to target-specific nodes,
so instruction selection should see only aligned load/store nodes.

No changes in functionality.

llvm-svn: 163960
```
  3e7ba761
- Handled unaligned load/stores properly in Mips16 · 189d0add
  Akira Hatanaka authored Sep 15, 2012
```
Patch by Reed Kotler.

llvm-svn: 163956
```
  189d0add
Sep 14, 2012

Implement getNumLDMAddresses and expose through ARMBaseInstrInfo. · 2ac6f7d6
Andrew Trick authored Sep 14, 2012
```
llvm-svn: 163922
```
2ac6f7d6

Cortex-A9 instruction-level scheduling machine model. · 985dc0dd

Andrew Trick authored Sep 14, 2012

This models the A9 processor at the level of instruction operands, as
opposed to the itinerary, which models each operation at the level of
pipeline stages.

The two primary motivations are:

1) Allow MachineScheduler to model A9 as an out-of-order processor. It
can now distinguish between hazards that force interlocking vs.
buffered resources.

2) Reduce long-term maintenance by allowing the itinerary and target
hooks to eventually be removed. Note that almost all of the complexity
in the new model exists to model instruction variants, which the
itinerary cannot handle. Instead the scheduler previously relied on
processor-specific target hooks which are incomplete and buggy.

llvm-svn: 163921

985dc0dd

DAG post-process for Hexagon MI scheduler · 2db64a70

Sergei Larin authored Sep 14, 2012

This patch introduces a possibility for Hexagon MI scheduler
to perform some target specific post- processing on the scheduling
DAG prior to scheduling.

llvm-svn: 163903

2db64a70