Commits · b7ff9b1599fa81a67297786301678fed660f11e2 · Roger Ferrer / llvm-epi-0.8

Apr 27, 2012
- Refactor IT handling not to store the bottom bit of the condition code in the... · f435b09e
  Richard Barton authored Apr 27, 2012
```
Refactor IT handling not to store the bottom bit of the condition code in the mask operand in the MCInst.

llvm-svn: 155700
```
  f435b09e
- Implement a bastardized ABI. · 1ec87ee0
  Evan Cheng authored Apr 27, 2012
```
llvm-svn: 155686
```
  1ec87ee0
- - thumbv6 shouldn't imply +thumb2. Cortex-M0 doesn't suppport 32-bit Thumb2 · f52003de
  Evan Cheng authored Apr 27, 2012
```
  instructions.
- However, it does support dmb, dsb, isb, mrs, and msr.
rdar://11331541

llvm-svn: 155685
```
  f52003de
Apr 26, 2012

ARM: Thumb ldr(literal) base address alignment is 32-bits. · 3d6c629e

Jim Grosbach authored Apr 26, 2012

The base address for the PC-relative load is Align(PC,4), so it's the
address of the word containing the 16-bit instruction, not the address
of the instruction itself. Ugh.

rdar://11314619

llvm-svn: 155659

3d6c629e

· 81290f4b

Preston Gurd authored Apr 26, 2012

Trivial change to set UseLeaForSP flag in addition to toggling
the FeatureLeaForSP feature bit when llvm auto detects Intel Atom.

Patch by Andy Zhang

llvm-svn: 155655

81290f4b

Use VLD1 in NEON extenting-load patterns instead of VLDR. · 3de97b7a

Tim Northover authored Apr 26, 2012

On some cores it's a bad idea for performance to mix VFP and NEON instructions
and since these patterns are NEON anyway, the NEON load should be used.

llvm-svn: 155630

3de97b7a

Test commit. · 6699a60b
Tim Northover authored Apr 26, 2012
```
llvm-svn: 155626
```
6699a60b

Enable detection of AVX and AVX2 support through CPUID. Add AVX/AVX2 to... · 08ccfbe5

Craig Topper authored Apr 26, 2012

Enable detection of AVX and AVX2 support through CPUID. Add AVX/AVX2 to corei7-avx, core-avx-i, and core-avx2 cpu names.

llvm-svn: 155618

08ccfbe5

If triple is armv7 / thumbv7 and a CPU is specified, do not automatically assume · 9f7ad310

Evan Cheng authored Apr 26, 2012

the feature set of v7a. This comes about if the user specifies something like
-arch armv7 -mcpu=cortex-m3. We shouldn't be generating instructions such as
uxtab in this case.

rdar://11318438

llvm-svn: 155601

9f7ad310

Apr 25, 2012

Unify internal representation of ARM instructions with a register... · ba5b0cc8

Richard Barton authored Apr 25, 2012

Unify internal representation of ARM instructions with a register right-shifted by #32. These are stored as shifts by #0 in the MCInst and correctly marshalled when transforming from or to assembly representation.

llvm-svn: 155565

ba5b0cc8

Add ifdef around getSubtargetFeatureName in tablegen output file so that only... · 3ec7c2aa

Craig Topper authored Apr 25, 2012

Add ifdef around getSubtargetFeatureName in tablegen output file so that only targets that want the function get it. This prevents other targets from getting an unused function warning.

llvm-svn: 155538

3ec7c2aa

Use vector_shuffles instead of target specific unpack nodes for AVX... · 5ff6dc34

Craig Topper authored Apr 25, 2012

Use vector_shuffles instead of target specific unpack nodes for AVX ZERO_EXTEND/ANY_EXTEND combine. These will be converted to target specific nodes during lowering. This is more consistent with other code.

llvm-svn: 155537

5ff6dc34

Do not use $gp as a dedicated global register if the target ABI is not O32. · 2020e27d
Akira Hatanaka authored Apr 25, 2012
```
llvm-svn: 155522
```
2020e27d

ARM: improved assembler diagnostics for missing CPU features. · 5117ef74

Jim Grosbach authored Apr 24, 2012

When an instruction match is found, but the subtarget features it
requires are not available (missing floating point unit, or thumb vs arm
mode, for example), issue a diagnostic that identifies what the feature
mismatch is.

rdar://11257547

llvm-svn: 155499

5117ef74

Apr 24, 2012
- ARM: Nuke remnant bogus code. · 1e75fc1f
  Jim Grosbach authored Apr 24, 2012
```
r154362 was supposed to delete this bit, but obviously didn't.

rdar://11305594

llvm-svn: 155465
```
  1e75fc1f
- AVX: Add additional vbroadcast replacement sequences for integers. · 810734b7
  Nadav Rotem authored Apr 24, 2012
```
Remove the v2f64 patterns because it does not match any vbroadcast
instruction.

llvm-svn: 155461
```
  810734b7
- AVX2: The BLENDPW instruction selects between vectors of v16i16 using an i8 · 7b7b99c7
  Nadav Rotem authored Apr 24, 2012
```
immediate. We can't use it here because the shuffle code does not check that
the lower part of the word is identical to the upper part.

llvm-svn: 155440
```
  7b7b99c7
- Refactor Thumb ITState handling in ARM Disassembler to more efficiently use its vector · e9600009
  Richard Barton authored Apr 24, 2012
```
llvm-svn: 155439
```
  e9600009
- AVX: We lower VECTOR_SHUFFLE and BUILD_VECTOR nodes into vbroadcast instructions · aa3ff8da
  Nadav Rotem authored Apr 24, 2012
```
using the pattern (vbroadcast (i32load src)). In some cases, after we generate
this pattern new users are added to the load node, which prevent the selection
of the blend pattern. This commit provides fallback patterns which perform
in-vector broadcast (using in-vector vbroadcast in AVX2 and pshufd on AVX1).

llvm-svn: 155437
```
  aa3ff8da
- Remove dangling spaces. Fix some other formatting. · 0b65c408
  Craig Topper authored Apr 24, 2012
```
llvm-svn: 155429
```
  0b65c408
- Simplify code a bit and make it compile better. Remove unused parameters. · 6f2a535d
  Craig Topper authored Apr 24, 2012
```
llvm-svn: 155428
```
  6f2a535d
- Tidy up. 80 columns, whitespace, et. al. · 671ad2a5
  Jim Grosbach authored Apr 23, 2012
```
llvm-svn: 155399
```
  671ad2a5
Apr 23, 2012

Optimize the vector UINT_TO_FP, SINT_TO_FP and FP_TO_SINT operations where the... · 3f8acfc3

Nadav Rotem authored Apr 23, 2012

Optimize the vector UINT_TO_FP, SINT_TO_FP and FP_TO_SINT operations where the integer type is i8 (commonly used in graphics).

llvm-svn: 155397

3f8acfc3

This patch fixes a problem which arose when using the Post-RA scheduler · 9a091475

Preston Gurd authored Apr 23, 2012

on X86 Atom. Some of our tests failed because the tail merging part of
the BranchFolding pass was creating new basic blocks which did not
contain live-in information. When the anti-dependency code in the Post-RA
scheduler ran, it would sometimes rename the register containing
the function return value because the fact that the return value was
live-in to the subsequent block had been lost. To fix this, it is necessary
to run the RegisterScavenging code in the BranchFolding pass.

This patch makes sure that the register scavenging code is invoked
in the X86 subtarget only when post-RA scheduling is being done.
Post RA scheduling in the X86 subtarget is only done for Atom.

This patch adds a new function to the TargetRegisterClass to control
whether or not live-ins should be preserved during branch folding.
This is necessary in order for the anti-dependency optimizations done
during the PostRASchedulerList pass to work properly when doing
Post-RA scheduling for the X86 in general and for the Intel Atom in particular.

The patch adds and invokes the new function trackLivenessAfterRegAlloc()
instead of using the existing requiresRegisterScavenging().
It changes BranchFolding.cpp to call trackLivenessAfterRegAlloc() instead of
requiresRegisterScavenging(). It changes the all the targets that
implemented requiresRegisterScavenging() to also implement
trackLivenessAfterRegAlloc().  

It adds an assertion in the Post RA scheduler to make sure that post RA
liveness information is available when it is needed.

It changes the X86 break-anti-dependencies test to use –mcpu=atom, in order
to avoid running into the added assertion.

Finally, this patch restores the use of anti-dependency checking
(which was turned off temporarily for the 3.1 release) for
Intel Atom in the Post RA scheduler.

Patch by Andy Zhang!

Thanks to Jakob and Anton for their reviews.

llvm-svn: 155395

9a091475

ARM: VSLI two-operand assmebly aliases are tblgen'erated. · 41e94d79
Jim Grosbach authored Apr 23, 2012
```
llvm-svn: 155393
```
41e94d79
ARM: tblgen'erate VSRA/VRSRA/VSRI assembly two-operand aliases. · 3dada484
Jim Grosbach authored Apr 23, 2012
```
llvm-svn: 155392
```
3dada484
ARM: vqdmulh two-operand aliases are tblgen'erated now. · e5012fba
Jim Grosbach authored Apr 23, 2012
```
llvm-svn: 155387
```
e5012fba

Revert r155365, r155366, and r155367. All three of these have regression · 3c3bb55a

Chandler Carruth authored Apr 23, 2012

test suite failures. The failures occur at each stage, and only get
worse, so I'm reverting all of them.

Please resubmit these patches, one at a time, after verifying that the
regression test suite passes. Never submit a patch without running the
regression test suite.

llvm-svn: 155372

3c3bb55a

Hexagon V5 (floating point) support. · a3f8ba24
Sirish Pande authored Apr 23, 2012
```
llvm-svn: 155367
```
a3f8ba24
Support for Hexagon architectural feature, new value jump. · 2c7bf00f
Sirish Pande authored Apr 23, 2012
```
llvm-svn: 155366
```
2c7bf00f
Support for Hexagon VLIW Packetizer. · 6cd22515
Sirish Pande authored Apr 23, 2012
```
llvm-svn: 155365
```
6cd22515

Use MVT instead of EVT through all of LowerVECTOR_SHUFFLEtoBlend and not just... · 153bb34a

Craig Topper authored Apr 23, 2012

Use MVT instead of EVT through all of LowerVECTOR_SHUFFLEtoBlend and not just the switch. Saves a little bit of binary size.

llvm-svn: 155339

153bb34a

Make getZeroVector and getOnesVector more alike as far as how they detect... · 0a2c809d

Craig Topper authored Apr 23, 2012

Make getZeroVector and getOnesVector more alike as far as how they detect 128-bit versus 256-bit vectors. Be explicit about both sizes and use llvm_unreachable. Similar changes to getLegalSplat.

llvm-svn: 155337

0a2c809d

Tidy up by removing some 'else' after 'return' · 2bbe8bcf
Craig Topper authored Apr 23, 2012
```
llvm-svn: 155336
```
2bbe8bcf

Tidy up spacing in LowerVECTOR_SHUFFLEtoBlend. Remove code that checks if... · 5c51eeec

Craig Topper authored Apr 23, 2012

Tidy up spacing in LowerVECTOR_SHUFFLEtoBlend. Remove code that checks if shuffle operand has a different type than the the shuffle result since it can never happen.

llvm-svn: 155333

5c51eeec

Add a couple llvm_unreachables. · a52f0d09
Craig Topper authored Apr 23, 2012
```
llvm-svn: 155332
```
a52f0d09
Remove some tab characers. · 984dc015
Craig Topper authored Apr 23, 2012
```
llvm-svn: 155331
```
984dc015
Remove some 'else' after 'return'. No functional change. · ea428fd7
Craig Topper authored Apr 23, 2012
```
llvm-svn: 155330
```
ea428fd7

Apr 22, 2012

Make Extract128BitVector and Insert128BitVector take an unsigned instead of an... · bf7d5666

Craig Topper authored Apr 22, 2012

Make Extract128BitVector and Insert128BitVector take an unsigned instead of an ConstantNode SDValue. getConstant was almost always called just before only to have the functions take it apart and build a new ConstantSDNode.

llvm-svn: 155325

bf7d5666

Convert getNode(UNDEF) to getUNDEF. · 2d474d6d
Craig Topper authored Apr 22, 2012
```
llvm-svn: 155321
```
2d474d6d