Commits · 8fe56e00ed9530438a2cd2b714e9938f0fea3eec · Roger Ferrer / llvm-epi-0.8

Nov 15, 2012
- PowerPC: Lowering floor intrinsic for Altivec · bdface56
  Adhemerval Zanella authored Nov 15, 2012
```
 
This patch lowers the llvm.floor, llvm.ceil, llvm.trunc, and
llvm.nearbyint to Altivec instruction when using 4 single-precision
float vectors.

llvm-svn: 168086
```
  bdface56
- Add assertions in MipsLongBranch which check the size of basic blocks. · 5fdeac32
  Akira Hatanaka authored Nov 15, 2012
```
llvm-svn: 168078
```
  5fdeac32
- Return 0 instead of false. · f34e4fa7
  Jakub Staszak authored Nov 15, 2012
```
llvm-svn: 168076
```
  f34e4fa7
- Simplify code. · 11d1aee6
  Jakub Staszak authored Nov 15, 2012
```
llvm-svn: 168064
```
  11d1aee6
- Use empty parens for empty function parameter list instead of '(void)'. · 0011bbf9
  Dmitri Gribenko authored Nov 15, 2012
```
llvm-svn: 168049
```
  0011bbf9
- Revert changing FNEG of v4f32 to Expand. It's legal. · 323f614c
  Craig Topper authored Nov 15, 2012
```
llvm-svn: 168030
```
  323f614c
- Make FNEG and FABS of v4f32 Expand. · bb706058
  Craig Topper authored Nov 15, 2012
```
llvm-svn: 168029
```
  bb706058
- Make a bunch of floating point operations on vectors Expand so that... · c8a2adf1
  Craig Topper authored Nov 15, 2012
```
Make a bunch of floating point operations on vectors Expand so that instruction selection won't fail.

llvm-svn: 168028
```
  c8a2adf1
- Add llvm.ceil, llvm.trunc, llvm.rint, llvm.nearbyint intrinsics. · 61d04578
  Craig Topper authored Nov 15, 2012
```
llvm-svn: 168025
```
  61d04578
- Remove unneeded #includes. · f33e0f95
  Jakub Staszak authored Nov 14, 2012
```
llvm-svn: 168006
```
  f33e0f95
- NVPTXISelLowering.cpp: Fix warnings. [-Wunused-variable] · 5bbe0e18
  NAKAMURA Takumi authored Nov 14, 2012
```
llvm-svn: 168001
```
  5bbe0e18
Nov 14, 2012

Remove the CellSPU port. · 950d8703
Eric Christopher authored Nov 14, 2012
```
Approved by Chris Lattner.

llvm-svn: 167984
```
950d8703
Fix invalid asserts, use llvm_unreachable instead. · d17df318
Jakub Staszak authored Nov 14, 2012
```
llvm-svn: 167976
```
d17df318
Added multiclass for post-increment load instructions. · 66493608
Jyotsna Verma authored Nov 14, 2012
```
llvm-svn: 167974
```
66493608

X86: Enable SSE memory intrinsics even when stack alignment is less than 16 bytes. · 6293429b

Benjamin Kramer authored Nov 14, 2012

The stack realignment code was fixed to work when there is stack realignment and
a dynamic alloca is present so this shouldn't cause correctness issues anymore.

Note that this also enables generation of AVX instructions for memset
under the assumptions:
- Unaligned loads/stores are always fast on CPUs supporting AVX
- AVX is not slower than SSE
We may need some tweaked heuristics if one of those assumptions turns out not to
be true.

Effectively reverts r58317. Part of PR2962.

llvm-svn: 167967

6293429b

The code pattern "imm0_255_neg" is used for checking if an immediate value is... · 9f567c62

Nadav Rotem authored Nov 14, 2012

The code pattern "imm0_255_neg" is used for checking if an immediate value is a small negative number.
This patch changes the definition of negative from -0..-255 to -1..-255. I am changing this because of
a bug that we had in some of the patterns that assumed that "subs" of zero does not set the carry flag.

rdar://12028498

llvm-svn: 167963

9f567c62

[NVPTX] Implement custom lowering of loads/stores for i1 · c6462aac

Justin Holewinski authored Nov 14, 2012

Loads from i1 become loads from i8 followed by trunc
Stores to i1 become zext to i8 followed by store to i8

Fixes PR13291

llvm-svn: 167948

c6462aac

X86: Better diagnostics for 32-bit vs. 64-bit mode mismatches. · 6f1f41b1

Jim Grosbach authored Nov 14, 2012

When an instruction as written requires 32-bit mode and we're assembling
in 64-bit mode, or vice-versa, issue a more specific diagnostic about
what's wrong.

rdar://12700702

llvm-svn: 167937

6f1f41b1

Set FFLOOR of vectors to expand to keep intruction selection from failing. · c4343f2c
Craig Topper authored Nov 14, 2012
```
llvm-svn: 167922
```
c4343f2c
Factor out an overly replicated typecast. No functional change. · a7f489d1
Craig Topper authored Nov 14, 2012
```
llvm-svn: 167916
```
a7f489d1
Set FFLOOR for vectors to expand on CellSPU to keep instruction selection from... · 54c45ab5
Craig Topper authored Nov 14, 2012
```
Set FFLOOR for vectors to expand on CellSPU to keep instruction selection from failing on llvm.floor of a vector.

llvm-svn: 167914
```
54c45ab5
Use TARGET2 relocation for TType references on ARM. · e42af369
Anton Korobeynikov authored Nov 14, 2012
```
Do some cleanup of the code while here.

Inspired by patch by Logan Chien!

llvm-svn: 167904
```
e42af369

Nov 13, 2012

Add (some) PowerPC TLS relocation types to ELF.h and · 85578500
Ulrich Weigand authored Nov 13, 2012
```
generate them from PPCELFObjectWriter::getRelocTypeInner
as appropriate.

llvm-svn: 167864
```
85578500
Fix wrong PowerPC instruction opcodes for: · 0f79500a
Ulrich Weigand authored Nov 13, 2012
```
 - lwaux
 - lhzux
 - stbu

llvm-svn: 167863
```
0f79500a

Fix wrong PowerPC instruction encodings due to · a82389b3

Ulrich Weigand authored Nov 13, 2012

operand field name mismatches in:
 - AForm_3  (fmul, fmuls)
 - XFXForm_5 (mtcrf)
 - XFLForm (mtfsf)

llvm-svn: 167862

a82389b3

Fix instruction encoding for "bd(n)z" on PowerPC, · 01177185
Ulrich Weigand authored Nov 13, 2012
```
by using a new instruction format BForm_1.

llvm-svn: 167861
```
01177185
Fix instruction encoding for "isel" on PowerPC, · 84ee76ac
Ulrich Weigand authored Nov 13, 2012
```
using a new instruction format AForm_4.

llvm-svn: 167860
```
84ee76ac

X86: when constructing VZEXT_LOAD from other loads, makes sure its output · 0f3240d3

Manman Ren authored Nov 13, 2012

chain is correctly setup.

As an example, if the original load must happen before later stores, we need
to make sure the constructed VZEXT_LOAD is constrained to be before the stores.

rdar://12684358

llvm-svn: 167859

0f3240d3

misched: Allow subtargets to enable misched and dependent options. · 108c88c5

Andrew Trick authored Nov 13, 2012

This allows me to begin enabling (or backing out) misched by default
for one subtarget at a time. To run misched we typically want to:
- Disable SelectionDAG scheduling (use the source order scheduler)
- Enable more aggressive coalescing (until we decide to always run the coalescer this way)
- Enable MachineScheduler pass itself.

Disabling PostRA sched may follow for some subtargets.

llvm-svn: 167826

108c88c5

Test commit. · ccfd77ef
Jyotsna Verma authored Nov 13, 2012
```
Add a blank line.

llvm-svn: 167819
```
ccfd77ef

Nov 12, 2012

misched: Target-independent support for load/store clustering. · a7714a0f

Andrew Trick authored Nov 12, 2012

This infrastructure is generally useful for any target that wants to
strongly prefer two instructions to be adjacent after scheduling.

A following checkin will add target-specific hooks with unit
tests. Then this feature will be enabled by default with misched.

llvm-svn: 167742

a7714a0f

Make TOC order deterministic by using MapVector instead of DenseMap. · 2c93acdf
Ulrich Weigand authored Nov 12, 2012
```
llvm-svn: 167737
```
2c93acdf
Remove unused field. · 16631130
Eric Christopher authored Nov 12, 2012
```
llvm-svn: 167719
```
16631130

Fix PR14314 · d39c0fb1

Michael Liao authored Nov 12, 2012

- Fix operand order for atomic sub, where the minuend is the value
  loaded from memory and the subtrahend is the parameter specified.

llvm-svn: 167718

d39c0fb1

[NVPTX] Add more precise PTX/SM target attributes · 1812ee9a

Justin Holewinski authored Nov 12, 2012

Each SM and PTX version is modeled as a subtarget feature/CPU. Additionally,
PTX 3.1 is added as the default PTX version to be out-of-the-box compatible
with CUDA 5.0.

Available CPUs for this target:

  sm_10 - Select the sm_10 processor.
  sm_11 - Select the sm_11 processor.
  sm_12 - Select the sm_12 processor.
  sm_13 - Select the sm_13 processor.
  sm_20 - Select the sm_20 processor.
  sm_21 - Select the sm_21 processor.
  sm_30 - Select the sm_30 processor.
  sm_35 - Select the sm_35 processor.

Available features for this target:

  ptx30 - Use PTX version 3.0.
  ptx31 - Use PTX version 3.1.
  sm_10 - Target SM 1.0.
  sm_11 - Target SM 1.1.
  sm_12 - Target SM 1.2.
  sm_13 - Target SM 1.3.
  sm_20 - Target SM 2.0.
  sm_21 - Target SM 2.1.
  sm_30 - Target SM 3.0.
  sm_35 - Target SM 3.5.

llvm-svn: 167699

1812ee9a

Nov 11, 2012
- Move some helper methods to being static functions in the implementation file. · dd13d3fd
  Craig Topper authored Nov 11, 2012
```
llvm-svn: 167696
```
  dd13d3fd
- Use the isTruncFree and isZExtFree API to figure out of these operations are free. Thanks Andy! · 3b99dc62
  Nadav Rotem authored Nov 11, 2012
```
llvm-svn: 167685
```
  3b99dc62
Nov 10, 2012
- Remove unnecessary subtraction and addition by 1 around a couple for loops. · a43e2fd3
  Craig Topper authored Nov 10, 2012
```
llvm-svn: 167673
```
  a43e2fd3
- Tidy up spacing. No functional change. · 84afbf2b
  Craig Topper authored Nov 10, 2012
```
llvm-svn: 167671
```
  84afbf2b
- Removed unimplemented method declaration. · 2dfc1a4d
  Craig Topper authored Nov 10, 2012
```
llvm-svn: 167670
```
  2dfc1a4d