Commits · e133ed88b56c7960f2424171987729e8f794361f · Roger Ferrer / llvm-epi-0.8

Oct 29, 2013
- Move getSymbol to TargetLoweringObjectFile. · e133ed88
  Rafael Espindola authored Oct 29, 2013
```
This allows constructing a Mangler with just a TargetMachine.

llvm-svn: 193630
```
  e133ed88
- Add a helper getSymbol to AsmPrinter. · 79858aa3
  Rafael Espindola authored Oct 29, 2013
```
llvm-svn: 193627
```
  79858aa3
- [AArch64] Implement FrameAddr and ReturnAddr · ffade617
  Weiming Zhao authored Oct 29, 2013
```
Fixes PR17690

llvm-svn: 193625
```
  ffade617
- [ARM] Make sure HasCRC is initialized to false in Subtarget. · f9a67fce
  Amara Emerson authored Oct 29, 2013
```
llvm-svn: 193624
```
  f9a67fce
- Support for microMIPS jump instructions · 507e084a
  Zoran Jovanovic authored Oct 29, 2013
```
llvm-svn: 193623
```
  507e084a
- R600/SI: Add compute support for CI v2 · 6e1ee476
  Tom Stellard authored Oct 29, 2013
```
v2:
  - Fix LDS size calculation

Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 193621
```
  6e1ee476
- R600: Expand vector FSQRT ops · e118b8be
  Tom Stellard authored Oct 29, 2013
```
llvm-svn: 193620
```
  e118b8be
- The asm printer has a mangler. Use it. · 7d78b2ae
  Rafael Espindola authored Oct 29, 2013
```
llvm-svn: 193618
```
  7d78b2ae
- The AsmPrinter has a Mangler. Use it. · 69c1d631
  Rafael Espindola authored Oct 29, 2013
```
llvm-svn: 193617
```
  69c1d631
- The asm printer has a mangler. Don't keep a second pointer to it. · 38c2e65e
  Rafael Espindola authored Oct 29, 2013
```
llvm-svn: 193616
```
  38c2e65e
- ARM: Add subtarget feature for CRC · ee87e855
  Bernard Ogden authored Oct 29, 2013
```
Adds a subtarget feature for the CRC instructions (optional in v8-A) to the ARM (32-bit) backend.

Differential Revision: http://llvm-reviews.chandlerc.com/D2036

llvm-svn: 193599
```
  ee87e855
- AArch64: add 'a' inline asm operand modifier · d29ddf67
  Tim Northover authored Oct 29, 2013
```
This is used in the Linux kernel, and effectively just means "print an
address".

llvm-svn: 193593
```
  d29ddf67
- ARM cost model: Unaligned vectorized double stores are expensive · 89ae2174
  Arnold Schwaighofer authored Oct 29, 2013
```
Updated a test case that assumed that <2 x double> would vectorize to use
<4 x float>.

radar://15338229

llvm-svn: 193574
```
  89ae2174
- ARM cost model: Account for zero cost scalar SROA instructions · 77af0f6e
  Arnold Schwaighofer authored Oct 29, 2013
```
By vectorizing a series of srl, or, ... instructions we have obfuscated the
intention so much that the backend does not know how to fold this code away.

radar://15336950

llvm-svn: 193573
```
  77af0f6e
Oct 28, 2013

[mips] Simplify LowerFormalArguments using getRegClassFor. · 7d82252d
Akira Hatanaka authored Oct 28, 2013
```
No functionality change.

llvm-svn: 193540
```
7d82252d

Return early from getUnconditionalBranchTargetOpValue if the branch target is · b5281661

Lang Hames authored Oct 28, 2013

an MCExpr, in order to avoid writing an encoded zero value in the immediate
field.

When getUnconditionalBranchTargetOpValue is called with an MCExpr target, we
don't know what the final immediate field value should be. We shouldn't
explicitly set the immediate field to an encoded zero value as zero is encoded
with a non-zero bit pattern. This leads to bits being set that pollute the
final immediate value. The nature of the encoding is such that the polluted
bits only affect very large immediate values, explaining why this hasn't
caused problems earlier.

Fixes <rdar://problem/15155975>.

llvm-svn: 193535

b5281661

[arm] Implement eabi_attribute, cpu, and fpu directives. · 8cbb80d1

Logan Chien authored Oct 28, 2013

This commit allows the ARM integrated assembler to parse
and assemble the code with .eabi_attribute, .cpu, and
.fpu directives.

To implement the feature, this commit moves the code from
AttrEmitter to ARMTargetStreamers, and several new test
cases related to cortex-m4, cortex-r5, and cortex-a15 are
added.

Besides, this commit also change the Subtarget->isFPOnlySP()
to Subtarget->hasD16() to match the usage of .fpu directive.

This commit changes the test cases:

* Several .eabi_attribute directives in
  2010-09-29-mc-asm-header-test.ll are removed because the .fpu
  directive already cover the functionality.

* In the Cortex-A15 test case, the value for
  Tag_Advanced_SIMD_arch has be changed from 1 to 2,
  which is more precise.

llvm-svn: 193524

8cbb80d1

[SystemZ] Set usaAA to true · 094e6097

Richard Sandiford authored Oct 28, 2013

useAA significantly improves the handling of vector code that has TBAA
information attached. It also helps other cases, as shown by the testsuite
changes here. The only real downside I've seen is that it interferes with
MergeConsecutiveStores. The problem is that that optimization works top
down, starting at the first store in the chain, and looks for cases where
the chain result is only used by a single related store. These related
stores don't alias, so useAA will have rewritten all the later stores to
use a different chain input (typically the same one as the first store).

I think the advantages outweigh the disadvantages though, so for now I've
just disabled alias analysis for the unaligned-01.ll test.

llvm-svn: 193521

094e6097

Prune utf8 chars in comments. · 8a046439
NAKAMURA Takumi authored Oct 28, 2013
```
llvm-svn: 193512
```
8a046439
Prune trailing linefeeds. · 0b865d44
NAKAMURA Takumi authored Oct 28, 2013
```
llvm-svn: 193511
```
0b865d44
Target/R600: Un-tab-ify. · 4bb85f90
NAKAMURA Takumi authored Oct 28, 2013
```
llvm-svn: 193510
```
4bb85f90

Oct 27, 2013

Make first substantial checkin of my port of ARM constant islands code to Mips. · 91ae9829

Reed Kotler authored Oct 27, 2013

Before I just ported the shell of the pass. I've tried to keep everything
nearly identical to the ARM version. I think it will be very easy to eventually
merge these two and create a new more general pass that other targets can
use. I have some improvements I would like to make to allow pools to
be shared across functions and some other things. When I'm all done we
can think about making a more general pass. More to be ported but the
basic mechanism works now almost as good as gcc mips16.

llvm-svn: 193509

91ae9829

NVPTX: Remove unused globals. · 7ad4100f
Benjamin Kramer authored Oct 27, 2013
```
llvm-svn: 193500
```
7ad4100f
Hexagon: Remove global state. · 602bb4ad
Benjamin Kramer authored Oct 27, 2013
```
llvm-svn: 193499
```
602bb4ad
AVX-512: PMIN/PMAX intrinsics and patterns · 199c8235
Elena Demikhovsky authored Oct 27, 2013
```
Patch by Cameron McInally <cameron.mcinally@nyu.edu>

llvm-svn: 193497
```
199c8235

Oct 25, 2013

[X86][AVX512] Add patterns that match the AVX512 floating point register vbroadcast intrinsics. · 8761a8f5
Quentin Colombet authored Oct 25, 2013
```
Patch by Cameron McInally <cameron.mcinally@nyu.edu>

llvm-svn: 193422
```
8761a8f5
[X86][AVX512] Add patterns that match the AVX512 floating point vbroadcast intrinsics. · 4bf1c282
Quentin Colombet authored Oct 25, 2013
```
Patch by Cameron McInally <cameron.mcinally@nyu.edu>

llvm-svn: 193421
```
4bf1c282

ARM: allow .thumb_func to be separated from symbol definition · 1744d0ad

Tim Northover authored Oct 25, 2013

When assembling, a .thumb_func directive is supposed to be applicable to the
next symbol definition, even if there are intervening directives. We were
racing ahead to try and find it, and this commit should fix the issue.

Patch by Gabor Ballabas

llvm-svn: 193403

1744d0ad

ARM: don't expand atomicrmw inline on Cortex-M0 · c7ea8048

Tim Northover authored Oct 25, 2013

There's a barrier instruction so that should still be used, but most actual
atomic operations are going to need a platform decision on the correct
behaviour (either nop if single-threaded or OS-support otherwise).

rdar://problem/15287210

llvm-svn: 193399

c7ea8048

Optimize concat_vectors(X, undef) -> scalar_to_vector(X). · d369d4bd

Nadav Rotem authored Oct 25, 2013

This optimization is not SSE specific so I am moving it to DAGco.
The new scalar_to_vector dag node exposed a missing pattern in the AArch64 target that I needed to add.

llvm-svn: 193393

d369d4bd

ARM: Tweak usage of '*vfp' compiler_rt functions. · 1d1d6d46

Jim Grosbach authored Oct 24, 2013

Only use them if the subtarget has ARM mode, as these routines are implemented
as ARM code.

rdar://15302004

llvm-svn: 193381

1d1d6d46

Oct 24, 2013

Remove class abstraction from ARM struct byval lowering · b0653e53

David Peixotto authored Oct 24, 2013

This commit changes the struct byval lowering for arm to use inline
checks for the subtarget instead of a class abstraction to represent
the differences. The class abstraction was judged to be too much
code for this task.

No intended functionality change.

llvm-svn: 193357

b0653e53

ARM: Mark double-precision instructions as such · 5620faf7

Tim Northover authored Oct 24, 2013

This prevents us from silently accepting invalid instructions on (for example)
Cortex-M4 with just single-precision VFP support.

No tests for the extra Pat Requires because they're essentially assertions: the
affected code should have been lowered to libcalls before ISel.

rdar://problem/15302004

llvm-svn: 193354

5620faf7

ARM: add a couple more NEON predicates. · 225bcbbe

Tim Northover authored Oct 24, 2013

The fused multiply instructions were added in VFPv4 but are still NEON
instructions, in particular they shouldn't be available on a Cortex-M4 not
matter how floaty it is.

llvm-svn: 193342

225bcbbe

ARM: mark various aliases with their architecture requirements. · 64dacb2b

Tim Northover authored Oct 24, 2013

If an alias inherits directly from InstAlias then it doesn't get any default
"Requires" values, so llvm-mc will allow it even on architectures that don't
support the underlying instruction.

This tidies up the obvious VFP and NEON cases I found.

llvm-svn: 193340

64dacb2b

ARM: Use non-VFP softcalls on embedded Darwinish targets · 94ecbd2e

Tim Northover authored Oct 24, 2013

The compiler-rt functions __adddf3vfp and so on exist purely to allow Thumb1
code to make use of VFP instructions by switching back to ARM mode, they make
no sense for M-class processors which don't even have an ARM mode.

Given that justification, in practice this is a platform ABI decision so the
actual check is based on that rather than CPU features.

rdar://problem/15302004

llvm-svn: 193327

94ecbd2e

ARM: fix assert on unpredictable POP instruction. · 741e6ef4

Tim Northover authored Oct 24, 2013

POP instructions are aliased to the ARM LDM variants but have different syntax.
This caused two problems: we tried to access a non-existent operand to annotate
the '!', and the error message didn't make much sense.

With some vigorous hand-waving in the error message both problems can be
fixed.

llvm-svn: 193322

741e6ef4

Make sure SP is always aligned on a 2 byte boundary · a8d35c98
Job Noorman authored Oct 24, 2013
```
llvm-svn: 193320
```
a8d35c98

[AArch64] Fix NZCV reg live-in bug in F128CSEL codegen. · c5cae0f2

Amara Emerson authored Oct 24, 2013

When generating the IfTrue basic block during the F128CSEL pseudo-instruction
handling, the NZCV live-in for the newly created BB wasn't being added. This
caused a fault during MI-sched/live range calculation when the predecessor
for the fall-through BB didn't have a live-in for phys-reg as expected.

llvm-svn: 193316

c5cae0f2

AVX-512: added VCVTPH2PS, VCVTPS2PH with intrinsics · dd0794e5
Elena Demikhovsky authored Oct 24, 2013
```
llvm-svn: 193312
```
dd0794e5