Commits · 5f1ba4cd2de8a82f6fe2e42b7fef46a8e055c46b · Roger Ferrer / llvm-epi-0.8

Apr 21, 2011
- Remove -use-divmod-libcall. Let targets opt in when they are available. · 5f1ba4cd
  Evan Cheng authored Apr 20, 2011
```
llvm-svn: 129884
```
  5f1ba4cd
- Add debug output for rematerializable instructions. · 86e53ced
  Jakob Stoklund Olesen authored Apr 20, 2011
```
llvm-svn: 129883
```
  86e53ced
- Permit remat when a virtual register has multiple defs. · 90d79bdc
  Jakob Stoklund Olesen authored Apr 20, 2011
```
TII::isTriviallyReMaterializable() shouldn't depend on any properties of the
register being defined by the instruction. Rematerialization is going to create
a new virtual register anyway.

llvm-svn: 129882
```
  90d79bdc
Apr 20, 2011
- Fix another case of <rdar://problem/9184212> that only occurs with code · ca4c6334
  Cameron Zwarich authored Apr 20, 2011
```
generated by llvm-gcc, since llvm-gcc uses 2 i64s for passing a 4 x float
vector on ARM rather than an i64 array like Clang.

llvm-svn: 129878
```
  ca4c6334
- The bitcast case here is actually handled uniformly earlier in the function, so · 76dfa226
  Cameron Zwarich authored Apr 20, 2011
```
delete it.

llvm-svn: 129877
```
  76dfa226
- Cleanup some code to better use an early return style in preparation for adding · 4cd9a4a9
  Cameron Zwarich authored Apr 20, 2011
```
more cases.

llvm-svn: 129876
```
  4cd9a4a9
- Revert r129846; it's breaking a buildbot. See · c93d399e
  Eli Friedman authored Apr 20, 2011
```
http://google1.osuosl.org:8011/builders/llvm-x86_64-linux-checks/builds/825/steps/test.llvm.stage2/logs/st.ll

llvm-svn: 129869
```
  c93d399e
- Prefer cheap registers for busy live ranges. · 0e34c1df
  Jakob Stoklund Olesen authored Apr 20, 2011
```
On the x86-64 and thumb2 targets, some registers are more expensive to encode
than others in the same register class.

Add a CostPerUse field to the TableGen register description, and make it
available from TRI->getCostPerUse. This represents the cost of a REX prefix or a
32-bit instruction encoding required by choosing a high register.

Teach the greedy register allocator to prefer cheap registers for busy live
ranges (as indicated by spill weight).

llvm-svn: 129864
```
  0e34c1df
- Excise unintended hunk in 129858. <rdar://problem/7662569> · 7850af6e
  Stuart Hastings authored Apr 20, 2011
```
llvm-svn: 129862
```
  7850af6e
- ARM byval support. Will be enabled by another patch to the FE. <rdar://problem/7662569> · 45fe3c38
  Stuart Hastings authored Apr 20, 2011
```
llvm-svn: 129858
```
  45fe3c38
- sys/Host: Change getHostTriple() to return the full Darwin version on OS X. · 8991d611
  Daniel Dunbar authored Apr 20, 2011
```
llvm-svn: 129852
```
  8991d611
- PTX: Add intrinsics to list of built-in intrinsics, which allows them to be · 7d8895e7
  Justin Holewinski authored Apr 20, 2011
```
     used by Clang.  To help Clang integration, the PTX target has been split
     into two targets: ptx32 and ptx64, depending on the desired pointer size.

- Add GCCBuiltin class to all intrinsics
- Split PTX target into ptx32 and ptx64

llvm-svn: 129851
```
  7d8895e7
- Behave like gnu as when a relocation crosses sections. · ed16477c
  Rafael Espindola authored Apr 20, 2011
```
llvm-svn: 129850
```
  ed16477c
- ptx: add integer div and rem instruction · 6586f846
  Che-Liang Chiou authored Apr 20, 2011
```
Patched by Dan Bailey

llvm-svn: 129848
```
  6586f846
- ptx: add floating-point comparison to setp · 5a952b3c
  Che-Liang Chiou authored Apr 20, 2011
```
Patched by Dan Bailey

llvm-svn: 129847
```
  5a952b3c
- ptx: fix parameter ordering · 49160f9a
  Che-Liang Chiou authored Apr 20, 2011
```
Patched by Dan Bailey

llvm-svn: 129846
```
  49160f9a
- This should always be signed chars, so use int8_t. This fixes a miscompile when · 4dae63e3
  Nick Lewycky authored Apr 20, 2011
```
llvm is built with unsigned chars where an immediate such as 0xff would be zero
extended to 64-bits, turning "cmp $0xff,%eax" into
"cmp $0xffffffffffffffff,%eax".

llvm-svn: 129845
```
  4dae63e3
- Remove unused arguments. · e473aaf5
  Rafael Espindola authored Apr 20, 2011
```
llvm-svn: 129844
```
  e473aaf5
- Rewrite the expander for umulo/smulo to remember to sign extend the input · bcaedb5c
  Eric Christopher authored Apr 20, 2011
```
manually and pass all (now) 4 arguments to the mul libcall. Add a new
ExpandLibCall for just this (copied gratuitously from type legalization).

Fixes rdar://9292577

llvm-svn: 129842
```
  bcaedb5c
- Made the MC disassembler check before accessing · d897f397
  Sean Callanan authored Apr 20, 2011
```
MCInst operands for ARM.  This allows it to be
more tolerant of malformed MCInsts or incorrect
instruction metadata.

llvm-svn: 129840
```
  d897f397
- ADT/Triple: Renambe isOSX... methods to isMacOSX for consistency with the OS · cd01ed5b
  Daniel Dunbar authored Apr 20, 2011
```
triple component.

llvm-svn: 129838
```
  cd01ed5b
- Fix typo in the comment. · dc62e597
  Johnny Chen authored Apr 19, 2011
```
llvm-svn: 129837
```
  dc62e597
- ADT/Triple: Drop support for -osx style triples, we are going with -macosx · 92469984
  Daniel Dunbar authored Apr 19, 2011
```
instead.

llvm-svn: 129836
```
  92469984
- ADT/Triple: Add support for Triple::MacOSX per feedback from Chris, will remove · 0854f347
  Daniel Dunbar authored Apr 19, 2011
```
Triple::OSX once Clang has moved.

llvm-svn: 129833
```
  0854f347
Apr 19, 2011

ADT/Triple: Move a variety of clients to using isOSDarwin() and isOSWindows() · 2b9b0e37
Daniel Dunbar authored Apr 19, 2011
```
predicates.

llvm-svn: 129816
```
2b9b0e37
ADT/Triple: Add isOSDarwin() and isOSWindows() helper functions. · 163a0966
Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129815
```
163a0966
ADT/Triple: Fix Triple::getArchNameForAssembler to support OSX and iOS · 3c0fbce1
Daniel Dunbar authored Apr 19, 2011
```
enumeration values.

llvm-svn: 129814
```
3c0fbce1
Target/X86: Eliminate uses of getDarwinVers(). · 100455a3
Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129813
```
100455a3
Target/X86: Add getTargetTriple() accessor. · 44b53036
Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129812
```
44b53036
Target/PPC: Kill off DarwinVers, which is now dead. · e3de896b
Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129811
```
e3de896b
Target/PPC: Eliminate a use of getDarwinVers(). · f954a0f0
Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129810
```
f954a0f0
Target/PPC: Add a TargetTriple field. · a37aab25
Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129809
```
a37aab25
Target: Eliminate a use of getDarwinMajorNumber(). · 9483bb6b
Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129803
```
9483bb6b

CodeGen: Eliminate a use of getDarwinMajorNumber(). · 4a7783b0

Daniel Dunbar authored Apr 19, 2011

 - There is a minor semantic change here (evidenced by the test change) for
   Darwin triples that have no version component. I debated changing the default
   behavior of isOSVersionLT, but decided it made more sense for triples to be
   explicit.

llvm-svn: 129802

4a7783b0

ADT/Triple: Generalize and simplify getDarwinNumber to just be getOSVersion. · 99f904c7
Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129799
```
99f904c7
ADT/Triple: Add support for more explicit "osx" and "ios" OS names. · d74bac70
Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129798
```
d74bac70
Delete unnecessary variable. <rdar://problem/7662569> · 468086d5
Stuart Hastings authored Apr 19, 2011
```
llvm-svn: 129796
```
468086d5
Remove some duplicate op action entries and reorganize. · c721b0db
Eric Christopher authored Apr 19, 2011
```
llvm-svn: 129781
```
c721b0db

This patch combines several changes from Evan Cheng for rdar://8659675 . · 0858c3aa

Bob Wilson authored Apr 19, 2011

Making use of VFP / NEON floating point multiply-accumulate / subtraction is
difficult on current ARM implementations for a few reasons.
1. Even though a single vmla has latency that is one cycle shorter than a pair
   of vmul + vadd, a RAW hazard during the first (4? on Cortex-a8) can cause
   additional pipeline stall. So it's frequently better to single codegen
   vmul + vadd.
2. A vmla folowed by a vmul, vmadd, or vsub causes the second fp instruction to
   stall for 4 cycles. We need to schedule them apart.
3. A vmla followed vmla is a special case. Obvious issuing back to back RAW
   vmla + vmla is very bad. But this isn't ideal either:
     vmul
     vadd
     vmla
   Instead, we want to expand the second vmla:
     vmla
     vmul
     vadd
   Even with the 4 cycle vmul stall, the second sequence is still 2 cycles
   faster.

Up to now, isel simply avoid codegen'ing fp vmla / vmls. This works well enough
but it isn't the optimial solution. This patch attempts to make it possible to
use vmla / vmls in cases where it is profitable.

A. Add missing isel predicates which cause vmla to be codegen'ed.
B. Make sure the fmul in (fadd (fmul)) has a single use. We don't want to
   compute a fmul and a fmla.
C. Add additional isel checks for vmla, avoid cases where vmla is feeding into
   fp instructions (except for the #3 exceptional case).
D. Add ARM hazard recognizer to model the vmla / vmls hazards.
E. Add a special pre-regalloc case to expand vmla / vmls when it's likely the
   vmla / vmls will trigger one of the special hazards.

Enable these fp vmlx codegen changes for Cortex-A9.

llvm-svn: 129775

0858c3aa

Add -mcpu=cortex-a9-mp. It's cortex-a9 with MP extension. rdar://8648637. · d04a83f8
Bob Wilson authored Apr 19, 2011
```
llvm-svn: 129774
```
d04a83f8