Commits · 213fced704f57e2b4a5cd41097ac0cf2c4a987ce · Roger Ferrer / llvm-epi-0.8

Feb 07, 2013

ARM cost model: Add costs for vector selects · 213fced7

Arnold Schwaighofer authored Feb 07, 2013

Vector selects are cheap on NEON. They get lowered to a vbsl instruction.

radar://13158753

llvm-svn: 174631

213fced7

R600/SI: Add pattern for flog2 · 349cabed

Michel Danzer authored Feb 07, 2013



22 more little piglits with radeonsi.

Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 174615

349cabed

R600: Consolidate sub register indices. · 9355b221

Tom Stellard authored Feb 07, 2013



Use sub0-15 everywhere.

Patch by: Michel Dänzerr

Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 174610

9355b221

R600: Add support for SET*_DX10 instructions · e06163a9

Tom Stellard authored Feb 07, 2013

These instructions compare two floating point values and return an
integer true (-1) or false (0) value.

When compiling code generated by the Mesa GLSL frontend, the SET*_DX10
instructions save us four instructions for most branch decisions that
use floating-point comparisons.

llvm-svn: 174609

e06163a9

R600: Fix assembly name for SETGT_INT · b40ada9b
Tom Stellard authored Feb 07, 2013
```
llvm-svn: 174607
```
b40ada9b
Make sure we call externals from libraries properly when -static. · 4a230ffa
Reed Kotler authored Feb 07, 2013
```
For example, when we are doing mips16 hard float or soft float.

llvm-svn: 174583
```
4a230ffa
Enable jumps when in -static mode. · ec60f7d3
Reed Kotler authored Feb 07, 2013
```
llvm-svn: 174580
```
ec60f7d3

Feb 06, 2013

[mips] Make NOP a pseudo instruction and expand it to "sll $zero, $zero, 0". · 556135d8
Akira Hatanaka authored Feb 06, 2013
```
llvm-svn: 174546
```
556135d8

This is a follow-up on r174446, now taking Atom processors into · ef4558ab

Eli Bendersky authored Feb 06, 2013

account. Atoms use LEA for updating SP in prologs/epilogs, and the
exact LEA opcode depends on the data model.

Also reapplying the test case which was added and then reverted
(because of Atom failures), this time specifying explicitly the CPU in
addition to the triple. The test case now checks all variations (data
mode, cpu Atom vs. Core).

llvm-svn: 174542

ef4558ab

PPC calling convention cleanup. · ef17c142

Bill Schmidt authored Feb 06, 2013

Most of PPCCallingConv.td is used only by the 32-bit SVR4 ABI.  Rename
things to clarify this.  Also delete some code that's been commented out
for a long time.

llvm-svn: 174526

ef17c142

R600: Support for indirect addressing v4 · f3b2a1e8

Tom Stellard authored Feb 06, 2013

Only implemented for R600 so far.  SI is missing implementations of a
few callbacks used by the Indirect Addressing pass and needs code to
handle frame indices.

At the moment R600 only supports array sizes of 16 dwords or less.
Register packing of vector types is currently disabled, which means that a
vec4 is stored in T0_X, T1_X, T2_X, T3_X, rather than T0_XYZW. In order
to correctly pack registers in all cases, we will need to implement an
analysis pass for R600 that determines the correct vector width for each
array.

v2:
  - Add support for i8 zext load from stack.
  - Coding style fixes

v3:
  - Don't reserve registers for indirect addressing when it isn't
    being used.
  - Fix bug caused by LLVM limiting the number of SubRegIndex
    declarations.

v4:
  - Fix 64-bit defines

llvm-svn: 174525

f3b2a1e8

Implement external weak (ELF) symbols on AArch64 · 228d9d3a

Tim Northover authored Feb 06, 2013

Weakly defined symbols should evaluate to 0 if they're undefined at
link-time. This is impossible to do with the usual address generation
patterns, so we should use a literal pool entry to materlialise the
address.

llvm-svn: 174518

228d9d3a

Add AArch64 CRC32 instructions · a80c4c1a

Tim Northover authored Feb 06, 2013

These instructions are a late addition to the architecture, and may
yet end up behind an optional attribute, but for now they're available
at all times.

llvm-svn: 174496

a80c4c1a

Add icache prefetch operations to AArch64 · 91a51c5a

Tim Northover authored Feb 06, 2013

This adds hints to the various "prfm" instructions so that they can
affect the instruction cache as well as the data cache.

llvm-svn: 174495

91a51c5a

ARM: Use MCTargetAsmParser::validateTargetOperandClass(). · 231e7aa4

Jim Grosbach authored Feb 06, 2013

Use the validateTargetOperandClass() hook to match literal '#0' operands in
InstAlias definitions. Previously this required per-instruction C++ munging of the
operand list, but not is handled as a natural part of the matcher. Much better.

No additional tests are required, as the pre-existing tests for these instructions
exercise the new behaviour as being functionally equivalent to the old.

llvm-svn: 174488

231e7aa4

Feb 05, 2013
- Make sure the correct opcodes are used to SUB and ADD the stack · 44a40ca1
  Eli Bendersky authored Feb 05, 2013
  
  pointer in function prologs/epilogs. The opcodes should depend on the data model (LP64 vs. ILP32) rather than the architecture bit-ness. llvm-svn: 174446
  44a40ca1
- [mips] Do not use function CC_MipsN_VarArg unless the function being analyzed · dec25266
  Akira Hatanaka authored Feb 05, 2013
  
  is a vararg function. The original code was examining flag OutputArg::IsFixed to determine whether CC_MipsN_VarArg or CC_MipsN should be called. This is not correct, since this flag is often set to false when the function being analyzed is a non-variadic function. llvm-svn: 174442
  dec25266
- Hexagon: Use TFR_cond with cmpb.[eq,gt,gtu] to handle · 6031625b
  Jyotsna Verma authored Feb 05, 2013
  
  zext( set[ne,eq,gt,ugt] (...) ) type of dag patterns. llvm-svn: 174429
  6031625b
- Move MRI liveouts to AArch64 return instructions. · dbc8c51a
  Jakob Stoklund Olesen authored Feb 05, 2013
  
  llvm-svn: 174415
  dbc8c51a
- Move MRI liveouts to XCore return instructions. · 4af19d00
  Jakob Stoklund Olesen authored Feb 05, 2013
  
  llvm-svn: 174414
  4af19d00
- Move MRI liveouts to Sparc return instructions. · ef8bf3cd
  Jakob Stoklund Olesen authored Feb 05, 2013
  
  llvm-svn: 174413
  ef8bf3cd
- Hexagon: Use multiclass for absolute addressing mode stores. · 50ca6dd8
  Jyotsna Verma authored Feb 05, 2013
  
  llvm-svn: 174412
  50ca6dd8
- Move MRI liveouts to MSP430 return instructions. · b52a3ec1
  Jakob Stoklund Olesen authored Feb 05, 2013
  
  llvm-svn: 174411
  b52a3ec1
- Move MRI liveouts to Mips return instructions. · a206050c
  Jakob Stoklund Olesen authored Feb 05, 2013
  
  llvm-svn: 174410
  a206050c
- Move MRI liveouts to PowerPC return instructions. · 8660a8c0
  Jakob Stoklund Olesen authored Feb 05, 2013
  
  llvm-svn: 174409
  8660a8c0
- Move MRI liveouts to MBlaze return instructions. · 242546c9
  Jakob Stoklund Olesen authored Feb 05, 2013
  
  llvm-svn: 174408
  242546c9
- Move MRI liveouts to Hexagon return instructions. · 0af477c3
  Jakob Stoklund Olesen authored Feb 05, 2013
  
  llvm-svn: 174407
  0af477c3
- Move MRI liveouts to ARM return instructions. · f90fb6e1
  Jakob Stoklund Olesen authored Feb 05, 2013
  
  llvm-svn: 174406
  f90fb6e1
- Move MRI liveouts to X86 return instructions. · dc69f6fb
  Jakob Stoklund Olesen authored Feb 05, 2013
  
  llvm-svn: 174402
  dc69f6fb
- Don't use MRI liveouts in R600. · fdc37670
  Jakob Stoklund Olesen authored Feb 05, 2013
  
  Something very strange is going on with the output registers in this target. Its ISelLowering code is inserting dangling CopyToReg nodes, hoping that those physregs won't get clobbered before the RETURN. This patch adds the output registers as implicit uses on RETURN instructions in the custom emission pass. I'd much prefer to have those CopyToReg nodes glued to the RETURNs, but I don't see how. llvm-svn: 174400
  fdc37670
- Avoid using MRI::liveout_iterator for computing VRSAVEs. · bf034dbd
  Jakob Stoklund Olesen authored Feb 05, 2013
  
  The liveout lists are about to be removed from MRI, this is the only place they were used after register allocation. Get the live out V registers directly from the return instructions instead. llvm-svn: 174399
  bf034dbd
- R600: Fold remaining CONST_COPY after expand pseudo inst · df063e61
  Tom Stellard authored Feb 05, 2013
  
  Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174395
  df063e61
- R600: improve inputs/interpolation handling · 41afe6a6
  Tom Stellard authored Feb 05, 2013
  
  Use one intrinsic for all sorts of interpolation. Use two separate unexpanded instructions to represent INTERP_XY and _ZW - this will allow to eliminate one part if it's not used. Track liveness of special interpolation regs instead of reserving them - this will allow to reuse those regs, lowering reg pressure. Patch By: Vadim Girlin v2[Vincent Lejeune]: Rebased against current llvm master Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174394
  41afe6a6
- R600: Emit function name in the AsmPrinter · 2e5e7a5b
  Tom Stellard authored Feb 05, 2013
  
  Emitting the function name allows us to check for it in the FileCheck tests so we can make sure FileCheck is checking the output of the correct function. llvm-svn: 174392
  2e5e7a5b
- R600/SI: Add patterns for fcos and fsin. · 836cdd97
  Tom Stellard authored Feb 05, 2013
  
  Fixes 37 piglit tests and allows e.g. FlightGear to run with radeonsi. Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174391
  836cdd97
- Fix comments · 530a3bc5
  Eli Bendersky authored Feb 05, 2013
  
  llvm-svn: 174390
  530a3bc5
- Hexagon: Add V4 compare instructions. Enable relationship mapping · 6f635b54
  Jyotsna Verma authored Feb 05, 2013
  
  for the existing instructions. llvm-svn: 174389
  6f635b54
- Fix signed-unsigned comparison warning. · d03ef4b5
  Tim Northover authored Feb 05, 2013
  
  llvm-svn: 174387
  d03ef4b5
- Fix remaining StringRef abuse. · 96e4946a
  Tim Northover authored Feb 05, 2013
  
  This should fix the valgrind buildbot failure. llvm-svn: 174375
  96e4946a
- ARM cost model: Cost for scalar integer casts and floating point conversions · a804bbee
  Arnold Schwaighofer authored Feb 05, 2013
  
  Also adds some costs for vector integer float conversions. llvm-svn: 174371
  a804bbee