Commits · 2ef36b633b131f5d0199285512301fafc371bb5d · Roger Ferrer / llvm-epi-0.8

Mar 08, 2013

R600: Optimize another selectcc case · 5e524897

Tom Stellard authored Mar 08, 2013



fold selectcc (selectcc x, y, a, b, cc), b, a, b, setne ->
     selectcc x, y, a, b, cc

Reviewed-by: Christian König <christian.koenig@amd.com>
llvm-svn: 176700

5e524897

R600: Improve custom lowering of select_cc · 2add82de

Tom Stellard authored Mar 08, 2013



Two changes:
1. Prefer SET* instructions when possible
2. Handle the CND*_INT case with floating-point args

Reviewed-by: Christian König <christian.koenig@amd.com>
llvm-svn: 176699

2add82de

R600: Change operation action from Custom to Expand for BR_CC · 492ebeab
Tom Stellard authored Mar 08, 2013
```
Reviewed-by: Christian König <christian.koenig@amd.com>
llvm-svn: 176698
```
492ebeab
R600: Change operation action from Custom to Expand for SETCC · e8f9f287
Tom Stellard authored Mar 08, 2013
```
Reviewed-by: Christian König <christian.koenig@amd.com>
llvm-svn: 176697
```
e8f9f287
R600: Set BooleanContents to ZeroOrNegativeOneBooleanContent · b852af5d
Tom Stellard authored Mar 08, 2013
```
Reviewed-by: Christian König <christian.koenig@amd.com>
llvm-svn: 176696
```
b852af5d

DAGCombiner: Use correct value type for checking legality of BR_CC v3 · b1588fc0

Tom Stellard authored Mar 08, 2013

LegalizeDAG.cpp uses the value of the comparison operands when checking
the legality of BR_CC, so DAGCombiner should do the same.

v2:
  - Expand more BR_CC value types for NVPTX

v3:
  - Expand correct BR_CC value types for Hexagon, Mips, and XCore.

llvm-svn: 176694

b1588fc0

Hexagon: Add patterns for zero extended loads from i1->i64. · 7825e064
Jyotsna Verma authored Mar 08, 2013
```
llvm-svn: 176689
```
7825e064
AArch64: expand sincos operations, we don't support them. · 95f4892d
Tim Northover authored Mar 08, 2013
```
Patch based on Mans Rullgard's.

llvm-svn: 176688
```
95f4892d

R600/SI: Use source scheduler · f52a672b

Michel Danzer authored Mar 08, 2013



This is certainly not the last word on scheduling for this target, but
right now this allows a few apps to run / finish with radeonsi, most
notably UT2004 / Lightsmark. They fail to compile some shaders with the
default scheduler because it ends up trying to spill registers, which
we don't support yet (and which is probably a bad idea in general for
performance if it can be avoided).

NOTE: This is a candidate for the Mesa stable branch.

Reviewed-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 176687

f52a672b

Mar 07, 2013

ArrayRefize some code. No functionality change. · fdf362bd
Benjamin Kramer authored Mar 07, 2013
```
llvm-svn: 176648
```
fdf362bd
Hexagon: Handle i8, i16 and i1 Var Args. · c7dcc2fb
Jyotsna Verma authored Mar 07, 2013
```
llvm-svn: 176647
```
c7dcc2fb
Hexagon: Add support to lower block address. · 2ba0c0b9
Jyotsna Verma authored Mar 07, 2013
```
llvm-svn: 176637
```
2ba0c0b9

X86: Fold EXTRACT_SUBVECTORs of a BUILD_VECTOR into a smaller BUILD_VECTOR. · 2c3d0df8

Benjamin Kramer authored Mar 07, 2013

That can usually be lowered efficiently and is common in sandybridge code.
It would be nice to do this in DAGCombiner but we can't insert arbitrary
BUILD_VECTORs this late.

Fixes PR15462.

llvm-svn: 176634

2c3d0df8

R600/SI: rework input interpolation v2 · 99ee0f47

Christian Konig authored Mar 07, 2013



v2: update CMakeLists.txt as well

Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 176626

99ee0f47

R600/SI: remove SI_vs_load_buffer_index · aa9f4e6d

Christian Konig authored Mar 07, 2013



Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 176625

aa9f4e6d

R600/SI: remove SGPR address space v2 · 189357c6

Christian Konig authored Mar 07, 2013



v2: fix R600 regressions

Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 176624

189357c6

R600/SI: add proper formal parameter handling for SI · 2c8f6d53

Christian Konig authored Mar 07, 2013



Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 176623

2c8f6d53

R600/SI: remove shader type intrinsic · 3625055b

Christian Konig authored Mar 07, 2013



Just encode the type as target specific attribute.

Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 176622

3625055b

R600/SI: switch types of SGPRs to v*i8 · 2214f14a

Christian Konig authored Mar 07, 2013



Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 176621

2214f14a

R600/SI: fix unused variable warning · a0ed6572

Christian Konig authored Mar 07, 2013



Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 176620

a0ed6572

Fix two remaining issue after fixing PR15355 when CMOV is not available · d5cac37d

Michael Liao authored Mar 07, 2013

- Phi nodes should be replaced/updated after lowering CMOV into branch
  because 'mainMBB' updating operand in Phi node is changed.
- Add EFLAGS in livein before lowering the 2nd CMOV. It's necessary as
  we will reuse the EFLAGS generated before the 1st lowered CMOV, which
  won't clobber EFLAGS. However, we need explicitly specify that.
- '-attr=-cmov' test case are added.

llvm-svn: 176598

d5cac37d

Mar 06, 2013

[mips] Custom-legalize BR_JT. · 0f693a8a

Akira Hatanaka authored Mar 06, 2013

In N64-static, GOT address is needed to compute the branch address.

llvm-svn: 176580

0f693a8a

Fix PR15355 · da22b30b

Michael Liao authored Mar 06, 2013

- Clear 'mayStore' flag when loading from the atomic variable before the
  spin loop
- Clear kill flag from one use to multiple use in registers forming the
  address to that atomic variable
- don't use a physical register as live-in register in BB (neither entry
  nor landing pad.) by copying it into virtual register

(patch by Cameron Zwarich)

llvm-svn: 176538

da22b30b

[mips] Remove android calling convention. · 1454ed8a

Akira Hatanaka authored Mar 05, 2013

This calling convention was added just to handle functions which return vector
of floats. The fix committed in r165585 solves the problem.

llvm-svn: 176530

1454ed8a

Mar 05, 2013

[mips] Fix MipsCC::analyzeReturn so that, in soft-float mode, fp128 gets · e092f729
Akira Hatanaka authored Mar 05, 2013
```
returned in registers $2 and $4.

llvm-svn: 176527
```
e092f729
[mips] Fix MipsTargetLowering::LowerCallResult and LowerReturn to correctly · 5f3ba9e5
Akira Hatanaka authored Mar 05, 2013
```
handle fp128 returns.

llvm-svn: 176523
```
5f3ba9e5
[mips] Fix MipsTargetLowering::LowerCall to pass fp128 arguments in floating · 3b7391d1
Akira Hatanaka authored Mar 05, 2013
```
point registers.

llvm-svn: 176521
```
3b7391d1
[mips] Correct handling of fp128 (long double) formals and read long double · 4b634fa3
Akira Hatanaka authored Mar 05, 2013
```
parameters from floating point registers if target is mips64 hard float.

llvm-svn: 176520
```
4b634fa3

Add more functions to the TLI. · b904e6e4

Meador Inge authored Mar 05, 2013



This patch adds many more functions to the target library information.
All of the functions being added were discovered while doing the migration
of the simplify-libcalls attribute annotation functionality to the
functionattrs pass.  As a part of that work the attribute annotation logic
will query TLI to determine if a function should be annotated or not.

Signed-off-by: Meador Inge <meadori@codesourcery.com>
llvm-svn: 176514

b904e6e4

reverting patch 176508. · 457801f7
Jyotsna Verma authored Mar 05, 2013
```
llvm-svn: 176513
```
457801f7
Hexagon: Add support for lowering block address. · 7179e712
Jyotsna Verma authored Mar 05, 2013
```
llvm-svn: 176508
```
7179e712
R600: Do not predicate vector op · fe32bd87
Vincent Lejeune authored Mar 05, 2013
```
llvm-svn: 176507
```
fe32bd87
Hexagon: Expand addc, adde, subc and sube. · 0eeea14e
Jyotsna Verma authored Mar 05, 2013
```
llvm-svn: 176505
```
0eeea14e
Update cmake build. · 5dc83180
Benjamin Kramer authored Mar 05, 2013
```
llvm-svn: 176501
```
5dc83180
Hexagon: Use MO operand flags to mark constant extended instructions. · f1214a8a
Jyotsna Verma authored Mar 05, 2013
```
llvm-svn: 176500
```
f1214a8a
Hexagon: Add encoding bits to the TFR64 instructions. · f4e324f4
Jyotsna Verma authored Mar 05, 2013
```
Set imMoveImm, isAsCheapAsAMove flags for TFRI instructions.

llvm-svn: 176499
```
f4e324f4

R600: initial scheduler code · 68b6b6dd

Vincent Lejeune authored Mar 05, 2013

This is a skeleton for a pre-RA MachineInstr scheduler strategy. Currently
it only tries to expose more parallelism for ALU instructions (this also
makes the distribution of GPR channels more uniform and increases the
chances of ALU instructions to be packed together in a single VLIW group).
Also it tries to reduce clause switching by grouping instruction of the
same kind (ALU/FETCH/CF) together.

Vincent Lejeune:
 - Support for VLIW4 Slot assignement
 - Recomputation of ScheduleDAG to get more parallelism opportunities

Tom Stellard:
 - Fix assertion failure when trying to determine an instruction's slot
   based on its destination register's class
 - Fix some compiler warnings

Vincent Lejeune: [v2]
 - Remove recomputation of ScheduleDAG (will be provided in a later patch)
 - Improve estimation of an ALU clause size so that heuristic does not emit cf
 instructions at the wrong position.
 - Make schedule heuristic smarter using SUnit Depth
 - Take constant read limitations into account

Vincent Lejeune: [v3]
 - Fix some uninitialized values in ConstPair
 - Add asserts to ensure an ALU slot is always populated

llvm-svn: 176498

68b6b6dd

R600: Remove LowerConstCopyPass and lower CONST_COPY right after ISel. · 0b72f102

Vincent Lejeune authored Mar 05, 2013

Maintaining CONST_COPY Instructions until Pre Emit may prevent some ifcvt case
and taking them in account for scheduling is difficult for no real benefit.

llvm-svn: 176488

0b72f102

R600: Turn BUILD_VECTOR into Reg_Sequence · 3b6f20e9
Vincent Lejeune authored Mar 05, 2013
```
Reviewed-by: Tom Stellard <thomas.stellard at amd.com>
llvm-svn: 176487
```
3b6f20e9

R600: CONST_ADDRESS node is not marked as mayLoad anymore · 10a5e477

Vincent Lejeune authored Mar 05, 2013

Reviewed-by: Tom Stellard <thomas.stellard at amd.com>

mayLoad complexify scheduling and does not bring any usefull info
as the location is not writeable at all.

llvm-svn: 176486

10a5e477