Commits · b9bf8dca47611cf1f79713c4cddcce13113e13ca · Roger Ferrer / llvm-epi-0.8

Feb 08, 2013

Add the 16 bit version of addiu. To the assembler, the 16 and 32 bit are the · b9bf8dca

Reed Kotler authored Feb 08, 2013

same so we put in the comment field an indicator when we think we are
emitting the 16 bit version. For the direct object emitter, the difference is 
important as well as for other passes which need an accurate count of 
program size. There will be other similar putbacks to this for various
instructions.

llvm-svn: 174747

b9bf8dca

DAGCombiner: Constant folding around pre-increment loads/stores · 2581905f

Hal Finkel authored Feb 08, 2013

Previously, even when a pre-increment load or store was generated,
we often needed to keep a copy of the original base register for use
with other offsets. If all of these offsets are constants (including
the offset which was combined into the addressing mode), then this is
clearly unnecessary. This change adjusts these other offsets to use the
new incremented address.

llvm-svn: 174746

2581905f

BBVectorize: Use TTI->getAddressComputationCost · dd272184

Hal Finkel authored Feb 08, 2013

This is a follow-up to the cost-model change in r174713 which splits
the cost of a memory operation between the address computation and the
actual memory access. In r174713, this cost is always added to the
memory operation cost, and so BBVectorize will do the same.

Currently, this new cost function is used only by ARM, and I don't
have any ARM test cases for BBVectorize. Assistance in generating some
good ARM test cases for BBVectorize would be greatly appreciated!

llvm-svn: 174743

dd272184

Revert 172027 and 174336. Remove diagnostics about over-aligned stack objects. · 67bbf3aa

Bob Wilson authored Feb 08, 2013

Aside from the question of whether we report a warning or an error when we
can't satisfy a requested stack object alignment, the current implementation
of this is not good. We're not providing any source location in the diagnostics
and the current warning is not connected to any warning group so you can't
control it. We could improve the source location somewhat, but we can do a
much better job if this check is implemented in the front-end, so let's do that
instead. <rdar://problem/13127907>

llvm-svn: 174741

67bbf3aa

Refine fix to bug 15041. · 62fe7a5b

Bill Schmidt authored Feb 08, 2013

Thanks to help from Nadav and Hal, I have a more reasonable (and even
correct!) approach.  This specifically penalizes the insertelement
and extractelement operations for the performance hit that will occur
on PowerPC processors.

llvm-svn: 174725

62fe7a5b

[SimplifyLibCalls] Library call simplification doen't work if the call site · 22d275f7

Chad Rosier authored Feb 08, 2013

isn't using the default calling convention.  However, if the transformation is
from a call to inline IR, then the calling convention doesn't matter.
rdar://13157990

llvm-svn: 174724

22d275f7

Typos. · 479e5a93
Jakob Stoklund Olesen authored Feb 08, 2013
```
llvm-svn: 174723
```
479e5a93

The patch to fix some issues in r174543 fixed the lines failing the test, but missed a couple · a6016e43

David Tweed authored Feb 08, 2013

of lines which weren't being explicitly looked at and were printing incorrect results. These
values clearly must lie within 32 bits, so the casts are definitely safe.

llvm-svn: 174717

a6016e43

ARM cost model: Address computation in vector mem ops not free · 594fa2dc

Arnold Schwaighofer authored Feb 08, 2013

Adds a function to target transform info to query for the cost of address
computation. The cost model analysis pass now also queries this interface.
The code in LoopVectorize adds the cost of address computation as part of the
memory instruction cost calculation. Only there, we know whether the instruction
will be scalarized or not.
Increase the penality for inserting in to D registers on swift. This becomes
necessary because we now always assume that address computation has a cost and
three is a closer value to the architecture.

radar://13097204

llvm-svn: 174713

594fa2dc

Test Commit · f63b77be
Michael Kuperstein authored Feb 08, 2013
```
llvm-svn: 174709
```
f63b77be

Parse the attribute group reference on a function. · b32b0411

Bill Wendling authored Feb 08, 2013

Attribute references are of this form:

  define void @foo() #0 #1 #2 { ... }

Parse them for function attributes. If there's more than one reference, then
they are merged together.

llvm-svn: 174697

b32b0411

When Mips16 frames grow large, the immediate field may exceed the maximum · 66165c8f

Reed Kotler authored Feb 08, 2013

allowed size for the instruction. This code uses RegScavenger to fix this.
We sometimes need 2 registers for Mips16 so we must handle things
differently than how register scavenger is normally used.

llvm-svn: 174696

66165c8f

Revert "Have InstCombine call SipmlifyCall when handling calls. Test case included." · 1bd53c36

Andrew Trick authored Feb 08, 2013

This reverts commit 3854a5d90fee52af1065edbed34521fff6cdc18d.

This causes a clang unit test to hang: vtable-available-externally.cpp.

llvm-svn: 174692

1bd53c36

Use ParseFnAttributeValuePairs instead of ParseOptionalFuncAttrs · 8b0321da

Bill Wendling authored Feb 08, 2013

The functionality of ParseOptionalFuncAttrs was there in
ParseFnAttributeValuePairs. So just use that instead.

llvm-svn: 174686

8b0321da

Have InstCombine call SipmlifyCall when handling calls. Test case included. · 6092dc54
Michael Ilseman authored Feb 07, 2013
```
llvm-svn: 174675
```
6092dc54

Feb 07, 2013

fix 80-col violation and fix the docs. · a9100f36
Nadav Rotem authored Feb 07, 2013
```
llvm-svn: 174671
```
a9100f36
[mips] Make Filler a class and reduce indentation. · a0612815
Akira Hatanaka authored Feb 07, 2013
```
llvm-svn: 174666
```
a0612815
Formatting. · 9006156a
Eric Christopher authored Feb 07, 2013
```
llvm-svn: 174664
```
9006156a

"Clean up" line section symbol emission by emitting the section · 7480433d

Eric Christopher authored Feb 07, 2013

syms before constructing the compile units so we're not emitting
section references to sections not there already.

llvm-svn: 174663

7480433d

[patch] bug 15055 Add Unistd.h to OProfileWrapper.cpp · 02cb6f9e

Will Schmidt authored Feb 07, 2013

Add #include <unistd.h> to OProfileWrapper.cpp. This provides the declarations for 'read' and 'close' that are otherwise missing, and result in 'error: <foo> was not declared in this scope'.

This matches the issue as reported in bug 15055 "Can no longer compile LLVM with --with-oprofile"

llvm-svn: 174661

02cb6f9e

Constrain PowerPC autovectorization to fix bug 15041. · b3cece13

Bill Schmidt authored Feb 07, 2013

Certain vector operations don't vectorize well with the current
PowerPC implementation.  Element insert/extract performs poorly
without VSX support because Altivec requires going through memory.
SREM, UREM, and VSELECT all produce bad scalar code.

There's a lot of work to do for the cost model before
autovectorization will be tuned well, and this is not an attempt to
address the larger problem.

llvm-svn: 174660

b3cece13

[mips] Add definition of JALR instruction which has two register operands. Change the · 061d1ea5
Akira Hatanaka authored Feb 07, 2013
```
original JALR instruction with one register operand to be a pseudo-instruction.

llvm-svn: 174657
```
061d1ea5

R600/SI: cleanup VGPR encoding · 1c822a89

Tom Stellard authored Feb 07, 2013



Remove all the unused code.

Patch by: Christian König

Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 174656

1c822a89

R600/SI: Handle VGPR64 destination in copyPhysReg(). · aac1889a

Tom Stellard authored Feb 07, 2013



Allows nexuiz to run with radeonsi.

Patch by: Michel Dänzer

Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 174655

aac1889a

R600/SI: Add pattern for mul. · ecacb801

Tom Stellard authored Feb 07, 2013



20 more little piglits with radeonsi.

Patch by: Michel Dänzer

Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 174654

ecacb801

R600/SI: simplify and fix SMRD encoding · 8909380e

Tom Stellard authored Feb 07, 2013



The _SGPR variants where wrong.

Patch by: Christian König

Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 174653

8909380e

R600/SI: add proper 64bit immediate support v2 · 26075d58

Tom Stellard authored Feb 07, 2013



v2: rebased on current upstream

Patch by: Christian König

Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 174652

26075d58

R600: Add an explicit default processor · 4ded0c1c

Tom Stellard authored Feb 07, 2013

This is for the case when no processor is passed to the backend.  This
prevents the

'' is not a recognized processor for this target (ignoring processor)

warning from being generated by clang.

llvm-svn: 174651

4ded0c1c

Identify and simplify idempotent intrinsics. Test case included. · 5485729b
Michael Ilseman authored Feb 07, 2013
```
llvm-svn: 174650
```
5485729b

Loop Vectorizer: Refactor Memory Cost Computation · 3476fc8c

Arnold Schwaighofer authored Feb 07, 2013

We don't want too many classes in a pass and the classes obscure the details. I
was going a little overboard with object modeling here. Replace classes by
generic code that handles both loads and stores.

No functionality change intended.

llvm-svn: 174646

3476fc8c

R600/SI: Use proper instructions for array/shadow samplers. · 462516b7

Tom Stellard authored Feb 07, 2013



Patch by: Michel Dänzer

Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 174634

462516b7

R600/SI: Make sample intrinsic address parameter type overloaded. · ae6c06e5

Tom Stellard authored Feb 07, 2013



Handle vectors of 1 to 16 integers.

Change the intrinsic names to prevent the wrong one from being selected at
runtime due to the overloading.

Patch By: Michel Dänzer

Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 174633

ae6c06e5

R600/SI: Add basic support for more integer vector types. · 538ceeb6

Tom Stellard authored Feb 07, 2013



v1i32, v2i32, v8i32 and v16i32.

Only add VGPR register classes for integer vector types, to avoid attempts
copying from VGPR to SGPR registers, which is not possible.

Patch By: Michel Dänzer

Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 174632

538ceeb6

ARM cost model: Add costs for vector selects · 213fced7

Arnold Schwaighofer authored Feb 07, 2013

Vector selects are cheap on NEON. They get lowered to a vbsl instruction.

radar://13158753

llvm-svn: 174631

213fced7

R600/SI: Add pattern for flog2 · 349cabed

Michel Danzer authored Feb 07, 2013



22 more little piglits with radeonsi.

Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 174615

349cabed

FDE::dumpHeader(): Forgot to fix one more formatting, ... take two! · 14727d7c
NAKAMURA Takumi authored Feb 07, 2013
```
Excuse me, I could not test it locally.

llvm-svn: 174614
```
14727d7c

R600: Consolidate sub register indices. · 9355b221

Tom Stellard authored Feb 07, 2013



Use sub0-15 everywhere.

Patch by: Michel Dänzerr

Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 174610

9355b221

R600: Add support for SET*_DX10 instructions · e06163a9

Tom Stellard authored Feb 07, 2013

These instructions compare two floating point values and return an
integer true (-1) or false (0) value.

When compiling code generated by the Mesa GLSL frontend, the SET*_DX10
instructions save us four instructions for most branch decisions that
use floating-point comparisons.

llvm-svn: 174609

e06163a9

R600: Fix assembly name for SETGT_INT · b40ada9b
Tom Stellard authored Feb 07, 2013
```
llvm-svn: 174607
```
b40ada9b
FDE::dumpHeader(): Forgot to fix one more formatting. It affected bigendian hosts. · 94651f9d
NAKAMURA Takumi authored Feb 07, 2013
```
llvm-svn: 174602
```
94651f9d