Commits · 13e6ccf341145080ae7cc94c9148368ed3f11402 · Roger Ferrer / llvm-epi-0.8

Aug 07, 2013

[mips] Rename register classes CPURegs and CPU64Regs. · 13e6ccf3
Akira Hatanaka authored Aug 06, 2013
```
llvm-svn: 187832
```
13e6ccf3

R600/SI: Use VSrc_* register classes as the default classes for types · 2f7cdda5

Tom Stellard authored Aug 06, 2013

Since the VSrc_* register classes contain both VGPRs and SGPRs, copies
that used be emitted by isel like this:

SGPR = COPY VGPR

Will now be emitted like this:

VSrC = COPY VGPR

This patch also adds a pass that tries to identify and fix situations where
a VGPR to SGPR copy may occur.  Hopefully, these changes will make it
impossible for the compiler to generate illegal VGPR to SGPR copies.

llvm-svn: 187831

2f7cdda5

R600/SI: Add more special cases for opcodes to ensureSRegLimit() · 4c0ffccb
Tom Stellard authored Aug 06, 2013
```
Also factor out the register class lookup to its own function.

llvm-svn: 187830
```
4c0ffccb

[NVPTX] We dont have any target specific flags yet for generating symbol... · 8b24e1e4

Justin Holewinski authored Aug 06, 2013

[NVPTX] We dont have any target specific flags yet for generating symbol references, so get rid of the default-only switch statement.  Fixes an MSVC warning.

llvm-svn: 187829

8b24e1e4

[mips] Mark instructions defined in Mips64InstrInfo.td that are duplicates of · c7e3998e
Akira Hatanaka authored Aug 06, 2013
```
instructions defined in MipsInstrInfo.td as codegen-only instructions.

llvm-svn: 187828
```
c7e3998e
[mips] Delete unnecessary InstAliases. Also, clear some of the InstAlias' · e2a39e75
Akira Hatanaka authored Aug 06, 2013
```
EmitAlias flag and have MipsInstPrinter::printAlias print the aliases.

llvm-svn: 187824
```
e2a39e75

[mips] Replace usages of register classes with register operands. Also, remove · 34a32c0b

Akira Hatanaka authored Aug 06, 2013

unnecessary jalr InstAliases in Mips64InstrInfo.td and add the code to print
jalr InstAliases in MipsInstPrinter::printAlias.

llvm-svn: 187821

34a32c0b

Aug 06, 2013

Add PPC64 mulli pattern · 11b9e452

Hal Finkel authored Aug 06, 2013

The PPC backend had been missing a pattern to generate mulli for 64-bit
multiples. We had been generating it only for 32-bit multiplies. Unfortunately,
generating li + mulld unnecessarily increases register pressure.

llvm-svn: 187807

11b9e452

This corrects creation of operands for t2PLDW. It also removes the definition of t2PLDWpci, · c34bf73e
Mihai Popa authored Aug 06, 2013
```
as pldw does not have a literal variant (i.e. pc relative version)

llvm-svn: 187804
```
c34bf73e
Support APSR_nzcv as operand for Thumb2 mrc. Deprecate pre-UAL syntax (pc instead of apsr_nzcv) · 8f49a45c
Mihai Popa authored Aug 06, 2013
```
llvm-svn: 187803
```
8f49a45c
[NVPTX] Add missing patterns for i1 [s,u]int_to_fp · debe686f
Justin Holewinski authored Aug 06, 2013
```
llvm-svn: 187800
```
debe686f

[NVPTX] Fix bug in stack code generation causes by MC conversion · 871ec939

Justin Holewinski authored Aug 06, 2013

We do use a very small set of physical registers, so account for
them in the virtual register encoding between MachineInstr and MC

llvm-svn: 187799

871ec939

[NVPTX] Start conversion to MC infrastructure · a2a63d28

Justin Holewinski authored Aug 06, 2013

This change converts the NVPTX target to use the MC infrastructure
instead of directly emitting MachineInstr instances. This brings
the target more up-to-date with LLVM TOT, and should fix PR15175
and PR15958 (libNVPTXInstPrinter is empty) as a side-effect.

llvm-svn: 187798

a2a63d28

ARM: implement allowTruncateForTailCall · cc2e903b

Tim Northover authored Aug 06, 2013

Now that it's in place, it seems silly not to let ARM make use of the extra
tail call opportunities.

llvm-svn: 187795

cc2e903b

Refactor isInTailCallPosition handling · a4415854

Tim Northover authored Aug 06, 2013

This change came about primarily because of two issues in the existing code.
Niether of:

define i64 @test1(i64 %val) {
  %in = trunc i64 %val to i32
  tail call i32 @ret32(i32 returned %in)
  ret i64 %val
}

define i64 @test2(i64 %val) {
  tail call i32 @ret32(i32 returned undef)
  ret i32 42
}

should be tail calls, and the function sameNoopInput is responsible. The main
problem is that it is completely symmetric in the "tail call" and "ret" value,
but in reality different things are allowed on each side.

For these cases:
1. Any truncation should lead to a larger value being generated by "tail call"
   than needed by "ret".
2. Undef should only be allowed as a source for ret, not as a result of the
   call.

Along the way I noticed that a mismatch between what this function treats as a
valid truncation and what the backends see can lead to invalid calls as well
(see x86-32 test case).

This patch refactors the code so that instead of being based primarily on
values which it recurses into when necessary, it starts by inspecting the type
and considers each fundamental slot that the backend will see in turn. For
example, given a pathological function that returned {{}, {{}, i32, {}}, i32}
we would consider each "real" i32 in turn, and ask if it passes through
unchanged. This is much closer to what the backend sees as a result of
ComputeValueVTs.

Aside from the bug fixes, this eliminates the recursion that's going on and, I
believe, makes the bulk of the code significantly easier to understand. The
trade-off is the nasty iterators needed to find the real types inside a
returned value.

llvm-svn: 187787

a4415854

Simplify vector lane handling math a bit. No functional change intended. · cf969ead
Craig Topper authored Aug 06, 2013
```
llvm-svn: 187783
```
cf969ead
Simplify math a little bit. · 7418ff46
Craig Topper authored Aug 06, 2013
```
llvm-svn: 187781
```
7418ff46

Target/*/CMakeLists.txt: Add the dependency to CommonTableGen explicitly for... · aaf66c73

NAKAMURA Takumi authored Aug 06, 2013

Target/*/CMakeLists.txt: Add the dependency to CommonTableGen explicitly for each corresponding CodeGen.

Without explicit dependencies, both per-file action and in-CommonTableGen action could run in parallel.
It races to emit *.inc files simultaneously.

llvm-svn: 187780

aaf66c73

Replace EVT with MVT in isHorizontalBinOp as it is only called with legal types. · 9bc00b65
Craig Topper authored Aug 06, 2013
```
llvm-svn: 187779
```
9bc00b65
Simplify code slightly. No functional change. · 47d7c5c8
Craig Topper authored Aug 06, 2013
```
llvm-svn: 187771
```
47d7c5c8
Factor FlattenCFG out from SimplifyCFG · aa664d9b
Tom Stellard authored Aug 06, 2013
```
Patch by: Mei Ye

llvm-svn: 187764
```
aa664d9b

R600: Implement TargetLowering::getVectorIdxTy() · 28d06de6

Tom Stellard authored Aug 05, 2013

We use MVT::i32 for the vector index type, because we use 32-bit
operations to caculate offsets when dynamically indexing vectors.

llvm-svn: 187749

28d06de6

Aug 05, 2013

Silencing an MSVC11 type conversion warning. · 5b463457
Aaron Ballman authored Aug 05, 2013
```
llvm-svn: 187727
```
5b463457

[SystemZ] Use BRCT and BRCTG to eliminate add-&-compare sequences · c212125d

Richard Sandiford authored Aug 05, 2013

This patch just uses a peephole test for "add; compare; branch" sequences
within a single block. The IR optimizers already convert loops to
decrement-and-branch-on-nonzero form in some cases, so even this
simplistic test triggers many times during a clang bootstrap and
projects/test-suite run. It looks like there are still cases where we
need to more strongly prefer branches on nonzero though. E.g. I saw a
case where a loop that started out with a check for 0 ended up with a
check for -1. I'll try to look at that sometime.

I ended up adding the Reference class because MachineInstr::readsRegister()
doesn't check for subregisters (by design, as far as I could tell).

llvm-svn: 187723

c212125d

[SystemZ] Add definitions for BRCT and BRCTG · 9795d8e6
Richard Sandiford authored Aug 05, 2013
```
llvm-svn: 187721
```
9795d8e6
[SystemZ] Use LOAD AND TEST to eliminate comparisons against zero · b49a3ab2
Richard Sandiford authored Aug 05, 2013
```
llvm-svn: 187720
```
b49a3ab2
[SystemZ] Add LOAD AND TEST instructions · c62c64a0
Richard Sandiford authored Aug 05, 2013
```
Just the definitions and MC support.  The next patch uses them for codegen.

llvm-svn: 187719
```
c62c64a0

[SystemZ] Split out comparison elimination into a separate pass · bdbb8af7

Richard Sandiford authored Aug 05, 2013

Perhaps predictably, doing comparison elimination on the fly during
SystemZLongBranch turned out to be a bad idea.  The next patches make
use of LOAD AND TEST and BRANCH ON COUNT, both of which require
changes to earlier instructions.

No functionality change intended.

llvm-svn: 187718

bdbb8af7

AVX-512 set: added mask operations, lowering BUILD_VECTOR for i1 vector types. · 40864b69
Elena Demikhovsky authored Aug 05, 2013
```
Added intrinsics and tests.

llvm-svn: 187717
```
40864b69

Add the saving of S2. This is needed for some of the floating point · 9c285b30

Reed Kotler authored Aug 04, 2013

helper functions. This can be optimized out later when the remaining
parts of the helper function work is moved into the Mips16HardFloat pass.
For now it forces us to use the 32 bit save/restore instructions instead
of the 16 bit ones.

llvm-svn: 187712

9c285b30

Aug 04, 2013

X86: Turn fp selects into mask operations. · 5bc180c1

Benjamin Kramer authored Aug 04, 2013

double test(double a, double b, double c, double d) { return a<b ? c : d; }

before:
_test:
	ucomisd	%xmm0, %xmm1
	ja	LBB0_2
	movaps	%xmm3, %xmm2
LBB0_2:
	movaps	%xmm2, %xmm0

after:
_test:
	cmpltsd	%xmm1, %xmm0
	andpd	%xmm0, %xmm2
	andnpd	%xmm3, %xmm0
	orpd	%xmm2, %xmm0

Small speedup on Benchmarks/SmallPT

llvm-svn: 187706

5bc180c1

AVX-512 set: added VEXTRACTPS instruction · cd466917
Elena Demikhovsky authored Aug 04, 2013
```
llvm-svn: 187705
```
cd466917

X86: correct tail return address calculation · ecc018c7

Tim Northover authored Aug 04, 2013

Due to the weird and wondeful usual arithmetic conversions, some
calculations involving negative values were getting performed in
uint32_t and then promoted to int64_t, which is really not a good
idea.

Patch by Katsuhiro Ueno.

llvm-svn: 187703

ecc018c7

Clean up code for Mips16 large frame handling. · 30cedf65
Reed Kotler authored Aug 04, 2013
```
llvm-svn: 187701
```
30cedf65
PPCAsmParser: Stop leaking names. · 72d45cc8
Benjamin Kramer authored Aug 03, 2013
```
Store them in a place that gets cleaned up properly.

llvm-svn: 187700
```
72d45cc8

ARMAsmParser: Plug a leak. · 23632bd4

Benjamin Kramer authored Aug 03, 2013

Using an object to do the cleanup may look like overkill, but it's safer and nicer than putting deletes everywhere.

llvm-svn: 187696

23632bd4

Stop leaking register infos in the disassemblers. · dcfd5b52
Benjamin Kramer authored Aug 03, 2013
```
llvm-svn: 187695
```
dcfd5b52

Aug 03, 2013

Fix PPC64 64-bit GPR inline asm constraint matching · b176acb6

Hal Finkel authored Aug 03, 2013

Internally, the PowerPC backend names the 32-bit GPRs R[0-9]+, and names the
64-bit parent GPRs X[0-9]+. When matching inline assembly constraints with
explicit register names, on PPC64 when an i64 MVT has been requested, we need
to follow gcc's convention of using r[0-9]+ to refer to the 64-bit (parent)
registers.

At some point, we'll probably want to arrange things so that the generic code
in TargetLowering uses the AsmName fields declared in *RegisterInfo.td in order
to match these inline asm register constraints. If we do that, this change can
be reverted.

llvm-svn: 187693

b176acb6

Aug 02, 2013
- Add a missing 'return' statement. · fcf67781
  Joey Gouly authored Aug 02, 2013
```
llvm-svn: 187671
```
  fcf67781
- [mips] Expand vector truncating stores and extending loads. · 7be35cb1
  Akira Hatanaka authored Aug 02, 2013
```
llvm-svn: 187667
```
  7be35cb1