Commits · 917e07f095449e981699569559bf6ed5346d7e64 · Roger Ferrer / llvm-epi-0.8

Apr 02, 2013

Jakob Stoklund Olesen authored Apr 02, 2013

SPARC v9 extends all ALU instructions to 64 bits, so we simply need to
add patterns to use them for both i32 and i64 values.

llvm-svn: 178527

917e07f0

Materialize 64-bit immediates. · bddb20ee

Jakob Stoklund Olesen authored Apr 02, 2013

The last resort pattern produces 6 instructions, and there are still
opportunities for materializing some immediates in fewer instructions.

llvm-svn: 178526

bddb20ee

Add 64-bit shift instructions. · c1d1a481

Jakob Stoklund Olesen authored Apr 02, 2013

SPARC v9 defines new 64-bit shift instructions. The 32-bit shift right
instructions are still usable as zero and sign extensions.

This adds new F3_Sr and F3_Si instruction formats that probably should
be used for the 32-bit shifts as well. They don't really encode an
simm13 field.

llvm-svn: 178525

c1d1a481

Add predicates for distinguishing 32-bit and 64-bit modes. · 739d722e

Jakob Stoklund Olesen authored Apr 02, 2013

The 'sparc' architecture produces 32-bit code while 'sparcv9' produces
64-bit code.

It is also possible to run 32-bit code using SPARC v9 instructions with:

  llc -march=sparc -mattr=+v9

llvm-svn: 178524

739d722e

Add support for 64-bit calling convention. · 0b21f35a

Jakob Stoklund Olesen authored Apr 02, 2013

This is far from complete, but it is enough to make it possible to write
test cases using i64 arguments.

Missing features:
- Floating point arguments.
- Receiving arguments on the stack.
- Calls.

llvm-svn: 178523

0b21f35a

Add an I64Regs register class for 64-bit registers. · 5ad3b353

Jakob Stoklund Olesen authored Apr 02, 2013

We are going to use the same registers for 32-bit and 64-bit values, but
in two different register classes. The I64Regs register class has a
larger spill size and alignment.

The addition of an i64 register class confuses TableGen's type
inference, so it is necessary to clarify the type of some immediates and
the G0 register.

In 64-bit mode, pointers are i64 and should use the I64Regs register
class. Implement getPointerRegClass() to dynamically provide the pointer
register class depending on the subtarget. Use ptr_rc and iPTR for
memory operands.

Finally, add the i64 type to the IntRegs register class. This register
class is not used to hold i64 values, I64Regs is for that. The type is
required to appease TableGen's type checking in output patterns like this:

  def : Pat<(add i64:$a, i64:$b), (ADDrr $a, $b)>;

SPARC v9 uses the same ADDrr instruction for i32 and i64 additions, and
TableGen doesn't know to check the type of register sub-classes.

llvm-svn: 178522

5ad3b353

Fix typo in PPCISelLowering · 93d75ea0

Hal Finkel authored Apr 02, 2013

Thanks to Bill Schmidt for finding this in review of r178480.

llvm-svn: 178521

93d75ea0

The divide unit is not pipeline, but it is still buffered. · e1d88cfb

Andrew Trick authored Apr 02, 2013

Buffered means a later divide may be executed out-of-order while a
prior divide is sitting (buffered) in a reservation station.

You can tell it's not pipelined, because operations that use it
reserve it for more than one cycle:

def : WriteRes<WriteIDiv, [HWPort0, HWDivider]> {
  let Latency = 25;
  let ResourceCycles = [1, 10];
}

We don't currently distinguish between an unpipeline operation and one
that is split into multiple micro-ops requiring the same unit. Except
that the later may have NumMicroOps > 1 if they also consume
issue/dispatch resources.

llvm-svn: 178519

e1d88cfb

Target/R600: Fix CMake build to add missing files. · fd98f7f2
NAKAMURA Takumi authored Apr 01, 2013
```
llvm-svn: 178508
```
fd98f7f2

Apr 01, 2013

R600: Add support for native control flow · bfaa63a6
Vincent Lejeune authored Apr 01, 2013
```
llvm-svn: 178505
```
bfaa63a6
R600/SI: Share code recording ShaderTypeAttribute between generations · ace6f735
Vincent Lejeune authored Apr 01, 2013
```
llvm-svn: 178504
```
ace6f735
R600: Emit CF_ALU and use true kcache register. · f43bc57b
Vincent Lejeune authored Apr 01, 2013
```
llvm-svn: 178503
```
f43bc57b
Fix a bad assert in PPCTargetLowering · 3f88d089
Hal Finkel authored Apr 01, 2013
```
llvm-svn: 178489
```
3f88d089

Add more PPC floating-point conversion instructions · f6d45f23

Hal Finkel authored Apr 01, 2013

The P7 and A2 have additional floating-point conversion instructions which
allow a direct two-instruction sequence (plus load/store) to convert from all
combinations (signed/unsigned i32/i64) <--> (float/double) (on previous cores,
only some combinations were directly available).

llvm-svn: 178480

f6d45f23

Use ImmToIdxMap.count in PPCRegisterInfo · 39caf9f5

Hal Finkel authored Apr 01, 2013

Code improvement suggested by Jakob (in review of r178450). No functionality
change intended.

llvm-svn: 178473

39caf9f5

Add the PPC popcntw instruction · 290376dd

Hal Finkel authored Apr 01, 2013

The popcntw instruction is available whenever the popcntd instruction is
available, and performs a separate popcnt on the lower and upper 32-bits.
Ignoring the high-order count, this can be used for the 32-bit input case
(saving on the explicit zero extension otherwise required to use popcntd).

llvm-svn: 178470

290376dd

Treat PPCISD::STFIWX like the memory opcode that it is · 60c75107

Hal Finkel authored Apr 01, 2013

PPCISD::STFIWX is really a memory opcode, and so it should come after
FIRST_TARGET_MEMORY_OPCODE, and we should use DAG.getMemIntrinsicNode to create
nodes using it.

No functionality change intended (although there could be optimization benefits
from preserving the MMO information).

llvm-svn: 178468

60c75107

Remove unused typedef. · fee96f83
Duncan Sands authored Apr 01, 2013
```
llvm-svn: 178462
```
fee96f83

ARM Scheduler Model: Add resources instructions, map resources in subtargets · 6793aebb

Arnold Schwaighofer authored Apr 01, 2013

Reapply r177968:
After commit 178074 we can now have undefined scheduler variants.

Move the CortexA9 resources into the CortexA9 SchedModel namespace. Define
resource mappings under the CortexA9 SchedModel. Define resources and mappings
for the SwiftModel.

Incooperate Andrew's feedback.

llvm-svn: 178460

6793aebb

X86TTI: Add accurate costs for itofp operations, based on the actual instruction counts. · 52ceb443
Benjamin Kramer authored Apr 01, 2013
```
llvm-svn: 178459
```
52ceb443

Mar 31, 2013

R600: Emit native instructions for tex · 53f3525d
Vincent Lejeune authored Mar 31, 2013
```
llvm-svn: 178452
```
53f3525d
There is no longer any need to silence this compiler warning as the warning has · e1aa194a
Duncan Sands authored Mar 31, 2013
```
been turned off globally.

llvm-svn: 178451
```
e1aa194a

Cleanup ImmToIdxMap and noImmForm in PPCRegisterInfo · 8540f777

Hal Finkel authored Mar 31, 2013

ImmToIdxMap should be a DenseMap (not a std::map) because there
is no ordering requirement. Also, we don't need a separate list
of instructions for noImmForm in eliminateFrameIndex, because this
list is essentially the complement of the keys in ImmToIdxMap.

No functionality change intended.

llvm-svn: 178450

8540f777

X86: Promote sitofp <8 x i16> to <8 x i32> when AVX is available. · b60633fb
Benjamin Kramer authored Mar 31, 2013
```
A vector sext + sitofp is a lot cheaper than 8 scalar conversions.

llvm-svn: 178448
```
b60633fb

Add the PPC lfiwax instruction · beb296be

Hal Finkel authored Mar 31, 2013

This instruction is available on modern PPC64 CPUs, and is now used
to improve the SINT_TO_FP lowering (by eliminating the need for the
separate sign extension instruction and decreasing the amount of
needed stack space).

llvm-svn: 178446

beb296be

Cleanup PPC(64) i32 -> float/double conversion · e53429a1

Hal Finkel authored Mar 31, 2013

The existing SINT_TO_FP code for i32 -> float/double conversion was disabled
because it relied on broken EXTSW_32/STD_32 instruction definitions. The
original intent had been to enable these 64-bit instructions to be used on CPUs
that support them even in 32-bit mode.  Unfortunately, this form of lying to
the infrastructure was buggy (as explained in the FIXME comment) and had
therefore been disabled.

This re-enables this functionality, using regular DAG nodes, but only when
compiling in 64-bit mode. The old STD_32/EXTSW_32 definitions (which were dead)
are removed.

llvm-svn: 178438

e53429a1

Mar 30, 2013
- Change '@SECREL' suffix to GAS-compatible '@SECREL32'. · 9c9e0a2c
  Benjamin Kramer authored Mar 30, 2013
```
'@SECREL' is what is used by the Microsoft assembler, but GNU as expects '@SECREL32'.
With the patch, the MC-generated code works fine in combination with a recent GNU as (2.23.51.20120920 here).

Patch by David Nadlinger!
Differential Revision: http://llvm-reviews.chandlerc.com/D429

llvm-svn: 178427
```
  9c9e0a2c
- [NVPTX] Remove support for SM < 2.0. This was never fully supported anyway. · 59fd8ba5
  Justin Holewinski authored Mar 30, 2013
```
llvm-svn: 178417
```
  59fd8ba5
- [NVPTX] Add NVVMReflect pass to allow compile-time selection of · b94bd05b
  Justin Holewinski authored Mar 30, 2013
```
specific code paths.

This allows us to write code like:

  if (__nvvm_reflect("FOO"))
    // Do something
  else
    // Do something else

and compile into a library, then give "FOO" a value at kernel
compile-time so the check becomes a no-op.

llvm-svn: 178416
```
  b94bd05b
- [NVPTX] Run clang-format on all NVPTX sources. · 0497ab14
  Justin Holewinski authored Mar 30, 2013
```
Hopefully this resolves any outstanding style issues and gives us
an automated way of ensuring we conform to the style guidelines.

llvm-svn: 178415
```
  0497ab14
- [mips] Add patterns for DSP indexed load instructions. · b3c1847b
  Akira Hatanaka authored Mar 30, 2013
```
llvm-svn: 178408
```
  b3c1847b
- [mips] Define reg+imm load/store pattern templates. · b1457304
  Akira Hatanaka authored Mar 30, 2013
```
llvm-svn: 178407
```
  b1457304
- [mips] Fix DSP instructions to have explicit accumulator register operands. · fb221c19
  Akira Hatanaka authored Mar 30, 2013
```
Check that instruction selection can select multiply-add/sub DSP instructions
from a pattern that doesn't have intrinsics.

llvm-svn: 178406
```
  fb221c19
- Remove unused variables. · 33c06048
  Akira Hatanaka authored Mar 30, 2013
```
llvm-svn: 178405
```
  33c06048
- [mips] Move the code which does dag-combine for multiply-add/sub nodes to · 9efcd76c
  Akira Hatanaka authored Mar 30, 2013
```
derived class MipsSETargetLowering.

We shouldn't be generating madd/msub nodes if target is Mips16, since Mips16
doesn't have support for multipy-add/sub instructions.

llvm-svn: 178404
```
  9efcd76c
- [mips] Fix definitions of multiply, multiply-add/sub and divide instructions. · be8612f6
  Akira Hatanaka authored Mar 30, 2013
```
The new instructions have explicit register output operands and use table-gen
patterns instead of C++ code to do instruction selection.

Mips16's instructions are unaffected by this change.

llvm-svn: 178403
```
  be8612f6
- [mips] Remove function getFPBranchCodeFromCond. Rename invertFPCondCodeAdd. · f0ea500c
  Akira Hatanaka authored Mar 30, 2013
```
llvm-svn: 178396
```
  f0ea500c
- Fix indentation. · d5a0e096
  Akira Hatanaka authored Mar 30, 2013
```
llvm-svn: 178395
```
  d5a0e096
- [mips] Add mips-specific nodes which will be used to select multiply and divide · 28721bd7
  Akira Hatanaka authored Mar 30, 2013
```
instructions.

llvm-svn: 178394
```
  28721bd7
- [mips] Implement getRepRegClassFor in MipsSETargetLowering. This function is · 3a34d147
  Akira Hatanaka authored Mar 30, 2013
```
called in several places in ScheduleDAGRRList.cpp.

llvm-svn: 178393
```
  3a34d147