Commits · d0ed730f924ecd1a39b9d979741dc627229d0cf9 · Roger Ferrer / llvm-epi-0.8

Nov 27, 2013
- Remove dead argument. · d0ed730f
  Rafael Espindola authored Nov 27, 2013
```
llvm-svn: 195806
```
  d0ed730f
- [AArch64] Add support for NEON scalar floating-point absolute difference. · 75290c63
  Chad Rosier authored Nov 27, 2013
```
llvm-svn: 195803
```
  75290c63
Nov 26, 2013

[AArch64] Add support for NEON scalar floating-point to integer convert · 9653d5c9
Chad Rosier authored Nov 26, 2013
```
instructions.

llvm-svn: 195788
```
9653d5c9
Fix a bug related to constant islands for Mips16 and mips16/32 dual mode. · 3aeb1d08
Reed Kotler authored Nov 26, 2013
```
The determination of when we are doing constant pools was being made too
early in the asm printer.

llvm-svn: 195781
```
3aeb1d08

Fix PR18054 · d617a301

Michael Liao authored Nov 26, 2013

- Fix bug in (vsext (vzext x)) -> (vsext x) in SIGN_EXTEND_IN_REG
  lowering where we need to check whether x is a vector type (in-reg
  type) of i8, i16 or i32; otherwise, that optimization is not valid.

llvm-svn: 195779

d617a301

Darwin-ARM: use movw/movt for static relocations · fa36dfee
Tim Northover authored Nov 26, 2013
```
llvm-svn: 195759
```
fa36dfee

[SystemZ] Fix incorrect use of RISBG for a zero-extended right shift · dd7dd930

Richard Sandiford authored Nov 26, 2013

We would wrongly transform the testcase into the equivalent of an AND with 1.
The problem was that, when testing whether the shifted-in bits of the right
shift were significant, we used the width of the final zero-extended result
rather than the width of the shifted value.

llvm-svn: 195731

dd7dd930

Refactored the implementation of AArch64 NEON instruction ZIP, UZP · 599c47d0
Kevin Qin authored Nov 26, 2013
```
and TRN.
Fix a bug when mixed use of vget_high_u8() and vuzp_u8().

llvm-svn: 195716
```
599c47d0
[AArch64]Implement 128 bit register copy with NEON. · 33ca18fd
Kevin Qin authored Nov 26, 2013
```
llvm-svn: 195713
```
33ca18fd

StackMap: Implement support for DirectMemRefOp. · 391dbadb

Andrew Trick authored Nov 26, 2013

A Direct stack map location records the address of frame index. This
address is itself the value that the runtime requested. This differs
from IndirectMemRefOp locations, which refer to a stack locations from
which the requested values must be loaded. Direct locations can
directly communicate the address if an alloca, while IndirectMemRefOp
handle register spills.

For example:

entry:
  %a = alloca i64...
  llvm.experimental.stackmap(i32 <ID>, i32 <shadowBytes>, i64* %a)

Since both the alloca and stackmap intrinsic are in the entry block,
and the intrinsic takes the address of the alloca, the runtime can
assume that LLVM will not substitute alloca with any intervening
value. This must be verified by the runtime by checking that the stack
map's location is a Direct location type. The runtime can then
determine the alloca's relative location on the stack immediately after
compilation, or at any time thereafter. This differs from Register and
Indirect locations, because the runtime can only read the values in
those locations when execution reaches the instruction address of the
stack map.

llvm-svn: 195712

391dbadb

whitespace · d3ab37cf
Andrew Trick authored Nov 26, 2013
```
llvm-svn: 195711
```
d3ab37cf
Add an intrinsic for the SSE2 PAUSE instruction. · c592e525
Cameron McInally authored Nov 26, 2013
```
llvm-svn: 195697
```
c592e525

Nov 25, 2013

Do the string comparison in the constructor instead of once per nop. · a834e301
Rafael Espindola authored Nov 25, 2013
```
Thanks to Roman Divacky for the suggestion.

llvm-svn: 195684
```
a834e301

Don't use nopl in cpus that don't support it. · 1b8bfdaa

Rafael Espindola authored Nov 25, 2013

Patch by Mikulas Patocka. I added the test. I checked that for cpu names that
gas knows about, it also doesn't generate nopl.

The modified cpus:
i686 - there are i686-class CPUs that don't have nopl: Via c3, Transmeta
        Crusoe, Microsoft VirtualBox - see
        https://bbs.archlinux.org/viewtopic.php?pid=775414
k6, k6-2, k6-3, winchip-c6, winchip2 - these are 586-class CPUs
via c3 c3-2 - see https://bugs.archlinux.org/task/19733 as a proof that
        Via c3 and c3-Nehemiah don't have nopl

llvm-svn: 195679

1b8bfdaa

Fix indentation typo · d34094e5
Tim Northover authored Nov 25, 2013
```
llvm-svn: 195660
```
d34094e5

ARM: remove special cases for Darwin dynamic-no-pic mode. · db962e2c

Tim Northover authored Nov 25, 2013

These are handled almost identically to static mode (and ELF's global address
materialisation), except that a symbol may have "$non_lazy_ptr" appended. This
can be handled by passing appropriate flags along with the instruction instead
of using entirely separate pseudo-instructions.

llvm-svn: 195655

db962e2c

ARM: remove unused patterns. · dfe2156c

Tim Northover authored Nov 25, 2013

There is no sane way for an LEApcrel (= single ADR) instruction to generate a
global address on any ARM target I know of. Fortunately, no-one was trying to
any more, but there were vestigial patterns.

llvm-svn: 195644

dfe2156c

[ARM] Enable FeatureMP for Cortex-A5 by default. · 34df448f
Amara Emerson authored Nov 25, 2013
```
Patch by Oliver Stannard.

llvm-svn: 195640
```
34df448f
X86: enable AVX2 under Haswell native compilation · 89ccb616
Tim Northover authored Nov 25, 2013
```
Patch by Adam Strzelecki

llvm-svn: 195632
```
89ccb616

Fixed a bug about disassembling AArch64 post-index load/store single element instructions. · fbd2b448

Hao Liu authored Nov 25, 2013

ie. echo "0x00 0x04 0x80 0x0d" | ../bin/llvm-mc -triple=aarch64 -mattr=+neon -disassemble
echo "0x00 0x00 0x80 0x0d" | ../bin/llvm-mc -triple=aarch64 -mattr=+neon -disassemble
will be disassembled into the same instruction st1 {v0b}[0], [x0], x0.

llvm-svn: 195591

fbd2b448

SparcFrameLowering.cpp: Prune 'DL' [-Wunused-variable] · edbeaee8
NAKAMURA Takumi authored Nov 25, 2013
```
llvm-svn: 195590
```
edbeaee8

Nov 24, 2013

[Sparc] Emit large negative adjustments to SP/FP with sethi+xor instead of... · 1116868a

Venkatraman Govindaraju authored Nov 24, 2013

[Sparc] Emit large negative adjustments to SP/FP with sethi+xor instead of sethi+or. This generates correct code for both sparc32 and sparc64.

llvm-svn: 195576

1116868a

[Sparc]: Implement LEA pattern for sparcv9. · 9c338504
Venkatraman Govindaraju authored Nov 24, 2013
```
llvm-svn: 195575
```
9c338504

[SparcV9]: Do not emit .register directives for global registers that are... · f79528c1

Venkatraman Govindaraju authored Nov 24, 2013

[SparcV9]: Do not emit .register directives for global registers that are clobbered by calls but not used in the function itself.

llvm-svn: 195574

f79528c1

[SparcV9] Enable custom lowering of DYNAMIC_STACKALLOC in sparc64. · 0510db05
Venkatraman Govindaraju authored Nov 24, 2013
```
llvm-svn: 195573
```
0510db05

Make sure that for C++ emitting LwConstant32 pseudos, that it corresponds · a787aa2b

Reed Kotler authored Nov 24, 2013

to what is needed for constant islands. The prescan method for Mips16 constant
islands will eventually go away. It is only temporary and should be done
earlier when the instructions are first created or from the DAG. If we keep
it here we need to handle better the situation where constant islands
is called multiple times since don't want to prescan more than once.

llvm-svn: 195569

a787aa2b

Fix a funny bug I introduced during conversion of ARM constant islands to Mips. · d3b28ebe

Reed Kotler authored Nov 24, 2013

I had to move some code and I moved a declaration forward past it's first use
in the function but by nutty coincidence there was another variable of the same
name and type and with completely unrelated function that was declared globally
in the class so no compilation error ensued.
It required some unusual conditions for it to even matter. Caused test
case casts.c in test-suite to fail during compilation with a duplicate
symbol error. I would have noticed it during final code review for this port.

llvm-svn: 195565

d3b28ebe

Nov 23, 2013

R600/SI: Fixing handling of condition codes · c0845334

Tom Stellard authored Nov 22, 2013

We were ignoring the ordered/onordered bits and also the signed/unsigned
bits of condition codes when lowering the DAG to MachineInstrs.

NOTE: This is a candidate for the 3.4 branch.
llvm-svn: 195514

c0845334

Nov 22, 2013

X86: Perform integer comparisons at i32 or larger. · 860934a9

Jim Grosbach authored Nov 22, 2013

Utilizing the 8 and 16 bit comparison instructions, even when an input can
be folded into the comparison instruction itself, is typically not worth it.
There are too many partial register stalls as a result, leading to significant
slowdowns. By always performing comparisons on at least 32-bit
registers, performance of the calculation chain leading to the
comparison improves. Continue to use the smaller comparisons when
minimizing size, as that allows better folding of loads into the
comparison instructions.

rdar://15386341

llvm-svn: 195496

860934a9

Teach ISel not to optimize 'optnone' functions (revised). · d89125a5

Paul Robinson authored Nov 22, 2013

Improvements over r195317:
- Set/restore EnableFastISel flag instead of just running FastISel within
  SelectAllBasicBlocks; the flag is checked in various places, and
  FastISel won't run properly if those places don't do the right thing.
- Test looks for normal ISel versus FastISel behavior, and not
  something more subtle that doesn't work everywhere.

Based on work by Andrea Di Biagio.

llvm-svn: 195491

d89125a5

Fix PR18014 · 02160d58

Michael Liao authored Nov 22, 2013

- When simplifying the mask generation for BLEND, check whether that mask is
  also consumed by other non-BLEND insns. If true, skip that simplification.

llvm-svn: 195476

02160d58

[SystemZ] Fix TMHH and TMHL usage for z10 with -O0 · f03789ca

Richard Sandiford authored Nov 22, 2013

I've no idea why I decided to handle TMxx differently from all the other
high/low logic operations, but it was a stupid thing to do.  The high
registers aren't available as separate 32-bit registers on z10,
so subreg_h32 can't be used on a GR64 there.

I've normally been testing with z196 and with -O3 and so hadn't noticed
this until now.

llvm-svn: 195473

f03789ca

Don't produce tail calls when the caller is x86_thiscallcc. · 5a8e985a
Rafael Espindola authored Nov 22, 2013
```
The callee will not pop the stack for us.

llvm-svn: 195467
```
5a8e985a
Fix typo in a comment added in r195455. · d40aea87
Daniel Sanders authored Nov 22, 2013
```
Credit to Matheus Almeida for spotting it.

llvm-svn: 195456
```
d40aea87

[mips][msa] Fix corner case for integer constant splats with undef values. · 630dbe0a

Daniel Sanders authored Nov 22, 2013

lowerBUILD_VECTOR() was treating integer constant splats as being legal
regardless of whether they had undef values. This caused instruction
selection failures when the undefs were legalized to zero, making the
constant non-splat.

Fixed this by requiring HasAnyUndef to be false for a integer constant
splat to be legal. If it is true, a new node is generated with the undefs
replaced with the necessary values to remain a splat.

llvm-svn: 195455

630dbe0a

Add support for Cortex-A12. · c31078cd
Richard Barton authored Nov 22, 2013
```
Patch by Oliver Stannard!

llvm-svn: 195448
```
c31078cd

[mips][msa] Float vector constants cannot use ldi.[wd] directly. Bitcast from... · fd8e4168

Daniel Sanders authored Nov 22, 2013

[mips][msa] Float vector constants cannot use ldi.[wd] directly. Bitcast from the appropriate integer vector type.

Fixes an instruction selection failure detected by llvm-stress.

llvm-svn: 195444

fd8e4168

Revert r195318 as it causes miscompilation (PR18029) · 40070098
Kostya Serebryany authored Nov 22, 2013
```
llvm-svn: 195439
```
40070098

Fix a Cygwin build failure caused by enum values starting with '_', which is... · e8bdc8c8

Hao Liu authored Nov 22, 2013

Fix a Cygwin build failure caused by enum values starting with '_', which is conflicted with some platform macros.
This patch only renames variables, no functional change.

llvm-svn: 195432

e8bdc8c8

Fix the bugs about AArch64 Load/Store vector types and bitcast between i64 and vector types. · 25aed9bb
Hao Liu authored Nov 22, 2013
```
e.g. "%tmp = load <2 x i64>* %ptr" can't be selected. 
     "%tmp = bitcast i64 %in to <2 x i32>" can't be selected.

llvm-svn: 195424
```
25aed9bb