Commits · 7dee697faa5361fc953909b8d4547f2d08af5cab · Roger Ferrer / llvm-epi-0.8

Aug 13, 2013

Evgeniy Stepanov authored Aug 13, 2013

../lib/Target/X86/X86ISelLowering.cpp:9715:7: error: unused variable 'OpVT' [-Werror,-Wunused-variable]
  EVT OpVT = Op0.getValueType();
      ^
../lib/Target/X86/X86ISelLowering.cpp:9763:14: error: unused variable 'NumElems' [-Werror,-Wunused-variable]
    unsigned NumElems = VT.getVectorNumElements();

llvm-svn: 188269

7dee697f

AVX-512: Added CMP and BLEND instructions. · 60b1f289
Elena Demikhovsky authored Aug 13, 2013
```
Lowering for SETCC.

llvm-svn: 188265
```
60b1f289

Aug 11, 2013
- AVX-512: Added more tests for BROADCAST · 5fed3b95
  Elena Demikhovsky authored Aug 11, 2013
```
llvm-svn: 188148
```
  5fed3b95
- AVX-512: Added VPERM* instructons and MOV* zmm-to-zmm instructions. · cf5b1458
  Elena Demikhovsky authored Aug 11, 2013
```
Added a test for shuffles using VPERM.

llvm-svn: 188147
```
  cf5b1458
Aug 07, 2013
- AVX-512 set: Added BROADCAST instructions · 45c54ad8
  Elena Demikhovsky authored Aug 07, 2013
```
with lowering logic and a test.

llvm-svn: 187884
```
  45c54ad8
- Simplify code. No functional change intended. · c5b0ad27
  Craig Topper authored Aug 07, 2013
```
llvm-svn: 187870
```
  c5b0ad27
Aug 06, 2013

Refactor isInTailCallPosition handling · a4415854

Tim Northover authored Aug 06, 2013

This change came about primarily because of two issues in the existing code.
Niether of:

define i64 @test1(i64 %val) {
  %in = trunc i64 %val to i32
  tail call i32 @ret32(i32 returned %in)
  ret i64 %val
}

define i64 @test2(i64 %val) {
  tail call i32 @ret32(i32 returned undef)
  ret i32 42
}

should be tail calls, and the function sameNoopInput is responsible. The main
problem is that it is completely symmetric in the "tail call" and "ret" value,
but in reality different things are allowed on each side.

For these cases:
1. Any truncation should lead to a larger value being generated by "tail call"
   than needed by "ret".
2. Undef should only be allowed as a source for ret, not as a result of the
   call.

Along the way I noticed that a mismatch between what this function treats as a
valid truncation and what the backends see can lead to invalid calls as well
(see x86-32 test case).

This patch refactors the code so that instead of being based primarily on
values which it recurses into when necessary, it starts by inspecting the type
and considers each fundamental slot that the backend will see in turn. For
example, given a pathological function that returned {{}, {{}, i32, {}}, i32}
we would consider each "real" i32 in turn, and ask if it passes through
unchanged. This is much closer to what the backend sees as a result of
ComputeValueVTs.

Aside from the bug fixes, this eliminates the recursion that's going on and, I
believe, makes the bulk of the code significantly easier to understand. The
trade-off is the nasty iterators needed to find the real types inside a
returned value.

llvm-svn: 187787

a4415854

Simplify vector lane handling math a bit. No functional change intended. · cf969ead
Craig Topper authored Aug 06, 2013
```
llvm-svn: 187783
```
cf969ead
Simplify math a little bit. · 7418ff46
Craig Topper authored Aug 06, 2013
```
llvm-svn: 187781
```
7418ff46
Replace EVT with MVT in isHorizontalBinOp as it is only called with legal types. · 9bc00b65
Craig Topper authored Aug 06, 2013
```
llvm-svn: 187779
```
9bc00b65
Simplify code slightly. No functional change. · 47d7c5c8
Craig Topper authored Aug 06, 2013
```
llvm-svn: 187771
```
47d7c5c8

Aug 05, 2013
- Silencing an MSVC11 type conversion warning. · 5b463457
  Aaron Ballman authored Aug 05, 2013
```
llvm-svn: 187727
```
  5b463457
- AVX-512 set: added mask operations, lowering BUILD_VECTOR for i1 vector types. · 40864b69
  Elena Demikhovsky authored Aug 05, 2013
```
Added intrinsics and tests.

llvm-svn: 187717
```
  40864b69
Aug 04, 2013

X86: Turn fp selects into mask operations. · 5bc180c1

Benjamin Kramer authored Aug 04, 2013

double test(double a, double b, double c, double d) { return a<b ? c : d; }

before:
_test:
	ucomisd	%xmm0, %xmm1
	ja	LBB0_2
	movaps	%xmm3, %xmm2
LBB0_2:
	movaps	%xmm2, %xmm0

after:
_test:
	cmpltsd	%xmm1, %xmm0
	andpd	%xmm0, %xmm2
	andnpd	%xmm3, %xmm0
	orpd	%xmm2, %xmm0

Small speedup on Benchmarks/SmallPT

llvm-svn: 187706

5bc180c1

X86: correct tail return address calculation · ecc018c7

Tim Northover authored Aug 04, 2013

Due to the weird and wondeful usual arithmetic conversions, some
calculations involving negative values were getting performed in
uint32_t and then promoted to int64_t, which is really not a good
idea.

Patch by Katsuhiro Ueno.

llvm-svn: 187703

ecc018c7

Aug 01, 2013
- EVEX and compressed displacement encoding for AVX512 · b1266b54
  Elena Demikhovsky authored Aug 01, 2013
```
llvm-svn: 187576
```
  b1266b54
Jul 31, 2013

Fixed assertion in Extract128BitVector() · b0a75431
Elena Demikhovsky authored Jul 31, 2013
```
llvm-svn: 187493
```
b0a75431

Added INSERT and EXTRACT intructions from AVX-512 ISA. · 67b05fc0

Elena Demikhovsky authored Jul 31, 2013

All insertf*/extractf* functions replaced with insert/extract since we have insertf and inserti forms.
Added lowering for INSERT_VECTOR_ELT / EXTRACT_VECTOR_ELT for 512-bit vectors.
Added lowering for EXTRACT/INSERT subvector for 512-bit vectors.
Added a test.

llvm-svn: 187491

67b05fc0

Jul 29, 2013

Proper va_arg/va_copy lowering on win64 · 06d17c80

Nico Rieck authored Jul 29, 2013

Win64 uses CharPtrBuiltinVaList instead of X86_64ABIBuiltinVaList like
other 64-bit targets.

llvm-svn: 187355

06d17c80

Jul 26, 2013

Add a target legalize hook for SplitVectorOperand (again) · d3f2035a

Justin Holewinski authored Jul 26, 2013

CustomLowerNode was not being called during SplitVectorOperand,
meaning custom legalization could not be used by targets.

This also adds a test case for NVPTX that depends on this custom
legalization.

Differential Revision: http://llvm-reviews.chandlerc.com/D1195

Attempt to fix the buildbots by making the X86 test I just added platform independent

llvm-svn: 187202

d3f2035a

Revert "Add a target legalize hook for SplitVectorOperand" · 1d812728

Rafael Espindola authored Jul 26, 2013

This reverts commit 187198. It broke the bots.

The soft float test probably needs a -triple because of name differences.
On the hard float test I am getting a "roundss $1, %xmm0, %xmm0", instead of
"vroundss $1, %xmm0, %xmm0, %xmm0".

llvm-svn: 187201

1d812728

Add a target legalize hook for SplitVectorOperand · f848a24e

Justin Holewinski authored Jul 26, 2013

CustomLowerNode was not being called during SplitVectorOperand,
meaning custom legalization could not be used by targets.

This also adds a test case for NVPTX that depends on this custom
legalization.

Differential Revision: http://llvm-reviews.chandlerc.com/D1195

llvm-svn: 187198

f848a24e

Jul 24, 2013

I'm starting to commit KNL backend. I'll push patches one-by-one. This patch... · 8cfb43f7

Elena Demikhovsky authored Jul 24, 2013

I'm starting to commit KNL backend. I'll push patches one-by-one. This patch includes support for the extended register set XMM16-31, YMM16-31, ZMM0-31.
The full ISA you can see here: http://software.intel.com/en-us/intel-isa-extensions

llvm-svn: 187030

8cfb43f7

Jul 16, 2013

[X86] Use min/max to optimze unsigend vector comparison on X86 · 3d527d80

Juergen Ributzka authored Jul 16, 2013

Use PMIN/PMAX for UGE/ULE vector comparions to reduce the number of required
instructions. This trick also works for UGT/ULT, but there is no advantage in
doing so. It wouldn't reduce the number of instructions and it would actually
reduce performance.

Reviewer: Ben

radar:5972691

llvm-svn: 186432

3d527d80

Jul 15, 2013
- Add 'static' keyword to some const arrays for consistency. · 202fbc2c
  Craig Topper authored Jul 15, 2013
```
llvm-svn: 186308
```
  202fbc2c
Jul 14, 2013
- Use SmallVectorImpl& instead of SmallVector to avoid repeating small vector size. · b94011fd
  Craig Topper authored Jul 14, 2013
```
llvm-svn: 186274
```
  b94011fd
Jul 12, 2013

X86: fold SSE2/AVX2 logical shift by immediate amount into zero vector when possible · fda967fd
Stephen Lin authored Jul 12, 2013
```
Patch by Andrea Di Biagio

llvm-svn: 186165
```
fda967fd

Target/X86: Add explicit Win64 and System V/x86-64 calling conventions. · e8f297ca

Charles Davis authored Jul 12, 2013

Summary:
This patch adds explicit calling convention types for the Win64 and
System V/x86-64 ABIs. This allows code to override the default, and use
the Win64 convention on a target that wants to use SysV (and
vice-versa). This is needed to implement the `ms_abi` and `sysv_abi` GNU
attributes.

Reviewers:

CC:

llvm-svn: 186144

e8f297ca

Jul 09, 2013

AArch64/PowerPC/SystemZ/X86: This patch fixes the interface, usage, and all · 73de7bf5

Stephen Lin authored Jul 09, 2013

in-tree implementations of TargetLoweringBase::isFMAFasterThanMulAndAdd in
order to resolve the following issues with fmuladd (i.e. optional FMA)
intrinsics:

1. On X86(-64) targets, ISD::FMA nodes are formed when lowering fmuladd
intrinsics even if the subtarget does not support FMA instructions, leading
to laughably bad code generation in some situations.

2. On AArch64 targets, ISD::FMA nodes are formed for operations on fp128,
resulting in a call to a software fp128 FMA implementation.

3. On PowerPC targets, FMAs are not generated from fmuladd intrinsics on types
like v2f32, v8f32, v4f64, etc., even though they promote, split, scalarize,
etc. to types that support hardware FMAs.

The function has also been slightly renamed for consistency and to force a
merge/build conflict for any out-of-tree target implementing it. To resolve,
see comments and fixed in-tree examples.

llvm-svn: 185956

73de7bf5

Jul 08, 2013
- Reuse %rax after calling __chkstk on win64 · 51969be7
  Nico Rieck authored Jul 08, 2013
```
Reapply this as I reverted the wrong commit.

llvm-svn: 185807
```
  51969be7
- Revert "Proper va_arg/va_copy lowering on win64" · 4801303c
  Nico Rieck authored Jul 08, 2013
```
This reverts commit 2b52880592a525cfe04d8f9008a35da8c2ea94c3.

Needs review.

llvm-svn: 185806
```
  4801303c
- Revert "Reuse %rax after calling __chkstk on win64" · 43b51056
  Nico Rieck authored Jul 08, 2013
```
This reverts commit 01f8d579f7672872324208ac5bc4ac311e81b22e.

llvm-svn: 185781
```
  43b51056
Jul 07, 2013
- Reuse %rax after calling __chkstk on win64 · 7adf6111
  Nico Rieck authored Jul 07, 2013
```
llvm-svn: 185778
```
  7adf6111
Jul 06, 2013
- Proper va_arg/va_copy lowering on win64 · 99ef2890
  Nico Rieck authored Jul 06, 2013
```
llvm-svn: 185763
```
  99ef2890
Jul 04, 2013
- Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. · db429d94
  Jakob Stoklund Olesen authored Jul 04, 2013
```
These exception-related opcodes are not used any longer.

llvm-svn: 185625
```
  db429d94
- Revert r185595-185596 which broke buildbots. · a1f5b901
  Jakob Stoklund Olesen authored Jul 04, 2013
```
Revert "Simplify landing pad lowering."
Revert "Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes."

llvm-svn: 185600
```
  a1f5b901
- Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. · f33ec531
  Jakob Stoklund Olesen authored Jul 03, 2013
```
These exception-related opcodes are not used any longer.

llvm-svn: 185596
```
  f33ec531
Jul 03, 2013
- Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid... · 31ee5866
  Craig Topper authored Jul 03, 2013
```
Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid specifying the vector size.

llvm-svn: 185540
```
  31ee5866
Jun 26, 2013

Optimized integer vector multiplication operation by replacing it with... · 6769c50d

Elena Demikhovsky authored Jun 26, 2013

Optimized integer vector multiplication operation by replacing it with shift/xor/sub when it is possible. Fixed a bug in SDIV, where the const operand is not a splat constant vector.

llvm-svn: 184931

6769c50d

Jun 22, 2013
- The getRegForInlineAsmConstraint function should only accept MVT value types. · 295bd43a
  Chad Rosier authored Jun 22, 2013
```
llvm-svn: 184642
```
  295bd43a