Commits · 7418ff460c6bbdb5e861ee0a0eedb039156e9bd4 · Roger Ferrer / llvm-epi-0.8

Aug 06, 2013
- Simplify math a little bit. · 7418ff46
  Craig Topper authored Aug 06, 2013
```
llvm-svn: 187781
```
  7418ff46
- Replace EVT with MVT in isHorizontalBinOp as it is only called with legal types. · 9bc00b65
  Craig Topper authored Aug 06, 2013
```
llvm-svn: 187779
```
  9bc00b65
- Simplify code slightly. No functional change. · 47d7c5c8
  Craig Topper authored Aug 06, 2013
```
llvm-svn: 187771
```
  47d7c5c8
Aug 05, 2013
- Silencing an MSVC11 type conversion warning. · 5b463457
  Aaron Ballman authored Aug 05, 2013
```
llvm-svn: 187727
```
  5b463457
- AVX-512 set: added mask operations, lowering BUILD_VECTOR for i1 vector types. · 40864b69
  Elena Demikhovsky authored Aug 05, 2013
```
Added intrinsics and tests.

llvm-svn: 187717
```
  40864b69
Aug 04, 2013

X86: Turn fp selects into mask operations. · 5bc180c1

Benjamin Kramer authored Aug 04, 2013

double test(double a, double b, double c, double d) { return a<b ? c : d; }

before:
_test:
	ucomisd	%xmm0, %xmm1
	ja	LBB0_2
	movaps	%xmm3, %xmm2
LBB0_2:
	movaps	%xmm2, %xmm0

after:
_test:
	cmpltsd	%xmm1, %xmm0
	andpd	%xmm0, %xmm2
	andnpd	%xmm3, %xmm0
	orpd	%xmm2, %xmm0

Small speedup on Benchmarks/SmallPT

llvm-svn: 187706

5bc180c1

X86: correct tail return address calculation · ecc018c7

Tim Northover authored Aug 04, 2013

Due to the weird and wondeful usual arithmetic conversions, some
calculations involving negative values were getting performed in
uint32_t and then promoted to int64_t, which is really not a good
idea.

Patch by Katsuhiro Ueno.

llvm-svn: 187703

ecc018c7

Aug 01, 2013
- EVEX and compressed displacement encoding for AVX512 · b1266b54
  Elena Demikhovsky authored Aug 01, 2013
```
llvm-svn: 187576
```
  b1266b54
Jul 31, 2013

Fixed assertion in Extract128BitVector() · b0a75431
Elena Demikhovsky authored Jul 31, 2013
```
llvm-svn: 187493
```
b0a75431

Added INSERT and EXTRACT intructions from AVX-512 ISA. · 67b05fc0

Elena Demikhovsky authored Jul 31, 2013

All insertf*/extractf* functions replaced with insert/extract since we have insertf and inserti forms.
Added lowering for INSERT_VECTOR_ELT / EXTRACT_VECTOR_ELT for 512-bit vectors.
Added lowering for EXTRACT/INSERT subvector for 512-bit vectors.
Added a test.

llvm-svn: 187491

67b05fc0

Jul 29, 2013

Proper va_arg/va_copy lowering on win64 · 06d17c80

Nico Rieck authored Jul 29, 2013

Win64 uses CharPtrBuiltinVaList instead of X86_64ABIBuiltinVaList like
other 64-bit targets.

llvm-svn: 187355

06d17c80

Jul 26, 2013

Add a target legalize hook for SplitVectorOperand (again) · d3f2035a

Justin Holewinski authored Jul 26, 2013

CustomLowerNode was not being called during SplitVectorOperand,
meaning custom legalization could not be used by targets.

This also adds a test case for NVPTX that depends on this custom
legalization.

Differential Revision: http://llvm-reviews.chandlerc.com/D1195

Attempt to fix the buildbots by making the X86 test I just added platform independent

llvm-svn: 187202

d3f2035a

Revert "Add a target legalize hook for SplitVectorOperand" · 1d812728

Rafael Espindola authored Jul 26, 2013

This reverts commit 187198. It broke the bots.

The soft float test probably needs a -triple because of name differences.
On the hard float test I am getting a "roundss $1, %xmm0, %xmm0", instead of
"vroundss $1, %xmm0, %xmm0, %xmm0".

llvm-svn: 187201

1d812728

Add a target legalize hook for SplitVectorOperand · f848a24e

Justin Holewinski authored Jul 26, 2013

CustomLowerNode was not being called during SplitVectorOperand,
meaning custom legalization could not be used by targets.

This also adds a test case for NVPTX that depends on this custom
legalization.

Differential Revision: http://llvm-reviews.chandlerc.com/D1195

llvm-svn: 187198

f848a24e

Jul 24, 2013

I'm starting to commit KNL backend. I'll push patches one-by-one. This patch... · 8cfb43f7

Elena Demikhovsky authored Jul 24, 2013

I'm starting to commit KNL backend. I'll push patches one-by-one. This patch includes support for the extended register set XMM16-31, YMM16-31, ZMM0-31.
The full ISA you can see here: http://software.intel.com/en-us/intel-isa-extensions

llvm-svn: 187030

8cfb43f7

Jul 16, 2013

[X86] Use min/max to optimze unsigend vector comparison on X86 · 3d527d80

Juergen Ributzka authored Jul 16, 2013

Use PMIN/PMAX for UGE/ULE vector comparions to reduce the number of required
instructions. This trick also works for UGT/ULT, but there is no advantage in
doing so. It wouldn't reduce the number of instructions and it would actually
reduce performance.

Reviewer: Ben

radar:5972691

llvm-svn: 186432

3d527d80

Jul 15, 2013
- Add 'static' keyword to some const arrays for consistency. · 202fbc2c
  Craig Topper authored Jul 15, 2013
```
llvm-svn: 186308
```
  202fbc2c
Jul 14, 2013
- Use SmallVectorImpl& instead of SmallVector to avoid repeating small vector size. · b94011fd
  Craig Topper authored Jul 14, 2013
```
llvm-svn: 186274
```
  b94011fd
Jul 12, 2013

X86: fold SSE2/AVX2 logical shift by immediate amount into zero vector when possible · fda967fd
Stephen Lin authored Jul 12, 2013
```
Patch by Andrea Di Biagio

llvm-svn: 186165
```
fda967fd

Target/X86: Add explicit Win64 and System V/x86-64 calling conventions. · e8f297ca

Charles Davis authored Jul 12, 2013

Summary:
This patch adds explicit calling convention types for the Win64 and
System V/x86-64 ABIs. This allows code to override the default, and use
the Win64 convention on a target that wants to use SysV (and
vice-versa). This is needed to implement the `ms_abi` and `sysv_abi` GNU
attributes.

Reviewers:

CC:

llvm-svn: 186144

e8f297ca

Jul 09, 2013

AArch64/PowerPC/SystemZ/X86: This patch fixes the interface, usage, and all · 73de7bf5

Stephen Lin authored Jul 09, 2013

in-tree implementations of TargetLoweringBase::isFMAFasterThanMulAndAdd in
order to resolve the following issues with fmuladd (i.e. optional FMA)
intrinsics:

1. On X86(-64) targets, ISD::FMA nodes are formed when lowering fmuladd
intrinsics even if the subtarget does not support FMA instructions, leading
to laughably bad code generation in some situations.

2. On AArch64 targets, ISD::FMA nodes are formed for operations on fp128,
resulting in a call to a software fp128 FMA implementation.

3. On PowerPC targets, FMAs are not generated from fmuladd intrinsics on types
like v2f32, v8f32, v4f64, etc., even though they promote, split, scalarize,
etc. to types that support hardware FMAs.

The function has also been slightly renamed for consistency and to force a
merge/build conflict for any out-of-tree target implementing it. To resolve,
see comments and fixed in-tree examples.

llvm-svn: 185956

73de7bf5

Jul 08, 2013
- Reuse %rax after calling __chkstk on win64 · 51969be7
  Nico Rieck authored Jul 08, 2013
```
Reapply this as I reverted the wrong commit.

llvm-svn: 185807
```
  51969be7
- Revert "Proper va_arg/va_copy lowering on win64" · 4801303c
  Nico Rieck authored Jul 08, 2013
```
This reverts commit 2b52880592a525cfe04d8f9008a35da8c2ea94c3.

Needs review.

llvm-svn: 185806
```
  4801303c
- Revert "Reuse %rax after calling __chkstk on win64" · 43b51056
  Nico Rieck authored Jul 08, 2013
```
This reverts commit 01f8d579f7672872324208ac5bc4ac311e81b22e.

llvm-svn: 185781
```
  43b51056
Jul 07, 2013
- Reuse %rax after calling __chkstk on win64 · 7adf6111
  Nico Rieck authored Jul 07, 2013
```
llvm-svn: 185778
```
  7adf6111
Jul 06, 2013
- Proper va_arg/va_copy lowering on win64 · 99ef2890
  Nico Rieck authored Jul 06, 2013
```
llvm-svn: 185763
```
  99ef2890
Jul 04, 2013
- Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. · db429d94
  Jakob Stoklund Olesen authored Jul 04, 2013
```
These exception-related opcodes are not used any longer.

llvm-svn: 185625
```
  db429d94
- Revert r185595-185596 which broke buildbots. · a1f5b901
  Jakob Stoklund Olesen authored Jul 04, 2013
```
Revert "Simplify landing pad lowering."
Revert "Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes."

llvm-svn: 185600
```
  a1f5b901
- Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. · f33ec531
  Jakob Stoklund Olesen authored Jul 03, 2013
```
These exception-related opcodes are not used any longer.

llvm-svn: 185596
```
  f33ec531
Jul 03, 2013
- Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid... · 31ee5866
  Craig Topper authored Jul 03, 2013
```
Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid specifying the vector size.

llvm-svn: 185540
```
  31ee5866
Jun 26, 2013

Optimized integer vector multiplication operation by replacing it with... · 6769c50d

Elena Demikhovsky authored Jun 26, 2013

Optimized integer vector multiplication operation by replacing it with shift/xor/sub when it is possible. Fixed a bug in SDIV, where the const operand is not a splat constant vector.

llvm-svn: 184931

6769c50d

Jun 22, 2013
- The getRegForInlineAsmConstraint function should only accept MVT value types. · 295bd43a
  Chad Rosier authored Jun 22, 2013
```
llvm-svn: 184642
```
  295bd43a
Jun 07, 2013
- Don't cache the instruction and register info from the TargetMachine, because · 8f26840c
  Bill Wendling authored Jun 07, 2013
```
the internals of TargetMachine could change.

No functionality change intended.

llvm-svn: 183571
```
  8f26840c
May 30, 2013

Order CALLSEQ_START and CALLSEQ_END nodes. · ad6d08ac

Andrew Trick authored May 29, 2013

Fixes PR16146: gdb.base__call-ar-st.exp fails after
pre-RA-sched=source fixes.

Patch by Xiaoyi Guo!

This also fixes an unsupported dbg.value test case. Codegen was
previously incorrect but the test was passing by luck.

llvm-svn: 182885

ad6d08ac

May 25, 2013
- Track IR ordering of SelectionDAG nodes 2/4. · ef9de2a7
  Andrew Trick authored May 25, 2013
```
Change SelectionDAG::getXXXNode() interfaces as well as call sites of
these functions to pass in SDLoc instead of DebugLoc.

llvm-svn: 182703
```
  ef9de2a7
- Replace Count{Leading,Trailing}Zeros_{32,64} with count{Leading,Trailing}Zeros. · df1ecbd7
  Michael J. Spencer authored May 24, 2013
```
llvm-svn: 182680
```
  df1ecbd7
May 22, 2013
- X86: Fix a bug in EltsFromConsecutiveLoads. We can't generate new loads without chains. · 7b66c470
  Nadav Rotem authored May 22, 2013
```
llvm-svn: 182507
```
  7b66c470
- X86: When expanding PCMPGTQ to PCMPGTD we always want to compare the lower halves as unsigned. · d76cc186
  Benjamin Kramer authored May 22, 2013
```
Take #2 on fixing PR15977.

llvm-svn: 182486
```
  d76cc186
May 21, 2013
- X86: When emulating unsigned PCMPGTQ with PCMPGTD, fix the sign bit for the smaller type. · 18ef6b22
  Benjamin Kramer authored May 21, 2013
```
Otherwise we'll get a mix of signed and unsigned compares.
Fixes PR15977.

llvm-svn: 182364
```
  18ef6b22
May 18, 2013
- Add LLVMContext argument to getSetCCResultType · 75865923
  Matt Arsenault authored May 18, 2013
```
llvm-svn: 182180
```
  75865923