Commits · f848a24e504c87a7a21e0cbf97030fe507ae9e45 · Roger Ferrer / llvm-epi-0.8

Jul 26, 2013

Add a target legalize hook for SplitVectorOperand · f848a24e

Justin Holewinski authored Jul 26, 2013

CustomLowerNode was not being called during SplitVectorOperand,
meaning custom legalization could not be used by targets.

This also adds a test case for NVPTX that depends on this custom
legalization.

Differential Revision: http://llvm-reviews.chandlerc.com/D1195

llvm-svn: 187198

f848a24e

Jul 24, 2013

I'm starting to commit KNL backend. I'll push patches one-by-one. This patch... · 8cfb43f7

Elena Demikhovsky authored Jul 24, 2013

I'm starting to commit KNL backend. I'll push patches one-by-one. This patch includes support for the extended register set XMM16-31, YMM16-31, ZMM0-31.
The full ISA you can see here: http://software.intel.com/en-us/intel-isa-extensions

llvm-svn: 187030

8cfb43f7

Jul 16, 2013

[X86] Use min/max to optimze unsigend vector comparison on X86 · 3d527d80

Juergen Ributzka authored Jul 16, 2013

Use PMIN/PMAX for UGE/ULE vector comparions to reduce the number of required
instructions. This trick also works for UGT/ULT, but there is no advantage in
doing so. It wouldn't reduce the number of instructions and it would actually
reduce performance.

Reviewer: Ben

radar:5972691

llvm-svn: 186432

3d527d80

Jul 15, 2013
- Add 'static' keyword to some const arrays for consistency. · 202fbc2c
  Craig Topper authored Jul 15, 2013
```
llvm-svn: 186308
```
  202fbc2c
Jul 14, 2013
- Use SmallVectorImpl& instead of SmallVector to avoid repeating small vector size. · b94011fd
  Craig Topper authored Jul 14, 2013
```
llvm-svn: 186274
```
  b94011fd
Jul 12, 2013

X86: fold SSE2/AVX2 logical shift by immediate amount into zero vector when possible · fda967fd
Stephen Lin authored Jul 12, 2013
```
Patch by Andrea Di Biagio

llvm-svn: 186165
```
fda967fd

Target/X86: Add explicit Win64 and System V/x86-64 calling conventions. · e8f297ca

Charles Davis authored Jul 12, 2013

Summary:
This patch adds explicit calling convention types for the Win64 and
System V/x86-64 ABIs. This allows code to override the default, and use
the Win64 convention on a target that wants to use SysV (and
vice-versa). This is needed to implement the `ms_abi` and `sysv_abi` GNU
attributes.

Reviewers:

CC:

llvm-svn: 186144

e8f297ca

Jul 09, 2013

AArch64/PowerPC/SystemZ/X86: This patch fixes the interface, usage, and all · 73de7bf5

Stephen Lin authored Jul 09, 2013

in-tree implementations of TargetLoweringBase::isFMAFasterThanMulAndAdd in
order to resolve the following issues with fmuladd (i.e. optional FMA)
intrinsics:

1. On X86(-64) targets, ISD::FMA nodes are formed when lowering fmuladd
intrinsics even if the subtarget does not support FMA instructions, leading
to laughably bad code generation in some situations.

2. On AArch64 targets, ISD::FMA nodes are formed for operations on fp128,
resulting in a call to a software fp128 FMA implementation.

3. On PowerPC targets, FMAs are not generated from fmuladd intrinsics on types
like v2f32, v8f32, v4f64, etc., even though they promote, split, scalarize,
etc. to types that support hardware FMAs.

The function has also been slightly renamed for consistency and to force a
merge/build conflict for any out-of-tree target implementing it. To resolve,
see comments and fixed in-tree examples.

llvm-svn: 185956

73de7bf5

Jul 08, 2013
- Reuse %rax after calling __chkstk on win64 · 51969be7
  Nico Rieck authored Jul 08, 2013
```
Reapply this as I reverted the wrong commit.

llvm-svn: 185807
```
  51969be7
- Revert "Proper va_arg/va_copy lowering on win64" · 4801303c
  Nico Rieck authored Jul 08, 2013
```
This reverts commit 2b52880592a525cfe04d8f9008a35da8c2ea94c3.

Needs review.

llvm-svn: 185806
```
  4801303c
- Revert "Reuse %rax after calling __chkstk on win64" · 43b51056
  Nico Rieck authored Jul 08, 2013
```
This reverts commit 01f8d579f7672872324208ac5bc4ac311e81b22e.

llvm-svn: 185781
```
  43b51056
Jul 07, 2013
- Reuse %rax after calling __chkstk on win64 · 7adf6111
  Nico Rieck authored Jul 07, 2013
```
llvm-svn: 185778
```
  7adf6111
Jul 06, 2013
- Proper va_arg/va_copy lowering on win64 · 99ef2890
  Nico Rieck authored Jul 06, 2013
```
llvm-svn: 185763
```
  99ef2890
Jul 04, 2013
- Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. · db429d94
  Jakob Stoklund Olesen authored Jul 04, 2013
```
These exception-related opcodes are not used any longer.

llvm-svn: 185625
```
  db429d94
- Revert r185595-185596 which broke buildbots. · a1f5b901
  Jakob Stoklund Olesen authored Jul 04, 2013
```
Revert "Simplify landing pad lowering."
Revert "Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes."

llvm-svn: 185600
```
  a1f5b901
- Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. · f33ec531
  Jakob Stoklund Olesen authored Jul 03, 2013
```
These exception-related opcodes are not used any longer.

llvm-svn: 185596
```
  f33ec531
Jul 03, 2013
- Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid... · 31ee5866
  Craig Topper authored Jul 03, 2013
```
Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid specifying the vector size.

llvm-svn: 185540
```
  31ee5866
Jun 26, 2013

Optimized integer vector multiplication operation by replacing it with... · 6769c50d

Elena Demikhovsky authored Jun 26, 2013

Optimized integer vector multiplication operation by replacing it with shift/xor/sub when it is possible. Fixed a bug in SDIV, where the const operand is not a splat constant vector.

llvm-svn: 184931

6769c50d

Jun 22, 2013
- The getRegForInlineAsmConstraint function should only accept MVT value types. · 295bd43a
  Chad Rosier authored Jun 22, 2013
```
llvm-svn: 184642
```
  295bd43a
Jun 07, 2013
- Don't cache the instruction and register info from the TargetMachine, because · 8f26840c
  Bill Wendling authored Jun 07, 2013
```
the internals of TargetMachine could change.

No functionality change intended.

llvm-svn: 183571
```
  8f26840c
May 30, 2013

Order CALLSEQ_START and CALLSEQ_END nodes. · ad6d08ac

Andrew Trick authored May 29, 2013

Fixes PR16146: gdb.base__call-ar-st.exp fails after
pre-RA-sched=source fixes.

Patch by Xiaoyi Guo!

This also fixes an unsupported dbg.value test case. Codegen was
previously incorrect but the test was passing by luck.

llvm-svn: 182885

ad6d08ac

May 25, 2013
- Track IR ordering of SelectionDAG nodes 2/4. · ef9de2a7
  Andrew Trick authored May 25, 2013
```
Change SelectionDAG::getXXXNode() interfaces as well as call sites of
these functions to pass in SDLoc instead of DebugLoc.

llvm-svn: 182703
```
  ef9de2a7
- Replace Count{Leading,Trailing}Zeros_{32,64} with count{Leading,Trailing}Zeros. · df1ecbd7
  Michael J. Spencer authored May 24, 2013
```
llvm-svn: 182680
```
  df1ecbd7
May 22, 2013
- X86: Fix a bug in EltsFromConsecutiveLoads. We can't generate new loads without chains. · 7b66c470
  Nadav Rotem authored May 22, 2013
```
llvm-svn: 182507
```
  7b66c470
- X86: When expanding PCMPGTQ to PCMPGTD we always want to compare the lower halves as unsigned. · d76cc186
  Benjamin Kramer authored May 22, 2013
```
Take #2 on fixing PR15977.

llvm-svn: 182486
```
  d76cc186
May 21, 2013
- X86: When emulating unsigned PCMPGTQ with PCMPGTD, fix the sign bit for the smaller type. · 18ef6b22
  Benjamin Kramer authored May 21, 2013
```
Otherwise we'll get a mix of signed and unsigned compares.
Fixes PR15977.

llvm-svn: 182364
```
  18ef6b22
May 18, 2013
- Add LLVMContext argument to getSetCCResultType · 75865923
  Matt Arsenault authored May 18, 2013
```
llvm-svn: 182180
```
  75865923
May 17, 2013

X86: Make shuffle -> shift conversion more aggressive about undefs. · fc33e1d9

Benjamin Kramer authored May 17, 2013

Shuffles that only move an element into position 0 of the vector are common in
the output of the loop vectorizer and often generate suboptimal code when SSSE3
is not available. Lower them to vector shifts if possible.

We still prefer palignr over psrldq because it has higher throughput on
sandybridge.

llvm-svn: 182102

fc33e1d9

May 05, 2013

Remove a recently redundant transform from X86ISelLowering. · 66fb70de

David Majnemer authored May 05, 2013

X86ISelLowering has support to treat:
(icmp ne (and (xor %flags, -1), (shl 1, flag)), 0)

as if it were actually:
(icmp eq (and %flags, (shl 1, flag)), 0)

However, r179386 has code at the InstCombine level to handle this.

llvm-svn: 181145

66fb70de

Fix an odd comment. · 42932bdc
Nadav Rotem authored May 04, 2013
```
llvm-svn: 181136
```
42932bdc

May 02, 2013
- 80-col fixup. · 06badde1
  Michael Liao authored May 02, 2013
```
llvm-svn: 180915
```
  06badde1
- Avoid duplicating logic on frame register selecting when lowering eh_return · afafa98f
  Michael Liao authored May 02, 2013
```
No functionality change

llvm-svn: 180914
```
  afafa98f
- Avoid duplicating logic on frame register selecting when lowering frameaddr · 31d39a4a
  Michael Liao authored May 02, 2013
```
No functionality change

llvm-svn: 180912
```
  31d39a4a
Apr 20, 2013
- Remove unused ShouldFoldAtomicFences flag. · 16aba170
  Tim Northover authored Apr 20, 2013
```
I think it's almost impossible to fold atomic fences profitably under
LLVM/C++11 semantics. As a result, this is now unused and just
cluttering up the target interface.

llvm-svn: 179940
```
  16aba170
- Remove unused MEMBARRIER DAG node; it's been replaced by ATOMIC_FENCE. · a2b53390
  Tim Northover authored Apr 20, 2013
```
llvm-svn: 179939
```
  a2b53390
- ArrayRefize getMachineNode(). No functionality change. · b53d8963
  Michael Liao authored Apr 19, 2013
```
llvm-svn: 179901
```
  b53d8963
Apr 19, 2013
- Use 'array_lengthof' as possible to avoid magic numbers · e28fab22
  Michael Liao authored Apr 19, 2013
```
llvm-svn: 179833
```
  e28fab22
Apr 18, 2013
- X86: Add an SSE2 lowering for 64 bit compares when pcmpgtq (SSE4.2) isn't available. · c5578288
  Benjamin Kramer authored Apr 18, 2013
```
This pattern started popping up in vectorized min/max reductions.

llvm-svn: 179797
```
  c5578288
Apr 11, 2013

Optimize vector select from all 0s or all 1s · 55658d42

Michael Liao authored Apr 11, 2013

As packed comparisons in AVX/SSE produce all 0s or all 1s in each SIMD lane,
vector select could be simplified to AND/OR or removed if one or both values
being selected is all 0s or all 1s.

llvm-svn: 179267

55658d42

Enhance bool simplifcation in X86 to handle more cases · f7bf8705

Michael Liao authored Apr 11, 2013

This patch is revised based on patch from Victor Umansky
<victor.umansky@intel.com>. More cases are handled in X86's bool
simplification, i.e.
- SETCC_CARRY
- value is truncated to i1 with AND

As a by-product, PR5443 is also fixed.

llvm-svn: 179265

f7bf8705