Commits · 7221b76c328de03c918f8c8193c85c39b013a0af · Roger Ferrer / llvm-epi-0.8

Aug 09, 2010
- Add VCVTPD2PS, VCVTPS2DQ, VCVTPS2PDY, VCVTTPD2DQY, VCVTTPS2DQ and VCVTPD2DQ... · 685cb32d
  Bruno Cardoso Lopes authored Aug 09, 2010
```
Add VCVTPD2PS, VCVTPS2DQ, VCVTPS2PDY, VCVTTPD2DQY, VCVTTPS2DQ and VCVTPD2DQ 256-bit conversion intrinsics

llvm-svn: 110608
```
  685cb32d
- Add patterns to AVX conversions instructions. Do that instead of declaring... · 3e9b5676
  Bruno Cardoso Lopes authored Aug 09, 2010
```
Add patterns to AVX conversions instructions. Do that instead of declaring more intructions whenever is possible, more coming

llvm-svn: 110605
```
  3e9b5676
- CMake: eliminated unnecessary target_link_libraries. · 212cfde6
  Oscar Fuentes authored Aug 09, 2010
```
Next time the build is broken due to wrong library dependencies, just
try building again (if you are on some Unix and are building all LLVM
targets) or ask someone to commit the regenerated LLVMLibDeps.cmake.

llvm-svn: 110593
```
  212cfde6
- Memory version of vcvtdq2pd intrinsic · c33940b3
  Bruno Cardoso Lopes authored Aug 09, 2010
```
llvm-svn: 110582
```
  c33940b3
- Patterns to match vinsert, vbroadcast, vmovmask and vcvtdq2pd AVX intrinsics · 828f6aec
  Bruno Cardoso Lopes authored Aug 09, 2010
```
llvm-svn: 110580
```
  828f6aec
Aug 07, 2010
- Use sdmem and sse_load_f64 (etc.) for the vector · a3bd31a9
  Dale Johannesen authored Aug 07, 2010
```
form of CMPSD (etc.)  Matching a 128-bit memory
operand is wrong, the instruction uses only 64 bits
(same as ADDSD etc.)  8193553.

llvm-svn: 110491
```
  a3bd31a9
- Patterns to match AVX 256-bit vzero intrinsics · 93cc666a
  Bruno Cardoso Lopes authored Aug 06, 2010
```
llvm-svn: 110480
```
  93cc666a
Aug 06, 2010
- Patterns to match AVX 256-bit permutation intrinsics · 3d6a3a0e
  Bruno Cardoso Lopes authored Aug 06, 2010
```
llvm-svn: 110468
```
  3d6a3a0e
- Reapply r110396, with fixes to appease the Linux buildbot gods. · a7aed186
  Owen Anderson authored Aug 06, 2010
```
llvm-svn: 110460
```
  a7aed186
- Patterns to match AVX 256-bit horizontal arithmetic intrinsics · 1cf067cb
  Bruno Cardoso Lopes authored Aug 06, 2010
```
llvm-svn: 110427
```
  1cf067cb
- Patterns to match AVX 256-bit arithmetic intrinsics · b9ad94fb
  Bruno Cardoso Lopes authored Aug 06, 2010
```
llvm-svn: 110425
```
  b9ad94fb
- Revert r110396 to fix buildbots. · bda59bd2
  Owen Anderson authored Aug 06, 2010
```
llvm-svn: 110410
```
  bda59bd2
- Add an option to always emit realignment code for a particular module. · e1fb772a
  Eric Christopher authored Aug 05, 2010
```
llvm-svn: 110404
```
  e1fb772a
- Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static · 755aceb5
  Owen Anderson authored Aug 05, 2010
```
ID member as the sole unique type identifier.  Clean up APIs related to this change.

llvm-svn: 110396
```
  755aceb5
- Support very basic (doesn't include ABI support in the front-end, varags, ...)... · 77954bdf
  Bruno Cardoso Lopes authored Aug 05, 2010
```
Support very basic (doesn't include ABI support in the front-end, varags, ...) 256-bit argument passing and return for AVX

llvm-svn: 110394
```
  77954bdf
Aug 05, 2010
- Handle the memory barrier pseudo that goes to nothing for the JIT. · 4d9c3400
  Eric Christopher authored Aug 05, 2010
```
llvm-svn: 110371
```
  4d9c3400
- Set hasSideEffects on the 64-bit no-sse memory barrier. · 7fd06eb8
  Eric Christopher authored Aug 05, 2010
```
llvm-svn: 110369
```
  7fd06eb8
- Be a little bit more specific about target for the memory barrier · 32f5d6b9
  Eric Christopher authored Aug 05, 2010
```
instructions.

llvm-svn: 110360
```
  32f5d6b9
- Handle the pseudo in MCInstLower. · 4abffad1
  Eric Christopher authored Aug 05, 2010
```
llvm-svn: 110359
```
  4abffad1
- Make x86-64 membarriers work without sse and clean up some of the · 2db84642
  Eric Christopher authored Aug 04, 2010
```
uses.

llvm-svn: 110274
```
  2db84642
- PR7814: Truncates cannot be ignored for signed comparisons. · 39d0f57c
  Eli Friedman authored Aug 04, 2010
```
llvm-svn: 110268
```
  39d0f57c
Aug 04, 2010
- Add DEBUG message. · 2bf0f3ce
  Devang Patel authored Aug 04, 2010
```
llvm-svn: 110224
```
  2bf0f3ce
- Enable COFF writer on mingw32 and cygwin. · a53a4eef
  Benjamin Kramer authored Aug 04, 2010
```
llvm-svn: 110200
```
  a53a4eef
- Print an error message when someone tries -integrated-as on an unsupported target. · 61c8e6dc
  Benjamin Kramer authored Aug 04, 2010
```
- The COFF backend doesn't support MingW/Cygwin at the moment, it'll report an
  error, but it's still much better than random assertions from the MachO backend.
- We want to make ELF the default eventually, it's what the majority of targets use.

llvm-svn: 110197
```
  61c8e6dc
- fix a win64 encoding problem, patch by Cameron Esfahani! · 53befe7b
  Chris Lattner authored Aug 03, 2010
```
llvm-svn: 110164
```
  53befe7b
Jul 31, 2010
- MC: Remove HasAbsolutizedSet from WindowsX86AsmBackend. · ed80f361
  Michael J. Spencer authored Jul 31, 2010
```
llvm-svn: 109949
```
  ed80f361
- Add relax all support to the COFF object streamer. · 6b4925e2
  Michael J. Spencer authored Jul 31, 2010
```
llvm-svn: 109947
```
  6b4925e2
Jul 30, 2010

Support all 128-bit AVX vector intrinsics. Most part of them I already · 349165b4

Bruno Cardoso Lopes authored Jul 30, 2010

declared during the addition of the assembler support, the additional
changes are:
- Add missing intrinsics
- Move all SSE conversion instructions in X86InstInfo64.td to the SSE.td file.
- Duplicate some patterns to AVX mode.
- Step into PCMPEST/PCMPIST custom inserter and add AVX versions.

llvm-svn: 109878

349165b4

Fix typo! · 405405bb
Bruno Cardoso Lopes authored Jul 30, 2010
```
llvm-svn: 109877
```
405405bb

Jul 29, 2010

Revert r109652, and remove the offending assert in loadRegFromStackSlot instead. · ba0e124a

Jakob Stoklund Olesen authored Jul 29, 2010

We do sometimes load from a too small stack slot when dealing with x86 arguments
(varargs and smaller-than-32-bit args). It looks like we know what we are doing
in those cases, so I am going to remove the assert instead of artifically
enlarging stack slot sizes.

The assert in storeRegToStackSlot stays in. We don't want to write beyond the
bounds of a stack slot.

llvm-svn: 109764

ba0e124a

Jul 28, 2010

Create a fixed stack object for varargs that is as large as any register. · f2234fbe

Jakob Stoklund Olesen authored Jul 28, 2010

The size of this object isn't used for anything - technically it is of variable
size.

This avoids a false positive from the assert in
X86InstrInfo::loadRegFromStackSlot, and fixes PR7735.

llvm-svn: 109652

f2234fbe

Implement a vectorized algorithm for <16 x i8> << <16 x i8> · 53afc8f0
Nate Begeman authored Jul 28, 2010
```
This is about 4x faster and smaller than the existing scalarization.

llvm-svn: 109566
```
53afc8f0

~40% faster vector shl <4 x i32> on SSE 4.1 Larger improvements for smaller... · 269a6da0

Nate Begeman authored Jul 27, 2010

~40% faster vector shl <4 x i32> on SSE 4.1  Larger improvements for smaller types coming in future patches.

For:

define <2 x i64> @shl(<4 x i32> %r, <4 x i32> %a) nounwind readnone ssp {
entry:
  %shl = shl <4 x i32> %r, %a                     ; <<4 x i32>> [#uses=1]
  %tmp2 = bitcast <4 x i32> %shl to <2 x i64>     ; <<2 x i64>> [#uses=1]
  ret <2 x i64> %tmp2
}

We get:

_shl:                                   ## @shl
	pslld	$23, %xmm1
	paddd	LCPI0_0, %xmm1
	cvttps2dq	%xmm1, %xmm1
	pmulld	%xmm1, %xmm0
	ret

Instead of:

_shl:                                   ## @shl
	pshufd	$3, %xmm0, %xmm2
	movd	%xmm2, %eax
	pshufd	$3, %xmm1, %xmm2
	movd	%xmm2, %ecx
	shll	%cl, %eax
	movd	%eax, %xmm2
	pshufd	$1, %xmm0, %xmm3
	movd	%xmm3, %eax
	pshufd	$1, %xmm1, %xmm3
	movd	%xmm3, %ecx
	shll	%cl, %eax
	movd	%eax, %xmm3
	punpckldq	%xmm2, %xmm3
	movd	%xmm0, %eax
	movd	%xmm1, %ecx
	shll	%cl, %eax
	movd	%eax, %xmm2
	movhlps	%xmm0, %xmm0
	movd	%xmm0, %eax
	movhlps	%xmm1, %xmm1
	movd	%xmm1, %ecx
	shll	%cl, %eax
	movd	%eax, %xmm0
	punpckldq	%xmm0, %xmm2
	movdqa	%xmm2, %xmm0
	punpckldq	%xmm3, %xmm0
	ret

llvm-svn: 109549

269a6da0

Jul 27, 2010
- Make MC use Windows COFF on Windows and add tests. · f8270bdb
  Michael J. Spencer authored Jul 27, 2010
```
llvm-svn: 109494
```
  f8270bdb
- The isLoadFromStackSlot and isStoreToStackSlot have no way of reporting · 96a890a7
  Jakob Stoklund Olesen authored Jul 27, 2010
```
subregister operands like this:

%reg1040:sub_32bit<def> = MOV32rm <fi#-2>, 1, %reg0, 0, %reg0, %reg1040<imp-def>; mem:LD4[FixedStack-2](align=8)

Make them return false when subreg operands are present. VirtRegRewriter is
making bad assumptions otherwise.

This fixes PR7713.

llvm-svn: 109489
```
  96a890a7
- Add assertions that expose the PR7713 miscompilation: Accessing a stack slot · c3c05ed0
  Jakob Stoklund Olesen authored Jul 27, 2010
```
with a too-big register class.

llvm-svn: 109488
```
  c3c05ed0
Jul 26, 2010

On x86, f32 / f64 nodes share the same registers as 128-bit vector values. · d4218b87
Evan Cheng authored Jul 26, 2010
```
llvm-svn: 109450
```
d4218b87

Temporary hack to let codegen assert or generate poor code in case · 36c2ea6c

Bruno Cardoso Lopes authored Jul 26, 2010

we are using AVX and no AVX version of the desired intruction is present,
this is better for incremental dev (without fallbacks it's easier to spot
what's missing). Not sure this is the best hack thought (we can also disable
all HasSSE* predicates by dinamically marking them 'false' if AVX is present)

llvm-svn: 109434

36c2ea6c

Jul 24, 2010

Add an ILP scheduler. This is a register pressure aware scheduler that's · 37b740c4

Evan Cheng authored Jul 24, 2010

appropriate for targets without detailed instruction iterineries.
The scheduler schedules for increased instruction level parallelism in
low register pressure situation; it schedules to reduce register pressure
when the register pressure becomes high.

On x86_64, this is a win for all tests in CFP2000. It also sped up 256.bzip2
by 16%.

llvm-svn: 109300

37b740c4

Support x86 "eiz" and "riz" pseudo index registers in the assembler. · 306a1f97
Bruno Cardoso Lopes authored Jul 24, 2010
```
llvm-svn: 109295
```
306a1f97