Commits · 32f5d6b9be7bffbbca6b8841d2913589c9b00b27 · Roger Ferrer / llvm-epi-0.8

Aug 05, 2010
- Be a little bit more specific about target for the memory barrier · 32f5d6b9
  Eric Christopher authored Aug 05, 2010
```
instructions.

llvm-svn: 110360
```
  32f5d6b9
- Handle the pseudo in MCInstLower. · 4abffad1
  Eric Christopher authored Aug 05, 2010
```
llvm-svn: 110359
```
  4abffad1
- Make x86-64 membarriers work without sse and clean up some of the · 2db84642
  Eric Christopher authored Aug 04, 2010
```
uses.

llvm-svn: 110274
```
  2db84642
- PR7814: Truncates cannot be ignored for signed comparisons. · 39d0f57c
  Eli Friedman authored Aug 04, 2010
```
llvm-svn: 110268
```
  39d0f57c
Aug 04, 2010
- Add DEBUG message. · 2bf0f3ce
  Devang Patel authored Aug 04, 2010
```
llvm-svn: 110224
```
  2bf0f3ce
- Enable COFF writer on mingw32 and cygwin. · a53a4eef
  Benjamin Kramer authored Aug 04, 2010
```
llvm-svn: 110200
```
  a53a4eef
- Print an error message when someone tries -integrated-as on an unsupported target. · 61c8e6dc
  Benjamin Kramer authored Aug 04, 2010
```
- The COFF backend doesn't support MingW/Cygwin at the moment, it'll report an
  error, but it's still much better than random assertions from the MachO backend.
- We want to make ELF the default eventually, it's what the majority of targets use.

llvm-svn: 110197
```
  61c8e6dc
- fix a win64 encoding problem, patch by Cameron Esfahani! · 53befe7b
  Chris Lattner authored Aug 03, 2010
```
llvm-svn: 110164
```
  53befe7b
Jul 31, 2010
- MC: Remove HasAbsolutizedSet from WindowsX86AsmBackend. · ed80f361
  Michael J. Spencer authored Jul 31, 2010
```
llvm-svn: 109949
```
  ed80f361
- Add relax all support to the COFF object streamer. · 6b4925e2
  Michael J. Spencer authored Jul 31, 2010
```
llvm-svn: 109947
```
  6b4925e2
Jul 30, 2010

Support all 128-bit AVX vector intrinsics. Most part of them I already · 349165b4

Bruno Cardoso Lopes authored Jul 30, 2010

declared during the addition of the assembler support, the additional
changes are:
- Add missing intrinsics
- Move all SSE conversion instructions in X86InstInfo64.td to the SSE.td file.
- Duplicate some patterns to AVX mode.
- Step into PCMPEST/PCMPIST custom inserter and add AVX versions.

llvm-svn: 109878

349165b4

Fix typo! · 405405bb
Bruno Cardoso Lopes authored Jul 30, 2010
```
llvm-svn: 109877
```
405405bb

Jul 29, 2010

Revert r109652, and remove the offending assert in loadRegFromStackSlot instead. · ba0e124a

Jakob Stoklund Olesen authored Jul 29, 2010

We do sometimes load from a too small stack slot when dealing with x86 arguments
(varargs and smaller-than-32-bit args). It looks like we know what we are doing
in those cases, so I am going to remove the assert instead of artifically
enlarging stack slot sizes.

The assert in storeRegToStackSlot stays in. We don't want to write beyond the
bounds of a stack slot.

llvm-svn: 109764

ba0e124a

Jul 28, 2010

Create a fixed stack object for varargs that is as large as any register. · f2234fbe

Jakob Stoklund Olesen authored Jul 28, 2010

The size of this object isn't used for anything - technically it is of variable
size.

This avoids a false positive from the assert in
X86InstrInfo::loadRegFromStackSlot, and fixes PR7735.

llvm-svn: 109652

f2234fbe

Implement a vectorized algorithm for <16 x i8> << <16 x i8> · 53afc8f0
Nate Begeman authored Jul 28, 2010
```
This is about 4x faster and smaller than the existing scalarization.

llvm-svn: 109566
```
53afc8f0

~40% faster vector shl <4 x i32> on SSE 4.1 Larger improvements for smaller... · 269a6da0

Nate Begeman authored Jul 27, 2010

~40% faster vector shl <4 x i32> on SSE 4.1  Larger improvements for smaller types coming in future patches.

For:

define <2 x i64> @shl(<4 x i32> %r, <4 x i32> %a) nounwind readnone ssp {
entry:
  %shl = shl <4 x i32> %r, %a                     ; <<4 x i32>> [#uses=1]
  %tmp2 = bitcast <4 x i32> %shl to <2 x i64>     ; <<2 x i64>> [#uses=1]
  ret <2 x i64> %tmp2
}

We get:

_shl:                                   ## @shl
	pslld	$23, %xmm1
	paddd	LCPI0_0, %xmm1
	cvttps2dq	%xmm1, %xmm1
	pmulld	%xmm1, %xmm0
	ret

Instead of:

_shl:                                   ## @shl
	pshufd	$3, %xmm0, %xmm2
	movd	%xmm2, %eax
	pshufd	$3, %xmm1, %xmm2
	movd	%xmm2, %ecx
	shll	%cl, %eax
	movd	%eax, %xmm2
	pshufd	$1, %xmm0, %xmm3
	movd	%xmm3, %eax
	pshufd	$1, %xmm1, %xmm3
	movd	%xmm3, %ecx
	shll	%cl, %eax
	movd	%eax, %xmm3
	punpckldq	%xmm2, %xmm3
	movd	%xmm0, %eax
	movd	%xmm1, %ecx
	shll	%cl, %eax
	movd	%eax, %xmm2
	movhlps	%xmm0, %xmm0
	movd	%xmm0, %eax
	movhlps	%xmm1, %xmm1
	movd	%xmm1, %ecx
	shll	%cl, %eax
	movd	%eax, %xmm0
	punpckldq	%xmm0, %xmm2
	movdqa	%xmm2, %xmm0
	punpckldq	%xmm3, %xmm0
	ret

llvm-svn: 109549

269a6da0

Jul 27, 2010
- Make MC use Windows COFF on Windows and add tests. · f8270bdb
  Michael J. Spencer authored Jul 27, 2010
```
llvm-svn: 109494
```
  f8270bdb
- The isLoadFromStackSlot and isStoreToStackSlot have no way of reporting · 96a890a7
  Jakob Stoklund Olesen authored Jul 27, 2010
```
subregister operands like this:

%reg1040:sub_32bit<def> = MOV32rm <fi#-2>, 1, %reg0, 0, %reg0, %reg1040<imp-def>; mem:LD4[FixedStack-2](align=8)

Make them return false when subreg operands are present. VirtRegRewriter is
making bad assumptions otherwise.

This fixes PR7713.

llvm-svn: 109489
```
  96a890a7
- Add assertions that expose the PR7713 miscompilation: Accessing a stack slot · c3c05ed0
  Jakob Stoklund Olesen authored Jul 27, 2010
```
with a too-big register class.

llvm-svn: 109488
```
  c3c05ed0
Jul 26, 2010

On x86, f32 / f64 nodes share the same registers as 128-bit vector values. · d4218b87
Evan Cheng authored Jul 26, 2010
```
llvm-svn: 109450
```
d4218b87

Temporary hack to let codegen assert or generate poor code in case · 36c2ea6c

Bruno Cardoso Lopes authored Jul 26, 2010

we are using AVX and no AVX version of the desired intruction is present,
this is better for incremental dev (without fallbacks it's easier to spot
what's missing). Not sure this is the best hack thought (we can also disable
all HasSSE* predicates by dinamically marking them 'false' if AVX is present)

llvm-svn: 109434

36c2ea6c

Jul 24, 2010

Add an ILP scheduler. This is a register pressure aware scheduler that's · 37b740c4

Evan Cheng authored Jul 24, 2010

appropriate for targets without detailed instruction iterineries.
The scheduler schedules for increased instruction level parallelism in
low register pressure situation; it schedules to reduce register pressure
when the register pressure becomes high.

On x86_64, this is a win for all tests in CFP2000. It also sped up 256.bzip2
by 16%.

llvm-svn: 109300

37b740c4

Support x86 "eiz" and "riz" pseudo index registers in the assembler. · 306a1f97
Bruno Cardoso Lopes authored Jul 24, 2010
```
llvm-svn: 109295
```
306a1f97
Remove trailing whitespace · d65cd1d5
Bruno Cardoso Lopes authored Jul 23, 2010
```
llvm-svn: 109276
```
d65cd1d5

Jul 23, 2010

Add AVX version of CLMUL instructions · ea0e05a3
Bruno Cardoso Lopes authored Jul 23, 2010
```
llvm-svn: 109248
```
ea0e05a3
Declare CLMUL as a subtarget feature · d618c8ac
Bruno Cardoso Lopes authored Jul 23, 2010
```
llvm-svn: 109207
```
d618c8ac
Add x86 CLMUL (Carry-less multiplication) cpu feature · 09dc24be
Bruno Cardoso Lopes authored Jul 23, 2010
```
llvm-svn: 109206
```
09dc24be
Add complete assembler support for FMA3 instructions, with descriptions and... · acd9230b
Bruno Cardoso Lopes authored Jul 23, 2010
```
Add complete assembler support for FMA3 instructions, with descriptions and encodings taken from the AVX manual

llvm-svn: 109204
```
acd9230b

The only supported calling convention for X86-64 uses · f2d75670

Dale Johannesen authored Jul 23, 2010

SSE, so we can't return floating point values if this
is disabled.  Detect this error for clang.

With SSE1 only, f64 is a problem; it can be done, but
neither llvm-gcc nor clang has ever generated correct
code for it.  Since nobody noticed this I think it's
OK to treat it as an error for now.

This also handles SSE-sized vectors of floating point.
8207686, 8204109.

llvm-svn: 109201

f2d75670

Fix some AVX instructions which didnt had HasAVX prefix. And also a problem... · e29e3896

Bruno Cardoso Lopes authored Jul 23, 2010

Fix some AVX instructions which didnt had HasAVX prefix. And also a problem with PINSRW, which was totally wrong because of a typo I introduced previously

llvm-svn: 109198

e29e3896

Jul 22, 2010

Add remaining AVX instructions (most of them dealing with GR64 destinations.... · 0710c74f

Bruno Cardoso Lopes authored Jul 22, 2010

Add remaining AVX instructions (most of them dealing with GR64 destinations. This complete the assembler support for the general AVX ISA. But we still miss instructions from FMA3 and CLMUL specific feature flags, which are now the next step

llvm-svn: 109168

0710c74f

remove the JIT "NeedsExactSize" feature and supporting logic. · 8f3adc90
Chris Lattner authored Jul 22, 2010
```
llvm-svn: 109167
```
8f3adc90
X86MCInstLower now depends on AsmPrinter being around. · b3f608bb
Chris Lattner authored Jul 22, 2010
```
llvm-svn: 109154
```
b3f608bb

instead of migrating it to the MC instruction encoder, just · 083be4d3

Chris Lattner authored Jul 22, 2010

rip out the implementation of X86InstrInfo::GetInstSizeInBytes.
The code being ripped out just implemented a copy and hacked up
version of the (old) instruction encoder, and is buggy and 
terrible in other ways.  Since "GetInstSizeInBytes" is really 
only there to support the JIT's "NeedsExactSize" hook (which
noone is using), just rip out the code.  I will rip out the
NeedsExactSize hook next.

This resolves rdar://7617809 - switch X86InstrInfo::GetInstSizeInBytes to use X86MCCodeEmitter

llvm-svn: 109149

083be4d3

Attempt to fix linking issues with CMake. Please review other CMake users, · 3180f9f5
Chandler Carruth authored Jul 22, 2010
```
especially on other platforms. Is there a better way to fix this.

llvm-svn: 109084
```
3180f9f5

Custom lower the memory barrier instructions and add support · 9a773826

Eric Christopher authored Jul 22, 2010

for lowering without sse2.  Add a couple of new testcases.

Fixes a few libgomp tests and latent bugs.  Remove a few todos.

llvm-svn: 109078

9a773826

80-columns. · a4c435f1
Eric Christopher authored Jul 22, 2010
```
llvm-svn: 109070
```
a4c435f1
Make fast isel win64-aware w.r.t. call-clobbered regs · 68a069a1
Nate Begeman authored Jul 22, 2010
```
llvm-svn: 109069
```
68a069a1

Add more 256-bit forms for a bunch of regular AVX instructions · e3acfd4d

Bruno Cardoso Lopes authored Jul 21, 2010

Add 64-bit (GR64) versions of some instructions (which are not
described in their SSE forms, but are described in AVX)

llvm-svn: 109063

e3acfd4d

Fixes win64. It was broken by a previous patch where I missed the !isWin64 · 350b1a44
Rafael Espindola authored Jul 21, 2010
```
and then forced every register to be a vr128 on win64.

llvm-svn: 109060
```
350b1a44