Commits · eadaeaab93302456465f8bbcf9476a4801096102 · Roger Ferrer / llvm-epi-0.8

Sep 05, 2010
- update this. · 5cac0f71
  Chris Lattner authored Sep 05, 2010
```
llvm-svn: 113116
```
  5cac0f71
Aug 26, 2010
- we should pattern match the SSE complex arithmetic ops. · aecf47a5
  Chris Lattner authored Aug 25, 2010
```
llvm-svn: 112109
```
  aecf47a5
Aug 23, 2010
- random improvement for variable shift codegen. · a42202e0
  Chris Lattner authored Aug 23, 2010
```
llvm-svn: 111813
```
  a42202e0
Jul 11, 2010

Remove obsolete README_SSE note. · 98ee37d8

Jakob Stoklund Olesen authored Jul 11, 2010

We are generating movaps for all XMM register copies, including scalar
floating point values. This is known to be at least as good as movss and movsd
for all known architectures up to and including Nehalem because it avoids a
partial register stall.

The SSEDomainFix pass will switch movaps to movdqa when appropriate (i.e., when
operands come from the integer unit). We don't now that switching movaps to
movapd has any benefit.

The same applies to andps -> pand.

llvm-svn: 108096

98ee37d8

Jul 05, 2010
- some notes about suboptimal insertps's · 7b909ac7
  Chris Lattner authored Jul 05, 2010
```
llvm-svn: 107613
```
  7b909ac7
Jun 03, 2010
- Remove some already-fixed README entries. · ceb13f2a
  Eli Friedman authored Jun 03, 2010
```
llvm-svn: 105377
```
  ceb13f2a
- Remove README entry which no longer compiles to something sane. · a59b7a72
  Eli Friedman authored Jun 03, 2010
```
llvm-svn: 105376
```
  a59b7a72
Mar 02, 2010
- Floating-point add, sub, and mul are now spelled fadd, fsub, and fmul, · 6f34abd0
  Dan Gohman authored Mar 02, 2010
```
respectively.

llvm-svn: 97531
```
  6f34abd0
Feb 10, 2010
- Fix "the the" and similar typos. · 4a618827
  Dan Gohman authored Feb 10, 2010
```
llvm-svn: 95781
```
  4a618827
Feb 09, 2010
- add a note from PR6194 · cf11e602
  Chris Lattner authored Feb 09, 2010
```
llvm-svn: 95649
```
  cf11e602
Feb 04, 2010
- move the PR6214 microoptzn to this file. · fb5670fc
  Chris Lattner authored Feb 04, 2010
```
llvm-svn: 95299
```
  fb5670fc
Jan 14, 2010
- this is an SSE-specific issue. · 3eb76c23
  Chris Lattner authored Jan 13, 2010
```
llvm-svn: 93373
```
  3eb76c23
Feb 04, 2009
- Bill implemented this. · e84a7911
  Chris Lattner authored Feb 04, 2009
```
llvm-svn: 63752
```
  e84a7911
- add a note, this is why we're faster at SciMark-MonteCarlo with · 553fd7e1
  Chris Lattner authored Feb 04, 2009
```
SSE disabled.

llvm-svn: 63751
```
  553fd7e1
Jan 28, 2009

The memory alignment requirement on some of the mov{h|l}p{d|s} patterns are... · f31f2888

Evan Cheng authored Jan 28, 2009

The memory alignment requirement on some of the mov{h|l}p{d|s} patterns are 16-byte. That is overly strict. These instructions read / write f64 memory locations without alignment requirement.

llvm-svn: 63195

f31f2888

Sep 20, 2008
- add a note · 9a8eb0d5
  Chris Lattner authored Sep 20, 2008
```
llvm-svn: 56391
```
  9a8eb0d5
Aug 19, 2008
- add a note · f076d5ee
  Chris Lattner authored Aug 19, 2008
```
llvm-svn: 54964
```
  f076d5ee
Jun 25, 2008
- - Fix a x86 vector isel bug: illegal transformation of a vector_shuffle into a · 3fc2372d
  Evan Cheng authored Jun 25, 2008
```
  shift.
- Add a readme entry for a missing vector_shuffle optimization that results in
  awful codegen.

llvm-svn: 52740
```
  3fc2372d
May 24, 2008
- This is done. · 8647b875
  Evan Cheng authored May 24, 2008
```
llvm-svn: 51526
```
  8647b875
May 23, 2008
- Use movlps / movhps to modify low / high half of 16-byet memory location. · 04d24edc
  Evan Cheng authored May 23, 2008
```
llvm-svn: 51501
```
  04d24edc
- Elaborate on the entry on integer vector multiplication by constants. · 66eea1b9
  Dan Gohman authored May 23, 2008
```
llvm-svn: 51491
```
  66eea1b9
- New entry. · d25cb8e0
  Evan Cheng authored May 23, 2008
```
llvm-svn: 51487
```
  d25cb8e0
- we compile multiply-by-constant into horrible code. Doesn't sse4 have some · 3546c2b4
  Chris Lattner authored May 23, 2008
```
instruction for doing this?

llvm-svn: 51473
```
  3546c2b4
May 13, 2008
- add a note · 03ce2061
  Chris Lattner authored May 13, 2008
```
llvm-svn: 51062
```
  03ce2061
- add a note · d17f58ae
  Chris Lattner authored May 13, 2008
```
llvm-svn: 51060
```
  d17f58ae
- Instead of a vector load, shuffle and then extract an element. Load the... · 1120279a
  Evan Cheng authored May 13, 2008
```
Instead of a vector load, shuffle and then extract an element. Load the element from address with an offset.
        pshufd $1, (%rdi), %xmm0
        movd %xmm0, %eax
=>
        movl 4(%rdi), %eax

llvm-svn: 51026
```
  1120279a
- On x86, it's safe to treat i32 load anyext as a normal i32 load. Ditto for i8 anyext load to i16. · 3f40c690
  Evan Cheng authored May 13, 2008
```
llvm-svn: 51019
```
  3f40c690
- Xform bitconvert(build_pair(load a, load b)) to a single load if the load... · b980f6fb
  Evan Cheng authored May 12, 2008
```
Xform bitconvert(build_pair(load a, load b)) to a single load if the load locations are at the right offset from each other.

llvm-svn: 51008
```
  b980f6fb
May 11, 2008
- Add note · a38e72d2
  Anton Korobeynikov authored May 11, 2008
```
llvm-svn: 50959
```
  a38e72d2
Apr 10, 2008
- add a note, this is actually not too bad to implement. · aeb23a8a
  Chris Lattner authored Apr 10, 2008
```
llvm-svn: 49466
```
  aeb23a8a
- move the x86-32 part of PR2108 here. · c6921880
  Chris Lattner authored Apr 10, 2008
```
llvm-svn: 49465
```
  c6921880
Mar 09, 2008

Finish implementing a readme entry: when inserting an i64 variable · b6387c8a

Chris Lattner authored Mar 09, 2008

into a vector of zeros or undef, and when the top part is obviously
zero, we can just use movd + shuffle.  This allows us to compile
vec_set-B.ll into:

_test3:
	movl	$1234567, %eax
	andl	4(%esp), %eax
	movd	%eax, %xmm0
	ret

instead of:

_test3:
	subl	$28, %esp
	movl	$1234567, %eax
	andl	32(%esp), %eax
	movl	%eax, (%esp)
	movl	$0, 4(%esp)
	movq	(%esp), %xmm0
	addl	$28, %esp
	ret

llvm-svn: 48090

b6387c8a

add a note · 93930dc2
Chris Lattner authored Mar 09, 2008
```
llvm-svn: 48064
```
93930dc2

Implement a readme entry, compiling · eef374c1

Chris Lattner authored Mar 09, 2008

#include <xmmintrin.h>
__m128i doload64(short x) {return _mm_set_epi16(0,0,0,0,0,0,0,1);}

into:
	movl	$1, %eax
	movd	%eax, %xmm0
	ret

instead of a constant pool load.

llvm-svn: 48063

eef374c1

Mar 08, 2008
- This one looks easy, add a note. · 35adf469
  Chris Lattner authored Mar 08, 2008
```
llvm-svn: 48055
```
  35adf469
- move these to the appropriate file · a76e23a9
  Chris Lattner authored Mar 08, 2008
```
llvm-svn: 48054
```
  a76e23a9
Mar 05, 2008
- evan implemented this. · 7c08a016
  Chris Lattner authored Mar 05, 2008
```
llvm-svn: 47948
```
  7c08a016
- add a note · 2acd0c25
  Chris Lattner authored Mar 05, 2008
```
llvm-svn: 47939
```
  2acd0c25
Mar 02, 2008
- Evan implemented these. · a70df9e2
  Chris Lattner authored Mar 02, 2008
```
llvm-svn: 47828
```
  a70df9e2
Feb 14, 2008
- upgrade some entries, remove stuff that is done. · eb63b092
  Chris Lattner authored Feb 14, 2008
```
llvm-svn: 47109
```
  eb63b092