Commits · 11cc8b3c143ebda2b7a009fd96ddb5e05dad983e · Roger Ferrer / llvm-epi-0.8

Dec 20, 2007
- Fix JIT encoding for CMPSD as well. · 01c7c198
  Evan Cheng authored Dec 20, 2007
```
llvm-svn: 45268
```
  01c7c198
Dec 18, 2007

Add "mayHaveSideEffects" and "neverHasSideEffects" flags to some instructions. I · b3d85a5d

Bill Wendling authored Dec 17, 2007

based what flag to set on whether it was already marked as
"isRematerializable". If there was a further check to determine if it's "really"
rematerializable, then I marked it as "mayHaveSideEffects" and created a check
in the X86 back-end similar to the remat one.

llvm-svn: 45132

b3d85a5d

Dec 16, 2007

Fix the JIT encoding of cmp*ss, which aborts with this assertion currently: · dab6bd90

Chris Lattner authored Dec 16, 2007

X86CodeEmitter.cpp:378: failed assertion `0 && "Immediate size not set!"'

I *think* this is right, but Evan, please verify.  It also looks like
CMPSDrr and maybe others are missing this info.  Evan, plz investigate.

llvm-svn: 45074

dab6bd90

Dec 15, 2007
- Make better use of instructions that clear high bits; fix various 2-wide shuffle bugs. · 23d2d4dc
  Evan Cheng authored Dec 15, 2007
```
llvm-svn: 45058
```
  23d2d4dc
Dec 13, 2007

Implicit def instructions, e.g. X86::IMPLICIT_DEF_GR32, are always... · 6e68381e

Evan Cheng authored Dec 12, 2007

Implicit def instructions, e.g. X86::IMPLICIT_DEF_GR32, are always re-materializable and they should not be spilled.

llvm-svn: 44960

6e68381e

Dec 06, 2007
- Remove a bogus optimization. It's not possible to do a move to low element to... · c829e5cd
  Evan Cheng authored Dec 06, 2007
```
Remove a bogus optimization. It's not possible to do a move to low element to a <8 x i16> or <16 x i8> vector.

llvm-svn: 44669
```
  c829e5cd
Nov 25, 2007

Fix a long standing deficiency in the X86 backend: we would · 5728bdd4

Chris Lattner authored Nov 25, 2007

sometimes emit "zero" and "all one" vectors multiple times,
for example:

_test2:
	pcmpeqd	%mm0, %mm0
	movq	%mm0, _M1
	pcmpeqd	%mm0, %mm0
	movq	%mm0, _M2
	ret

instead of:

_test2:
	pcmpeqd	%mm0, %mm0
	movq	%mm0, _M1
	movq	%mm0, _M2
	ret

This patch fixes this by always arranging for zero/one vectors
to be defined as v4i32 or v2i32 (SSE/MMX) instead of letting them be
any random type.  This ensures they get trivially CSE'd on the dag.
This fix is also important for LegalizeDAGTypes, as it gets unhappy
when the x86 backend wants BUILD_VECTOR(i64 0) to be legal even when
'i64' isn't legal.

This patch makes the following changes:

1) X86TargetLowering::LowerBUILD_VECTOR now lowers 0/1 vectors into
   their canonical types.
2) The now-dead patterns are removed from the SSE/MMX .td files.
3) All the patterns in the .td file that referred to immAllOnesV or
   immAllZerosV in the wrong form now use *_bc to match them with a
   bitcast wrapped around them.
4) X86DAGToDAGISel::SelectScalarSSELoad is generalized to handle 
   bitcast'd zero vectors, which simplifies the code actually.
5) getShuffleVectorZeroOrUndef is updated to generate a shuffle that
   is legal, instead of generating one that is illegal and expecting
   a later legalize pass to clean it up.
6) isZeroShuffle is generalized to handle bitcast of zeros.
7) several other minor tweaks.

This patch is definite goodness, but has the potential to cause random
code quality regressions.  Please be on the lookout for these and let 
me know if they happen.

llvm-svn: 44310

5728bdd4

Nov 17, 2007
- Add support for vectors to int <-> float casts. · d4d45c26
  Nate Begeman authored Nov 17, 2007
```
llvm-svn: 44204
```
  d4d45c26
Oct 30, 2007
- Add missing SSE builtins: CVTPD2PI, CVTPS2PI, · d50c8bce
  Dale Johannesen authored Oct 30, 2007
```
CVTTPD2PI, CVTTPS2PI, CVTPI2PD, CVTPI2PS.

llvm-svn: 43523
```
  d50c8bce
Oct 12, 2007

Corrected many typing errors. And removed 'nest' parameter handling · 1f0da1fe

Arnold Schwaighofer authored Oct 12, 2007

for fastcc from X86CallingConv.td.  This means that nested functions
are not supported for calling convention 'fastcc'.

llvm-svn: 42934

1f0da1fe

Oct 11, 2007
- Add missing argument to PALIGNR · 62f65edc
  Dale Johannesen authored Oct 11, 2007
```
llvm-svn: 42874
```
  62f65edc
Oct 06, 2007

Added DAG xforms. e.g. · f4b5d491

Evan Cheng authored Oct 06, 2007

(vextract (v4f32 s2v (f32 load $addr)), 0) -> (f32 load $addr) 
(vextract (v4i32 bc (v4f32 s2v (f32 load $addr))), 0) -> (i32 load $addr)
Remove x86 specific patterns.

llvm-svn: 42677

f4b5d491

Oct 01, 2007
- Typo. X86comi doesn't read / write chain's. · a1b7e950
  Evan Cheng authored Oct 01, 2007
```
llvm-svn: 42492
```
  a1b7e950
Sep 29, 2007
- Enabling new condition code modeling scheme. · 5fb5a1f3
  Evan Cheng authored Sep 29, 2007
```
llvm-svn: 42459
```
  5fb5a1f3
Sep 25, 2007

Added support for new condition code modeling scheme (i.e. physical register... · e95f391e

Evan Cheng authored Sep 25, 2007

Added support for new condition code modeling scheme (i.e. physical register dependency). These are a bunch of instructions that are duplicated so the x86 backend can support both the old and new schemes at the same time. They will be deleted after
all the kinks are worked out.

llvm-svn: 42285

e95f391e

Sep 23, 2007

Fix PR 1681. When X86 target uses +sse -sse2, · e36c4002

Dale Johannesen authored Sep 23, 2007

keep f32 in SSE registers and f64 in x87.  This
is effectively a new codegen mode.
Change addLegalFPImmediate to permit float and
double variants to do different things.
Adjust callers.

llvm-svn: 42246

e36c4002

Sep 14, 2007
- Add implicit def of EFLAGS on those instructions that may modify flags. · 483e1ce1
  Evan Cheng authored Sep 14, 2007
```
llvm-svn: 41962
```
  483e1ce1
Sep 11, 2007
- Remove (somewhat confusing) Imp<> helper, use let Defs = [], Uses = [] instead. · 3e18e504
  Evan Cheng authored Sep 11, 2007
```
llvm-svn: 41863
```
  3e18e504
Sep 07, 2007
- Avoid storing and reloading zeros and other constants from stack slots · a95cbb00
  Dan Gohman authored Sep 07, 2007
```
by flagging the associated instructions as being trivially rematerializable.

llvm-svn: 41775
```
  a95cbb00
Aug 30, 2007
- Mark load instructions with isLoad = 1. · c2081fe5
  Evan Cheng authored Aug 30, 2007
```
llvm-svn: 41595
```
  c2081fe5
Aug 11, 2007
- 64-bit SSSE3 ops that use MMX registers don't require 16-byte alignment. · cdbd82ee
  Bill Wendling authored Aug 11, 2007
```
Make a 'memop' pattern just for them.

llvm-svn: 41017
```
  cdbd82ee
Aug 10, 2007
- For kicks, I though it would be fun to use the correct opcode. · 70146150
  Bill Wendling authored Aug 10, 2007
```
llvm-svn: 40985
```
  70146150
- Adding SSSE3 intrinsics. · 23772069
  Bill Wendling authored Aug 10, 2007
```
llvm-svn: 40982
```
  23772069
Aug 02, 2007

Fix the alignment requirements of several unpck and shuf instructions. · 8932bff7

Dan Gohman authored Aug 02, 2007

Generalize isPSHUFDMask and add a unary SHUFPD pattern so that SHUFPD's
memory operand alignment can be tested as well, with a fix to avoid
breaking MMX's use of isPSHUFDMask.

llvm-svn: 40756

8932bff7

Fix pastos in vector arithmetic intrinsics. · 4d436e2b
Dan Gohman authored Aug 02, 2007
```
llvm-svn: 40754
```
4d436e2b

Mark the SSE and MMX load instructions that · fa3eeeed

Dan Gohman authored Aug 02, 2007

X86InstrInfo::isReallyTriviallyReMaterializable knows how to handle
with the isReMaterializable flag so that it is given a chance to handle
them. Without hoisting constant-pool loads from loops this isn't very
visible, though it does keep CodeGen/X86/constant-pool-remat-0.ll from
making a copy of the constant pool on the stack.

llvm-svn: 40736

fa3eeeed

Aug 01, 2007
- Missing Requires. · da549ece
  Evan Cheng authored Aug 01, 2007
```
llvm-svn: 40691
```
  da549ece
Jul 31, 2007

Change the x86 assembly output to use tab characters to separate the · 54ec4bfa

Dan Gohman authored Jul 31, 2007

mnemonics from their operands instead of single spaces. This makes the
assembly output a little more consistent with various other compilers
(f.e. GCC), and slightly easier to read. Also, update the regression
tests accordingly.

llvm-svn: 40648

54ec4bfa

Redo and generalize previously removed opt for pinsrw: (vextract (v4i32 bc... · 12c6be84

Evan Cheng authored Jul 31, 2007

Redo and generalize previously removed opt for pinsrw: (vextract (v4i32 bc (v4f32 s2v (f32 load ))), 0) -> (i32 load )

llvm-svn: 40628

12c6be84

Jul 27, 2007

Re-apply 40504, but with a fix for the segfault it caused in oggenc: · 4788552d

Dan Gohman authored Jul 27, 2007

Make the alignedload and alignedstore patterns always require 16-byte
alignment. This way when they are used in the "Fs" instructions, in which
a vector instruction is used for a scalar purpose, they can still require
the full vector alignment. And add a regression test for this.

llvm-svn: 40555

4788552d

Reverting 40504 for now. It's breaking oggenc. · 931de40a
Evan Cheng authored Jul 27, 2007
```
llvm-svn: 40547
```
931de40a

Jul 26, 2007
- Fix a whitespace difference between CMPSSrr and CMPSDrr. · cecd4b37
  Dan Gohman authored Jul 26, 2007
```
llvm-svn: 40528
```
  cecd4b37
- Remove X86ISD::LOAD_PACK and X86ISD::LOAD_UA and associated code from the · 8455bd3f
  Dan Gohman authored Jul 26, 2007
```
x86 target, replacing them with the new alignment attributes on memory
references.

llvm-svn: 40504
```
  8455bd3f
Jul 20, 2007

Because we promote SSE logical ops and loads to v2i64, we often end up generate · 8fefeffb

Evan Cheng authored Jul 20, 2007

code that cross integer / floating point domains (e.g. generate pxor / pand for
logical ops on floating point value, movdqa to load / store floating point SSE
values). Given that, it's better to use movaps instead of movdqa and movups
instead of movdqu. They have the same latency but the "aps" variants are one
byte shorter.
If the domain crossing problem is a real performance issue, then we will have to
fix it with dynamic programming based isel.

llvm-svn: 40076

8fefeffb

Fix patterns so we isel the xorps, etc. for floating pt logical SSE ops. DAG... · 7ca3555b

Evan Cheng authored Jul 19, 2007

Fix patterns so we isel the xorps, etc. for floating pt logical SSE ops. DAG combiner may fold away the (bit_convert (load)).

llvm-svn: 40070

7ca3555b

Jul 19, 2007

Change instruction description to split OperandList into OutOperandList and · 94b5a80b

Evan Cheng authored Jul 19, 2007

InOperandList. This gives one piece of important information: # of results
produced by an instruction.
An example of the change:
def ADD32rr  : I<0x01, MRMDestReg, (ops GR32:$dst, GR32:$src1, GR32:$src2),
                 "add{l} {$src2, $dst|$dst, $src2}",
                 [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>;
=>
def ADD32rr  : I<0x01, MRMDestReg, (outs GR32:$dst), (ins GR32:$src1, GR32:$src2),
                 "add{l} {$src2, $dst|$dst, $src2}",
                 [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>;

llvm-svn: 40033

94b5a80b

Jul 18, 2007

Implement initial memory alignment awareness for SSE instructions. Vector loads · 776962a9

Dan Gohman authored Jul 18, 2007

and stores that have a specified alignment of less than 16 bytes now use
instructions that support misaligned memory references.

llvm-svn: 40015

776962a9

Jul 10, 2007

Define non-intrinsic instructions for vector min, max, sqrt, rsqrt, and rcp, · 57111e7a

Dan Gohman authored Jul 10, 2007

in addition to the intrinsic forms. Add spill-folding entries for these new
instructions, and for the scalar min and max instrinsic instructions which
were missing. And add some preliminary ISelLowering code for using the new
non-intrinsic vector sqrt instruction, and fneg and fabs.

llvm-svn: 38478

57111e7a

Jul 03, 2007
- Fix for PR 1505 (and 1489). Rewrite X87 register · a2b3c175
  Dale Johannesen authored Jul 03, 2007
```
model to include f32 variants.  Some factoring
improvments forthcoming.

llvm-svn: 37847
```
  a2b3c175
Jun 26, 2007

Revert the earlier change that removed the M_REMATERIALIZABLE machine · e8c1e428

Dan Gohman authored Jun 26, 2007

instruction flag, and use the flag along with a virtual member function
hook for targets to override if there are instructions that are only
trivially rematerializable with specific operands (i.e. constant pool
loads).

llvm-svn: 37728

e8c1e428