Commits · bbd10792c2deefec6625981d0fc08d2560abcb04 · Roger Ferrer / llvm-epi-0.8

Aug 30, 2012

Introduce 'UseSSEx' to force SSE legacy encoding · bbd10792

Michael Liao authored Aug 30, 2012

- Add 'UseSSEx' to force SSE legacy insn not being selected when AVX is
  enabled.

  As the penalty of inter-mixing SSE and AVX instructions, we need
  prevent SSE legacy insn from being generated except explicitly
  specified through some intrinsics. For patterns supported by both
  SSE and AVX, so far, we force AVX insn will be tried first relying on
  AddedComplexity or position in td file. It's error-prone and
  introduces bugs accidentally.

  'UseSSEx' is disabled when AVX is turned on. For SSE insns inherited
  by AVX, we need this predicate to force VEX encoding or SSE legacy
  encoding only.

  For insns not inherited by AVX, we still use the previous predicates,
  i.e. 'HasSSEx'. So far, these insns fall into the following
  categories:
  * SSE insns with MMX operands
  * SSE insns with GPR/MEM operands only (xFENCE, PREFETCH, CLFLUSH,
    CRC, and etc.)
  * SSE4A insns.
  * MMX insns.
  * x87 insns added by SSE.

2 test cases are modified:

 - test/CodeGen/X86/fast-isel-x86-64.ll
   AVX code generation is different from SSE one. 'vcvtsi2sdq' cannot be
   selected by fast-isel due to complicated pattern and fast-isel
   fallback to materialize it from constant pool.

 - test/CodeGen/X86/widen_load-1.ll
   AVX code generation is different from SSE one after fixing SSE/AVX
   inter-mixing. Exec-domain fixing prefers 'vmovapd' instead of
   'vmovaps'.

llvm-svn: 162919

bbd10792

Only perform DAG combine on FMAs of legal types. · e39ad7b5
Craig Topper authored Aug 30, 2012
```
llvm-svn: 162892
```
e39ad7b5

Fix PR13727 · 3c898064

Michael Liao authored Aug 30, 2012

- The root cause is that target constant materialization in X86 fast-isel
  creates a PC-rel addressing which may overflow 32-bit range in non-Small code
  model if .rodata section is allocated too far away from code segment in
  MCJIT, which uses Large code model so far.
- Follow the similar logic to fix non-Small code model in fast-isel by skipping
  non-Small code model.

llvm-svn: 162881

3c898064

Aug 29, 2012
- Make helper function static. · 8f5c5ded
  Benjamin Kramer authored Aug 29, 2012
```
llvm-svn: 162843
```
  8f5c5ded
- Convert FMA4 patterns to use target specific nodes instead of intrinsics to align with FMA3. · a999c662
  Craig Topper authored Aug 29, 2012
```
llvm-svn: 162829
```
  a999c662
- Typo. · 3b1336ce
  Chad Rosier authored Aug 28, 2012
```
llvm-svn: 162807
```
  3b1336ce
- Add comments on the literal value used. · 407d659f
  Michael Liao authored Aug 28, 2012
```
llvm-svn: 162805
```
  407d659f
Aug 28, 2012
- Explicitly update the number of nodes to be traversed · 710e1a59
  Michael Liao authored Aug 28, 2012
```
llvm-svn: 162780
```
  710e1a59
- The commutative flag is already correctly set within the multiclass. If we set · cc567180
  Bill Wendling authored Aug 28, 2012
```
it here, then a 'register-memory' version would wrongly get the commutative
flag.
<rdar://problem/12180135>

llvm-svn: 162741
```
  cc567180
- Convert V_SETALLONES/AVX_SETALLONES/AVX2_SETALLONES to Post-RA pseudos. · 72f51c39
  Craig Topper authored Aug 28, 2012
```
llvm-svn: 162740
```
  72f51c39
- Merge AVX_SET0PSY/AVX_SET0PDY/AVX2_SET0 into a single post-RA pseudo. · bd509eea
  Craig Topper authored Aug 28, 2012
```
llvm-svn: 162738
```
  bd509eea
- Fix PR12312 · b7d85b63
  Michael Liao authored Aug 28, 2012
```
- Add a target-specific DAG optimization to recognize a pattern PTEST-able.
  Such a pattern is a OR'd tree with X86ISD::OR as the root node. When
  X86ISD::OR node has only its flag result being used as a boolean value and
  all its leaves are extracted from the same vector, it could be folded into an
  X86ISD::PTEST node.

llvm-svn: 162735
```
  b7d85b63
- More missing mayLoad flags on AVX multiclasses. · 89d6b29d
  Jakob Stoklund Olesen authored Aug 28, 2012
```
llvm-svn: 162714
```
  89d6b29d
Aug 27, 2012

Remove MMX shift intrinsic handling code that also exists in SelectionDAGBuilder. · a737ef89
Craig Topper authored Aug 27, 2012
```
llvm-svn: 162661
```
a737ef89

Don't allow vextractf128 to be folded with unaligned stores. We don't fold... · 5af2fed5

Craig Topper authored Aug 27, 2012

Don't allow vextractf128 to be folded with unaligned stores. We don't fold unaligned loads so shouldn't fold unaligned stores as it can cause an alignment fault to occur.

llvm-svn: 162658

5af2fed5

Fold some patterns into instruction definitons so tablegen can infer flags... · 6d44554c

Craig Topper authored Aug 27, 2012

Fold some patterns into instruction definitons so tablegen can infer flags removing the need for an explicit 'neverHasSideEffects = 1'

llvm-svn: 162656

6d44554c

Add HasAVX1Only predicate and use it for patterns that have an AVX1... · f7828f91

Craig Topper authored Aug 27, 2012

Add HasAVX1Only predicate and use it for patterns that have an AVX1 instruction and an AVX2 instruction rather than relying on AddedComplexity.

llvm-svn: 162654

f7828f91

Aug 25, 2012
- Fix integer undefined behavior due to signed left shift overflow in LLVM. · 228e6d4c
  Richard Smith authored Aug 24, 2012
```
Reviewed offline by chandlerc.

llvm-svn: 162623
```
  228e6d4c
- Add missing mayLoad flags to a large class of AVX *_Int instructions. · 3d91b43a
  Jakob Stoklund Olesen authored Aug 24, 2012
```
llvm-svn: 162622
```
  3d91b43a
Aug 24, 2012
- Mark X86::RET and RETI instructions as variadic. · b50cf8b3
  Jakob Stoklund Olesen authored Aug 24, 2012
```
There is special magic happening when returning floating point values on
the x87 stack. The RET instructions get extra f80 operands.

llvm-svn: 162592
```
  b50cf8b3
- Remove more mayLoad workarounds. · 8ff666fc
  Jakob Stoklund Olesen authored Aug 24, 2012
```
llvm-svn: 162556
```
  8ff666fc
- Custom lower FMA intrinsics to target specific nodes and remove the patterns. · 663d160a
  Craig Topper authored Aug 24, 2012
```
llvm-svn: 162534
```
  663d160a
- Remove some spurious mayLoad = 0 flags. · d3511235
  Jakob Stoklund Olesen authored Aug 24, 2012
```
They were inserted to silence TableGen's warning about
redundant properties. That warning is now gone.

llvm-svn: 162517
```
  d3511235
- X86MemBarrier has unmodeled side effects. · df1faa05
  Jakob Stoklund Olesen authored Aug 24, 2012
```
llvm-svn: 162514
```
  df1faa05
- Preserve operand flags in convertToThreeAddress() by copying operands. · 70304276
  Jakob Stoklund Olesen authored Aug 23, 2012
```
No test case, this is a generalization of r160260.

llvm-svn: 162485
```
  70304276
Aug 23, 2012
- Favor FMA3 over FMA4 if both are enabled. · 4a4634d6
  Craig Topper authored Aug 23, 2012
```
llvm-svn: 162454
```
  4a4634d6
- Use a switch statement instead of a bunch of if-else checks and pull out the common function call. · f9115974
  Craig Topper authored Aug 23, 2012
```
llvm-svn: 162428
```
  f9115974
Aug 22, 2012

[ms-inline asm] Avoid a false positive assertion · cf172e5e

Chad Rosier authored Aug 22, 2012

Assertion failed: (Start.isValid() == End.isValid() && "Start and end should 
either both be valid or both be invalid!")

when parsing inline asm.  SMLoc assumes that the first char * in the source is
invalid.  However, when parsing an inline asm the mnemonic is at this location.
I don't want to change SMLoc, so use a trivial workaround.

llvm-svn: 162381

cf172e5e

Add a getName function to MachineFunction. Use it in places that previously... · a538d831

Craig Topper authored Aug 22, 2012

Add a getName function to MachineFunction. Use it in places that previously did getFunction()->getName(). Remove includes of Function.h that are no longer needed.

llvm-svn: 162347

a538d831

Don't cache the MBB in the class. Its only used by one function. Change a for... · 056dfccc

Craig Topper authored Aug 22, 2012

Don't cache the MBB in the class. Its only used by one function. Change a for loop over operands to use unsigned instead of int.

llvm-svn: 162344

056dfccc

Mark a function as static since it doesn't use anything in the class. · 455bcafa
Craig Topper authored Aug 22, 2012
```
llvm-svn: 162342
```
455bcafa

Aug 21, 2012
- Fix unaligned memory accesses when performing relocations in X86 JIT. There's · 13473857
  Richard Smith authored Aug 21, 2012
```
no cost to using memcpy here: the fixed code is optimized by LLVM to perfect
machine code.

llvm-svn: 162311
```
  13473857
- [ms-inline asm] Do not report a Parser error when matching inline assembly. · 3d4bc62a
  Chad Rosier authored Aug 21, 2012
```
llvm-svn: 162306
```
  3d4bc62a
- [ms-inline asm] Expose the ErrorInfo from the MatchInstructionImpl. In general, · 79e766c3
  Chad Rosier authored Aug 21, 2012
```
this is the index of the operand that failed to match.

Note: This may cause a buildbot failure due to an API mismatch in clang.  Should
recover with my next commit to clang.

llvm-svn: 162295
```
  79e766c3
- Fix up indentation and remove a couple else's after returns. · bab0c766
  Craig Topper authored Aug 21, 2012
```
llvm-svn: 162270
```
  bab0c766
- Use uint16_t for tables of opcodes. · bfcfdeb5
  Craig Topper authored Aug 21, 2012
```
llvm-svn: 162267
```
  bfcfdeb5
- Fix up indentation. No functional change. · a0cabf19
  Craig Topper authored Aug 21, 2012
```
llvm-svn: 162264
```
  a0cabf19
- Add a couple llvm_unreachables. Add a message to several others. · 4bc3e5a1
  Craig Topper authored Aug 21, 2012
```
llvm-svn: 162263
```
  4bc3e5a1
- Replace a break with llvm_unreachable in the default case of a nested switch.... · 653e7590
  Craig Topper authored Aug 21, 2012
```
Replace a break with llvm_unreachable in the default case of a nested switch. Condense code a bit. No functional change.

llvm-svn: 162261
```
  653e7590
- Cleanup the scalar FMA3 definitions. Add patterns to fold loads with scalar forms. · 384fae2f
  Craig Topper authored Aug 21, 2012
```
llvm-svn: 162260
```
  384fae2f