Commits · 86b4dfac016231d728f060c1a97c664f87582768 · Roger Ferrer / llvm-epi-0.8

Jan 09, 2012
- Enable FISTTP* instructions when AVX is enabled. · c1ab7afe
  Craig Topper authored Jan 08, 2012
```
llvm-svn: 147758
```
  c1ab7afe
Jan 08, 2012
- Don't forget to transfer implicit uses of return instruction. · 4882e488
  Evan Cheng authored Jan 08, 2012
```
llvm-svn: 147752
```
  4882e488
- Reverted commit #147601 upon Evan's request. · 540651cf
  Victor Umansky authored Jan 08, 2012
```
llvm-svn: 147748
```
  540651cf
Jan 07, 2012
- Match SelectionDAG logic for enabling movt. · 083dbdca
  Jakob Stoklund Olesen authored Jan 07, 2012
```
Darwin doesn't do static, and ELF targets only support static.

llvm-svn: 147740
```
  083dbdca
- Fix typo in the X86 backend readme. Patch from Jaeden Amero. · f210619d
  Craig Topper authored Jan 07, 2012
```
llvm-svn: 147739
```
  f210619d
- Remove VectorExtras. This unused helper was written for a type of API that is discouraged now. · 6898db62
  Benjamin Kramer authored Jan 07, 2012
```
llvm-svn: 147738
```
  6898db62
- Remove unnecessary check of hasAVX(). It's already included in hasXMM(). · ca66bba4
  Craig Topper authored Jan 07, 2012
```
llvm-svn: 147734
```
  ca66bba4
- Use getRegForValue() to materialize the address of ARM globals. · 8cdce7e6
  Jakob Stoklund Olesen authored Jan 07, 2012
```
This enables basic local CSE, giving us 20% smaller code for
consumer-typeset in -O0 builds.

<rdar://problem/10658692>

llvm-svn: 147720
```
  8cdce7e6
- Split Finish into Finish and FinishImpl to have a common place to do end of · 07082096
  Rafael Espindola authored Jan 07, 2012
```
file error checking. Use that to error on an unfinished cfi_startproc.

The error is not nice, but is already better than a segmentation fault.

llvm-svn: 147717
```
  07082096
- Copy implicit defs (e.g. r0) when changing tBX_RET to tPOP_RET. This bug is · 501e3095
  Evan Cheng authored Jan 07, 2012
```
exposed with an upcoming change will would delete the copy to return register
because there is no use! It's amazing anything works.

llvm-svn: 147715
```
  501e3095
- Use movw+movt in ARMFastISel::ARMMaterializeGV. · 68f034ee
  Jakob Stoklund Olesen authored Jan 07, 2012
```
This eliminates a lot of constant pool entries for -O0 builds of code
with many global variable accesses.

This speeds up -O0 codegen of consumer-typeset by 2x because the
constant island pass no longer has to look at thousands of constant pool
entries.

<rdar://problem/10629774>

llvm-svn: 147712
```
  68f034ee
- Make the 'x' constraint work for AVX registers as well. · c206d467
  Eric Christopher authored Jan 07, 2012
```
Fixes rdar://10614894

llvm-svn: 147704
```
  c206d467
Jan 06, 2012
- Enable aligned NEON spilling by default. · 68a922c0
  Jakob Stoklund Olesen authored Jan 06, 2012
```
Experiments show this to be a small speedup for modern ARM cores.

llvm-svn: 147689
```
  68a922c0
- Abort AdjustBBOffsetsAfter early when possible. · 69051113
  Jakob Stoklund Olesen authored Jan 06, 2012
```
llvm-svn: 147685
```
  69051113
- Initializing to false makes better sense. Thanks, David. · 64dc8aa4
  Chad Rosier authored Jan 06, 2012
```
llvm-svn: 147679
```
  64dc8aa4
- Fix uninitialized variable warning. · a3d90a94
  Chad Rosier authored Jan 06, 2012
```
llvm-svn: 147676
```
  a3d90a94
- Fix uninitialized variable warning. · 6b64c3c6
  Chad Rosier authored Jan 06, 2012
```
llvm-svn: 147675
```
  6b64c3c6
Jan 05, 2012

Mark scalar FMA4 instructions as ignoring the VEX.L bit. · 29b07374
Craig Topper authored Jan 05, 2012
```
llvm-svn: 147602
```
29b07374

Peephole optimization of ptest-conditioned branch in X86 arch. Performs... · 9255b6d9

Victor Umansky authored Jan 05, 2012

Peephole optimization of ptest-conditioned branch in X86 arch. Performs instruction combining of sequences generated by ptestz/ptestc intrinsics to ptest+jcc pair for SSE and AVX.

Testing: passed 'make check' including LIT tests for all sequences being handled (both SSE and AVX)

Reviewers: Evan Cheng, David Blaikie, Bruno Lopes, Elena Demikhovsky, Chad Rosier, Anton Korobeynikov
llvm-svn: 147601

9255b6d9

Replace the uint64_t -> double convertion algorithm with one that's more efficient. · ac27f0c8

Bill Wendling authored Jan 05, 2012

This small bit of ASM code is sufficient to do what the old algorithm did:

     movq       %rax,  %xmm0
     punpckldq  (c0),  %xmm0  // c0: (uint4){ 0x43300000U, 0x45300000U, 0U, 0U }
     subpd      (c1),  %xmm0  // c1: (double2){ 0x1.0p52, 0x1.0p52 * 0x1.0p32 }
   #ifdef __SSE3__
     haddpd   %xmm0, %xmm0          
   #else
     pshufd   $0x4e, %xmm0, %xmm1 
     addpd    %xmm1, %xmm0
   #endif

It's arguably faster. One caveat, the 'haddpd' instruction isn't very fast on
all processors.
<rdar://problem/7719814>

llvm-svn: 147593

ac27f0c8

Reapply r146997, "Heed spill slot alignment on ARM." · d110e2a8

Jakob Stoklund Olesen authored Jan 05, 2012

Now that canRealignStack() understands frozen reserved registers, it is
safe to use it for aligned spill instructions.

It will only return true if the registers reserved at the beginning of
register allocation allow for dynamic stack realignment.

<rdar://problem/10625436>

llvm-svn: 147579

d110e2a8

Avoid reserving an ARM base pointer during register allocation. · 9cb477db

Jakob Stoklund Olesen authored Jan 05, 2012

Once register allocation has started the reserved registers are frozen.

Fix the ARM canRealignStack() hook to respect the frozen register state.
Now the hook returns false if register allocation was started with frame
pointer elimination enabled.

It also returns false if register allocation started without a reserved
base pointer, and stack realignment would require a base pointer.  This
bug was breaking oggenc on armv6.

No test case, an upcoming patch will use this functionality to realign
the stack for spill slots when possible.

llvm-svn: 147578

9cb477db

Jan 04, 2012
- Silence warnings of a mysterious compiler that still defaults to C89. · 9c48f263
  Benjamin Kramer authored Jan 04, 2012
```
llvm-svn: 147553
```
  9c48f263
- Enable -soft-float for MIPS. · aac3e06b
  Akira Hatanaka authored Jan 04, 2012
```
llvm-svn: 147541
```
  aac3e06b
- Rename immLUiOpnd. · 3b775b8c
  Akira Hatanaka authored Jan 04, 2012
```
llvm-svn: 147519
```
  3b775b8c
- - Define base classes for Jump-and-link instructions and make 32-bit and 64-bit · b89a4bfe
  Akira Hatanaka authored Jan 04, 2012
```
  versions derive from them.
- JALR64 is not needed since N64 does not emit jal. 
- Add template parameter to BranchLink that sets the rt field. 
- Fix the set of temporary registers for O32 and N64.

llvm-svn: 147518
```
  b89a4bfe
- Have getRegForInlineAsmConstraint return the correct register class when target · c669d7a6
  Akira Hatanaka authored Jan 04, 2012
```
is Mips64.

llvm-svn: 147516
```
  c669d7a6
- Fix more places which should be checking for iOS, not darwin. · 801d98b3
  Evan Cheng authored Jan 04, 2012
```
llvm-svn: 147513
```
  801d98b3
- For x86, canonicalize max · 104dbb0f
  Evan Cheng authored Jan 04, 2012
```
(x > y) ? x : y
=>
(x >= y) ? x : y

So for something like
(x - y) > 0 : (x - y) ? 0
It will be
(x - y) >= 0 : (x - y) ? 0

This makes is possible to test sign-bit and eliminate a comparison against
zero. e.g.
subl   %esi, %edi
testl  %edi, %edi
movl   $0, %eax
cmovgl %edi, %eax
=>
xorl   %eax, %eax
subl   %esi, $edi
cmovsl %eax, %edi

rdar://10633221

llvm-svn: 147512
```
  104dbb0f
- Fix 80-column violations. · 6ca97df9
  Chad Rosier authored Jan 03, 2012
```
llvm-svn: 147495
```
  6ca97df9
Jan 03, 2012
- Revert r146997, "Heed spill slot alignment on ARM." · 1b7f2a76
  Jakob Stoklund Olesen authored Jan 03, 2012
```
This patch caused a miscompilation of oggenc because a frame pointer was
suddenly needed halfway through register allocation.

<rdar://problem/10625436>

llvm-svn: 147487
```
  1b7f2a76
- Revert 147426 because it caused pr11696. · 6d31bac8
  Nadav Rotem authored Jan 03, 2012
```
llvm-svn: 147485
```
  6d31bac8
- Enhance DAGCombine for transforming 128->256 casts into a vmovaps, rather · 493c1b31
  Chad Rosier authored Jan 03, 2012
```
then a vxorps + vinsertf128 pair if the original vector came from a load.
rdar://10594409

llvm-svn: 147481
```
  493c1b31
- Fix malformed assert. · b982d8eb
  Matt Beaumont-Gay authored Jan 03, 2012
```
If anybody has strong feelings about 'default: assert(0 && "blah")' vs
'default: llvm_unreachable("blah")', feel free to regularize the instances of
each in this file.

llvm-svn: 147459
```
  b982d8eb
- Intel style asm variant does not need '%' prefix. · c1215324
  Devang Patel authored Jan 03, 2012
```
llvm-svn: 147453
```
  c1215324
Jan 02, 2012

Miscellaneous shuffle lowering cleanup. No functional changes. Primarily... · 5bacb7e9

Craig Topper authored Jan 02, 2012

Miscellaneous shuffle lowering cleanup. No functional changes. Primarily converting the indexing loops to unsigned to be consistent across functions.

llvm-svn: 147430

5bacb7e9

Make CanXFormVExtractWithShuffleIntoLoad reject loads with multiple uses. Also... · 53d55964

Craig Topper authored Jan 02, 2012

Make CanXFormVExtractWithShuffleIntoLoad reject loads with multiple uses. Also make it return false if there's not even a load at all. This makes the code better match the code in DAGCombiner that it tries to match. These two changes prevent some cases where vector_shuffles were making it to instruction selection and causing the older shuffle selection code to be triggered. Also needed to fix a bad pattern that this change exposed. This is the first step towards getting rid of the old shuffle selection support. No test cases yet because there's no way to tell whether a shuffle was handled in the legalize stage or at instruction selection.

llvm-svn: 147428

53d55964

· 6c7a0e6c

Nadav Rotem authored Jan 02, 2012

Optimize the sequence blend(sign_extend(x)) to blend(shl(x)) since SSE blend instructions only look at the highest bit.

llvm-svn: 147426

6c7a0e6c

Jan 01, 2012

Allow CRC32 instructions to be selected when AVX is enabled. · b9109844
Craig Topper authored Jan 01, 2012
```
llvm-svn: 147411
```
b9109844

Fix sfence, lfence, mfence, and clflush to be able to be selected when AVX is... · 1c064e0a

Craig Topper authored Jan 01, 2012

Fix sfence, lfence, mfence, and clflush to be able to be selected when AVX is enabled. Fix monitor and mwait to require SSE3 or AVX, previously they worked even if SSE3 was disabled. Make prefetch instructions not set the execution domain since they don't use XMM registers.

llvm-svn: 147409

1c064e0a