Commits · 47fc44e52e8e7bad1b901bfae4dc78dec048d5f1 · Roger Ferrer / llvm-epi-0.8

Dec 16, 2013

AVX-512: Added legal type MVT::i1 and VK1 register for it. · 47fc44e5

Elena Demikhovsky authored Dec 16, 2013

Added scalar compare VCMPSS, VCMPSD.
Implemented LowerSELECT for scalar FP operations.
I replaced FSETCCss, FSETCCsd with one node type FSETCCs.
Node extract_vector_elt(v16i1/v8i1, idx) returns an element of type i1.

llvm-svn: 197384

47fc44e5

Dec 14, 2013
- [Stackmap] Only the AnyReg calling convention should preserve all registers. · 36f46197
  Juergen Ributzka authored Dec 14, 2013
```
llvm-svn: 197316
```
  36f46197
Dec 13, 2013

Assume defaults to produce smaller datalayout strings. · 1caa693a
Rafael Espindola authored Dec 13, 2013
```
llvm-svn: 197249
```
1caa693a

X86: When lowering shl_parts, don't emit shift amounts larger than the bit width. · e723bb10

Benjamin Kramer authored Dec 13, 2013

While it's safe for the X86-specific shift nodes, dag combining will
kill generic nodes. Insert an AND to make it safe, isel will nuke it
as x86's shift instructions have an implicit AND.

Fixes PR16108, which contains a contraption to hit this case in between
constant folders.

llvm-svn: 197228

e723bb10

Change stack probing code for MingW. · 87b23aec

Kai Nacke authored Dec 13, 2013

Since gcc 4.6 the compiler uses ___chkstk_ms which has the same semantics as the
MS CRT function __chkstk. This simplifies the prologue generation a bit.

Reviewed by Rafael Espíndola. 

llvm-svn: 197205

87b23aec

Dec 12, 2013

Switch to the new MingW ABI. · 32cb5ac9

Rafael Espindola authored Dec 12, 2013

GCC 4.7 changed the MingW ABI. On the LLVM side it means that sret functions
don't pop the stack.

llvm-svn: 197163

32cb5ac9

Added new X86 patterns to select SSE scalar fp arithmetic instructions from · 9b5c3dcf

Andrea Di Biagio authored Dec 12, 2013

a vector packed single/double fp operation followed by a vector insert.

The effect is that the backend coverts the packed fp instruction
followed by a vectro insert into a SSE or AVX scalar fp instruction.

For example, given the following code:
   __m128 foo(__m128 A, __m128 B) {
     __m128 C = A + B;
     return (__m128) {c[0], a[1], a[2], a[3]};
   }

 previously we generated:
   addps %xmm0, %xmm1
   movss %xmm1, %xmm0
 
 we now generate:
   addss %xmm1, %xmm0

llvm-svn: 197145

9b5c3dcf

Dec 11, 2013

AVX-512: Removed "z" suffix from AVX-512 instructions, since it is incompatible with GCC. · cf088098

Elena Demikhovsky authored Dec 11, 2013

I moved a test from avx512-vbroadcast-crash.ll to avx512-vbroadcast.ll
I defined HasAVX512 predicate as AssemblerPredicate. It means that you should invoke llvm-mc with "-mcpu=knl" to get encoding for AVX-512 instructions. I need this to let AsmMatcher to set different encoding for AVX and AVX-512 instructions that have the same mnemonic and operands (all scalar instructions).

llvm-svn: 197041

cf088098

Prune redundant dependencies in LLVMBuild.txt. · 8bc9bfaa
NAKAMURA Takumi authored Dec 11, 2013
```
llvm-svn: 196988
```
8bc9bfaa

Revert the backend fatal error from r196939 · ad92aca4

Reid Kleckner authored Dec 10, 2013

The combination of inline asm, stack realignment, and dynamic allocas
turns out to be too common to reject out of hand.

ASan inserts empy inline asm fragments and uses aligned allocas.
Compiling any trivial function containing a dynamic alloca with ASan is
enough to trigger the check.

XFAIL the test cases that would be miscompiled and add one that uses the
relevant functionality.

llvm-svn: 196986

ad92aca4

Dec 10, 2013

Refactor the computation of the x86 datalayout. · 002f8aa5
Rafael Espindola authored Dec 10, 2013
```
llvm-svn: 196976
```
002f8aa5
on darwin<10, fallback to .weak_definition (PPC,X86) · 1b01849f
David Fang authored Dec 10, 2013
```
.weak_def_can_be_hidden was not yet supported by the system assembler

llvm-svn: 196970
```
1b01849f

Reland "Fix miscompile of MS inline assembly with stack realignment" · ee08897f

Reid Kleckner authored Dec 10, 2013

This re-lands commit r196876, which was reverted in r196879.

The tests have been fixed to pass on platforms with a stack alignment
larger than 4.

Update to clang side tests will land shortly.

llvm-svn: 196939

ee08897f

Make Triple's isOSBinFormatXXX functions partition triple-space. · 9653eb57

Tim Northover authored Dec 10, 2013

Most users would be surprised if "isCOFF" and "isMachO" were simultaneously
true, unless they'd put the compiler in a box with a gun attached to a photon
detector.

This makes sure precisely one of the three formats is true for any triple and
simplifies some target logic based on that.

llvm-svn: 196934

9653eb57

Ensure that the backend no longer emits unnecessary vector insert instructions · f7c33c81

Andrea Di Biagio authored Dec 10, 2013

immediately after SSE scalar fp instructions like addss or mulss.

Added patterns to select SSE scalar fp arithmetic instructions from a scalar
fp operation followed by a blend.

For example, given the following code:
  __m128 foo(__m128 A, __m128 B) {
    A[0] += B[0];
    return A;
  }

previously we generated:
  addss %xmm0, %xmm1
  movss %xmm1, %xmm0

now we generate:
  addss %xmm1, %xmm0

llvm-svn: 196925

f7c33c81

AVX-512: changed intrinsics for mask operations · e382c3fd
Elena Demikhovsky authored Dec 10, 2013
```
llvm-svn: 196918
```
e382c3fd
AVX-512: Changed intrinsics of VPCONFLICT to match GCC builtin form · 6270b388
Elena Demikhovsky authored Dec 10, 2013
```
llvm-svn: 196914
```
6270b388
Revert "Fix miscompile of MS inline assembly with stack realignment" · 0a9509f0
Reid Kleckner authored Dec 10, 2013
```
This reverts commit r196876.  Its tests failed on the bots, so I'll
figure it out tomorrow.

llvm-svn: 196879
```
0a9509f0

Fix miscompile of MS inline assembly with stack realignment · 7f10a8cd

Reid Kleckner authored Dec 10, 2013

For stack frames requiring realignment, three pointers may be needed:
- ebp to address incoming arguments
- esi (could be any callee-saved register) to address locals
- esp to address outgoing arguments

We would use esi unconditionally without verifying that it did not
conflict with inline assembly.

This change doesn't do the verification, it simply emits a fatal error
on functions that use stack realignment, dynamic SP adjustments, and
inline assembly.

Because stack realignment is common on Windows, we also no longer assume
that MS inline assembly clobbers esp.  Instead, we analyze the inline
instructions for implicit definitions and check if esp is there.  If so,
we require the use of a base pointer and consider it in the condition
above.

Mostly fixes PR16830, but we could try harder to find a non-conflicting
base pointer.

Reviewers: sunfish

Differential Revision: http://llvm-reviews.chandlerc.com/D1317

llvm-svn: 196876

7f10a8cd

Dec 09, 2013
- Don't add suffixes for stdcall/fastcall on 64 coff. · 1a3a22fa
  Rafael Espindola authored Dec 09, 2013
```
This matches the behavior of both msvc and mingw.

llvm-svn: 196814
```
  1a3a22fa
Dec 06, 2013
- Update AVX512 vector blend intrinsic names. · e3cc4aac
  Cameron McInally authored Dec 06, 2013
```
llvm-svn: 196581
```
  e3cc4aac
Dec 05, 2013

Remove the isImplicitlyPrivate argument of getNameWithPrefix. · 117b20c4

Rafael Espindola authored Dec 05, 2013

getSymbolWithGlobalValueBase use is to create a name of a new symbol based
on the name of an existing GV. Assert that and then remove the last call
to pass true to isImplicitlyPrivate.

This gives the mangler API a 1:1 mapping from GV to names, which is what we
need to drop the mangler dependency on the target (and use an extended
datalayout instead).

llvm-svn: 196472

117b20c4

Correct word hyphenations · f907b891

Alp Toker authored Dec 05, 2013

This patch tries to avoid unrelated changes other than fixing a few
hyphen-related ambiguities and contractions in nearby lines.

llvm-svn: 196471

f907b891

Hide the stub created for MO_ExternalSymbol too. · 01d19d02

Rafael Espindola authored Dec 05, 2013

given

declare void @llvm.memset.p0i8.i32(i8* nocapture, i8, i32, i32, i1)
declare void @foo()
define void @bar() {
  call void @foo()
  call void @llvm.memset.p0i8.i32(i8* null, i8 0, i32 188, i32 1, i1 false)
  ret void
}

We used to produce

L_foo$stub:
        .indirect_symbol        _foo
        .ascii  "\364\364\364\364\364"

_memset$stub:
        .indirect_symbol        _memset
        .ascii  "\364\364\364\364\364"

We not produce a private stub for memset too.

Stubs are not needed with recent linkers, but we still produce them for darwin8.

Thanks to David Fang for confirming that gcc used to do this too.

llvm-svn: 196468

01d19d02

Add AVX512 patterns for v16i32 broadcast and v2i64 zero extend load. · 30bbb214
Cameron McInally authored Dec 05, 2013
```
Patch by Aleksey Bader.

llvm-svn: 196435
```
30bbb214

Fix a bug in darwin's 32-bit X86 handling of evaluating fixups. · 86496a45

Kevin Enderby authored Dec 04, 2013

Where it would use a scattered relocation entry but falls back to a
normal relocation entry because the FixupOffset is more than 24-bits.

The bug is in the X86MachObjectWriter::RecordScatteredRelocation() where
it changes reference parameter FixedValue but then returns false to indicate
it did not create a scattered relocation entry.  The fix is simply to save the
original value of the parameter FixedValue at the start of the method and
restore it if we are returning false in that case.

rdar://15526046

llvm-svn: 196432

86496a45

Dec 04, 2013
- Fix assembly syntax for AVX512 vector blend instructions. · cbb51dac
  Cameron McInally authored Dec 04, 2013
```
llvm-svn: 196393
```
  cbb51dac
- [X86] Check YMM31/ZMM31 as well · 9a0e3f48
  Michael Liao authored Dec 04, 2013
```
- No test case as there's no calling convention preserve YMM31/ZMM31 only

llvm-svn: 196391
```
  9a0e3f48
- Suppress '(x < y) ? a : 0 -> (x < y) & a' transform on X86 architectures with... · c5f420e1
  Cameron McInally authored Dec 04, 2013
```
Suppress '(x < y) ? a : 0 -> (x < y) & a' transform on X86 architectures with dedicated mask registers.

Patch by Aleksey Bader.

llvm-svn: 196386
```
  c5f420e1
- [Stackmap] Emit multi-byte nops for X86. · 17e0d9ee
  Juergen Ributzka authored Dec 04, 2013
```
llvm-svn: 196334
```
  17e0d9ee
Dec 03, 2013

Fix mingw32 thiscall + sret. · 0a2baf8e

Rafael Espindola authored Dec 03, 2013

Unlike msvc, when handling a thiscall + sret gcc will
* Put the sret in %ecx
* Put the this pointer is (%esp)

This fixes, for example, calling stringstream::str.

llvm-svn: 196312

0a2baf8e

Enhance the fix of PR17631 · 14b02848

Michael Liao authored Dec 03, 2013

- The fix to PR17631 fixes part of the cases where 'vzeroupper' should
  not be issued before 'call' insn. There're other cases where helper
  calls will be inserted not limited to epilog. These helper calls do
  not follow the standard calling convention and won't clobber any YMM
  registers. (So far, all call conventions will clobber any or part of
  YMM registers.)
  This patch enhances the previous fix to cover more cases 'vzerosupper' should
  not be inserted by checking if that function call won't clobber any YMM
  registers and skipping it if so.

llvm-svn: 196261

14b02848

Refactor the setting of PrivateGlobalPrefix. · 5113d166
Rafael Espindola authored Dec 02, 2013
```
No functionality change.

llvm-svn: 196170
```
5113d166

Dec 02, 2013
- Move getSymbolWithGlobalValueBase to TargetLoweringObjectFile. · f4e6b29a
  Rafael Espindola authored Dec 02, 2013
```
This allows it to be used in TargetLoweringObjectFileImpl.cpp.

llvm-svn: 196117
```
  f4e6b29a
- Introduce poor man's consumeToken() in X86AsmParser · a5b88a58
  Alp Toker authored Dec 02, 2013
```
This makes the code a little more idiomatic.

No change in behaviour.

llvm-svn: 196113
```
  a5b88a58
- Change the default of AsmWriterClassName and isMCAsmWriter. · 50712a45
  Rafael Espindola authored Dec 02, 2013
```
llvm-svn: 196065
```
  50712a45
Dec 01, 2013

Revamp error checking in the ms inline asm parser. · 951b15eb

Benjamin Kramer authored Dec 01, 2013

- Actually abort when an error occurred.
- Check that the frontend lookup worked when parsing length/size/type operators.

Tested by a clang test. PR18096.

llvm-svn: 196044

951b15eb

Nov 29, 2013

Refactor a lot of patchpoint/stackmap related code to simplify and make it · 39609996

Lang Hames authored Nov 29, 2013

target independent.

Most of the x86 specific stackmap/patchpoint handling was necessitated by the
use of the native address-mode format for frame index operands. PEI has now
been modified to treat stackmap/patchpoint similarly to DEBUG_INFO, allowing
us to use a simple, platform independent register/offset pair for frame
indexes on stackmap/patchpoints.

Notes:
  - Folding is now platform independent and automatically supported.
  - Emiting patchpoints with direct memory references now just involves calling
    the TargetLoweringBase::emitPatchPoint utility method from the target's
    XXXTargetLowering::EmitInstrWithCustomInserter method. (See
    X86TargetLowering for an example).
  - No more ugly platform-specific operand parsers.

This patch shouldn't change the generated output for X86. 

llvm-svn: 195944

39609996

Nov 28, 2013

Refactor to remove a bit of duplication. No functionality change. · d5bd5a47
Rafael Espindola authored Nov 28, 2013
```
llvm-svn: 195933
```
d5bd5a47

[CMake] Let add_public_tablegen_target() provide intrinsics_gen, too. · 226e10ed

NAKAMURA Takumi authored Nov 28, 2013

I think, in principle, intrinsics_gen may be added explicitly.
That said, it can be added incidentally, since each target already has dependencies to llvm-tblgen.
Almost all source files depend on both CommonTaleGen and intrinsics_gen.

Explicit add_dependencies() have been pruned under lib/Target.

llvm-svn: 195929

226e10ed