Commits · 9adfd8aabbdbe7ce06663c98d3c5e8eda29e6ed1 · Roger Ferrer / llvm-epi-0.8

Apr 18, 2013

X86: Add an SSE2 lowering for 64 bit compares when pcmpgtq (SSE4.2) isn't available. · c5578288
Benjamin Kramer authored Apr 18, 2013
```
This pattern started popping up in vectorized min/max reductions.

llvm-svn: 179797
```
c5578288

Allow misaligned stores in x86 fast-isel. · a403d243

Derek Schuff authored Apr 18, 2013

In X86FastISel::X86SelectStore(), improperly aligned stores are rejected and
handled by the DAG-based ISel.  However, X86FastISel::X86SelectLoad() makes
no such requirement.  There doesn't appear to be an x86 architectural
correctness issue with allowing potentially unaligned store instructions.
This patch removes this restriction.

Patch by Jim Stichnot.

llvm-svn: 179774

a403d243

[ms-inline asm] Simplify some logic and add a FIXME for unhandled unary minus. · db003998
Chad Rosier authored Apr 18, 2013
```
llvm-svn: 179765
```
db003998
Make this private method. · c2f055d1
Chad Rosier authored Apr 18, 2013
```
llvm-svn: 179764
```
c2f055d1

Apr 17, 2013

[ms-inline asm] These should be int64_t, not uint64_t. · 6241c1a6
Chad Rosier authored Apr 17, 2013
```
llvm-svn: 179724
```
6241c1a6

[ms-inline asm] Add support for the minus unary operator. Previously, we were · 3124627a

Chad Rosier authored Apr 17, 2013

unable to handle cases such as __asm mov eax, 8*-8.

This patch also attempts to simplify the state machine.  Further, the error
reporting has been improved.  Test cases included, but more will be added to
the clang side shortly.
rdar://13668445

llvm-svn: 179719

3124627a

This patch teaches x86 fast-isel to generate the native div/idiv instructions · 24a36eb3

Eli Bendersky authored Apr 17, 2013

for the sdiv/srem/udiv/urem bitcode instructions.  This is done for the i8,
i16, and i32 types, as well as i64 for the x86_64 target.

Patch by Jim Stichnoth

llvm-svn: 179715

24a36eb3

X86 cost model: Exit before calling getSimpleVT on non-simple VTs · c0c7ff4a
Arnold Schwaighofer authored Apr 17, 2013
```
getSimpleVT can only handle simple value types.

radar://13676022

llvm-svn: 179714
```
c0c7ff4a
[ms-inline asm] Add support for parsing complex immediate expressions. Test · bfb7099e
Chad Rosier authored Apr 17, 2013
```
cases to be submitted on clang side shortly.
rdar://13663768 and PR15760

llvm-svn: 179655
```
bfb7099e

Apr 16, 2013
- Remove unused variable from previous refactor. · 0932a1ff
  Chad Rosier authored Apr 16, 2013
```
llvm-svn: 179611
```
  0932a1ff
- [ms-inline asm] Refactor. No functional change intended. · 5362af90
  Chad Rosier authored Apr 16, 2013
```
llvm-svn: 179610
```
  5362af90
- [ms-inline asm] Remove some dead code. · e10b7b35
  Chad Rosier authored Apr 16, 2013
```
llvm-svn: 179607
```
  e10b7b35
Apr 13, 2013

X86 machine model: reduce SandyBridge and Haswell ILPWindow. · f7fd6b9e

Andrew Trick authored Apr 13, 2013

The initial values were arbitrary. I want them to be more
conservative. This represents the number of latency cycles hidden by
OOO execution. In practice, I think it should be within a small factor
of the complex floating point operation latency so the scheduler can
make some attempt to hide latency even for smallish blocks.

These are by no means the best values, just a starting point for
tuning heuristics. Some benchmarks such as TSVC run faster with this
lower value for SandyBridge. I haven't run anything on Haswell, but
it's shouldn't be 2x SB.

llvm-svn: 179450

f7fd6b9e

Catch another case where SD fails to propagate node order. · 52b8387f

Andrew Trick authored Apr 13, 2013

I need to handle this for the test case in my following scheduler
commit.

Work is already under way to redesign the mechanism for node order
propagation because this case by case approach is unmaintainable.

llvm-svn: 179448

52b8387f

[ms-inline asm] Simplify the logic by using parsePrimaryExpr. No functional · 43554eed
Chad Rosier authored Apr 12, 2013
```
change intended.  Test case previously added in r178568.
Part of rdar://13611297

llvm-svn: 179425
```
43554eed

Apr 12, 2013

[ms-inline asm] Move this logic into a static function as it's only applicable · d383db51
Chad Rosier authored Apr 12, 2013
```
when parsing MS-style inline assembly.  No functional change intended.

llvm-svn: 179407
```
d383db51

[ms-inline asm] Address the FIXME for ImmDisp before brackets. This · e9902d83

Chad Rosier authored Apr 12, 2013

is a follow on to r179393 and r179399.  Test case to be added on
the clang side.
Part of rdar://13453209

llvm-svn: 179403

e9902d83

[ms-inline asm] Have the [ Symbol ] case fall into the more general logic. This · 152749ce
Chad Rosier authored Apr 12, 2013
```
is a follow on to r179393.  Test case to be added on the clang side.
Part of rdar://13453209

llvm-svn: 179399
```
152749ce

[ms-inline asm] Add support for operands that include both a symbol and an · 175d0aee

Chad Rosier authored Apr 12, 2013

immediate displacement.  Specifically, add support for generating the proper IR.
We've been able to parse this for some time now.  Test case to be added on the
clang side.
Part of rdar://13453209

llvm-svn: 179393

175d0aee

[ms-inline asm] Add support for using the LENGTH, TYPE, and SIZE operators with · b67f8057

Chad Rosier authored Apr 11, 2013

variables that use namespace alias qualifiers.  Test case coming on clang side
shortly.
Part of rdar://13499009

llvm-svn: 179343

b67f8057

[ms-inline asm] Add support for using offsetof operator with variables that use · ae7ecd6d
Chad Rosier authored Apr 11, 2013
```
namespace alias qualifiers.  Test case coming on clang side shortly.
Part of rdar://13499009

llvm-svn: 179339
```
ae7ecd6d

[ms-inline asm] Pass a StringRef reference to ParseIntelVarWithQualifier so we · ce03189b

Chad Rosier authored Apr 11, 2013

can build up the identifier string.  No test case as support for looking up
these type of identifiers hasn't been implemented on the clang side.
Part of rdar://13499009

llvm-svn: 179336

ce03189b

Apr 11, 2013

[ms-inline asm] Remove brackets from around a symbol reference in the target · 8fb83300

Chad Rosier authored Apr 11, 2013

specific logic.  This makes the code much less fragile.  Test case coming on the
clang side in a moment.
rdar://13634327

llvm-svn: 179323

8fb83300

Optimize vector select from all 0s or all 1s · 55658d42

Michael Liao authored Apr 11, 2013

As packed comparisons in AVX/SSE produce all 0s or all 1s in each SIMD lane,
vector select could be simplified to AND/OR or removed if one or both values
being selected is all 0s or all 1s.

llvm-svn: 179267

55658d42

Add CLAC/STAC instruction encoding/decoding support · 95d94403

Michael Liao authored Apr 11, 2013

As these two instructions in AVX extension are privileged instructions for
special purpose, it's only expected to be used in inlined assembly.

llvm-svn: 179266

95d94403

Enhance bool simplifcation in X86 to handle more cases · f7bf8705

Michael Liao authored Apr 11, 2013

This patch is revised based on patch from Victor Umansky
<victor.umansky@intel.com>. More cases are handled in X86's bool
simplification, i.e.
- SETCC_CARRY
- value is truncated to i1 with AND

As a by-product, PR5443 is also fixed.

llvm-svn: 179265

f7bf8705

MC: Support COFF image-relative MCSymbolRefs · 1da4529b

Nico Rieck authored Apr 10, 2013

Add support for the COFF relocation types IMAGE_REL_I386_DIR32NB and
IMAGE_REL_AMD64_ADDR32NB for 32- and 64-bit respectively. These are
similar to normal 4-byte relocations except that they do not include
the base address of the image.

Image-relative relocations are used for debug information (32-bit) and
SEH unwind tables (64-bit).

A new MCSymbolRef variant called 'VK_COFF_IMGREL32' is introduced to
specify such relocations. For AT&T assembly, this variant can be accessed
using the symbol suffix '@imgrel'.

llvm-svn: 179240

1da4529b

Apr 10, 2013

fixed xsave, xsaveopt, xrstor mnemonics with intel syntax; added test cases · 394bf148
Kay Tiong Khoo authored Apr 10, 2013
```
llvm-svn: 179223
```
394bf148
fixed to disassemble with tab after mnemonic rather than space · 6f76c210
Kay Tiong Khoo authored Apr 10, 2013
```
llvm-svn: 179215
```
6f76c210

· ddf96b50

Preston Gurd authored Apr 10, 2013

In the X86 back end, getMemoryOperandNo() returns the offset
into the operand array of the start of the memory reference descriptor.

Additional code in EncodeInstruction provides an additional adjustment.

This patch places that additional code in a separate function,
called getOperandBias, so that any caller of getMemoryOperandNo
can also call getOperandBias.

llvm-svn: 179211

ddf96b50

Tidy up, fix and simplify a few of the SMLocs. Prior to r179109 the Start SMLoc · 70f47596

Chad Rosier authored Apr 10, 2013

wasn't always the start of the operand.  If there was a symbol reference, then
Start pointed to that token.  It's very likely there are other places that need
to be updated.

llvm-svn: 179210

70f47596

Remove unused variable. · 53eb7d79
Chad Rosier authored Apr 10, 2013
```
llvm-svn: 179205
```
53eb7d79

Reapply r179115, but use parsePrimaryExpression a little more judiciously. · 1863f4f4

Chad Rosier authored Apr 10, 2013

Test cases that regressed due to r179115, plus a few more, were added in
r179182.  Original commit message below:

[ms-inline asm] Use parsePrimaryExpr in lieu of parseExpression if we need to
parse an identifier.  Otherwise, parseExpression may parse multiple tokens,
which makes it impossible to properly compute an immediate displacement.
An example of such a case is the source operand (i.e., [Symbol + ImmDisp]) in
the below example:

 __asm mov eax, [Symbol + ImmDisp]

Part of rdar://13611297

llvm-svn: 179187

1863f4f4

__sincosf_stret returns sinf / cosf in bits 0:31 and 32:63 of xmm0, not in · ac0469c5
Evan Cheng authored Apr 10, 2013
```
xmm0 / xmm1.

rdar://13599493

llvm-svn: 179141
```
ac0469c5

Apr 09, 2013

Cleanup. No functional change intended. · 18785857
Chad Rosier authored Apr 09, 2013
```
llvm-svn: 179129
```
18785857
Cleanup. No functional change intended. · 10d1d1cc
Chad Rosier authored Apr 09, 2013
```
llvm-svn: 179125
```
10d1d1cc
Revert r179115 as it looks to have killed the ASan tests. · e8d8288d
Chad Rosier authored Apr 09, 2013
```
llvm-svn: 179120
```
e8d8288d

[ms-inline asm] Use parsePrimaryExpr in lieu of parseExpression if we need to · a08f30f0

Chad Rosier authored Apr 09, 2013

parse an identifier.  Otherwise, parseExpression may parse multiple tokens,
which makes it impossible to properly compute an immediate displacement.
An example of such a case is the source operand (i.e., [Symbol + ImmDisp]) in
the below example:

 __asm mov eax, [Symbol + ImmDisp]

The existing test cases exercise this patch.
rdar://13611297

llvm-svn: 179115

a08f30f0

[ms-inline asm] Maintain a StringRef to reference a symbol in a parsed operand, · e81309b3

Chad Rosier authored Apr 09, 2013

rather than deriving the StringRef from the Start and End SMLocs.

Using the Start and End SMLocs works fine for operands such as [Symbol], but
not for operands such as [Symbol + ImmDisp].  All existing test cases that
reference a variable exercise this patch.
rdar://13602265

llvm-svn: 179109

e81309b3

Apr 08, 2013

X86 cost model: Model cost for uitofp and sitofp on SSE2 · f47d2d7f

Arnold Schwaighofer authored Apr 08, 2013

The costs are overfitted so that I can still use the legalization factor.

For example the following kernel has about half the throughput vectorized than
unvectorized when compiled with SSE2. Before this patch we would vectorize it.

unsigned short A[1024];
double B[1024];
void f() {
  int i;
  for (i = 0; i < 1024; ++i) {
    B[i] = (double) A[i];
  }
}

radar://13599001

llvm-svn: 179033

f47d2d7f