Commits · afcc62024ed3ce8f30284ab3858e3cb4d141961e · Roger Ferrer / llvm-epi-0.8

Aug 01, 2013

Bugfix for making the DWARF debug strings and labels to code emitted as... · afcc6202

Carlo Kok authored Aug 01, 2013

Bugfix for making the DWARF debug strings and labels to code emitted as secrel32 instead of long opcodes (only for coff). This makes them debuggable with GDB.

fixes Bug 16249 - LLVM generates broken debug info on Windows

llvm-svn: 187597

afcc6202

R600: Add 64-bit float load/store support · 0344cdfe

Tom Stellard authored Aug 01, 2013



* Added R600_Reg64 class
* Added T#Index#.XY registers definition
* Added v2i32 register reads from parameter and global space
* Added f32 and i32 elements extraction from v2f32 and v2i32
* Added v2i32 -> v2f32 conversions

Tom Stellard:
  - Mark vec2 operations as expand.  The addition of a vec2 register
    class made them all legal.

Patch by: Dmitry Cherkassov

Signed-off-by: Dmitry Cherkassov <dcherkassov@gmail.com>
llvm-svn: 187582

0344cdfe

R600: Use 64-bit alignment for 64-bit kernel arguments · 53698938
Tom Stellard authored Aug 01, 2013
```
llvm-svn: 187581
```
53698938
R600/SI: Custom lower i64 ZERO_EXTEND · 98f675a9
Tom Stellard authored Aug 01, 2013
```
llvm-svn: 187580
```
98f675a9
EVEX and compressed displacement encoding for AVX512 · b1266b54
Elena Demikhovsky authored Aug 01, 2013
```
llvm-svn: 187576
```
b1266b54

[SystemZ] Reuse CC results for integer comparisons with zero · fd7f4ae6

Richard Sandiford authored Aug 01, 2013

This also fixes a bug in the predication of LR to LOCR: I'd forgotten
that with these in-place instruction builds, the implicit operands need
to be added manually.  I think this was latent until now, but is tested
by int-cmp-45.c.  It also adds a CC valid mask to STOC, again tested by
int-cmp-45.c.

llvm-svn: 187573

fd7f4ae6

[SystemZ] Prefer comparisons with zero · a075708a

Richard Sandiford authored Aug 01, 2013

Convert >= 1 to > 0, etc.  Using comparison with zero isn't a win on its own,
but it exposes more opportunities for CC reuse (the next patch).

llvm-svn: 187571

a075708a

Add tests for Mips DSP instructions. · deaa618c
Vladimir Medic authored Aug 01, 2013
```
llvm-svn: 187570
```
deaa618c

AArch64: add initial NEON support · 40e9efd7

Tim Northover authored Aug 01, 2013

Patch by Ana Pazos.

- Completed implementation of instruction formats:
AdvSIMD three same
AdvSIMD modified immediate
AdvSIMD scalar pairwise

- Completed implementation of instruction classes
(some of the instructions in these classes
belong to yet unfinished instruction formats):
Vector Arithmetic
Vector Immediate
Vector Pairwise Arithmetic

- Initial implementation of instruction formats:
AdvSIMD scalar two-reg misc
AdvSIMD scalar three same

- Intial implementation of instruction class:
Scalar Arithmetic

- Initial clang changes to support arm v8 intrinsics.
Note: no clang changes for scalar intrinsics function name mangling yet.

- Comprehensive test cases for added instructions
To verify auto codegen, encoding, decoding, diagnosis, intrinsics.

llvm-svn: 187567

40e9efd7

XCore target: Fix Vararg handling · 4be00f8a
Robert Lytton authored Aug 01, 2013
```
llvm-svn: 187565
```
4be00f8a
XCore target: Add byval handling · 4e60a3f4
Robert Lytton authored Aug 01, 2013
```
llvm-svn: 187563
```
4e60a3f4

Xcore target · b4787a15

Robert Lytton authored Aug 01, 2013

Fix emitArrayBound() calling OutStreamer.Emit*() multiple times when trying to print a single line

llvm-svn: 187562

b4787a15

Fix some misc. issues with Mips16 fp stubs. · 302ae6b0

Reed Kotler authored Aug 01, 2013

1) They should never be inlined.
2) A naming inconsistency with gcc mips16
3) Stubs should not have the global attribute

llvm-svn: 187555

302ae6b0

Jul 31, 2013

Added the B9.3.19 SUBS PC, LR, #imm (Thumb2) system instruction. · 78f9572f

Kevin Enderby authored Jul 31, 2013

While the .td entry is nice and all, it takes a pretty gross hack in
ARMAsmParser::ParseInstruction() because of handling of other "subs"
instructions to get it to match.  Ran it by Jim Grosbach and he said it was
about what he expected to make this work given the existing code.

rdar://14214063

llvm-svn: 187530

78f9572f

Revert "R600: Non vector only instruction can be scheduled on trans unit" · ca69a53b
Tom Stellard authored Jul 31, 2013
```
This reverts commit 98ce62780ea7185ba710868bf83c8077e8d7f6d6.

llvm-svn: 187526
```
ca69a53b
R600: Avoid more than 4 literals in the same instruction group at scheduling · bb3f9311
Vincent Lejeune authored Jul 31, 2013
```
llvm-svn: 187515
```
bb3f9311
R600: Non vector only instruction can be scheduled on trans unit · df18804e
Vincent Lejeune authored Jul 31, 2013
```
llvm-svn: 187514
```
df18804e
Reject bitcasts between address spaces with different sizes · 24b49c41
Matt Arsenault authored Jul 31, 2013
```
llvm-svn: 187506
```
24b49c41

[SystemZ] Implement isLegalAddressingMode() · 791bea41

Richard Sandiford authored Jul 31, 2013

The loop optimizers were assuming that scales > 1 were OK. I think this
is actually a bug in TargetLoweringBase::isLegalAddressingMode(),
since it seems to be trying to reject anything that isn't r+i or r+r,
but it has no default case for scales other than 0, 1 or 2. Implementing
the hook for z means that z can no longer test any change there though.

llvm-svn: 187497

791bea41

[SystemZ] Be more careful about inverting CC masks (conditional loads) · ee834382

Richard Sandiford authored Jul 31, 2013

Extend r187495 to conditional loads.  I split this out because the
easiest way seemed to be to force a particular operand order in
SystemZISelDAGToDAG.cpp.

llvm-svn: 187496

ee834382

[SystemZ] Be more careful about inverting CC masks · 3d768e33

Richard Sandiford authored Jul 31, 2013

System z branches have a mask to select which of the 4 CC values should
cause the branch to be taken.  We can invert a branch by inverting the mask.
However, not all instructions can produce all 4 CC values, so inverting
the branch like this can lead to some oddities.  For example, integer
comparisons only produce a CC of 0 (equal), 1 (less) or 2 (greater).
If an integer EQ is reversed to NE before instruction selection,
the branch will test for 1 or 2.  If instead the branch is reversed
after instruction selection (by inverting the mask), it will test for
1, 2 or 3.  Both are correct, but the second isn't really canonical.
This patch therefore keeps track of which CC values are possible
and uses this when inverting a mask.

Although this is mostly cosmestic, it fixes undefined behavior
for the CIJNLH in branch-08.ll.  Another fix would have been
to mask out bit 0 when generating the fused compare and branch,
but the point of this patch is that we shouldn't need to do that
in the first place.

The patch also makes it easier to reuse CC results from other instructions.

llvm-svn: 187495

3d768e33

[SystemZ] Move compare-and-branch generation even later · 8a757bba

Richard Sandiford authored Jul 31, 2013

r187116 moved compare-and-branch generation from the instruction-selection
pass to the peephole optimizer (via optimizeCompare).  It turns out that even
this is a bit too early.  Fused compare-and-branch instructions don't
interact well with predication, where a CC result is needed.  They also
make it harder to reuse the CC side-effects of earlier instructions
(not yet implemented, but the subject of a later patch).

Another problem was that the AnalyzeBranch family of routines weren't
handling compares and branches, so we weren't able to reverse the fused
form in cases where we would reverse a separate branch.  This could have
been fixed by extending AnalyzeBranch, but given the other problems,
I've instead moved the fusing to the long-branch pass, which is also
responsible for the opposite transformation: splitting out-of-range
compares and branches into separate compares and long branches.

I've added a test for the AnalyzeBranch problem.  A test for the
predication problem is included in the next patch, which fixes a bug
in the choice of CC mask.

llvm-svn: 187494

8a757bba

[SystemZ] Postpone NI->RISBG conversion to convertToThreeAddress() · 6a06ba36

Richard Sandiford authored Jul 31, 2013

r186399 aggressively used the RISBG instruction for immediate ANDs,
both because it can handle some values that AND IMMEDIATE can't,
and because it allows the destination register to be different from
the source.  I realized later while implementing the distinct-ops
support that it would be better to leave the choice up to
convertToThreeAddress() instead.  The AND IMMEDIATE form is shorter
and is less likely to be cracked.

This is a problem for 32-bit ANDs because we assume that all 32-bit
operations will leave the high word untouched, whereas RISBG used in
this way will either clear the high word or copy it from the source
register.  The patch uses the z196 instruction RISBLG for this instead.

This means that z10 will be restricted to NILL, NILH and NILF for
32-bit ANDs, but I think that should be OK for now.  Although we're
using z10 as the base architecture, the optimization work is going
to be focused more on z196 and zEC12.

llvm-svn: 187492

6a06ba36

Added INSERT and EXTRACT intructions from AVX-512 ISA. · 67b05fc0

Elena Demikhovsky authored Jul 31, 2013

All insertf*/extractf* functions replaced with insert/extract since we have insertf and inserti forms.
Added lowering for INSERT_VECTOR_ELT / EXTRACT_VECTOR_ELT for 512-bit vectors.
Added lowering for EXTRACT/INSERT subvector for 512-bit vectors.
Added a test.

llvm-svn: 187491

67b05fc0

[SystemZ] Add RISBLG and RISBHG instruction definitions · 6cf80b3e
Richard Sandiford authored Jul 31, 2013
```
The next patch will make use of RISBLG for codegen.

llvm-svn: 187490
```
6cf80b3e

Changed register names (and pointer keywords) to be lower case when using... · efd67d46

Craig Topper authored Jul 31, 2013

Changed register names (and pointer keywords) to be lower case when using Intel X86 assembler syntax.

Patch by Richard Mitton.

llvm-svn: 187476

efd67d46

Preserve fast-math flags when folding (fsub x, (fneg y)) to (fadd x, y). · c7be519d
Owen Anderson authored Jul 30, 2013
```
llvm-svn: 187462
```
c7be519d

Jul 30, 2013

isKnownToBeAPowerOfTwo: Strengthen isKnownToBeAPowerOfTwo's analysis on add instructions · b7d5409a

David Majnemer authored Jul 30, 2013

Call into ComputeMaskedBits to figure out which bits are set on both add
operands and determine if the value is a power-of-two-or-zero or not.

llvm-svn: 187445

b7d5409a

Change behavior of calling bitcasted alias functions. · cacbb237

Matt Arsenault authored Jul 30, 2013

It will now only convert the arguments / return value and call
the underlying function if the types are able to be bitcasted.
This avoids using fp<->int conversions that would occur before.

llvm-svn: 187444

cacbb237

This test may have been sensitive to the ARM ABI... · 3f423dec
Andrew Trick authored Jul 30, 2013
```
llvm-svn: 187442
```
3f423dec
MI Sched fix: assert "Disconnected LRG within the scheduling region." · d9761776
Andrew Trick authored Jul 30, 2013
```
llvm-svn: 187435
```
d9761776
R600/SI: Expand vector fp <-> int conversions · aa313d0a
Tom Stellard authored Jul 30, 2013
```
llvm-svn: 187421
```
aa313d0a

This patch implements parsing of mips FCC register operands. The example... · 643b3987

Vladimir Medic authored Jul 30, 2013

This patch implements parsing of mips FCC register operands. The example instructions have been added to test files.

llvm-svn: 187410

643b3987

[ARM] check bitwidth in PerformORCombine · 0c2ee5a2

Saleem Abdulrasool authored Jul 30, 2013



When simplifying a (or (and B A) (and C ~A)) to a (VBSL A B C) ensure that the
bitwidth of the second operands to both ands match before comparing the negation
of the values.

Split the check of the value of the second operands to the ands.  Move the cast
and variable declaration slightly higher to make it slightly easier to follow.

Bug-Id: 16700
Signed-off-by: Saleem Abdulrasool <compnerd@compnerd.org>
llvm-svn: 187404

0c2ee5a2

[R600] Replicate old DAGCombiner behavior in target specific DAG combine. · e2e0548d

Quentin Colombet authored Jul 30, 2013

build_vector is lowered to REG_SEQUENCE, which is something the register
allocator does a good job at optimizing.

llvm-svn: 187397

e2e0548d

[DAGCombiner] insert_vector_elt: Avoid building a vector twice. · 6bf4baa4

Quentin Colombet authored Jul 30, 2013

This patch prevents the following combine when the input vector is used more
than once.
insert_vector_elt (build_vector elt0, ..., eltN), NewEltIdx, idx
=>
build_vector elt0, ..., NewEltIdx, ..., eltN 

The reasons are:
- Building a vector may be expensive, so try to reuse the existing part of a
  vector instead of creating a new one (think big vectors).
- elt0 to eltN now have two users instead of one. This may prevent some other
  optimizations.

llvm-svn: 187396

6bf4baa4

Move file to X86 and add a triple to fix darwin bots for now. · 4ed04e2e

Eric Christopher authored Jul 30, 2013

The problem is due to the section name being explicitly mentioned in
the IR and differing between the two platforms.

llvm-svn: 187394

4ed04e2e

Fix a truly egregious thinko in anonymous namespace check, · e414ece7

Eric Christopher authored Jul 29, 2013

update testcase to make sure we generate debug info for walrus
by adding a non-trivial constructor and verify that we don't
emit an ODR signature for the type.

llvm-svn: 187393

e414ece7

Make sure we don't emit an ODR hash for types with no name and make · d853ea31
Eric Christopher authored Jul 29, 2013
```
sure the comments for each testcase are a bit easier to distinguish.

llvm-svn: 187392
```
d853ea31
Clarify comments for types contained in anonymous namespaces and · 32d1531a
Eric Christopher authored Jul 29, 2013
```
odr hashes.

llvm-svn: 187391
```
32d1531a