Commits · 0c5ed2b43708dc3e53540baa2e3d2406ff851e3b · Roger Ferrer / llvm-epi-0.8

Jul 31, 2013

R600: Remove predicated_break inst · 0c5ed2b4

Vincent Lejeune authored Jul 31, 2013

We were using two instructions for similar purpose : break and
predicated break. Only predicated_break was emitted and it was
lowered at R600ControlFlowFinalizer to JUMP;CF_BREAK;POP.
This commit simplify the situation by making AMDILCFGStructurizer
emit IF_PREDICATE;BREAK;ENDIF; instead of predicated_break (which
is now removed).

There is no functionality change.

llvm-svn: 187510

0c5ed2b4

Reject bitcasts between address spaces with different sizes · 24b49c41
Matt Arsenault authored Jul 31, 2013
```
llvm-svn: 187506
```
24b49c41

[SystemZ] Implement isLegalAddressingMode() · 791bea41

Richard Sandiford authored Jul 31, 2013

The loop optimizers were assuming that scales > 1 were OK. I think this
is actually a bug in TargetLoweringBase::isLegalAddressingMode(),
since it seems to be trying to reject anything that isn't r+i or r+r,
but it has no default case for scales other than 0, 1 or 2. Implementing
the hook for z means that z can no longer test any change there though.

llvm-svn: 187497

791bea41

[SystemZ] Be more careful about inverting CC masks (conditional loads) · ee834382

Richard Sandiford authored Jul 31, 2013

Extend r187495 to conditional loads.  I split this out because the
easiest way seemed to be to force a particular operand order in
SystemZISelDAGToDAG.cpp.

llvm-svn: 187496

ee834382

[SystemZ] Be more careful about inverting CC masks · 3d768e33

Richard Sandiford authored Jul 31, 2013

System z branches have a mask to select which of the 4 CC values should
cause the branch to be taken.  We can invert a branch by inverting the mask.
However, not all instructions can produce all 4 CC values, so inverting
the branch like this can lead to some oddities.  For example, integer
comparisons only produce a CC of 0 (equal), 1 (less) or 2 (greater).
If an integer EQ is reversed to NE before instruction selection,
the branch will test for 1 or 2.  If instead the branch is reversed
after instruction selection (by inverting the mask), it will test for
1, 2 or 3.  Both are correct, but the second isn't really canonical.
This patch therefore keeps track of which CC values are possible
and uses this when inverting a mask.

Although this is mostly cosmestic, it fixes undefined behavior
for the CIJNLH in branch-08.ll.  Another fix would have been
to mask out bit 0 when generating the fused compare and branch,
but the point of this patch is that we shouldn't need to do that
in the first place.

The patch also makes it easier to reuse CC results from other instructions.

llvm-svn: 187495

3d768e33

[SystemZ] Move compare-and-branch generation even later · 8a757bba

Richard Sandiford authored Jul 31, 2013

r187116 moved compare-and-branch generation from the instruction-selection
pass to the peephole optimizer (via optimizeCompare).  It turns out that even
this is a bit too early.  Fused compare-and-branch instructions don't
interact well with predication, where a CC result is needed.  They also
make it harder to reuse the CC side-effects of earlier instructions
(not yet implemented, but the subject of a later patch).

Another problem was that the AnalyzeBranch family of routines weren't
handling compares and branches, so we weren't able to reverse the fused
form in cases where we would reverse a separate branch.  This could have
been fixed by extending AnalyzeBranch, but given the other problems,
I've instead moved the fusing to the long-branch pass, which is also
responsible for the opposite transformation: splitting out-of-range
compares and branches into separate compares and long branches.

I've added a test for the AnalyzeBranch problem.  A test for the
predication problem is included in the next patch, which fixes a bug
in the choice of CC mask.

llvm-svn: 187494

8a757bba

Fixed assertion in Extract128BitVector() · b0a75431
Elena Demikhovsky authored Jul 31, 2013
```
llvm-svn: 187493
```
b0a75431

[SystemZ] Postpone NI->RISBG conversion to convertToThreeAddress() · 6a06ba36

Richard Sandiford authored Jul 31, 2013

r186399 aggressively used the RISBG instruction for immediate ANDs,
both because it can handle some values that AND IMMEDIATE can't,
and because it allows the destination register to be different from
the source.  I realized later while implementing the distinct-ops
support that it would be better to leave the choice up to
convertToThreeAddress() instead.  The AND IMMEDIATE form is shorter
and is less likely to be cracked.

This is a problem for 32-bit ANDs because we assume that all 32-bit
operations will leave the high word untouched, whereas RISBG used in
this way will either clear the high word or copy it from the source
register.  The patch uses the z196 instruction RISBLG for this instead.

This means that z10 will be restricted to NILL, NILH and NILF for
32-bit ANDs, but I think that should be OK for now.  Although we're
using z10 as the base architecture, the optimization work is going
to be focused more on z196 and zEC12.

llvm-svn: 187492

6a06ba36

Added INSERT and EXTRACT intructions from AVX-512 ISA. · 67b05fc0

Elena Demikhovsky authored Jul 31, 2013

All insertf*/extractf* functions replaced with insert/extract since we have insertf and inserti forms.
Added lowering for INSERT_VECTOR_ELT / EXTRACT_VECTOR_ELT for 512-bit vectors.
Added lowering for EXTRACT/INSERT subvector for 512-bit vectors.
Added a test.

llvm-svn: 187491

67b05fc0

[SystemZ] Add RISBLG and RISBHG instruction definitions · 6cf80b3e
Richard Sandiford authored Jul 31, 2013
```
The next patch will make use of RISBLG for codegen.

llvm-svn: 187490
```
6cf80b3e
Add parentheses to silence gcc warning. · 8dc43231
Richard Trieu authored Jul 31, 2013
```
llvm-svn: 187482
```
8dc43231
Increment arg_count inside the loop in printInline. Patch by Joe Matarazzo. · 62cb2bc8
Craig Topper authored Jul 31, 2013
```
llvm-svn: 187477
```
62cb2bc8

Changed register names (and pointer keywords) to be lower case when using... · efd67d46

Craig Topper authored Jul 31, 2013

Changed register names (and pointer keywords) to be lower case when using Intel X86 assembler syntax.

Patch by Richard Mitton.

llvm-svn: 187476

efd67d46

Fix a severe compile time problem when forming large SCEV expressions. · c3bc8b8d

Andrew Trick authored Jul 31, 2013

This fix is very lightweight. The same fix already existed for AddRec
but was missing for NAry expressions.

This is obviously an improvement and I'm unsure how to test compile
time problems.

Patch by Xiaoyi Guo!

llvm-svn: 187475

c3bc8b8d

Remove trailing whitespace and some tab characters. · 75a5ba7e
Craig Topper authored Jul 31, 2013
```
llvm-svn: 187472
```
75a5ba7e
Fixed incorrect disassembly for MOV16o16a when using Intel syntax. · 6e8cd80d
Craig Topper authored Jul 31, 2013
```
Patch by Richard Mitton.

llvm-svn: 187471
```
6e8cd80d

Fix crashing on invalid inline asm with matching constraints. · e6656ac8

Eric Christopher authored Jul 31, 2013

For a testcase like the following:

 typedef unsigned long uint64_t;

 typedef struct {
   uint64_t lo;
   uint64_t hi;
 } blob128_t;

 void add_128_to_128(const blob128_t *in, blob128_t *res) {
   asm ("PAND %1, %0" : "+Q"(*res) : "Q"(*in));
 }

where we'll fail to allocate the register for the output constraint,
our matching input constraint will not find a register to match,
and could try to search past the end of the current operands array.

On the idea that we'd like to attempt to keep compilation going
to find more errors in the module, change the error cases when
we're visiting inline asm IR to return immediately and avoid
trying to create a node in the DAG. This leaves us with only
a single error message per inline asm instruction, but allows us
to safely keep going in the general case.

llvm-svn: 187470

e6656ac8

[mips] Rename instruction DANDi to ANDi64. · d6445686
Akira Hatanaka authored Jul 31, 2013
```
No functionality change.

llvm-svn: 187469
```
d6445686
[mips] Define instruction itineraries IIArith and IILogic. · f8fff213
Akira Hatanaka authored Jul 31, 2013
```
No functionality change.

llvm-svn: 187468
```
f8fff213

Fix ptr vector inconsistency in CreatePointerCast · 065ced9b

Matt Arsenault authored Jul 31, 2013

One form would accept a vector of pointers, and the other did not.
Make both accept vectors of pointers, and add an assertion
for the number of elements.

llvm-svn: 187464

065ced9b

Fix windows' implementation of status when a file doesn't exist. · 107b74c6

Rafael Espindola authored Jul 31, 2013

The unix one was returning no_such_file_or_directory, but the windows one
was return success.

Update the one one caller that was depending on the old behavior.

llvm-svn: 187463

107b74c6

Preserve fast-math flags when folding (fsub x, (fneg y)) to (fadd x, y). · c7be519d
Owen Anderson authored Jul 30, 2013
```
llvm-svn: 187462
```
c7be519d
Reflow this to be easier to read. · 029af150
Eric Christopher authored Jul 30, 2013
```
llvm-svn: 187459
```
029af150

Respect address space sizes in isEliminableCastPair. · 130e0ef6

Matt Arsenault authored Jul 30, 2013

This avoids constant folding bitcast/ptrtoint/inttoptr combinations
that have illegal bitcasts between differently sized address spaces.

llvm-svn: 187455

130e0ef6

Revert "Remove isCastable since nothing uses it now" · b4019ae1
Matt Arsenault authored Jul 30, 2013
```
Apparently dragonegg uses it.

llvm-svn: 187454
```
b4019ae1

Jul 30, 2013

Remove isCastable since nothing uses it now · f63dfbb1
Matt Arsenault authored Jul 30, 2013
```
llvm-svn: 187448
```
f63dfbb1

isKnownToBeAPowerOfTwo: Strengthen isKnownToBeAPowerOfTwo's analysis on add instructions · b7d5409a

David Majnemer authored Jul 30, 2013

Call into ComputeMaskedBits to figure out which bits are set on both add
operands and determine if the value is a power-of-two-or-zero or not.

llvm-svn: 187445

b7d5409a

Change behavior of calling bitcasted alias functions. · cacbb237

Matt Arsenault authored Jul 30, 2013

It will now only convert the arguments / return value and call
the underlying function if the types are able to be bitcasted.
This avoids using fp<->int conversions that would occur before.

llvm-svn: 187444

cacbb237

[mips] Delete instruction format for "bal". · 8f69d7f0
Akira Hatanaka authored Jul 30, 2013
```
llvm-svn: 187443
```
8f69d7f0
Implement getUniqueID for directories on windows. · a5932afe
Rafael Espindola authored Jul 30, 2013
```
llvm-svn: 187441
```
a5932afe
[mips] Define "bal" as a pseudo instruction. Also, fix bug in the InstAlias that · 5973e837
Akira Hatanaka authored Jul 30, 2013
```
turns "bal" into "bgezal".

llvm-svn: 187440
```
5973e837
Remove dead code. · 62b418e2
Rafael Espindola authored Jul 30, 2013
```
llvm-svn: 187439
```
62b418e2
Down-scale slot index distance to save bits. · c7934b3e
Andrew Trick authored Jul 30, 2013
```
llvm-svn: 187438
```
c7934b3e

MI Sched: Track live-thru registers. · 9c17eab7

Andrew Trick authored Jul 30, 2013

When registers must be live throughout the scheduling region, increase
the limit for the register class. Once we exceed the original limit,
they will be spilled, and there's no point further reducing pressure.

This isn't a perfect heuristics but avoids a situation where the
scheduler could become trapped by trying to achieve the impossible.

llvm-svn: 187436

9c17eab7

MI Sched fix: assert "Disconnected LRG within the scheduling region." · d9761776
Andrew Trick authored Jul 30, 2013
```
llvm-svn: 187435
```
d9761776
[Sparc] Rewrite MBB's live-in registers for leaf functions. Also, add · fee76fac
Venkatraman Govindaraju authored Jul 30, 2013
```
register i7 as a live-in if current function's return address is taken.

This revision fixes PR16269.

llvm-svn: 187433
```
fee76fac

Implement TokenizeWindowsCommandLine. · a2222b57

Rui Ueyama authored Jul 30, 2013

This is a follow up patch for r187390 to implement the parser for the
Windows-style command line. This should follow the rule as described
at http://msdn.microsoft.com/en-us/library/windows/desktop/17w5ykft(v=vs.85).aspx

Differential Revision: http://llvm-reviews.chandlerc.com/D1235

llvm-svn: 187430

a2222b57

R600/SI: Expand vector fp <-> int conversions · aa313d0a
Tom Stellard authored Jul 30, 2013
```
llvm-svn: 187421
```
aa313d0a

This patch implements parsing of mips FCC register operands. The example... · 643b3987

Vladimir Medic authored Jul 30, 2013

This patch implements parsing of mips FCC register operands. The example instructions have been added to test files.

llvm-svn: 187410

643b3987

[ARM] check bitwidth in PerformORCombine · 0c2ee5a2

Saleem Abdulrasool authored Jul 30, 2013



When simplifying a (or (and B A) (and C ~A)) to a (VBSL A B C) ensure that the
bitwidth of the second operands to both ands match before comparing the negation
of the values.

Split the check of the value of the second operands to the ands.  Move the cast
and variable declaration slightly higher to make it slightly easier to follow.

Bug-Id: 16700
Signed-off-by: Saleem Abdulrasool <compnerd@compnerd.org>
llvm-svn: 187404

0c2ee5a2