Commits · 2ca0ae2a24e4b3ad9a50b7192d854ed0c0ab7fc1 · Roger Ferrer / llvm-epi

Feb 24, 2015

Revert "Raising minimum required CMake version to 2.8.12.2." · 2ca0ae2a

Tobias Grosser authored Feb 24, 2015

This reverts commit r230062.

Debian stable (wheezy) ships still with cmake 2.8.9.

The commit broke my LLVM/Polly buildbot, to my knowledge our only Linux+cmake
buildbot.

llvm-svn: 230343

2ca0ae2a

simplify control flow; NFC · a709f3a5
Sanjay Patel authored Feb 24, 2015
```
llvm-svn: 230342
```
a709f3a5

Revert r230280: "Bugfix: SCEVExpander incorrectly marks increment operations as no-wrap" · 953d6fb8

Hans Wennborg authored Feb 24, 2015

This caused PR22674, failing this assert:

Instructions.h:2281: llvm::Value* llvm::PHINode::getOperand(unsigned int) const: Assertion `i_nocapture < OperandTraits<PHINode>::operands(this) && "getOperand() out of range!"' failed.

llvm-svn: 230341

953d6fb8

[x32] Mark RBX as reserved when EBX is the base pointer. · d2f3b878
Michael Kuperstein authored Feb 24, 2015
```
This should have gone into r230334.

llvm-svn: 230339
```
d2f3b878
fix typo in comment; NFC · 28985485
Sanjay Patel authored Feb 24, 2015
```
llvm-svn: 230338
```
28985485
[x32] x32 should use ebx as the base pointer. · 8ffb4091
Michael Kuperstein authored Feb 24, 2015
```
This fixes the original issue in PR22655, but not the secondary one.

llvm-svn: 230334
```
8ffb4091

[SDAG] Handle LowerOperation returning its input consistently · cec70130

Hal Finkel authored Feb 24, 2015

For almost all node types, if the target requested custom lowering, and
LowerOperation returned its input, we'd treat the original node as legal. This
did not work, however, for many loads and stores, because they follow
slightly different code paths, and we did not account for the possibility of
LowerOperation returning its input at those call sites.

I think that we now handle this consistently everywhere. At the call sites in
LegalizeDAG, we used to assert in this case, so there's no functional change
for any existing code there. For the call sites in LegalizeVectorOps, this
really only affects whether or not we set Changed = true, but I think makes the
semantics clearer.

No test case here, but it will be covered by an upcoming PowerPC commit adding
QPX support.

llvm-svn: 230332

cec70130

[mips] Reformat some TableGen definitions. NFC. · a90f144a

Toma Tabacu authored Feb 24, 2015

Summary: Separated some instruction and pseudo-instruction definitions from InstAlias definitions, added banner for pseudo-instructions and removed a redundant whitespace from a pseudo-instruction definition. No functional change.

Reviewers: dsanders

Reviewed By: dsanders

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D7552

llvm-svn: 230327

a90f144a

Fix alloca_instruments_all_paddings.cc test to work under higher -O levels (llvm part) · f5875d30

Kuba Brecka authored Feb 24, 2015

When AddressSanitizer only a single dynamic alloca and no static allocas, due to an early exit from FunctionStackPoisoner::poisonStack we forget to unpoison the dynamic alloca. This patch fixes that.

Reviewed at http://reviews.llvm.org/D7810

llvm-svn: 230316

f5875d30

[X86] Remove the AbsMem32 type from the assembly parser. Only really need the... · cf51397c

Craig Topper authored Feb 24, 2015

[X86] Remove the AbsMem32 type from the assembly parser. Only really need the 16-bit version which will automatically get prioritized over AbsMem.

llvm-svn: 230313

cf51397c

Beginning of alloca implementation for Mips fast-isel · 5fb7d8b5

Reed Kotler authored Feb 24, 2015

Summary: Begin to add various address modes; including alloca.

Test Plan: Make sure there are no regressions in test-suite at O0/02 in mips32r1/r2

Reviewers: dsanders

Reviewed By: dsanders

Subscribers: echristo, rfuhler, llvm-commits

Differential Revision: http://reviews.llvm.org/D6426

llvm-svn: 230300

5fb7d8b5

Fix handling of negative offsets for AddrModeT2_i8s4 in rewriteT2FrameIndex. · 8e29dec9

Bob Wilson authored Feb 24, 2015

This is a follow up to r230233 to fix something that I noticed by
inspection. The AddrModeT2_i8s4 addressing mode does not support
negative offsets. I spent a good chunk of the day trying to come up with
a testcase for this but was not successful. This addressing mode is used
to spill and restore GPRPair registers in Thumb2 code and that does not
happen often. We also make very limited used of negative offsets when
lowering frame indexes. I am going ahead with the change anyway, because
I am pretty confident that it is correct. I also added a missing assertion
to check that the low bits of the scaled offset are zero.

llvm-svn: 230297

8e29dec9

Fix bug 22641 · b14010d2

Sanjoy Das authored Feb 24, 2015

The bug was a result of getPreStartForExtend interpreting nsw/nuw
flags on an add recurrence more strongly than is legal.  {S,+,X}<nsw>
implies S+X is nsw only if the backedge of the loop is taken at least
once.

NOTE: I had accidentally committed an unrelated change with the commit
message of this change in r230275 (r230275 was reverted in r230279).
This is the correct change for this commit message.

Differential Revision: http://reviews.llvm.org/D7808

llvm-svn: 230291

b14010d2

[LTO API] add lto_codegen_set_module to set the destination module. · 6487ce95

Manman Ren authored Feb 24, 2015

When debugging LTO issues with ld64, we use -save-temps to save the merged
optimized bitcode file, then invoke ld64 again on the single bitcode file to
speed up debugging code generation passes and ld64 stuff after code generation.

llvm linking a single bitcode file via lto_codegen_add_module will generate a
different bitcode file from the single input. With the newly-added
lto_codegen_set_module, we can make sure the destination module is the same as
the input.

lto_codegen_set_module will transfer the ownship of the module to code
generator.

rdar://19024554

llvm-svn: 230290

6487ce95

[LoopAccesses] LAA::getInfo to use const reference for stride parameter · 8bc61df9
Adam Nemet authored Feb 24, 2015
```
And other required const-correctness fixes to make this work.

llvm-svn: 230289
```
8bc61df9

X86: Only use 'lea' in Win64 epilogues if a frame pointer exists · 3aa0bd81

David Majnemer authored Feb 24, 2015

We can only use 'add' in epilogues, 'lea' is not permitted unless we've
established a frame pointer in the prologue.

llvm-svn: 230286

3aa0bd81

New instcombine rule: max(~a,~b) -> ~min(a, b) · 82ea3d45

Sanjoy Das authored Feb 24, 2015

This case is interesting because ScalarEvolutionExpander lowers min(a,
b) as ~max(~a,~b).  I think the profitability heuristics can be made
more clever/aggressive, but this is a start.

Differential Revision: http://reviews.llvm.org/D7821

llvm-svn: 230285

82ea3d45

Bugfix: SCEVExpander incorrectly marks increment operations as no-wrap · 18c243b9

Sanjoy Das authored Feb 23, 2015

When emitting the increment operation, SCEVExpander marks the
operation as nuw or nsw based on the flags on the preincrement SCEV.
This is incorrect because, for instance, it is possible that {-6,+,1}
is <nuw> while {-6,+,1}+1 = {-5,+,1} is not.

This change teaches SCEV to mark the increment as nuw/nsw only if it
can explicitly prove that the increment operation won't overflow.

Apart from the attached test case, another (more realistic) manifestation
of the bug can be seen in Transforms/IndVarSimplify/pr20680.ll.

NOTE: this change was landed with an incorrect commit message in
rL230275 and was reverted for that reason in rL230279.  This commit
message is the correct one.

Differential Revision: http://reviews.llvm.org/D7778

llvm-svn: 230280

18c243b9

Revert 230275. · c9cf0151

Sanjoy Das authored Feb 23, 2015

230275 got committed with an incorrect commit message due to a mixup
on my side.  Will re-land in a few moments with the correct commit
message.

llvm-svn: 230279

c9cf0151

Fix based on post-commit comment on D7816 & rL230177 - BUILD_VECTOR operand... · 662c1d27

Simon Pilgrim authored Feb 23, 2015

Fix based on post-commit comment on D7816 & rL230177 - BUILD_VECTOR operand truncation was using the the BV's output scalar type instead of the input type.

llvm-svn: 230278

662c1d27

Feb 23, 2015

[X86] Teach how to custom lower double-to-half conversions under fast-math. · af3f397b

Andrea Di Biagio authored Feb 23, 2015

This patch teaches the backend how to expand a double-half conversion into
a double-float conversion immediately followed by a float-half conversion.
We do this only under fast-math, and if float-half conversions are legal
for the target.

Added test CodeGen/X86/fastmath-float-half-conversion.ll

Differential Revision: http://reviews.llvm.org/D7832

llvm-svn: 230276

af3f397b

Fix bug 22641 · 913dfd8f

Sanjoy Das authored Feb 23, 2015

The bug was a result of getPreStartForExtend interpreting nsw/nuw
flags on an add recurrence more strongly than is legal.  {S,+,X}<nsw>
implies S+X is nsw only if the backedge of the loop is taken at least
once.

Differential Revision: http://reviews.llvm.org/D7808

llvm-svn: 230275

913dfd8f

Fix invalid cast. · 993502ea

Rafael Espindola authored Feb 23, 2015

Fixes PR22525.

Patch by Ben Longbons with testcase by me.

llvm-svn: 230271

993502ea

X86: Use a smaller 'mov' instruction for stack probe calls · 006c490b

David Majnemer authored Feb 23, 2015

Prologue emission, in some cases, requires calls to a stack probe helper
function.  The amount of stack to probe is passed as a register
argument in the Win64 ABI but the instruction sequence used is
pessimistic: it assumes that the number of bytes to probe is greater
than 4 GB.

Instead, select a more appropriate opcode depending on the number of
bytes we are going to probe.

llvm-svn: 230270

006c490b

X86: Use 'mov' instead of 'lea' in Win64 SEH prologues when possible · 31d868b6

David Majnemer authored Feb 23, 2015

'mov' and 'lea' are equivalent when the displacement applied with 'lea'
is zero.  However, 'mov' should encode smaller.

llvm-svn: 230269

31d868b6

X86: Explain why we cannot use a 'mov' in a Win64 epilogue · b85e023b
David Majnemer authored Feb 23, 2015
```
llvm-svn: 230268
```
b85e023b
X86: Consistently use 'epilogue' instead of 'epilog' · 086f6a7e
David Majnemer authored Feb 23, 2015
```
llvm-svn: 230267
```
086f6a7e
add newline for easier reading; NFC · 27aa1423
Sanjay Patel authored Feb 23, 2015
```
llvm-svn: 230265
```
27aa1423

[AsmPrinter] Access pointers to globals via pcrel GOT entries · 24492b05

Bruno Cardoso Lopes authored Feb 23, 2015

Front-ends could use global unnamed_addr to hold pointers to other
symbols, like @gotequivalent below:

@foo = global i32 42
@gotequivalent = private unnamed_addr constant i32* @foo

@delta = global i32 trunc (i64 sub (i64 ptrtoint (i32** @gotequivalent to i64),
                                    i64 ptrtoint (i32* @delta to i64))
                           to i32)

The global @delta holds a data "PC"-relative offset to @gotequivalent,
an unnamed pointer to @foo. The darwin/x86-64 assembly output for this follows:

 .globl  _foo
_foo:
 .long   42

 .globl  _gotequivalent
_gotequivalent:
 .quad   _foo

 .globl  _delta
_delta:
 .long   _gotequivalent-_delta

Since unnamed_addr indicates that the address is not significant, only
the content, we can optimize the case above by replacing pc-relative
accesses to "GOT equivalent" globals, by a PC relative access to the GOT
entry of the final symbol instead. Therefore, "delta" can contain a pc
relative relocation to foo's GOT entry and we avoid the emission of
"gotequivalent", yielding the assembly code below:

 .globl  _foo
_foo:
 .long   42

 .globl  _delta
_delta:
 .long   _foo@GOTPCREL+4

There are a couple of advantages of doing this: (1) Front-ends that need
to emit a great deal of data to store pointers to external symbols could
save space by not emitting such "got equivalent" globals and (2) IR
constructs combined with this opt opens a way to represent GOT pcrel
relocations by using the LLVM IR, which is something we previously had
no way to express.

Differential Revision: http://reviews.llvm.org/D6922

rdar://problem/18534217

llvm-svn: 230264

24492b05

InstrProf: Teach llvm-cov to show the max count instead of the last · 4d7aae93

Justin Bogner authored Feb 23, 2015

When multiple regions start on the same line, llvm-cov was just
showing the count of the last one as the line count. This can be
confusing and misleading for things like one-liner loops, where the
count at the end isn't very interesting, or even "if" statements with
an opening brace at the end of the line.

Instead, use the maximum of all of the region start counts.

llvm-svn: 230263

4d7aae93

Removing unused private field. · 982ea13c
Andrew Kaylor authored Feb 23, 2015
```
llvm-svn: 230259
```
982ea13c

[X86][MMX] Fix test to reflect current codegen · 1eb8376c

Bruno Cardoso Lopes authored Feb 23, 2015

This test failed in several buildbots, a bit unclear how that happen
since this was the previous behavior before r230248.

llvm-svn: 230258

1eb8376c

Second attempt to fix WinEHCatchDirector build failures. · 322236ee
Andrew Kaylor authored Feb 23, 2015
```
llvm-svn: 230257
```
322236ee
Attempting to fix WinEHCatchDirector destructor related build failures. · 2e30b459
Andrew Kaylor authored Feb 23, 2015
```
llvm-svn: 230252
```
2e30b459
Adding test for Windows EH frame variable remapping. · 1cc6db07
Andrew Kaylor authored Feb 23, 2015
```
llvm-svn: 230250
```
1cc6db07
Remap frame variables for native Windows exception handling. · f22fe4ae
Andrew Kaylor authored Feb 23, 2015
```
Differential Revision: http://reviews.llvm.org/D7770

llvm-svn: 230249
```
f22fe4ae
Revert "[X86][MMX] Add MMX instructions to foldable tables" · 32173cdf
Bruno Cardoso Lopes authored Feb 23, 2015
```
This reverts commit r230226 since it breaks win buildbots.

llvm-svn: 230248
```
32173cdf
Revert "Revert "Raising minimum required CMake version to 2.8.12.2."" · 1df91242
Chad Rosier authored Feb 23, 2015
```
This reverts commit r230240, which was an accidental commit.

llvm-svn: 230246
```
1df91242

Rewrite the global merge pass to be subprogram agnostic for now. · ed47b229

Eric Christopher authored Feb 23, 2015

It was previously using the subtarget to get values for the global
offset without actually checking each function as it was generating
code. Go ahead and solidify the current behavior and make the
existing FIXMEs more prominent.

As a note the ARM backend previously had a thumb1 and non-thumb1
set of defaults. Only the former was tested so I've changed the
behavior to only use that for now.

llvm-svn: 230245

ed47b229

Prevent hoisting fmul from THEN/ELSE to IF if there is fmsub/fmadd opportunity. · 54390053

Chad Rosier authored Feb 23, 2015

This patch adds the isProfitableToHoist API.  For AArch64, we want to prevent a
fmul from being hoisted in cases where it is more profitable to form a
fmsub/fmadd.

Phabricator Review: http://reviews.llvm.org/D7299
Patch by Lawrence Hu <lawrence@codeaurora.org>

llvm-svn: 230241

54390053