Commits · 3994be87de7aef0ccd1e939963e6c366668f9551 · Roger Ferrer / llvm-epi

Aug 10, 2015

[Sparc] Implement i64 load/store support for 32-bit sparc. · 3994be87

James Y Knight authored Aug 10, 2015

The LDD/STD instructions can load/store a 64bit quantity from/to
memory to/from a consecutive even/odd pair of (32-bit) registers. They
are part of SparcV8, and also present in SparcV9. (Although deprecated
there, as you can store 64bits in one register).

As recommended on llvmdev in the thread "How to enable use of 64bit
load/store for 32bit architecture" from Apr 2015, I've modeled the
64-bit load/store operations as working on a v2i32 type, rather than
making i64 a legal type, but with few legal operations. The latter
does not (currently) work, as there is much code in llvm which assumes
that if i64 is legal, operations like "add" will actually work on it.

The same assumption does not hold for v2i32 -- for vector types, it is
workable to support only load/store, and expand everything else.

This patch:
- Adds a new register class, IntPair, for even/odd pairs of registers.

- Modifies the list of reserved registers, the stack spilling code,
  and register copying code to support the IntPair register class.

- Adds support in AsmParser. (note that in asm text, you write the
  name of the first register of the pair only. So the parser has to
  morph the single register into the equivalent paired register).

- Adds the new instructions themselves (LDD/STD/LDDA/STDA).

- Hooks up the instructions and registers as a vector type v2i32. Adds
  custom legalizer to transform i64 load/stores into v2i32 load/stores
  and bitcasts, so that the new instructions can actually be
  generated, and marks all operations other than load/store on v2i32
  as needing to be expanded.

- Copies the unfortunate SelectInlineAsm hack from ARMISelDAGToDAG.
  This hack undoes the transformation of i64 operands into two
  arbitrarily-allocated separate i32 registers in
  SelectionDAGBuilder. and instead passes them in a single
  IntPair. (Arbitrarily allocated registers are not useful, asm code
  expects to be receiving a pair, which can be passed to ldd/std.)

Also adds a bunch of test cases covering all the bugs I've added along
the way.

Differential Revision: http://reviews.llvm.org/D8713

llvm-svn: 244484

3994be87

rename toELFShdrIter to getSection and move it closer to getSymbol. NFC. · fe0e4e4c
Rafael Espindola authored Aug 10, 2015
```
llvm-svn: 244483
```
fe0e4e4c
toELFSymIter and getSymbol are now the same thing. Merge them. · 19046678
Rafael Espindola authored Aug 10, 2015
```
llvm-svn: 244482
```
19046678

Fix a bunch of trivial cases of 'CHECK[^:]*$' in the tests. NFCI · 49e46ce8

Jon Roelofs authored Aug 10, 2015

I looked into adding a warning / error for this to FileCheck, but there doesn't
seem to be a good way to avoid it triggering on the instances of it in RUN lines.

llvm-svn: 244481

49e46ce8

Use continue to reduce indentation. NFC. · fc2b6fa3
Rafael Espindola authored Aug 10, 2015
```
llvm-svn: 244480
```
fc2b6fa3
[AArch64] Convert a conditional check that will always be true to an assert. NFC. · c56a9132
Chad Rosier authored Aug 10, 2015
```
llvm-svn: 244479
```
c56a9132
Correct non-existing past participle of split in filename · 874b5c21
Michael Kruse authored Aug 10, 2015
```
llvm-svn: 244478
```
874b5c21
Add a test for our handling of shndx. · 904c81dc
Rafael Espindola authored Aug 10, 2015
```
It was already working, but missing a test.

llvm-svn: 244477
```
904c81dc
Recommit r244470+ r244471 together, the bot failed between them. · 2ad3b336
Yaron Keren authored Aug 10, 2015
```
llvm-svn: 244476
```
2ad3b336
Fix typo. · 2bbdbcb8
Filipe Cabecinhas authored Aug 10, 2015
```
llvm-svn: 244475
```
2bbdbcb8
[IndVarSimplify] Make cost estimation in RewriteLoopExitValues smarter · 4709c037
Igor Laevsky authored Aug 10, 2015
```
Differential Revision: http://reviews.llvm.org/D11687

llvm-svn: 244474
```
4709c037

[clang-cl] Add support for CL and _CL_ environment variables · 3a4f9586

David Majnemer authored Aug 10, 2015

cl uses 'CL' and '_CL_' to prepend and append command line options to
the given argument vector.  There is an additional quirk whereby '#' is
transformed into '='.

Differential Revision: http://reviews.llvm.org/D11896

llvm-svn: 244473

3a4f9586

Revert r244470 and 244471 while looking into it. · 1a1e1ca9
Yaron Keren authored Aug 10, 2015
```
llvm-svn: 244472
```
1a1e1ca9
Second part of r244470 (source file was unsaved in editor). · b27259b2
Yaron Keren authored Aug 10, 2015
```
llvm-svn: 244471
```
b27259b2

Really implement David Blaikie suggestion in full of seperating · f850d984

Yaron Keren authored Aug 10, 2015

variable initialization from its usage in the push_back making
collapse of the two statements unlikely even without a comment.

llvm-svn: 244470

f850d984

Allow dosep.py to print dotest.py output on success. · 38e64175

Zachary Turner authored Aug 10, 2015

Previously all test output was reported by each individual
instance of dotest.py.  After a recent patch, dosep gets dotest
outptu via a pipe, and selectively decides which output to
print.

This breaks certain scripts which rely on having full output
of each dotest instance to do various parsing and/or log-scraping.

While we make no promises about the format of dotest output, it's
easy to restore this to the old behavior for now, although it is
behind a flag.  To re-enable full output, run dosep.py with the -s
option.

Differential Revision: http://reviews.llvm.org/D11816
Reviewed By: Chaoren Lin

llvm-svn: 244469

38e64175

Correct x86_64 fp128 calling convention · 241a890b

Chih-Hung Hsieh authored Aug 10, 2015

These changes are for Android x86_64 targets to be compatible
with current Android g++ and conform to AMD64 ABI.

https://llvm.org/bugs/show_bug.cgi?id=23897
  * Return type of long double (fp128) should be fp128, not x86_fp80.
  * Vararg of long double (fp128) could be in register and overflowed to memory.

https://llvm.org/bugs/show_bug.cgi?id=24111
  * Return value of long double (fp128) _Complex should be in memory like a structure of {fp128,fp128}.

Differential Revision: http://reviews.llvm.org/D11437

llvm-svn: 244468

241a890b

Add new llvm.loop.unroll.enable metadata for use with "#pragma unroll". · 397a98d8

Mark Heffernan authored Aug 10, 2015

This change adds the new unroll metadata "llvm.loop.unroll.enable" which directs
the optimizer to unroll a loop fully if the trip count is known at compile time, and
unroll partially if the trip count is not known at compile time. This differs from
"llvm.loop.unroll.full" which explicitly does not unroll a loop if the trip count is not
known at compile time

With this change "#pragma unroll" generates "llvm.loop.unroll.enable" rather than
"llvm.loop.unroll.full" metadata. This changes the semantics of "#pragma unroll" slightly
to mean "unroll aggressively (fully or partially)" rather than "unroll fully or not at all".

The motivating example for this change was some internal code with a loop marked
with "#pragma unroll" which only sometimes had a compile-time trip count depending
on template magic. When the trip count was a compile-time constant, everything works
as expected and the loop is fully unrolled. However, when the trip count was not a
compile-time constant the "#pragma unroll" explicitly disabled unrolling of the loop(!).
Removing "#pragma unroll" caused the loop to be unrolled partially which was desirable
from a performance perspective.

llvm-svn: 244467

397a98d8

Add new llvm.loop.unroll.enable metadata. · 8939154a

Mark Heffernan authored Aug 10, 2015

This change adds the unroll metadata "llvm.loop.unroll.enable" which directs
the optimizer to unroll a loop fully if the trip count is known at compile time, and
unroll partially if the trip count is not known at compile time. This differs from
"llvm.loop.unroll.full" which explicitly does not unroll a loop if the trip count is not
known at compile time.

The "llvm.loop.unroll.enable" is intended to be added for loops annotated with
"#pragma unroll".

llvm-svn: 244466

8939154a

Typo. Move comment closer to relevant code. NFC. · caed6db5
Chad Rosier authored Aug 10, 2015
```
llvm-svn: 244465
```
caed6db5
fix minsize detection: minsize attribute implies optimizing for size · 10294b59
Sanjay Patel authored Aug 10, 2015
```
llvm-svn: 244464
```
10294b59
fix minsize detection: minsize attribute implies optimizing for size · 0f12d71b
Sanjay Patel authored Aug 10, 2015
```
llvm-svn: 244463
```
0f12d71b
Protect template argument from user interference. · 08142fa6
Joerg Sonnenberger authored Aug 10, 2015
```
llvm-svn: 244462
```
08142fa6
Fully apply David Blaikie suggestion and add comment explaining why. · 0b4c9693
Yaron Keren authored Aug 10, 2015
```
llvm-svn: 244461
```
0b4c9693
fix minsize detection: minsize attribute implies optimizing for size · 68b0325a
Sanjay Patel authored Aug 10, 2015
```
llvm-svn: 244460
```
68b0325a

Make StmtSet a list. · d6c30160

Johannes Doerfert authored Aug 10, 2015

  With a deque (or any other sequential container) it is not sound to
  take the address of the elements when the container is still under
  construction. With a pointer based container this is save.

llvm-svn: 244459

d6c30160

fix minsize detection: minsize attribute implies optimizing for size · 9a9003d9
Sanjay Patel authored Aug 10, 2015
```
llvm-svn: 244458
```
9a9003d9
Add missing include guard to FuzzerInternal.h, NFC. · 347663b2
Yaron Keren authored Aug 10, 2015
```
llvm-svn: 244457
```
347663b2

Add test case with PHI node in exit block · 4adb0279

Michael Kruse authored Aug 10, 2015

The PHI node with multiple incoming edges from inside the region.

Thanks Tobias for coming up with the example. 

llvm-svn: 244456

4adb0279

Modify r244405 to clearer code, per David Blaikie suggestion. · e3c07067
Yaron Keren authored Aug 10, 2015
```
llvm-svn: 244455
```
e3c07067
misc-unused-parameters: Don't touch K&R style functions. · b3a74c65
Daniel Jasper authored Aug 10, 2015
```
We couldn't calculate the removal ranges properly at this point.

llvm-svn: 244454
```
b3a74c65

-Wdeprecated: Use noexcept rather than throw() where supported · 57add8dd

David Blaikie authored Aug 10, 2015

Summary: I've copy/pasted the LLVM_NOEXCEPT definition macro goo from LLVM's Compiler.h. Is there somewhere I should put this in Compiler RT? Is there a useful header to define/share things like this?

Reviewers: samsonov

Differential Revision: http://reviews.llvm.org/D11780

llvm-svn: 244453

57add8dd

Silence a sign mismatch warning; NFC. · d8ac7de7
Aaron Ballman authored Aug 10, 2015
```
llvm-svn: 244452
```
d8ac7de7
Don't depend on getDotSymtabSec. It is going away. · d8340dae
Rafael Espindola authored Aug 10, 2015
```
llvm-svn: 244451
```
d8340dae

Remove leftover comment · 1d3c9b54

Michael Kruse authored Aug 10, 2015

The function to which this commit applies has been removed in a
previous commit.

llvm-svn: 244450

1d3c9b54

[TTI] Add a hook for specifying per-target defaults for Interleaved Accesses · 61bdc513

Silviu Baranga authored Aug 10, 2015

Summary:
This adds a hook to TTI which enables us to selectively turn on by default
interleaved access vectorization for targets on which we have have performed
the required benchmarking.

Reviewers: rengolin

Subscribers: rengolin, llvm-commits

Differential Revision: http://reviews.llvm.org/D11901

llvm-svn: 244449

61bdc513

Prevent the scalarizer from caching incorrect entries · e29ab2bf

Fraser Cormack authored Aug 10, 2015

The scalarizer can cache incorrect entries when walking up a chain of
insertelement instructions. This occurs when it encounters more than one
instruction that it is not actively searching for, as it unconditionally caches
every element it finds. The fix is to only cache the first element that it
isn't searching for so we don't overwrite correct entries.

Reviewers: hfinkel

Differential Revision: http://reviews.llvm.org/D11559

llvm-svn: 244448

e29ab2bf

elf2yaml: Use existing section walk to find the symbol table. NFC. · 94515abf
Rafael Espindola authored Aug 10, 2015
```
llvm-svn: 244447
```
94515abf

Add WebKit brace style configuration option. · 291f64fd

Roman Kashitsyn authored Aug 10, 2015

Summary:
Add brace style `BS_WebKit` as described on https://www.webkit.org/coding/coding-style.html:

* Function definitions: place each brace on its own line.
* Other braces: place the open brace on the line preceding the code block; place the close brace on its own line.

Set brace style used in `getWebKitStyle()` to the newly added `BS_WebKit`.

Reviewers: djasper, klimek

Subscribers: klimek, cfe-commits

Differential Revision: http://reviews.llvm.org/D11837

llvm-svn: 244446

291f64fd

[RegionInfo] Fix typo · 7d21eb35
Michael Kruse authored Aug 10, 2015
```
llvm-svn: 244445
```
7d21eb35