Commits · 965e963d6d48ec7814d99dc51c0a4d3f996878a2 · Roger Ferrer / llvm-epi

Aug 10, 2015

cmake: Make CMAKE_BUILD_TYPE check case-insensitive · 965e963d
Justin Bogner authored Aug 10, 2015
```
Juergen pointed out that this variable is treated in a case
insensitive way.

llvm-svn: 244516
```
965e963d

MachineVerifier: Handle the optional def operand in a PATCHPOINT instruction. · e5101e20

Alex Lorenz authored Aug 10, 2015

The PATCHPOINT instructions have a single optional defined register operand,
but the machine verifier can't verify the optional defined register operands.
This commit makes sure that the machine verifier won't report an error when a
PATCHPOINT instruction doesn't have its optional defined register operand.
This change will allow us to enable the machine verifier for the code
generation tests for the patchpoint intrinsics.

Reviewers: Juergen Ributzka
llvm-svn: 244513

e5101e20

[llvm-symbolizer] Remove underscores and other C mangling on Windows · c25c7944

Reid Kleckner authored Aug 10, 2015

Summary:
This makes it so that reports symbolized after the fact with
llvm-symbolizer are more similar to the ones we generate at runtime with
in-process dbghelp.

Reviewers: samsonov

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D11785

llvm-svn: 244512

c25c7944

Don't iterate over all sections in the ELFFile constructor. · aae55414

Rafael Espindola authored Aug 10, 2015

With this we finally have an ELFFile that is O(1) to construct. This is helpful
for programs like lld which have to do their own section walk.

llvm-svn: 244510

aae55414

remove function names from comments; NFC · cc655436
Sanjay Patel authored Aug 10, 2015
```
llvm-svn: 244509
```
cc655436

StackMap: FastISel: Add an appropriate number of immediate operands to the · 2f43dd5a

Alex Lorenz authored Aug 10, 2015

frame setup instruction.

This commit ensures that the stack map lowering code in FastISel adds an
appropriate number of immediate operands to the frame setup instruction.

The previous code added just one immediate operand, which was fine for a target
like AArch64, but on X86 the ADJCALLSTACKDOWN64 instruction needs two explicit
operands. This caused the machine verifier to report an error when the old code
added just one.

Reviewers: Juergen Ributzka

Differential Revision: http://reviews.llvm.org/D11853

llvm-svn: 244508

2f43dd5a

Rename improperly named variable. NFC. · 0f251731
Rafael Espindola authored Aug 10, 2015
```
llvm-svn: 244507
```
0f251731
Make fp vectorization test X86 specified to avoid cost-model related problems... · 655e573d
Tyler Nowicki authored Aug 10, 2015
```
Make fp vectorization test X86 specified to avoid cost-model related problems on arm-thumb and hexagon.

llvm-svn: 244505
```
655e573d
Add a test showing that objdump (and so ObjectFIle) can handle shndx. · 3db22738
Rafael Espindola authored Aug 10, 2015
```
It was already passing, we were just not testing the code.

llvm-svn: 244504
```
3db22738

x86: Emit LAHF/SAHF instead of PUSHF/POPF · fa9746dc

JF Bastien authored Aug 10, 2015

NaCl's sandbox doesn't allow PUSHF/POPF out of security concerns (priviledged emulators have forgotten to mask system bits in the past, and EFLAGS's DF bit is a constant source of hilarity). Commit r220529 fixed PR20376 by saving cmpxchg's flags result using EFLAGS, this commit now generated LAHF/SAHF instead, for all of x86 (not just NaCl) because it leads to an overall performance gain over PUSHF/POPF.

As with the previous patch this code generation is pretty bad because it occurs very later, after register allocation, and in many cases it rematerializes flags which were already available (e.g. already in a register through SETE). Fortunately it's somewhat rare that this code needs to fire.

I did [[ https://github.com/jfbastien/benchmark-x86-flags | a bit of benchmarking ]], the results on an Intel Haswell E5-2690 CPU at 2.9GHz are:

| Time per call (ms)  | Runtime (ms) | Benchmark                      |
| 0.000012514         |      6257    | sete.i386                      |
| 0.000012810         |      6405    | sete.i386-fast                 |
| 0.000010456         |      5228    | sete.x86-64                    |
| 0.000010496         |      5248    | sete.x86-64-fast               |
| 0.000012906         |      6453    | lahf-sahf.i386                 |
| 0.000013236         |      6618    | lahf-sahf.i386-fast            |
| 0.000010580         |      5290    | lahf-sahf.x86-64               |
| 0.000010304         |      5152    | lahf-sahf.x86-64-fast          |
| 0.000028056         |     14028    | pushf-popf.i386                |
| 0.000027160         |     13580    | pushf-popf.i386-fast           |
| 0.000023810         |     11905    | pushf-popf.x86-64              |
| 0.000026468         |     13234    | pushf-popf.x86-64-fast         |

Clearly `PUSHF`/`POPF` are suboptimal. It doesn't really seems to be worth teaching LLVM about individual flags, at least not for this purpose.

Reviewers: rnk, jvoung, t.p.northover

Subscribers: llvm-commits

Differential revision: http://reviews.llvm.org/D6629

llvm-svn: 244503

fa9746dc

Use higher level functions in llvm-objdump. · a01ff22b

Rafael Espindola authored Aug 10, 2015

This matches the rest of llvm-objdump better and isolates it from upcoming
changes to ELFFile.

llvm-svn: 244500

a01ff22b

fix minsize detection: minsize attribute implies optimizing for size · d09391c8
Sanjay Patel authored Aug 10, 2015
```
llvm-svn: 244499
```
d09391c8
[x86, SSE]]add missing tests for load folding with partial register update · 178f8cba
Sanjay Patel authored Aug 10, 2015
```
The minsize case is wrong; that will be fixed in the next commit.

llvm-svn: 244498
```
178f8cba

Delete getDotSymtabSec. · 821a64c7

Rafael Espindola authored Aug 10, 2015

Another step in avoiding iterating over all sections in the ELFFile constructor.

llvm-svn: 244496

821a64c7

[InstCombine] Move SSE2/AVX2 arithmetic vector shift folding to instcombiner · a3a72b41

Simon Pilgrim authored Aug 10, 2015

As discussed in D11760, this patch moves the (V)PSRA(WD) arithmetic shift-by-constant folding to InstCombine to match the logical shift implementations.

Differential Revision: http://reviews.llvm.org/D11886

llvm-svn: 244495

a3a72b41

Removed unused and incorrectly implemented classof() on Optimization Remark base class. · 8e7661ec
Tyler Nowicki authored Aug 10, 2015
```
llvm-svn: 244494
```
8e7661ec
[TableGen] NFC improving comments about what the tokenized identifiers will contain. · 3d905747
Colin LeMahieu authored Aug 10, 2015
```
llvm-svn: 244493
```
3d905747
Fix a few more cases of 'CHECK[^:]*$'. NFCI · f45295c3
Jon Roelofs authored Aug 10, 2015
```
llvm-svn: 244491
```
f45295c3

Late evaluation of the fast-math vectorization requirement. · c1a86f58

Tyler Nowicki authored Aug 10, 2015

This patch moves the verification of fast-math to just before vectorization is done. This way we can tell clang to append the command line options would that allow floating-point commutativity. Specifically those are enableing fast-math or specifying a loop hint. 

llvm-svn: 244489

c1a86f58

Fix another case of 'CHECK[^:]*$'. NFCI · 5dcf1574
Jon Roelofs authored Aug 10, 2015
```
llvm-svn: 244486
```
5dcf1574

Modify diagnostic messages to clearly indicate the why interleaving wasn't done. · 4d62f2e0

Tyler Nowicki authored Aug 10, 2015

Sometimes interleaving is not beneficial, as determined by the cost-model and sometimes it is disabled by a loop hint (by the user). This patch modifies the diagnostic messages to make it clear why interleaving wasn't done.

llvm-svn: 244485

4d62f2e0

[Sparc] Implement i64 load/store support for 32-bit sparc. · 3994be87

James Y Knight authored Aug 10, 2015

The LDD/STD instructions can load/store a 64bit quantity from/to
memory to/from a consecutive even/odd pair of (32-bit) registers. They
are part of SparcV8, and also present in SparcV9. (Although deprecated
there, as you can store 64bits in one register).

As recommended on llvmdev in the thread "How to enable use of 64bit
load/store for 32bit architecture" from Apr 2015, I've modeled the
64-bit load/store operations as working on a v2i32 type, rather than
making i64 a legal type, but with few legal operations. The latter
does not (currently) work, as there is much code in llvm which assumes
that if i64 is legal, operations like "add" will actually work on it.

The same assumption does not hold for v2i32 -- for vector types, it is
workable to support only load/store, and expand everything else.

This patch:
- Adds a new register class, IntPair, for even/odd pairs of registers.

- Modifies the list of reserved registers, the stack spilling code,
  and register copying code to support the IntPair register class.

- Adds support in AsmParser. (note that in asm text, you write the
  name of the first register of the pair only. So the parser has to
  morph the single register into the equivalent paired register).

- Adds the new instructions themselves (LDD/STD/LDDA/STDA).

- Hooks up the instructions and registers as a vector type v2i32. Adds
  custom legalizer to transform i64 load/stores into v2i32 load/stores
  and bitcasts, so that the new instructions can actually be
  generated, and marks all operations other than load/store on v2i32
  as needing to be expanded.

- Copies the unfortunate SelectInlineAsm hack from ARMISelDAGToDAG.
  This hack undoes the transformation of i64 operands into two
  arbitrarily-allocated separate i32 registers in
  SelectionDAGBuilder. and instead passes them in a single
  IntPair. (Arbitrarily allocated registers are not useful, asm code
  expects to be receiving a pair, which can be passed to ldd/std.)

Also adds a bunch of test cases covering all the bugs I've added along
the way.

Differential Revision: http://reviews.llvm.org/D8713

llvm-svn: 244484

3994be87

rename toELFShdrIter to getSection and move it closer to getSymbol. NFC. · fe0e4e4c
Rafael Espindola authored Aug 10, 2015
```
llvm-svn: 244483
```
fe0e4e4c
toELFSymIter and getSymbol are now the same thing. Merge them. · 19046678
Rafael Espindola authored Aug 10, 2015
```
llvm-svn: 244482
```
19046678

Fix a bunch of trivial cases of 'CHECK[^:]*$' in the tests. NFCI · 49e46ce8

Jon Roelofs authored Aug 10, 2015

I looked into adding a warning / error for this to FileCheck, but there doesn't
seem to be a good way to avoid it triggering on the instances of it in RUN lines.

llvm-svn: 244481

49e46ce8

Use continue to reduce indentation. NFC. · fc2b6fa3
Rafael Espindola authored Aug 10, 2015
```
llvm-svn: 244480
```
fc2b6fa3
[AArch64] Convert a conditional check that will always be true to an assert. NFC. · c56a9132
Chad Rosier authored Aug 10, 2015
```
llvm-svn: 244479
```
c56a9132
Recommit r244470+ r244471 together, the bot failed between them. · 2ad3b336
Yaron Keren authored Aug 10, 2015
```
llvm-svn: 244476
```
2ad3b336
[IndVarSimplify] Make cost estimation in RewriteLoopExitValues smarter · 4709c037
Igor Laevsky authored Aug 10, 2015
```
Differential Revision: http://reviews.llvm.org/D11687

llvm-svn: 244474
```
4709c037
Revert r244470 and 244471 while looking into it. · 1a1e1ca9
Yaron Keren authored Aug 10, 2015
```
llvm-svn: 244472
```
1a1e1ca9
Second part of r244470 (source file was unsaved in editor). · b27259b2
Yaron Keren authored Aug 10, 2015
```
llvm-svn: 244471
```
b27259b2

Really implement David Blaikie suggestion in full of seperating · f850d984

Yaron Keren authored Aug 10, 2015

variable initialization from its usage in the push_back making
collapse of the two statements unlikely even without a comment.

llvm-svn: 244470

f850d984

Add new llvm.loop.unroll.enable metadata. · 8939154a

Mark Heffernan authored Aug 10, 2015

This change adds the unroll metadata "llvm.loop.unroll.enable" which directs
the optimizer to unroll a loop fully if the trip count is known at compile time, and
unroll partially if the trip count is not known at compile time. This differs from
"llvm.loop.unroll.full" which explicitly does not unroll a loop if the trip count is not
known at compile time.

The "llvm.loop.unroll.enable" is intended to be added for loops annotated with
"#pragma unroll".

llvm-svn: 244466

8939154a

Typo. Move comment closer to relevant code. NFC. · caed6db5
Chad Rosier authored Aug 10, 2015
```
llvm-svn: 244465
```
caed6db5
fix minsize detection: minsize attribute implies optimizing for size · 10294b59
Sanjay Patel authored Aug 10, 2015
```
llvm-svn: 244464
```
10294b59
fix minsize detection: minsize attribute implies optimizing for size · 0f12d71b
Sanjay Patel authored Aug 10, 2015
```
llvm-svn: 244463
```
0f12d71b
Fully apply David Blaikie suggestion and add comment explaining why. · 0b4c9693
Yaron Keren authored Aug 10, 2015
```
llvm-svn: 244461
```
0b4c9693
fix minsize detection: minsize attribute implies optimizing for size · 68b0325a
Sanjay Patel authored Aug 10, 2015
```
llvm-svn: 244460
```
68b0325a
fix minsize detection: minsize attribute implies optimizing for size · 9a9003d9
Sanjay Patel authored Aug 10, 2015
```
llvm-svn: 244458
```
9a9003d9
Add missing include guard to FuzzerInternal.h, NFC. · 347663b2
Yaron Keren authored Aug 10, 2015
```
llvm-svn: 244457
```
347663b2