Commits · 72f18bbcffe3a57fc8f23c2f4e5aa5779eec0425 · Roger Ferrer / llvm-epi-0.8

Apr 12, 2012

Fixed a case of ARM disassembly getting an assert on a bad encoding · 72f18bbc
Kevin Enderby authored Apr 11, 2012
```
of a VST instruction.

llvm-svn: 154544
```
72f18bbc

Fix bugs in lowering of FCOPYSIGN nodes. · 4f5c8421

Akira Hatanaka authored Apr 11, 2012

- FCOPYSIGN nodes that have operands of different types were not handled.
- Different code was generated depending on the endianness of the target.

Additionally, code is added that emits INS and EXT instructions, if they are
supported by target (they are R2 instructions).

llvm-svn: 154540

4f5c8421

Apr 11, 2012

Typo. · cc899f3b
Chad Rosier authored Apr 11, 2012
```
llvm-svn: 154522
```
cc899f3b

ARM 'vuzp.32 Dd, Dm' is a pseudo-instruction. · 6e536de1

Jim Grosbach authored Apr 11, 2012

While there is an encoding for it in VUZP, the result of that is undefined,
so we should avoid it. Define the instruction as a pseudo for VTRN.32
instead, as the ARM ARM indicates.

rdar://11222366

llvm-svn: 154511

6e536de1

ARM 'vzip.32 Dd, Dm' is a pseudo-instruction. · 4640c816

Jim Grosbach authored Apr 11, 2012

While there is an encoding for it in VZIP, the result of that is undefined,
so we should avoid it. Define the instruction as a pseudo for VTRN.32
instead, as the ARM ARM indicates.

rdar://11221911

llvm-svn: 154505

4640c816

Fix the build under Debian GNU/Hurd. · 14ada946
Sylvestre Ledru authored Apr 11, 2012
```
Thanks to Pino Toscano for the patch

llvm-svn: 154500
```
14ada946

Cache the hash value of the operands in the MDNode. · 2335a5cb

Benjamin Kramer authored Apr 11, 2012

FoldingSet is implemented as a chained hash table. When there is a hash
collision during insertion, which is common as we fill the table until a
load factor of 2.0 is hit, we walk the chained elements, comparing every
operand with the new element's operands. This can be very expensive if the
MDNode has many operands.

We sacrifice a word of space in MDNode to cache the full hash value, reducing
compares on collision to a minimum. MDNode grows from 28 to 32 bytes + operands
on x86. On x86_64 the new bits fit nicely into existing padding, not growing
the struct at all.

The actual speedup depends a lot on the test case and is typically between
1% and 2% for C++ code with clang -c -O0 -g.

llvm-svn: 154497

2335a5cb

FoldingSet: Push the hash through FoldingSetTraits::Equals, so clients can use it. · 63057a5f
Benjamin Kramer authored Apr 11, 2012
```
llvm-svn: 154496
```
63057a5f
Compute hashes directly with hash_combine instead of taking a detour through FoldingSetNodeID. · 7a426b5f
Benjamin Kramer authored Apr 11, 2012
```
llvm-svn: 154495
```
7a426b5f
remove unused argument · 372cf151
Nadav Rotem authored Apr 11, 2012
```
llvm-svn: 154494
```
372cf151
Add a C binding to the Target and TargetMachine classes to allow for emitting · 264d2e71
Duncan Sands authored Apr 11, 2012
```
binary and assembly. Patch by Carlo Kok.  Emitting was inspired by but not based
on the D llvm bindings. 

llvm-svn: 154493
```
264d2e71
Add two statistics to help track how we are computing the inline cost. · 7ae90d4d
Chandler Carruth authored Apr 11, 2012
```
Yea, 'NumCallerCallersAnalyzed' isn't a great name, suggestions welcome.

llvm-svn: 154492
```
7ae90d4d

Reapply 154397. Original message: · 9d376b65

Nadav Rotem authored Apr 11, 2012

Fix a dagcombine optimization which assumes that the vsetcc result type is always
of the same size as the compared values. This is ture for SSE/AVX/NEON but not
for all targets.

llvm-svn: 154490

9d376b65

Add more fused mul+add/sub patterns. rdar://10139676 · 5efc4422
Evan Cheng authored Apr 11, 2012
```
llvm-svn: 154484
```
5efc4422

Reapply 154396 after fixing a test. · 9bc178ac

Nadav Rotem authored Apr 11, 2012

Original message:
Modify the code that lowers shuffles to blends from using blendvXX to vblendXX.
blendV uses a register for the selection while Vblend uses an immediate.
On sandybridge they still have the same latency and execute on the same execution ports.

llvm-svn: 154483

9bc178ac

Clean up ARM fused multiply + add/sub support some more: rename some isel · 48346c1c

Evan Cheng authored Apr 11, 2012

predicates.
Also remove NEON2 since it's not really useful and it is confusing. If
NEON + VFP4 implies NEON2 but NEON2 doesn't imply NEON + VFP4, what does it
really mean?

rdar://10139676

llvm-svn: 154480

48346c1c

Fix an overly indented line. Remove an 'else' after an 'if' that returns. · 692d5849
Craig Topper authored Apr 11, 2012
```
llvm-svn: 154479
```
692d5849
Inline implVisitAluOverflow by introducing a nested switch to convert the intrinsic to an nodetype. · bc680061
Craig Topper authored Apr 11, 2012
```
llvm-svn: 154478
```
bc680061
Optimize code a bit by calling push_back only once in some loops. Reduces compiled code size a bit. · 3ef01cdb
Craig Topper authored Apr 11, 2012
```
llvm-svn: 154473
```
3ef01cdb
Match (fneg (fma) to vfnma. rdar://10139676 · 67a09fc3
Evan Cheng authored Apr 11, 2012
```
llvm-svn: 154469
```
67a09fc3
Add retw and lretw instructions. Also, fix Intel syntax parsing for all · 74c282b5
Charles Davis authored Apr 11, 2012
```
ret instructions.

llvm-svn: 154468
```
74c282b5
Fix ARM disassembly of VLD instructions with writebacks. And add test a case · d2980cd0
Kevin Enderby authored Apr 11, 2012
```
for all opcodes handed by DecodeVLDInstruction() in ARMDisassembler.cpp .

llvm-svn: 154459
```
d2980cd0
ARM add missing Thumb1 two-operand aliases for shift-by-immediate. · ad66de15
Jim Grosbach authored Apr 11, 2012
```
rdar://11222742

llvm-svn: 154457
```
ad66de15

Fix a number of problems with ARM fused multiply add/subtract instructions. · aca6c822

Evan Cheng authored Apr 11, 2012

1. The new instruction itinerary entries are not properly described.
2. The asm parser can't handle vfms and vfnms.
3. There were no assembler, disassembler test cases.
4. HasNEON2 has the wrong assembler predicate.
rdar://10139676

llvm-svn: 154456

aca6c822

Tweak MachineLICM heuristics for cheap instructions. · 645bdd4b

Jakob Stoklund Olesen authored Apr 11, 2012

Allow cheap instructions to be hoisted if they are register pressure
neutral or better. This happens if the instruction is the last loop use
of another virtual register.

Only expensive instructions are allowed to increase loop register
pressure.

llvm-svn: 154455

645bdd4b

Only check for PHI uses inside the current loop. · a3e86a60

Jakob Stoklund Olesen authored Apr 11, 2012

Hoisting a value that is used by a PHI in the loop will introduce a
copy because the live range is extended to cross the PHI.

The same applies to PHIs in exit blocks.

Also use this opportunity to make HasLoopPHIUse() non-recursive.

llvm-svn: 154454

a3e86a60

Move the constant-folding support for FP_ROUND in SelectionDAG from the... · 6f1ee163

Owen Anderson authored Apr 10, 2012

Move the constant-folding support for FP_ROUND in SelectionDAG from the one-operand version of getNode() to the two-operand version, since it became a two-operand node at sound point.
Zap a testcase that this allows us to completely fold away.

llvm-svn: 154447

6f1ee163

[tsan] two more compile-time optimizations: · 5ba61ac6

Kostya Serebryany authored Apr 10, 2012

- don't isntrument reads from constant globals.
Saves ~1.5% of instrumented instructions on CPU2006
(counting static instructions, not their execution).
- don't insrument reads from vtable (which is a global constant too).
Saves ~5%.

I did not measure the run-time impact of this,
but it is certainly non-negative.

llvm-svn: 154444

5ba61ac6

Apr 10, 2012

Handle llvm.fma.* intrinsics. rdar://10914096 · d0007f3c
Evan Cheng authored Apr 10, 2012
```
llvm-svn: 154439
```
d0007f3c
Add a comment noting that the fdiv -> fmul conversion won't generate · 4f53074c
Duncan Sands authored Apr 10, 2012
```
multiplication by a denormal, and some tests checking that.

llvm-svn: 154431
```
4f53074c

The MDString class stored a StringRef to the string which was already in a · c4c568b2

Bill Wendling authored Apr 10, 2012

StringMap. This was redundant and unnecessarily bloated the MDString class.

Because the MDString class is a "Value" and will never have a "name", and
because the Name field in the Value class is a pointer to a StringMap entry, we
repurpose the Name field for an MDString. It stores the StringMap entry in the
Name field, and uses the normal methods to get the string (name) back.

PR12474

llvm-svn: 154429

c4c568b2

Whitespace. · f7345b02
Chad Rosier authored Apr 10, 2012
```
llvm-svn: 154427
```
f7345b02
Revert r154396, which looks to be the real culprit behind the bot failures. · 235a7a17
Chad Rosier authored Apr 10, 2012
```
llvm-svn: 154426
```
235a7a17
Temporarily revert this patch to see if it brings the buildbots back. · 65ada95b
Eric Christopher authored Apr 10, 2012
```
llvm-svn: 154425
```
65ada95b

[tsan] compile-time instrumentation: do not instrument a read if · bf2de80b

Kostya Serebryany authored Apr 10, 2012

a write to the same temp follows in the same BB.
Also add stats printing.

On Spec CPU2006 this optimization saves roughly 4% of instrumented reads
(which is 3% of all instrumented accesses):
Writes            : 161216
Reads             : 446458
Reads-before-write: 18295

llvm-svn: 154418

bf2de80b

To ensure that we have more accurate line information for a block · e9abba71

Eric Christopher authored Apr 10, 2012

don't elide the branch instruction if it's the only one in the block,
otherwise it's ok.

PR9796 and rdar://11215207

llvm-svn: 154417

e9abba71

Revert r154397, which was causing make check failures on the buildbots. · 3efc8f22
Owen Anderson authored Apr 10, 2012
```
llvm-svn: 154414
```
3efc8f22

ARM fix cc_out operand handling for t2SUBrr instructions. · df5a2447

Jim Grosbach authored Apr 10, 2012

We were incorrectly conflating some add variants which don't have a
cc_out operand with the mirroring sub encodings, which do. Part of the
awesome non-orthogonality legacy of thumb1. Similarly, handling of
add/sub of an immediate was sometimes incorrectly removing the cc_out
operand for add/sub register variants.

rdar://11216577

llvm-svn: 154411

df5a2447

Remove unused variable. · 27351366
David Blaikie authored Apr 10, 2012
```
llvm-svn: 154398
```
27351366
Fix a dagcombine optimization which assumes that the vsetcc result type is always · 065564d8
Nadav Rotem authored Apr 10, 2012
```
of the same size as the compared values. This is ture for SSE/AVX/NEON but not
for all targets.

llvm-svn: 154397
```
065564d8