Commits · 7f4c9d142986b6dab625c8a3034cfc57ce54f6d9 · Roger Ferrer / llvm-epi-0.8

Apr 12, 2012

Emit abs.s or abs.d only if -enable-no-nans-fp-math is supplied by user. · 7f4c9d14
Akira Hatanaka authored Apr 11, 2012
```
Invalid operation is signaled if the operand of these instructions is NaN.

llvm-svn: 154545
```
7f4c9d14
Fixed a case of ARM disassembly getting an assert on a bad encoding · 72f18bbc
Kevin Enderby authored Apr 11, 2012
```
of a VST instruction.

llvm-svn: 154544
```
72f18bbc

Fix bugs in lowering of FCOPYSIGN nodes. · 4f5c8421

Akira Hatanaka authored Apr 11, 2012

- FCOPYSIGN nodes that have operands of different types were not handled.
- Different code was generated depending on the endianness of the target.

Additionally, code is added that emits INS and EXT instructions, if they are
supported by target (they are R2 instructions).

llvm-svn: 154540

4f5c8421

Apr 11, 2012
- Remove incorrect comment. · b4722bba
  Jim Grosbach authored Apr 11, 2012
```
llvm-svn: 154533
```
  b4722bba
- Tidy up. Remove hard tab characters. · 3263a07d
  Jim Grosbach authored Apr 11, 2012
```
llvm-svn: 154532
```
  3263a07d
- Tidy up. Whitespace. · dac4a95b
  Jim Grosbach authored Apr 11, 2012
```
llvm-svn: 154531
```
  dac4a95b
- Fix pasto. · 63fa02ea
  Benjamin Kramer authored Apr 11, 2012
```
llvm-svn: 154527
```
  63fa02ea
- Typo. · cc899f3b
  Chad Rosier authored Apr 11, 2012
```
llvm-svn: 154522
```
  cc899f3b
- TableGen's regpressure: emit per-registerclass weight limits. · 97254150
  Andrew Trick authored Apr 11, 2012
```
llvm-svn: 154518
```
  97254150
- ARM 'vuzp.32 Dd, Dm' is a pseudo-instruction. · 6e536de1
  Jim Grosbach authored Apr 11, 2012
```
While there is an encoding for it in VUZP, the result of that is undefined,
so we should avoid it. Define the instruction as a pseudo for VTRN.32
instead, as the ARM ARM indicates.

rdar://11222366

llvm-svn: 154511
```
  6e536de1
- TableGen'd regpressure: register unit set pruning. · a5eee987
  Andrew Trick authored Apr 11, 2012
```
The pruning is more complete if it is not done incrementally. The code
is also a tad less convluted.

llvm-svn: 154510
```
  a5eee987
- ARM 'vzip.32 Dd, Dm' is a pseudo-instruction. · 4640c816
  Jim Grosbach authored Apr 11, 2012
```
While there is an encoding for it in VZIP, the result of that is undefined,
so we should avoid it. Define the instruction as a pseudo for VTRN.32
instead, as the ARM ARM indicates.

rdar://11221911

llvm-svn: 154505
```
  4640c816
- Fix the build under Debian GNU/Hurd. · 14ada946
  Sylvestre Ledru authored Apr 11, 2012
```
Thanks to Pino Toscano for the patch

llvm-svn: 154500
```
  14ada946
- Cache the hash value of the operands in the MDNode. · 2335a5cb
  Benjamin Kramer authored Apr 11, 2012
```
FoldingSet is implemented as a chained hash table. When there is a hash
collision during insertion, which is common as we fill the table until a
load factor of 2.0 is hit, we walk the chained elements, comparing every
operand with the new element's operands. This can be very expensive if the
MDNode has many operands.

We sacrifice a word of space in MDNode to cache the full hash value, reducing
compares on collision to a minimum. MDNode grows from 28 to 32 bytes + operands
on x86. On x86_64 the new bits fit nicely into existing padding, not growing
the struct at all.

The actual speedup depends a lot on the test case and is typically between
1% and 2% for C++ code with clang -c -O0 -g.

llvm-svn: 154497
```
  2335a5cb
- FoldingSet: Push the hash through FoldingSetTraits::Equals, so clients can use it. · 63057a5f
  Benjamin Kramer authored Apr 11, 2012
```
llvm-svn: 154496
```
  63057a5f
- Compute hashes directly with hash_combine instead of taking a detour through FoldingSetNodeID. · 7a426b5f
  Benjamin Kramer authored Apr 11, 2012
```
llvm-svn: 154495
```
  7a426b5f
- remove unused argument · 372cf151
  Nadav Rotem authored Apr 11, 2012
```
llvm-svn: 154494
```
  372cf151
- Add a C binding to the Target and TargetMachine classes to allow for emitting · 264d2e71
  Duncan Sands authored Apr 11, 2012
```
binary and assembly. Patch by Carlo Kok.  Emitting was inspired by but not based
on the D llvm bindings. 

llvm-svn: 154493
```
  264d2e71
- Add two statistics to help track how we are computing the inline cost. · 7ae90d4d
  Chandler Carruth authored Apr 11, 2012
```
Yea, 'NumCallerCallersAnalyzed' isn't a great name, suggestions welcome.

llvm-svn: 154492
```
  7ae90d4d
- Reapply 154397. Original message: · 9d376b65
  Nadav Rotem authored Apr 11, 2012
```
Fix a dagcombine optimization which assumes that the vsetcc result type is always
of the same size as the compared values. This is ture for SSE/AVX/NEON but not
for all targets.

llvm-svn: 154490
```
  9d376b65
- Comment typo fix. · a4b12563
  Duncan Sands authored Apr 11, 2012
```
llvm-svn: 154488
```
  a4b12563
- Add more fused mul+add/sub patterns. rdar://10139676 · 5efc4422
  Evan Cheng authored Apr 11, 2012
```
llvm-svn: 154484
```
  5efc4422
- Reapply 154396 after fixing a test. · 9bc178ac
  Nadav Rotem authored Apr 11, 2012
```
Original message:
Modify the code that lowers shuffles to blends from using blendvXX to vblendXX.
blendV uses a register for the selection while Vblend uses an immediate.
On sandybridge they still have the same latency and execute on the same execution ports.

llvm-svn: 154483
```
  9bc178ac
- Clean up ARM fused multiply + add/sub support some more: rename some isel · 48346c1c
  Evan Cheng authored Apr 11, 2012
```
predicates.
Also remove NEON2 since it's not really useful and it is confusing. If
NEON + VFP4 implies NEON2 but NEON2 doesn't imply NEON + VFP4, what does it
really mean?

rdar://10139676

llvm-svn: 154480
```
  48346c1c
- Fix an overly indented line. Remove an 'else' after an 'if' that returns. · 692d5849
  Craig Topper authored Apr 11, 2012
```
llvm-svn: 154479
```
  692d5849
- Inline implVisitAluOverflow by introducing a nested switch to convert the intrinsic to an nodetype. · bc680061
  Craig Topper authored Apr 11, 2012
```
llvm-svn: 154478
```
  bc680061
- Tablegen'd regpressure: emit the weighted pressure limit. · b1a92d3b
  Andrew Trick authored Apr 11, 2012
```
llvm-svn: 154477
```
  b1a92d3b
- Table-generated register pressure fixes. · 0d94c73c
  Andrew Trick authored Apr 11, 2012
```
Handle mixing allocatable and unallocatable register gracefully.
Simplify the pruning of register unit sets.

llvm-svn: 154474
```
  0d94c73c
- Optimize code a bit by calling push_back only once in some loops. Reduces compiled code size a bit. · 3ef01cdb
  Craig Topper authored Apr 11, 2012
```
llvm-svn: 154473
```
  3ef01cdb
- Match (fneg (fma) to vfnma. rdar://10139676 · 67a09fc3
  Evan Cheng authored Apr 11, 2012
```
llvm-svn: 154469
```
  67a09fc3
- Add retw and lretw instructions. Also, fix Intel syntax parsing for all · 74c282b5
  Charles Davis authored Apr 11, 2012
```
ret instructions.

llvm-svn: 154468
```
  74c282b5
- Merge fma.ll into fusedMAC.ll · d0f61cbe
  Evan Cheng authored Apr 11, 2012
```
llvm-svn: 154466
```
  d0f61cbe
- Fix ARM disassembly of VLD instructions with writebacks. And add test a case · d2980cd0
  Kevin Enderby authored Apr 11, 2012
```
for all opcodes handed by DecodeVLDInstruction() in ARMDisassembler.cpp .

llvm-svn: 154459
```
  d2980cd0
- ARM add missing Thumb1 two-operand aliases for shift-by-immediate. · ad66de15
  Jim Grosbach authored Apr 11, 2012
```
rdar://11222742

llvm-svn: 154457
```
  ad66de15
- Fix a number of problems with ARM fused multiply add/subtract instructions. · aca6c822
  Evan Cheng authored Apr 11, 2012
```
1. The new instruction itinerary entries are not properly described.
2. The asm parser can't handle vfms and vfnms.
3. There were no assembler, disassembler test cases.
4. HasNEON2 has the wrong assembler predicate.
rdar://10139676

llvm-svn: 154456
```
  aca6c822
- Tweak MachineLICM heuristics for cheap instructions. · 645bdd4b
  Jakob Stoklund Olesen authored Apr 11, 2012
```
Allow cheap instructions to be hoisted if they are register pressure
neutral or better. This happens if the instruction is the last loop use
of another virtual register.

Only expensive instructions are allowed to increase loop register
pressure.

llvm-svn: 154455
```
  645bdd4b
- Only check for PHI uses inside the current loop. · a3e86a60
  Jakob Stoklund Olesen authored Apr 11, 2012
```
Hoisting a value that is used by a PHI in the loop will introduce a
copy because the live range is extended to cross the PHI.

The same applies to PHIs in exit blocks.

Also use this opportunity to make HasLoopPHIUse() non-recursive.

llvm-svn: 154454
```
  a3e86a60
- Fix test to be register assignment invariant. · 0bcf8f4b
  Jakob Stoklund Olesen authored Apr 11, 2012
```
llvm-svn: 154453
```
  0bcf8f4b
- TableGen/reginfo potential bug: typo from previous checkin. · f8b1a666
  Andrew Trick authored Apr 10, 2012
```
llvm-svn: 154452
```
  f8b1a666
- Move the constant-folding support for FP_ROUND in SelectionDAG from the... · 6f1ee163
  Owen Anderson authored Apr 10, 2012
```
Move the constant-folding support for FP_ROUND in SelectionDAG from the one-operand version of getNode() to the two-operand version, since it became a two-operand node at sound point.
Zap a testcase that this allows us to completely fold away.

llvm-svn: 154447
```
  6f1ee163