Commits · 11284ea499c5f2f1ef81b162503e5b4928d152f7 · Roger Ferrer / llvm-epi-0.8

Aug 31, 2008

Another situation where ROTR is cheaper than ROTL. · 11284ea4
Bill Wendling authored Aug 31, 2008
```
llvm-svn: 55577
```
11284ea4
For this pattern, ROTR is the cheaper option. · 4822a7ac
Bill Wendling authored Aug 31, 2008
```
llvm-svn: 55576
```
4822a7ac

- Fix comment so that it describes how the code really works: · fc724164

Bill Wendling authored Aug 31, 2008

   // fold (or (shl x, (*ext y)), (srl x, (*ext (sub 32, y)))) ->
   //   (rotl x, y)
   // fold (or (shl x, (*ext y)), (srl x, (*ext (sub 32, y)))) ->
   //   (rotr x, (sub 32, y))

Example: (x == 0xDEADBEEF and y == 4)

    (x << 4) | (x >> 28)
 => 0xEADBEEF0 | 0x0000000D
 => 0xEADBEEFD

    (rotl x, 4)
 => 0xEADBEEFD

    (rotr x, 28)
 => 0xEADBEEFD

- Fix comment and code for second version. It wasn't using the rot* propertly.

   // fold (or (shl x, (*ext (sub 32, y))), (srl x, (*ext r))) -> 
   //   (rotr x, y)
   // fold (or (shl x, (*ext (sub 32, y))), (srl x, (*ext r))) ->
   //   (rotl x, (sub 32, y))

    (x << 28) | (x >> 4)
 => 0xD0000000 | 0x0DEADBEE
 => 0xDDEADBEE

    (rotl x, 4)
 => 0xEADBEEFD

    (rotr x, 28)
 => (0xEADBEEFD)

llvm-svn: 55575

fc724164

typo · 66ccf603
Gabor Greif authored Aug 30, 2008
```
llvm-svn: 55574
```
66ccf603

Aug 30, 2008

fix some 80-col violations · e12264bf
Gabor Greif authored Aug 30, 2008
```
llvm-svn: 55571
```
e12264bf

Re-apply 55467 with fix. If copy is being replaced by remat'ed def, transfer... · a3771d5b

Evan Cheng authored Aug 30, 2008

Re-apply 55467 with fix. If copy is being replaced by remat'ed def, transfer the implicit defs onto the remat'ed instruction.

llvm-svn: 55564

a3771d5b

Fold isRematerializable checks into isSafeToReMat. · 542ac629
Evan Cheng authored Aug 30, 2008
```
llvm-svn: 55563
```
542ac629

Transform (x << (y&31)) -> (x << y). This takes advantage of the fact x86... · cfb7f3ab

Evan Cheng authored Aug 30, 2008

Transform (x << (y&31)) -> (x << y). This takes advantage of the fact x86 shift instructions 2nd operand (shift count) is limited to 0 to 31 (or 63 in the x86-64 case).

llvm-svn: 55558

cfb7f3ab

Fix an issue where a use might be selected before a def, and then we didn't... · 6f0c51d9

Owen Anderson authored Aug 30, 2008

Fix an issue where a use might be selected before a def, and then we didn't respect the pre-chosen vreg
assignment when selecting the def. This is the naive solution to the problem: insert a copy to the pre-chosen
vreg. Other solutions might be preferable, such as:
1) Passing the dest reg into FastEmit_. However, this would require the higher level code to know about reg classes, which they don't currently.
2) Selecting blocks in reverse postorder. This has some compile time cost for computing the order, and we'd need to measure its impact.

llvm-svn: 55555

6f0c51d9

Fix 80 col. violations. · 894be333
Evan Cheng authored Aug 29, 2008
```
llvm-svn: 55551
```
894be333
Back out 55498. It broken Apple style bootstrapping. · 5e7658c2
Evan Cheng authored Aug 29, 2008
```
llvm-svn: 55549
```
5e7658c2

Aug 29, 2008
- Add a target callback for FastISel. · d58f3e36
  Dan Gohman authored Aug 28, 2008
```
llvm-svn: 55512
```
  d58f3e36
Aug 28, 2008

erect abstraction boundaries for accessing SDValue members, rename Val -> Node to reflect semantics · f304a7aa
Gabor Greif authored Aug 28, 2008
```
llvm-svn: 55504
```
f304a7aa
Implement null and undef values for FastISel. · c45733f1
Dan Gohman authored Aug 28, 2008
```
llvm-svn: 55500
```
c45733f1

Optimize DAGCombiner's worklist processing. Previously it started · f27e33ba

Dan Gohman authored Aug 28, 2008

its work by putting all nodes in the worklist, requiring a big
dynamic allocation. Now, DAGCombiner just iterates over the AllNodes
list and maintains a worklist for nodes that are newly created or
need to be revisited. This allows the worklist to stay small in most
cases, so it can be a SmallVector.

This has the side effect of making DAGCombine not miss a folding
opportunity in alloca-align-rounding.ll.

llvm-svn: 55498

f27e33ba

Move CaseBlock, JumpTable, and BitTestBlock to be members of · 17da6719

Dan Gohman authored Aug 28, 2008

SelectionDAGLowering instead of being in an anonymous namespace.
This fixes warnings about SelectionDAGLowering having fields
using anonymous namespaces.

llvm-svn: 55497

17da6719

Fix a FastISel bug where the instructions from lowering the arguments · 360c57f6
Dan Gohman authored Aug 28, 2008
```
were being emitted after the first instructions of the entry block.

llvm-svn: 55496
```
360c57f6
Reduce the size of the Parts vector. · 6c8a99a7
Rafael Espindola authored Aug 28, 2008
```
llvm-svn: 55483
```
6c8a99a7
Hook up support for fast-isel of trunc instructions, using the newly working... · d8a82b75
Owen Anderson authored Aug 28, 2008
```
Hook up support for fast-isel of trunc instructions, using the newly working support for EXTRACT_SUBREG.

llvm-svn: 55482
```
d8a82b75

FastEmitInst_extractsubreg doesn't need to be passed the register class. It... · 9cd1a5e5

Owen Anderson authored Aug 28, 2008

FastEmitInst_extractsubreg doesn't need to be passed the register class.  It can get it from MachineRegisterInfo instead.

llvm-svn: 55476

9cd1a5e5

Revert r55467; it causes regressions in UnitTests/Vector/divides, · 04cf2e45
Dan Gohman authored Aug 28, 2008
```
Benchmarks/sim/sim, and others on x86-64.

llvm-svn: 55475
```
04cf2e45
Correctly resize the Parts array. · 029c1c84
Rafael Espindola authored Aug 28, 2008
```
llvm-svn: 55471
```
029c1c84

If a copy isn't coalesced, but its src is defined by trivial computation.... · 69756020

Evan Cheng authored Aug 28, 2008

If a copy isn't coalesced, but its src is defined by trivial computation. Re-materialize the src to replace the copy.

llvm-svn: 55467

69756020

Split the ATOMIC NodeType's to include the size, e.g. · 41be0d44

Dale Johannesen authored Aug 28, 2008

ATOMIC_LOAD_ADD_{8,16,32,64} instead of ATOMIC_LOAD_ADD.
Increased the Hardcoded Constant OpActionsCapacity to match.
Large but boring; no functional change.

This is to support partial-word atomics on ppc; i8 is
not a valid type there, so by the time we get to lowering, the
ATOMIC_LOAD nodes looks the same whether the type was i8 or i32.
The information can be added to the AtomicSDNode, but that is the
largest SDNode; I don't fully understand the SDNode allocation,
but it is sensitive to the largest node size, so increasing
that must be bad.  This is the alternative.

llvm-svn: 55457

41be0d44

Reorganize the lifetimes of the major objects SelectionDAGISel · e1a9a780

Dan Gohman authored Aug 27, 2008

works with.

SelectionDAG, FunctionLoweringInfo, and SelectionDAGLowering
objects now get created once per SelectionDAGISel instance, and
can be reused across blocks and across functions. Previously,
they were created and destroyed each time they were needed.

This reorganization simplifies the handling of PHI nodes, and
also SwitchCases, JumpTables, and BitTestBlocks. This
simplification has the side effect of fixing a bug in FastISel
where successor PHI nodes weren't being updated correctly.

This is also a step towards making the transition from FastISel
into and out of SelectionDAG faster, and also making
plain SelectionDAG faster on code with lots of little blocks.

llvm-svn: 55450

e1a9a780

Add a helper method that will be used to support EXTRACT_SUBREG for selecting trunc's in fast-isel. · 5f57bc22
Owen Anderson authored Aug 27, 2008
```
llvm-svn: 55439
```
5f57bc22

Aug 27, 2008
- Move the check whether it's worth remating to caller. · f016b263
  Evan Cheng authored Aug 27, 2008
```
llvm-svn: 55434
```
  f016b263
- Fix FastISel's bitcast code for the case where getRegForValue fails. · 61cfa309
  Dan Gohman authored Aug 27, 2008
```
llvm-svn: 55431
```
  61cfa309
- Refactor isSafeToReMat out of 2addr pass. · 57dc0785
  Evan Cheng authored Aug 27, 2008
```
llvm-svn: 55430
```
  57dc0785
- Use TargetLowering to get the types in fast isel, which handles pointer types... · 90609850
  Owen Anderson authored Aug 27, 2008
```
Use TargetLowering to get the types in fast isel, which handles pointer types correctly for our purposes.

llvm-svn: 55428
```
  90609850
- Don't check TLI.getOperationAction. The FastISel way is to · d01789be
  Dan Gohman authored Aug 27, 2008
```
just try to do the action and let the tablegen-generated code
determine if there is target-support for an operation.

llvm-svn: 55427
```
  d01789be
- Add a new FastISel method, getRegForValue, which takes care of · b0b5a274
  Dan Gohman authored Aug 27, 2008
```
the details of materializing constants and other values into
registers, and make use of it in several places.

llvm-svn: 55426
```
  b0b5a274
- Add a comment about the current floating-point constant code in FastISel. · f2a6c157
  Dan Gohman authored Aug 27, 2008
```
llvm-svn: 55425
```
  f2a6c157
- Optimize ScheduleDAGRRList's topological sort to use one pass instead · 3a3a52de
  Dan Gohman authored Aug 27, 2008
```
of two, and to not need a scratch std::vector. Also, compute the ordering
immediately in the result array, instead of in another scratch std::vector
that is copied to the result array.

llvm-svn: 55421
```
  3a3a52de
- Optimize ScheduleDAG's ComputeDepths and ComputeHeights to not need · 9cbdedcb
  Dan Gohman authored Aug 27, 2008
```
a scratch std::vector.

llvm-svn: 55420
```
  9cbdedcb
- Remove the std::ostream form of PseudoSourceValue's print, · a5b15bd0
  Dan Gohman authored Aug 27, 2008
```
which isn't needed anymore.

llvm-svn: 55419
```
  a5b15bd0
- Basic FastISel support for floating-point constants. · 5ca269e6
  Dan Gohman authored Aug 27, 2008
```
llvm-svn: 55401
```
  5ca269e6
- Fix handling of inttoptr and ptrtoint when unhandled operands are present. · 54aff7bb
  Owen Anderson authored Aug 27, 2008
```
llvm-svn: 55400
```
  54aff7bb
- Add support for fast isel of inttoptr and ptrtoint in the cases where truncation is not needed. · 14054925
  Owen Anderson authored Aug 27, 2008
```
llvm-svn: 55399
```
  14054925
- Factor out a large amoutn of the cast handling code in fast isel into helper methods. · ca1711a5
  Owen Anderson authored Aug 26, 2008
```
This simultaneously makes the code simpler and adds support for sext as well.

llvm-svn: 55398
```
  ca1711a5