Commits · d5d63597853462446e0d3f34b06518eb4e1644f0 · Roger Ferrer / llvm-epi-0.8

Aug 11, 2011

ARM LDRT assembly parsing and encoding. · d5d63597
Jim Grosbach authored Aug 10, 2011
```
llvm-svn: 137282
```
d5d63597
Tidy up. 80 columns. · d3f7bcd4
Jim Grosbach authored Aug 10, 2011
```
llvm-svn: 137277
```
d3f7bcd4

Andrew Trick authored Aug 10, 2011

An algorithm for incrementally updating LoopInfo within a
LoopPassManager. The incremental update should be extremely cheap in
most cases and can be used in places where it's not feasible to
regenerate the entire loop forest.

- "Unloop" is a node in the loop tree whose last backedge has been removed.
- Perform reverse dataflow on the block inside Unloop to propagate the
  nearest loop from the block's successors.
- For reducible CFG, each block in unloop is visited exactly
  once. This is because unloop no longer has a backedge and blocks
  within subloops don't change parents.
- Immediate subloops are summarized by the nearest loop reachable from
  their exits or exits within nested subloops.
- At completion the unloop blocks each have a new parent loop, and
  each immediate subloop has a new parent.

llvm-svn: 137276

d3530b91

ARM LDRH(immediate) assembly parsing and encoding support. · cd4dd255
Jim Grosbach authored Aug 10, 2011
```
llvm-svn: 137260
```
cd4dd255

Aug 10, 2011
- ARM LDRD(register) assembly parsing and encoding. · 1d9d5e93
  Jim Grosbach authored Aug 10, 2011
```
Add support for literal encoding of #-0 along the way.

llvm-svn: 137254
```
  1d9d5e93
- · bb23a4a9
  Devang Patel authored Aug 10, 2011
```
Distinguish between two copies of one inlined variable. Take 2.

llvm-svn: 137253
```
  bb23a4a9
- While extending definition range of a debug variable, consult lexical scopes... · 37a62058
  Devang Patel authored Aug 10, 2011
```
While extending definition range of a debug variable, consult lexical scopes also. There is no point extending debug variable out side its lexical block. This provides 6x compile time speedup in some cases.

llvm-svn: 137250
```
  37a62058
- Revert unintentional parts of previous check-in. · e30746c8
  Devang Patel authored Aug 10, 2011
```
llvm-svn: 137249
```
  e30746c8
- Start using LexicalScopes utility. No intetional functionality change. · 7e62302f
  Devang Patel authored Aug 10, 2011
```
llvm-svn: 137246
```
  7e62302f
- Fix typo. Not quite sure how that slipped in there. · f7164b2c
  Jim Grosbach authored Aug 10, 2011
```
llvm-svn: 137245
```
  f7164b2c
- ARM LDRD(immediate) assembly parsing and encoding support. · 5b96b806
  Jim Grosbach authored Aug 10, 2011
```
llvm-svn: 137244
```
  5b96b806
- When performing a truncating store, it is sometimes possible to rearrange the · 410a11fe
  Nadav Rotem authored Aug 10, 2011
```
data in-register prior to saving to memory.  When we reorder the data in memory
we prevent the need to save multiple scalars to memory, making a single regular
store.

llvm-svn: 137238
```
  410a11fe
- Provide utility to extract and use lexical scoping information from machine instructions. · e1649c31
  Devang Patel authored Aug 10, 2011
```
llvm-svn: 137237
```
  e1649c31
- Add initial support for decoding NEON instructions in Thumb2 mode. · c86a5bd2
  Owen Anderson authored Aug 10, 2011
```
llvm-svn: 137236
```
  c86a5bd2
- Comments. Thanks for the spell check Nick! · 6dbb0607
  Andrew Trick authored Aug 10, 2011
```
Also, my apologies for spoiling the autocomplete on SimplifyInstructions.cpp. I couldn't think of a better filename.

llvm-svn: 137229
```
  6dbb0607
- The following X86 pattern is incorrect: · 3ff111c1
  Bruno Cardoso Lopes authored Aug 10, 2011
```
def : Pat<(X86Movss VR128:$src1,
                   (bc_v4i32 (v2i64 (load addr:$src2)))),
          (MOVLPSrm VR128:$src1, addr:$src2)>;
This matches a MOVSS dag with a MOVLPS instruction. However, MOVSS will replace only the low 32 bits of the register, while the MOVLPS instruction will replace the low 64 bits. A testcase is added and illustrates the bug and also modified the one that was already present. Patch by Tanya Lattner.

llvm-svn: 137227
```
  3ff111c1
- Whitespace. · cad9f2af
  Eli Friedman authored Aug 10, 2011
```
llvm-svn: 137226
```
  cad9f2af
- Tabs --> spaces. · 1531e5cd
  Owen Anderson authored Aug 10, 2011
```
llvm-svn: 137225
```
  1531e5cd
- Cleanups based on Nick Lewycky's feedback. · 5d69f63b
  Owen Anderson authored Aug 10, 2011
```
llvm-svn: 137224
```
  5d69f63b
- Rewrite some ARM InstrInfo functions to be most accepting of arbitrary... · 732f82c4
  Owen Anderson authored Aug 10, 2011
```
Rewrite some ARM InstrInfo functions to be most accepting of arbitrary register subclasses.  Hopefully this fixes some buildbots.

llvm-svn: 137223
```
  732f82c4
- Add support for the R and Q constraints. · 36a3abc6
  Rafael Espindola authored Aug 10, 2011
```
llvm-svn: 137217
```
  36a3abc6
- Clarify a comment. · 527bd079
  Bob Wilson authored Aug 10, 2011
```
llvm-svn: 137204
```
  527bd079
- Invoke SimplifyIndVar when we partially unroll a loop. Fixes PR10534. · 4d0040ba
  Andrew Trick authored Aug 10, 2011
```
llvm-svn: 137203
```
  4d0040ba
- Cleanup. Make ScalarEvolution an explicit argument of the · e629d008
  Andrew Trick authored Aug 10, 2011
```
SimplifyIndVar utility since it is required.

llvm-svn: 137202
```
  e629d008
- SimplifyIndVar: make foldIVUser iterative to fold a chain of operands. · 74664d5e
  Andrew Trick authored Aug 10, 2011
```
llvm-svn: 137199
```
  74664d5e
- Update CMake build. · 0b0e47d6
  Benjamin Kramer authored Aug 10, 2011
```
llvm-svn: 137198
```
  0b0e47d6
- Added a SimplifyIndVar utility to simplify induction variable users · 3ec331ea
  Andrew Trick authored Aug 10, 2011
```
based on ScalarEvolution without changing the induction variable phis.

This utility is the main tool of IndVarSimplifyPass, but the pass also
restructures induction variables in strange ways that are sensitive to
pass ordering. This provides a way for other loop passes to simplify
new uses of induction variables created during transformation. The
utility may be used by any pass that preserves ScalarEvolution. Soon
LoopUnroll will use it.

The net effect in this checkin is to cleanup the IndVarSimplify pass
by factoring out the SimplifyIndVar algorithm into a standalone utility.

llvm-svn: 137197
```
  3ec331ea
- Cleanup. Added LoopBlocksDFS::perform for simple clients. · 78b40c3f
  Andrew Trick authored Aug 10, 2011
```
llvm-svn: 137195
```
  78b40c3f
- Fix a bug in vpermilps mask checking. Fix PR10560 · 278ffd7d
  Bruno Cardoso Lopes authored Aug 10, 2011
```
llvm-svn: 137194
```
  278ffd7d
- Fix the LoopUnroller to handle nontrivial loops and partial unrolling. · b72bbe2a
  Andrew Trick authored Aug 10, 2011
```
These are not individual bug fixes. I had to rewrite a good chunk of
the unroller to make it sane. I think it was getting lucky on trivial
completely unrolled loops with no early exits. I included some fairly
simple unit tests for partial unrolling. I didn't do much stress
testing, so it may not be perfect, but should be usable now.

llvm-svn: 137190
```
  b72bbe2a
- Push GPRnopc through a large number of instruction definitions to tighten operand decoding. · 8059f0cf
  Owen Anderson authored Aug 10, 2011
```
llvm-svn: 137189
```
  8059f0cf
- Trim an unneeded header. · b91e4899
  Jakob Stoklund Olesen authored Aug 09, 2011
```
llvm-svn: 137184
```
  b91e4899
- Promote VMOVS to VMOVD when possible. · 6a14dc01
  Jakob Stoklund Olesen authored Aug 09, 2011
```
On Cortex-A8, we use the NEON v2f32 instructions for f32 arithmetic. For
better latency, we also send D-register copies down the NEON pipeline by
translating them to vorr instructions.

This patch promotes even S-register copies to D-register copies when
possible so they can also go down the NEON pipeline.  Example:

        vldr.32 s0, LCPI0_0
    loop:
        vorr    d1, d0, d0
    loop2:
        ...
        vadd.f32        d1, d1, d16

The vorr instruction looked like this after regalloc:

    %S2<def> = COPY %S0, %D1<imp-def>

Copies involving odd S-registers, and copies that don't define the full
D-register are left alone.

llvm-svn: 137182
```
  6a14dc01
- Tighten operand checking of register-shifted-register operands. · 92b942b1
  Owen Anderson authored Aug 09, 2011
```
llvm-svn: 137180
```
  92b942b1
- Add 256-bit support for v8i32, v4i64 and v4f64 ISD::SELECT. Fix PR10556 · 72323966
  Bruno Cardoso Lopes authored Aug 09, 2011
```
llvm-svn: 137179
```
  72323966
- Tighten operand checking on memory barrier instructions. · e008931b
  Owen Anderson authored Aug 09, 2011
```
llvm-svn: 137176
```
  e008931b
- VMCore/BasicBlock.cpp: Don't assume BasicBlock::iterator might end with a... · 4f041651
  NAKAMURA Takumi authored Aug 09, 2011
```
VMCore/BasicBlock.cpp: Don't assume BasicBlock::iterator might end with a non-PHInode Instruction in successors.

Frontends(eg. clang) might pass incomplete form of IR, to step off the way beyond iterator end. In the case I had met, it took infinite loop due to meeting bogus PHInode.

Thanks to Jay Foad and John McCall.

llvm-svn: 137175
```
  4f041651
- Fix whitespace. · 5b64b810
  NAKAMURA Takumi authored Aug 09, 2011
```
llvm-svn: 137174
```
  5b64b810
- Tighten operand checking on CPS instructions. · 3d2e0e9d
  Owen Anderson authored Aug 09, 2011
```
llvm-svn: 137172
```
  3d2e0e9d
- Representation of 'atomic load' and 'atomic store' in IR. · 59b66883
  Eli Friedman authored Aug 09, 2011
```
llvm-svn: 137170
```
  59b66883