Commits · 52340ac5f8dbc5eed11bc5cc21c0175cbc4fcc1e · Roger Ferrer / llvm-epi-0.8

Oct 23, 2011
- Oops! Fix test I forgot to submit as part of r142735. · 52340ac5
  Nick Lewycky authored Oct 22, 2011
```
llvm-svn: 142736
```
  52340ac5
Oct 22, 2011

A non-escaping malloc in the entry block is not unlike an alloca. Do dead-store · 32f8051d
Nick Lewycky authored Oct 22, 2011
```
elimination on them too.

llvm-svn: 142735
```
32f8051d

Make SCEV's brute force analysis stronger in two ways. Firstly, we should be · a6674c7f

Nick Lewycky authored Oct 22, 2011

able to constant fold load instructions where the argument is a constant.
Second, we should be able to watch multiple PHI nodes through the loop; this
patch only supports PHIs in loop headers, more can be done here.

With this patch, we now constant evaluate:
  static const int arr[] = {1, 2, 3, 4, 5};
  int test() {
    int sum = 0;
    for (int i = 0; i < 5; ++i) sum += arr[i];
    return sum;
  }

llvm-svn: 142731

a6674c7f

Fix pr11193. · e649d665

Nadav Rotem authored Oct 22, 2011

SHL inserts zeros from the right, thus even when the original
sign_extend_inreg value was of 1-bit, we need to sra.

llvm-svn: 142724

e649d665

Assembly parsing for 4-register sequential variant of VLD2. · 11c0b347
Jim Grosbach authored Oct 21, 2011
```
llvm-svn: 142704
```
11c0b347
Assembly parsing for 2-register sequential variant of VLD2. · 118b38cb
Jim Grosbach authored Oct 21, 2011
```
llvm-svn: 142691
```
118b38cb

Oct 21, 2011

Remap blockaddress correctly when inlining a function. Fixes PR10162. · 688db1d6
Eli Friedman authored Oct 21, 2011
```
llvm-svn: 142684
```
688db1d6
Assembly parsing for 4-register variant of VLD1. · 846bcff7
Jim Grosbach authored Oct 21, 2011
```
llvm-svn: 142682
```
846bcff7
Assembly parsing for 3-register variant of VLD1. · c4360fe5
Jim Grosbach authored Oct 21, 2011
```
llvm-svn: 142675
```
c4360fe5

Extend instcombine's shufflevector simplification to handle more cases where... · ce818277

Eli Friedman authored Oct 21, 2011

Extend instcombine's shufflevector simplification to handle more cases where the input and output vectors have different sizes.  Patch by Xiaoyi Guo.

llvm-svn: 142671

ce818277

ARM VLD parsing and encoding. · 2f2e3c47

Jim Grosbach authored Oct 21, 2011

Next step in the ongoing saga of NEON load/store assmebly parsing. Handle
VLD1 instructions that take a two-register register list.

Adjust the instruction definitions to only have the single encoded register
as an operand. The super-register from the pseudo is kept as an implicit def,
so passes which come after pseudo-expansion still know that the instruction
defines the other subregs.

llvm-svn: 142670

2f2e3c47

Fix pr11194. When promoting and splitting integers we need to use · 5e00bb5f

Nadav Rotem authored Oct 21, 2011

ZExtPromotedInteger and SExtPromotedInteger based on the operation we legalize.

SetCC return type needs to be legalized via PromoteTargetBoolean.

llvm-svn: 142660

5e00bb5f

Don't hard code the desired alignment for loops -- it isn't 16-bytes on · 70a38058
Chandler Carruth authored Oct 21, 2011
```
all x86 systems. Sorry for the breakage.

llvm-svn: 142656
```
70a38058
1. Fix the widening of SETCC in WidenVecOp_SETCC. Use the correct return CC type. · d315157f
Nadav Rotem authored Oct 21, 2011
```
2. Fix a typo in CONCAT_VECTORS which exposed the bug in #1.

llvm-svn: 142648
```
d315157f

Add loop aligning to MachineBlockPlacement based on review discussion so · 8b9737cb

Chandler Carruth authored Oct 21, 2011

it's a bit more plausible to use this instead of CodePlacementOpt. The
code for this was shamelessly stolen from CodePlacementOpt, and then
trimmed down a bit. There doesn't seem to be much utility in returning
true/false from this pass as we may or may not have rewritten all of the
blocks. Also, the statistic of counting how many loops were aligned
doesn't seem terribly important so I removed it. If folks would like it
to be included, I'm happy to add it back.

This was probably the most egregious of the missing features, and now
I'm going to start gathering some performance numbers and looking at
specific loop structures that have different layout between the two.

Test is updated to include both basic loop alignment and nested loop
alignment.

llvm-svn: 142645

8b9737cb

Add a very basic test for MachineBlockPlacement. This is essentially the · ddfeaafd

Chandler Carruth authored Oct 21, 2011

canonical example I used when developing it, and is one of the primary
motivating real-world use cases for __builtin_expect (when burried under
a macro).

I'm working on more test cases here, but I'm trying to make sure both
that the pass is doing the right thing with the test cases and that they
aren't too brittle to changes elsewhere in the code generation pipeline.

Feedback and/or suggestions on how to test this are very welcome.
Especially feedback on whether testing the block comments is a good
strategy; I couldn't find any good examples to steal from but all the
other ideas I had were a lot uglier or more fragile.

llvm-svn: 142644

ddfeaafd

Remove intrinsics for X86 BLSI, BLSMSK, and BLSR intrinsics and replace with... · 039a7906
Craig Topper authored Oct 21, 2011
```
Remove intrinsics for X86 BLSI, BLSMSK, and BLSR intrinsics and replace with custom isel lowering code.

llvm-svn: 142642
```
039a7906
Revert r142618, r142622, and r142624, which were based on an incorrect reading of the ARMv7 docs. · 16c8fc51
Owen Anderson authored Oct 20, 2011
```
llvm-svn: 142626
```
16c8fc51
Fix decoding tests for fixed MSR encodings. · 608c60c7
Owen Anderson authored Oct 20, 2011
```
llvm-svn: 142624
```
608c60c7

Oct 20, 2011
- Fix tests for corrected MSR encodings. · 48da0ed4
  Owen Anderson authored Oct 20, 2011
```
llvm-svn: 142622
```
  48da0ed4
- ARM VLD1/VST1 (one register, no writeback) assembly parsing and encoding. · 9036c5cf
  Jim Grosbach authored Oct 20, 2011
```
llvm-svn: 142583
```
  9036c5cf
- Tidy up formatting. · 3ad44e50
  Jim Grosbach authored Oct 20, 2011
```
llvm-svn: 142582
```
  3ad44e50
- ARM VTBX (one register) assembly parsing and encoding. · 8db25984
  Jim Grosbach authored Oct 20, 2011
```
llvm-svn: 142581
```
  8db25984
- Refactor code from inlining and globalopt that checks whether a function... · 1923a330
  Eli Friedman authored Oct 20, 2011
```
Refactor code from inlining and globalopt that checks whether a function definition is unused, and enhance it so it can tell that functions which are only used by a blockaddress are in fact dead.  This probably doesn't happen much on most code, but the Linux kernel's _THIS_IP_ can trigger this issue with blockaddress.  (GlobalDCE can also handle the given tescase, but we only run that at -O3.)  Found while looking at PR11180.

llvm-svn: 142572
```
  1923a330
- "@string = constant i8 0" is a value i8* string of length zero. Analyze that · 46209882
  Nick Lewycky authored Oct 20, 2011
```
correctly in GetStringLength, fixing PR11181!

llvm-svn: 142558
```
  46209882
- Revert 142337. Thumb1 still doesn't support dynamic stack realignment. :( · add38c12
  Chad Rosier authored Oct 20, 2011
```
llvm-svn: 142557
```
  add38c12
- Fix TLS lowering bug. The CopyFromReg must be glued to the TLSCALL. rdar://10291355 · 54d678ff
  Evan Cheng authored Oct 19, 2011
```
llvm-svn: 142550
```
  54d678ff
Oct 19, 2011

Improve code generation for vselect on SSE2: · 8824472a

Nadav Rotem authored Oct 19, 2011

When checking the availability of instructions using the TLI, a 'promoted'
instruction IS available. It means that the value is bitcasted to another type
for which there is an operation. The correct check for the availablity of an
instruction is to check if it should be expanded.

llvm-svn: 142542

8824472a

Fix parsing of a line with only a # in it. · e0d09083
Rafael Espindola authored Oct 19, 2011
```
llvm-svn: 142537
```
e0d09083

Use literal pool loads instead of MOVW/MOVT for materializing global addresses... · 2d768fd3

James Molloy authored Oct 19, 2011

Use literal pool loads instead of MOVW/MOVT for materializing global addresses when optimizing for size.

On spec/gcc, this caused a codesize improvement of ~1.9% for ARM mode and ~4.9% for Thumb(2) mode. This is
codesize including literal pools.

The pools themselves doubled in size for ARM mode and quintupled for Thumb mode, leaving suggestion that there
is still perhaps redundancy in LLVM's use of constant pools that could be decreased by sharing entries.

Fixes PR11087.

llvm-svn: 142530

2d768fd3

Add Paste Test · 13c8360c
David Greene authored Oct 19, 2011
```
This tests TableGen's paste functionality.

llvm-svn: 142526
```
13c8360c

Add NAME Member · d699161a

David Greene authored Oct 19, 2011

Add a Value named "NAME" to each Record.  This will be set to the def or defm
name when instantiating multiclasses.  This will replace the #NAME# processing
hack once paste functionality is in place.

llvm-svn: 142518

d699161a

Generalize the reading of probability metadata to work for both branches · deac50cb

Chandler Carruth authored Oct 19, 2011

and switches, with arbitrary numbers of successors. Still optimized for
the common case of 2 successors for a conditional branch.

Add a test case for switch metadata showing up in the BlockFrequencyInfo pass.

llvm-svn: 142493

deac50cb

Teach the BranchProbabilityInfo analysis pass to read any metadata · d27a7a94

Chandler Carruth authored Oct 19, 2011

encoding of probabilities. In the absense of metadata, it continues to
fall back on static heuristics.

This allows __builtin_expect, after lowering through llvm.expect
a branch instruction's metadata, to actually enter the branch
probability model. This is one component of resolving PR2577.

llvm-svn: 142492

d27a7a94

Add pass printing support to BlockFrequencyInfo pass. The implementation · 343fad44

Chandler Carruth authored Oct 19, 2011

layer already had support for printing the results of this analysis, but
the wiring was missing.

Now that printing the analysis works, actually bring some of this
analysis, and the BranchProbabilityInfo analysis that it wraps, under
test! I'm planning on fixing some bugs and doing other work here, so
having a nice place to add regression tests and a way to observe the
results is really useful.

llvm-svn: 142491

343fad44

Add support for the vector-widening of vselect and vector-setcc · 6652e22b
Nadav Rotem authored Oct 19, 2011
```
llvm-svn: 142488
```
6652e22b
Rename PEXTR to PEXT. Add intrinsics for BMI instructions. · ef309c33
Craig Topper authored Oct 19, 2011
```
llvm-svn: 142480
```
ef309c33
Added testcase for <rdar://problem/10215997> · 20a04e74
Lang Hames authored Oct 18, 2011
```
llvm-svn: 142462
```
20a04e74
Add additional element-promotion tests. · 0d339335
Nadav Rotem authored Oct 18, 2011
```
llvm-svn: 142442
```
0d339335
Fix a bug in the legalization of vector anyext-load and trunc-store. Mem Index starts with zero. · 75c2229f
Nadav Rotem authored Oct 18, 2011
```
llvm-svn: 142434
```
75c2229f