Commits · 9bc48e521518407efb7910ac81be022cf66c91f6 · Roger Ferrer / llvm-epi-0.8

Jan 11, 2012

Disable the transformation I added in r147936 to see if it fixes some · 9bc48e52

Chandler Carruth authored Jan 11, 2012

strange build bot failures that look like a miscompile into an infloop.
I'll investigate this tomorrow, but I'd both like to know whether my
patch is the culprit, and get the bots back to green.

llvm-svn: 147945

9bc48e52

Hoist a really redundant code pattern into a helper function, and delete · 3eacfb83
Chandler Carruth authored Jan 11, 2012
```
lots of lines of code. No functionality changed.

llvm-svn: 147942
```
3eacfb83
Simplify the AND-rooted mask+shift checking code to match that of the · b0049f4a
Chandler Carruth authored Jan 11, 2012
```
SRL-rooted code.

llvm-svn: 147941
```
b0049f4a

Unify the interface of the three mask+shift transform helpers, and · 3dbcda84

Chandler Carruth authored Jan 11, 2012

factor the differences that were hiding in one of them into its other
caller, the SRL handling code. No change in behavior.

llvm-svn: 147940

3dbcda84

Clarify and make explicit some of the requirements for transforming · aa01e666

Chandler Carruth authored Jan 11, 2012

mask+shift pairs at the beginning of the ISD::AND case block, and then
hoist the final pattern into a helper function, simplifying and
reflowing it appropriately. This should have no observable behavior
change, but several simplifications fell out of this such as directly
computing the new mask constant, etc.

llvm-svn: 147939

aa01e666

Fix undefined code and reenable test case. · 60399837

Jakob Stoklund Olesen authored Jan 11, 2012

I don't think the compact encoding code is right, but at least is has
defined behavior now.

llvm-svn: 147938

60399837

Hoist the logic to transform shift+mask combinations into sub-register · 51d3076b

Chandler Carruth authored Jan 11, 2012

extracts and scaled addressing modes into its own helper function. No
functionality changed here, just hoisting and layout fixes falling out
of that hoisting.

llvm-svn: 147937

51d3076b

Teach the X86 instruction selection to do some heroic transforms to · 55b2cdee

Chandler Carruth authored Jan 11, 2012

detect a pattern which can be implemented with a small 'shl' embedded in
the addressing mode scale. This happens in real code as follows:

  unsigned x = my_accelerator_table[input >> 11];

Here we have some lookup table that we look into using the high bits of
'input'. Each entity in the table is 4-bytes, which means this
implicitly gets turned into (once lowered out of a GEP):

  *(unsigned*)((char*)my_accelerator_table + ((input >> 11) << 2));

The shift right followed by a shift left is canonicalized to a smaller
shift right and masking off the low bits. That hides the shift right
which x86 has an addressing mode designed to support. We now detect
masks of this form, and produce the longer shift right followed by the
proper addressing mode. In addition to saving a (rather large)
instruction, this also reduces stalls in Intel chips on benchmarks I've
measured.

In order for all of this to work, one part of the DAG needs to be
canonicalized *still further* than it currently is. This involves
removing pointless 'trunc' nodes between a zextload and a zext. Without
that, we end up generating spurious masks and hiding the pattern.

llvm-svn: 147936

55b2cdee

Improved compile time: · 82165698

Stepan Dyatkovskiy authored Jan 11, 2012

1. Size heuristics changed. Now we calculate number of unswitching
branches only once per loop.
2. Some checks was moved from UnswitchIfProfitable to
processCurrentLoop, since it is not changed during processCurrentLoop
iteration. It allows decide to skip some loops at an early stage.
Extended statistics:
- Added total number of instructions analyzed.

llvm-svn: 147935

82165698

llvm/test/CodeGen/X86/zext-fold.ll: Relax an expression in stack offset. · 0e60839e
NAKAMURA Takumi authored Jan 11, 2012
```
llvm-svn: 147928
```
0e60839e
llvm/test/CodeGen/X86/sub-with-overflow.ll: Add explicit -mtriple=i686-linux. · 9e823f6a
NAKAMURA Takumi authored Jan 11, 2012
```
llvm-svn: 147927
```
9e823f6a
Clarified the SCEV getSmallConstantTripCount interface with in-your-face comments. · e81211f4
Andrew Trick authored Jan 11, 2012
```
This interface is misleading and dangerous, but it is actually what we need for unrolling.

llvm-svn: 147926
```
e81211f4
Add big endian mips support. Based on a patch by Jack Carter. · 647841b1
Rafael Espindola authored Jan 11, 2012
```
llvm-svn: 147924
```
647841b1
Add the skeleton of an asm parser for mips. · 870c4e92
Rafael Espindola authored Jan 11, 2012
```
llvm-svn: 147923
```
870c4e92

ARM Ld/St Optimizer fix. · 642f0f6a

Andrew Trick authored Jan 11, 2012

Allow LDRD to be formed from pairs with different LDR encodings. This was the original intention of the pass. Somewhere along the way, the LDR opcodes were refined which broke the optimization. We really don't care what the original opcodes are as long as they both map to the same LDRD and the immediate still fits.

Fixes rdar://10435045 ARMLoadStoreOptimization cannot handle mixed LDRi8/LDRi12

llvm-svn: 147922

642f0f6a

Disable test that seems to expose an unrelated Linux issue. · 05ff7f06
Jakob Stoklund Olesen authored Jan 11, 2012
```
llvm-svn: 147921
```
05ff7f06

Detect when a value is undefined on an edge to a landing pad. · 8b1d023a

Jakob Stoklund Olesen authored Jan 11, 2012

Consider this code:

int h() {
  int x;
  try {
    x = f();
    g();
  } catch (...) {
    return x+1;
  }
  return x;
}

The variable x is undefined on the first edge to the landing pad, but it
has the f() return value on the second edge to the landing pad.

SplitAnalysis::getLastSplitPoint() would assume that the return value
from f() was live into the landing pad when f() throws, which is of
course impossible.

Detect these cases, and treat them as if the landing pad wasn't there.
This allows spill code to be inserted after the function call to f().

<rdar://problem/10664933>

llvm-svn: 147912

8b1d023a

Exclusively use SplitAnalysis::getLastSplitPoint(). · 67aec124

Jakob Stoklund Olesen authored Jan 11, 2012

Delete the alternative implementation in LiveIntervalAnalysis.

These functions computed the same thing, but SplitAnalysis caches the
result.

llvm-svn: 147911

67aec124

Avoid CSE of instructions which define physical registers across MBBs unless · d9725a38
Evan Cheng authored Jan 11, 2012
```
the physical registers are not allocatable.

llvm-svn: 147902
```
d9725a38

If the global variable is removed by the linker, then don't constant merge it · c7915519

Bill Wendling authored Jan 11, 2012

with other symbols.

An object in the __cfstring section is suppoed to be filled with CFString
objects, which have a pointer to ___CFConstantStringClassReference followed by a
pointer to a __cstring. If we allow the object in the __cstring section to be
merged with another global, then it could end up in any section. Because the
linker is going to remove these symbols in the final executable, we shouldn't
bother to merge them.
<rdar://problem/10564621>

llvm-svn: 147899

c7915519

Don't avoid recursing for pointer types, just reference types. Expand on · 43a11829
Eric Christopher authored Jan 11, 2012
```
the comment.

Fixes constvars.exp on the gdb test builder.

llvm-svn: 147897
```
43a11829
Add test case for r147881. · a415140a
Chad Rosier authored Jan 10, 2012
```
llvm-svn: 147891
```
a415140a

Jan 10, 2012

Fixed order of operands in comment to match code. · 995c6332
Lang Hames authored Jan 10, 2012
```
llvm-svn: 147890
```
995c6332

Default stack alignment for 32bit x86 should be 4 Bytes, not 8 Bytes. · 96cd35cf

Joerg Sonnenberger authored Jan 10, 2012

Add a test that checks the stack alignment of a simple function for
Darwin, Linux and NetBSD for 32bit and 64bit mode.

llvm-svn: 147888

96cd35cf

Consider unknown alignment caused by OptimizeThumb2Instructions(). · 20f1dd5f

Jakob Stoklund Olesen authored Jan 10, 2012

This function runs after all constant islands have been placed, and may
shrink some instructions to their 2-byte forms.  This can actually cause
some constant pool entries to move out of range because of growing
alignment padding.

Treat instructions that may be shrunk the same as inline asm - they
erode the known alignment bits.

Also reinstate an old assertion in verify(). It is correct now that
basic block offsets include alignments.

Add a single large test case that will hopefully exercise many parts of
the constant island pass.

<rdar://problem/10670199>

llvm-svn: 147885

20f1dd5f

80 col violation. · da46832e
Evan Cheng authored Jan 10, 2012
```
llvm-svn: 147884
```
da46832e
Add missing VEX predicates to VMOVSDto64rr/VMOVSDto64mr. This fixes a few · 1a8f0ccd
Chad Rosier authored Jan 10, 2012
```
failing test cases on our internal AVX nightly tester.
rdar://10663637

llvm-svn: 147881
```
1a8f0ccd
Let asm parser query asm syntax dialect. · 227b6279
Devang Patel authored Jan 10, 2012
```
llvm-svn: 147880
```
227b6279

This is the matching change for the data structure name changes for the · f7d77069

Kevin Enderby authored Jan 10, 2012

functional change in r147860 to use DW_TAG_label's instead TAG_subprogram's.
This only changes names and updates comments.  No functional change.

llvm-svn: 147877

f7d77069

ARM updating VST2 pseudo-lowering fixed vs. register update. · 74ac7d50
Jim Grosbach authored Jan 10, 2012
```
rdar://10663487

llvm-svn: 147876
```
74ac7d50
Fix some leftover control reaches end of non-void function warnings. · 233149cf
Benjamin Kramer authored Jan 10, 2012
```
llvm-svn: 147874
```
233149cf
Teach the triple library about the androideabi environment. · 9a7510af
Chandler Carruth authored Jan 10, 2012
```
Patch by Evgeniy Stepanov.

llvm-svn: 147871
```
9a7510af
Move default case for covered enum outside of switch. · ad5b42c0
Richard Smith authored Jan 10, 2012
```
llvm-svn: 147870
```
ad5b42c0

For i386, don't use the generic code. · d5ab0260

Bill Wendling authored Jan 10, 2012

As the comment around 7746 says, it's better to use the x87 extended precision
here than SSE. And the generic code doesn't know how to do that. It also regains
the speed lost for the uint64_to_float.c testcase.
<rdar://problem/10669858>

llvm-svn: 147869

d5ab0260

Fix a -Wreturn-type warning in g++. · 3f103541
Richard Smith authored Jan 10, 2012
```
llvm-svn: 147867
```
3f103541
Cleanup these asserts to follow common LLVM style and coding · 4c0ee749
Chandler Carruth authored Jan 10, 2012
```
conventions. Also, clarify the grouping of one of the asserts to silence
-Wparentheses.

llvm-svn: 147863
```
4c0ee749

Add 'llvm_unreachable' to passify GCC's understanding of the constraints · f3e8502c

Chandler Carruth authored Jan 10, 2012

of several newly un-defaulted switches. This also helps optimizers
(including LLVM's) recognize that every case is covered, and we should
assume as much.

llvm-svn: 147861

f3e8502c

Various crash reporting tools have a problem with the dwarf generated for · 8d4a2204

Kevin Enderby authored Jan 10, 2012

assembly source when it generates the TAG_subprogram dwarf debug info for
the labels that have nothing between them as in this bit of assembly source:

% cat ZeroLength.s 
_func1:
_func2:
 nop

One solution would be to not emit the subsequent labels with the same address
and use the next label with a different address or the end of the section for
the AT_high_pc value of the TAG_subprogram.

Turns out in llvm-mc it is not possible in all cases to determine of two
symbols have the same value at the point we put out the TAG_subprogram dwarf
debug info.

So we will have llvm-mc instead of putting out TAG_subprogram's put out
DW_TAG_label's.  And the DW_TAG_label does not have a AT_high_pc value which
avoids the problem.

This commit is only the functional change to make the diffs clear as to what is
really being changed.  The next commit will be to clean up the names of such
things like MCGenDwarfSubprogramEntry to something like MCGenDwarfLabelEntry.

rdar://10666925

llvm-svn: 147860

8d4a2204

Add definition for intel asm variant. · 67bf992a

Devang Patel authored Jan 10, 2012

Right now, this just adds additional entries in match table. The parser does not use them yet.

llvm-svn: 147859

67bf992a

Record asm variant id in MatchEntry and check it while matching instruction. · 9bdc505c
Devang Patel authored Jan 10, 2012
```
llvm-svn: 147858
```
9bdc505c