Commits · 8543ba3e527241a4c51fbce999430a0e71d94a5d · Roger Ferrer / llvm-epi-0.8

Apr 12, 2013

SLPVectorizer: add support for vectorization of diamond shaped trees. We now... · 8543ba3e

Nadav Rotem authored Apr 12, 2013

SLPVectorizer: add support for vectorization of diamond shaped trees. We now perform a preliminary traversal of the graph to collect values with multiple users and check where the users came from. 

llvm-svn: 179414

8543ba3e

CostModel: increase the default cost of supported floating point operations... · 87a0af6e

Nadav Rotem authored Apr 12, 2013

CostModel: increase the default cost of supported floating point operations from 1 to two. Fixed a few tests that changes because now the cost of one insert + a vector operation on two doubles is lower than two scalar operations on doubles.

llvm-svn: 179413

87a0af6e

Add debug prints. · 4da0ab1d
Nadav Rotem authored Apr 12, 2013
```
llvm-svn: 179412
```
4da0ab1d
Add support for additional vector instructions in the interpreter. · e4b8aa00
Nadav Rotem authored Apr 12, 2013
```
patch by Veselov, Yuri <Yuri.Veselov@intel.com>.

llvm-svn: 179409
```
e4b8aa00
[ms-inline asm] Move this logic into a static function as it's only applicable · d383db51
Chad Rosier authored Apr 12, 2013
```
when parsing MS-style inline assembly.  No functional change intended.

llvm-svn: 179407
```
d383db51

[ms-inline asm] Address the FIXME for ImmDisp before brackets. This · e9902d83

Chad Rosier authored Apr 12, 2013

is a follow on to r179393 and r179399.  Test case to be added on
the clang side.
Part of rdar://13453209

llvm-svn: 179403

e9902d83

[ms-inline asm] Have the [ Symbol ] case fall into the more general logic. This · 152749ce
Chad Rosier authored Apr 12, 2013
```
is a follow on to r179393.  Test case to be added on the clang side.
Part of rdar://13453209

llvm-svn: 179399
```
152749ce

ARM: Correct printing of pre-indexed operands. · c313220b

Quentin Colombet authored Apr 12, 2013

According to the ARM reference manual, constant offsets are mandatory for pre-indexed addressing modes.
The MC disassembler was not obeying this when the offset is 0.
It was producing instructions like: str r0, [r1]!.
Correct syntax is: str r0, [r1, #0]!.

This change modifies the dumping of operands so that the offset is always printed, regardless of its value, when pre-indexed addressing mode is used.

Patch by Mihail Popa <Mihail.Popa@arm.com>

llvm-svn: 179398

c313220b

[ms-inline asm] Add support for operands that include both a symbol and an · 175d0aee

Chad Rosier authored Apr 12, 2013

immediate displacement.  Specifically, add support for generating the proper IR.
We've been able to parse this for some time now.  Test case to be added on the
clang side.
Part of rdar://13453209

llvm-svn: 179393

175d0aee

PPC: Remove (broken) nested implicit definition lists · 1b58f335

Hal Finkel authored Apr 12, 2013

TableGen will not combine nested list 'let' bindings into a single list, and
instead uses only the inner scope. As a result, several instruction definitions
were missing implicit register defs that were in outer scopes. This de-nests
these scopes and makes all instructions have only one let binding which sets
implicit register definitions.

llvm-svn: 179392

1b58f335

Add a comment about the PPC Interpretation64Bit bit · 2277196f
Hal Finkel authored Apr 12, 2013
```
llvm-svn: 179391
```
2277196f
Hexagon: Set isPredicatedNew flag on predicate new instructions. · ce1be113
Jyotsna Verma authored Apr 12, 2013
```
llvm-svn: 179388
```
ce1be113
Hexagon: Set isPredicatedFlase flag for all the instructions with negated predication. · bea8327f
Jyotsna Verma authored Apr 12, 2013
```
llvm-svn: 179387
```
bea8327f

Simplify (A & ~B) in icmp if A is a power of 2 · 1a08accb

David Majnemer authored Apr 12, 2013

The transform will execute like so:
(A & ~B) == 0 --> (A & B) != 0
(A & ~B) != 0 --> (A & B) == 0

llvm-svn: 179386

1a08accb

[ms-inline asm] Add the implementation for the AOK_Delete kind, which was added · ff10ed17
Chad Rosier authored Apr 12, 2013
```
in r179325.  Test case coming shortly on the clang side.
Part of rdar://13453209

llvm-svn: 179383
```
ff10ed17

LoopVectorizer: integer division is not a reduction operation · f9cea17f

Arnold Schwaighofer authored Apr 12, 2013

Don't classify idiv/udiv as a reduction operation. Integer division is lossy.
For example : (1 / 2) * 4 != 4/2.

Example:

int a[] = { 2, 5, 2, 2}
int x = 80;

for()
  x /= a[i];

Scalar:
  x /= 2 // = 40
  x /= 5 // = 8
  x /= 2 // = 4
  x /= 2 // = 2

Vectorized:

 <80, 1> / <2,5> //= <40,0>
 <40, 0> / <2,2> //= <20,0>

 20*0 = 0

radar://13640654

llvm-svn: 179381

f9cea17f

Revert broken pieces of r179373. · dae08512

Benjamin Kramer authored Apr 12, 2013

You can't copy an OwningPtr, and move semantics aren't available in C++98.

llvm-svn: 179374

dae08512

Replace uses of the deprecated std::auto_ptr with OwningPtr. · 95777550
Andy Gibbs authored Apr 12, 2013
```
llvm-svn: 179373
```
95777550

Fix a disconcerting bug in Value::isUsedInBasicBlock, which gave wrong answers... · eee73f5f

Benjamin Kramer authored Apr 12, 2013

Fix a disconcerting bug in Value::isUsedInBasicBlock, which gave wrong answers for blocks larger than 3 instrs.

Also add a unit test. PR15727.

llvm-svn: 179370

eee73f5f

Add PPC instruction record forms and associated query functions · 654d43b4

Hal Finkel authored Apr 12, 2013

This is prep. work for the implementation of optimizeCompare. Many PPC
instructions have 'record' forms (in almost all cases, this means that the RC
bit is set) that cause the result of the instruction to be compared with zero,
and the result of that comparison saved in a predefined condition register. In
order to add the record forms of the instructions without too much
copy-and-paste, the relevant functions have been refactored into multiclasses
which define both the record and normal forms.

Also, two TableGen-generated mapping functions have been added which allow
querying the instruction code for the record form given the normal form (and
vice versa).

No functionality change intended.

llvm-svn: 179356

654d43b4

Don't disable block layout when forcing block alignment. · c0adc9fd
Nadav Rotem authored Apr 12, 2013
```
llvm-svn: 179355
```
c0adc9fd

Add a flag to align all basic blocks in the function. · c3b0f50a

Nadav Rotem authored Apr 12, 2013

When debugging performance regressions we often ask ourselves if the regression
that we see is due to poor isel/sched/ra or due to some micro-architetural
problem. When comparing two code sequences one good way to rule out front-end
bottlenecks (and other the issues) is to force code alignment. This pass adds
a flag that forces the alignment of all of the basic blocks in the program.

llvm-svn: 179353

c3b0f50a

Add 179294 back, but don't use bit fields so that it works on big endian hosts. · ecf13205

Rafael Espindola authored Apr 12, 2013

Original message:

Print more information about relocations.

With this patch llvm-readobj now prints if a relocation is pcrel, its length,
if it is extern and if it is scattered.

It also refactors the code a bit to use bit fields instead of shifts and
masks all over the place.

llvm-svn: 179345

ecf13205

[ms-inline asm] Add support for using the LENGTH, TYPE, and SIZE operators with · b67f8057

Chad Rosier authored Apr 11, 2013

variables that use namespace alias qualifiers.  Test case coming on clang side
shortly.
Part of rdar://13499009

llvm-svn: 179343

b67f8057

[ms-inline asm] Add support for using offsetof operator with variables that use · ae7ecd6d
Chad Rosier authored Apr 11, 2013
```
namespace alias qualifiers.  Test case coming on clang side shortly.
Part of rdar://13499009

llvm-svn: 179339
```
ae7ecd6d

Aliasing rules for struct-path aware TBAA. · 06a9d50a

Manman Ren authored Apr 11, 2013

Added PathAliases to check if two struct-path tags can alias.
Added command line option -struct-path-tbaa.

llvm-svn: 179337

06a9d50a

[ms-inline asm] Pass a StringRef reference to ParseIntelVarWithQualifier so we · ce03189b

Chad Rosier authored Apr 11, 2013

can build up the identifier string.  No test case as support for looking up
these type of identifiers hasn't been implemented on the clang side.
Part of rdar://13499009

llvm-svn: 179336

ce03189b

Apr 11, 2013

[ms-inline asm] Remove brackets from around a symbol reference in the target · 8fb83300

Chad Rosier authored Apr 11, 2013

specific logic.  This makes the code much less fragile.  Test case coming on the
clang side in a moment.
rdar://13634327

llvm-svn: 179323

8fb83300

Fix undefined behavior in AArch64 · 93849399

David Majnemer authored Apr 11, 2013

A64Imms::isLogicalImmBits and A64Imms::isLogicalImm will attempt to
execute shifts that perform undefined behavior. Instead of attempting
to perform the 64-bit rotation, treat it as a no-op.

llvm-svn: 179317

93849399

Optimize icmp involving addition better · b81cd63c

David Majnemer authored Apr 11, 2013

Allows LLVM to optimize sequences like the following:

%add = add nsw i32 %x, 1
%cmp = icmp sgt i32 %add, %y

into:

%cmp = icmp sge i32 %x, %y

as well as:

%add1 = add nsw i32 %x, 20
%add2 = add nsw i32 %y, 57
%cmp = icmp sge i32 %add1, %add2

into:

%add = add nsw i32 %y, 37
%cmp = icmp sle i32 %cmp, %x

llvm-svn: 179316

b81cd63c

[mips] Custom-lower i64 MULHS and MULHU nodes. Remove the code which selects · 4f1130eb
Akira Hatanaka authored Apr 11, 2013
```
multiply instructions in MipsSEDAGToDAGISel.

This patch was supposed to be part of r178403.

llvm-svn: 179314
```
4f1130eb
[mips] Clean up MipsISelDAGToDAG.cpp and MipsISelLowering.cpp. · 52f79fcd
Akira Hatanaka authored Apr 11, 2013
```
- Rename function.
- Pass iterator by value.
- Remove header include.

No functionality changes.

llvm-svn: 179312
```
52f79fcd
Revert my last two commits while I debug what is wrong in a big endian host. · e2742a03
Rafael Espindola authored Apr 11, 2013
```
llvm-svn: 179303
```
e2742a03

Print more information about relocations. · 708a44d4

Rafael Espindola authored Apr 11, 2013

With this patch llvm-readobj now prints if a relocation is pcrel, its length,
if it is extern and if it is scattered.

It also refactors the code a bit to use bit fields instead of shifts and
masks all over the place.

llvm-svn: 179294

708a44d4

Fix for wrong instcombine on vector insert/extract · a95f8749

Benjamin Kramer authored Apr 11, 2013

When trying to collapse sequences of insertelement/extractelement
instructions into single shuffle instructions, there is one specific
case where the Instruction Combiner wrongly updates the resulting
Mask of shuffle indexes.

The problem is in function CollectShuffleElments.

If we have a sequence of insert/extract element instructions
like the one below:

  %tmp1 = extractelement <4 x float> %LHS, i32 0
  %tmp2 = insertelement <4 x float> %RHS, float %tmp1, i32 1
  %tmp3 = extractelement <4 x float> %RHS, i32 2
  %tmp4 = insertelement <4 x float> %tmp2, float %tmp3, i32 3

Where:
  . %RHS will have a mask of [4,5,6,7]
  . %LHS will have a mask of [0,1,2,3]

The Mask of shuffle indexes is wrongly computed to [4,1,6,7]
instead of [4,0,6,7].
When analyzing %tmp2 in order to compute the Mask for the
resulting shuffle instruction, the algorithm forgets to update
the mask index at position 1 with the index associated to the
element extracted from %LHS by instruction %tmp1.

Patch by Andrea DiBiagio!

llvm-svn: 179291

a95f8749

Add a function to check if an argument list is too long. · cd848c08

Rafael Espindola authored Apr 11, 2013

This will be used in clang to decide if it should create an @file or not. It
will be tested on the clang side.

Patch by Nathan Froyd.

llvm-svn: 179285

cd848c08

[ASan] Allow disabling init-order checks for globals by source file name. · a28f36c2
Alexey Samsonov authored Apr 11, 2013
```
llvm-svn: 179280
```
a28f36c2
Add braces around || in && to pacify GCC. · e7c45bc6
Benjamin Kramer authored Apr 11, 2013
```
llvm-svn: 179275
```
e7c45bc6
Rename the C function to create a SLPVectorizerPass to something sane and... · c86fdf12
Benjamin Kramer authored Apr 11, 2013
```
Rename the C function to create a SLPVectorizerPass to something sane and expose it in the header file.

llvm-svn: 179272
```
c86fdf12

Optimize vector select from all 0s or all 1s · 55658d42

Michael Liao authored Apr 11, 2013

As packed comparisons in AVX/SSE produce all 0s or all 1s in each SIMD lane,
vector select could be simplified to AND/OR or removed if one or both values
being selected is all 0s or all 1s.

llvm-svn: 179267

55658d42