Commits · 01cfa94212a1e352145319554047aea3a637d709 · Lorenzo Albano / LLVM bpEVL

Dec 05, 2013

Apply transformation on OS X 10.9+ and iOS 7.0+: pow(10, x) ―> __exp10(x) · 01cfa942
Yi Jiang authored Dec 05, 2013
```
llvm-svn: 196544
```
01cfa942

Added two new test case decorator functions: @skipIfRemote and @skipIfRemoteDueToDeadlock. · 12514560

Greg Clayton authored Dec 05, 2013

@skipIfRemote is used to decorate test cases that don't make sense to run remotely.

@skipIfRemoteDueToDeadlock indicates these tests need to be looked at and currently deadlock the running of the test suite when run on remote systems. These issues should be fixed soon.

llvm-svn: 196543

12514560

llvm-cov: Further improved error messages. · 4c9f19d6
Yuchen Wu authored Dec 05, 2013
```
llvm-svn: 196542
```
4c9f19d6
llvm-cov: Conformed headers. · c3e64247
Yuchen Wu authored Dec 05, 2013
```
llvm-svn: 196541
```
c3e64247
80-column wrap a comment in a test · 6ceed44d
Nico Weber authored Dec 05, 2013
```
llvm-svn: 196540
```
6ceed44d

Move test to X86 dir · e593fea5

Renato Golin authored Dec 05, 2013

Test is platform independent, but I don't want to force vector-width, or
that could spoil the pragma test.

llvm-svn: 196539

e593fea5

For NetBSD, use arm1176jzf-s as default CPU for ARMv6. · 706c2816
Joerg Sonnenberger authored Dec 05, 2013
```
llvm-svn: 196538
```
706c2816

Add #pragma vectorize enable/disable to LLVM · 729a3ae9

Renato Golin authored Dec 05, 2013

The intended behaviour is to force vectorization on the presence
of the flag (either turn on or off), and to continue the behaviour
as expected in its absence. Tests were added to make sure the all
cases are covered in opt. No tests were added in other tools with
the assumption that they should use the PassManagerBuilder in the
same way.

This patch also removes the outdated -late-vectorize flag, which was
on by default and not helping much.

The pragma metadata is being attached to the same place as other loop
metadata, but nothing forbids one from attaching it to a function
(to enable #pragma optimize) or basic blocks (to hint the basic-block
vectorizers), etc. The logic should be the same all around.

Patches to Clang to produce the metadata will be produced after the
initial implementation is agreed upon and committed. Patches to other
vectorizers (such as SLP and BB) will be added once we're happy with
the pass manager changes.

llvm-svn: 196537

729a3ae9

Check hint registers for interference only once before evictions · 73f3d33d
Aditya Nandakumar authored Dec 05, 2013
```
llvm-svn: 196536
```
73f3d33d
Implemented vget/vset_lane_f16 intrinsics · 6a8b8b5f
Ana Pazos authored Dec 05, 2013
```
llvm-svn: 196535
```
6a8b8b5f
Mark auto-generated file as auto-generated, as suggested by Alp. · e7f52499
Richard Smith authored Dec 05, 2013
```
llvm-svn: 196534
```
e7f52499
Implemented vget/vset_lane_f16 intrinsics · 6b0a8c50
Ana Pazos authored Dec 05, 2013
```
llvm-svn: 196533
```
6b0a8c50
Pass down the target CPU to the system assembler for NetBSD/ARM. · 1f94da5a
Joerg Sonnenberger authored Dec 05, 2013
```
llvm-svn: 196532
```
1f94da5a

ObjectiveC: Don't warn when method implemented in · f3077a29

Fariborz Jahanian authored Dec 05, 2013

category is declared in category's primary
class's super class. Because the super class is
expected to implemented the method. // rdar://15580969

llvm-svn: 196531

f3077a29

llvm-cov: Changed extension from .llcov to .gcov. · 9af3938b
Yuchen Wu authored Dec 05, 2013
```
llvm-svn: 196530
```
9af3938b
[c-index-test] For the '-remap-file' option use a comma for separator which is more · 5899e891
Argyrios Kyrtzidis authored Dec 05, 2013
```
Windows friendly than the colon.

llvm-svn: 196529
```
5899e891
Revert part of GCC warning fix to fix debug build. · 79d55f5c
Matt Arsenault authored Dec 05, 2013
```
The typedef is used inside the DEBUG(), and apparently can't be moved
inside of it.

llvm-svn: 196528
```
79d55f5c

Fix assertion failure left over from changes to move away from "darwin" triples. · 83e723a9

Bob Wilson authored Dec 05, 2013

I happened to notice this while trying to write a test for an iOS simulator
target. I suspect we just missed this when we added separate "macosx" and "ios"
triples instead of the generic "darwin" OS.

llvm-svn: 196527

83e723a9

Fix minor GCC warnings. · c44a3ff6
Matt Arsenault authored Dec 05, 2013
```
Unused typedefs and unused variables.

llvm-svn: 196526
```
c44a3ff6
Extends StringExtractorGDBRemote to support debugger packets. · 0fefa676
Jean-Daniel Dupas authored Dec 05, 2013
```
CC: lldb-commits

Differential Revision: http://llvm-reviews.chandlerc.com/D2337

llvm-svn: 196525
```
0fefa676

Change std::deque => std::vector. No functionality change. · 2bf0173b

Michael Gottesman authored Dec 05, 2013

There is no reason to use std::deque here over std::vector. Thus given the
performance differences inbetween the two it makes sense to change deque to
vector.

llvm-svn: 196524

2bf0173b

Document that dllexported symbols are preserved by optimization passes. · f5b769e4
Yunzhong Gao authored Dec 05, 2013
```
llvm-svn: 196523
```
f5b769e4

Fix non-deterministic behavior. · cdbde3aa

Rafael Espindola authored Dec 05, 2013

We use CSEBlocks to initialize a worklist:

SmallVector<BasicBlock *, 8> CSEWorkList(CSEBlocks.begin(), CSEBlocks.end());

so it must have a deterministic order.

llvm-svn: 196520

cdbde3aa

Rename DwarfUnits to DwarfFile to help avoid some naming confusion. · f8194853
Eric Christopher authored Dec 05, 2013
```
llvm-svn: 196519
```
f8194853
AttributeList: tweak the conditional order to avoid two strcmps · a31d1dd1
Alp Toker authored Dec 05, 2013
```
llvm-svn: 196518
```
a31d1dd1

MI-Sched: Model "reserved" processor resources. · 5a22df49

Andrew Trick authored Dec 05, 2013

This allows a target to use MI-Sched as an in-order scheduler that
will model strict resource conflicts without defining a processor
itinerary. Instead, the target can now use the new per-operand machine
model and define in-order resources with BufferSize=0. For example,
this would allow restricting the type of operations that can be formed
into a dispatch group. (Normally NumMicroOps is sufficient to enforce
dispatch groups).

If the intent is to model latency in in-order pipeline, as opposed to
resource conflicts, then a resource with BufferSize=1 should be
defined instead.

This feature is only casually tested as there are no in-tree targets
using it yet. However, Hal will be experimenting with POWER7.

llvm-svn: 196517

5a22df49

MI-Sched: handle latency of in-order operations with the new machine model. · 880e573d

Andrew Trick authored Dec 05, 2013

The per-operand machine model allows the target to define "unbuffered"
processor resources. This change is a quick, cheap way to model stalls
caused by the latency of operations that use such resources. This only
applies when the processor's micro-op buffer size is non-zero
(Out-of-Order). We can't precisely model in-order stalls during
out-of-order execution, but this is an easy and effective
heuristic. It benefits cortex-a9 scheduling when using the new
machine model, which is not yet on by default.

MI-Sched for armv7 was evaluated on Swift (and only not enabled because
of a performance bug related to predication). However, we never
evaluated Cortex-A9 performance on MI-Sched in its current form. This
change adds MI-Sched functionality to reach performance goals on
A9. The only remaining change is to allow MI-Sched to run as a PostRA
pass.

I evaluated performance using a set of options to estimate the performance impact once MI sched is default on armv7:
-mcpu=cortex-a9 -disable-post-ra -misched-bench -scheditins=false

For a simple saxpy loop I see a 1.7x speedup. Here are the llvm-testsuite results:
(min run time over 2 runs, filtering tiny changes)

Speedups:
| Benchmarks/BenchmarkGame/recursive         |  52.39% |
| Benchmarks/VersaBench/beamformer           |  20.80% |
| Benchmarks/Misc/pi                         |  19.97% |
| Benchmarks/Misc/mandel-2                   |  19.95% |
| SPEC/CFP2000/188.ammp                      |  18.72% |
| Benchmarks/McCat/08-main/main              |  18.58% |
| Benchmarks/Misc-C++/Large/sphereflake      |  18.46% |
| Benchmarks/Olden/power                     |  17.11% |
| Benchmarks/Misc-C++/mandel-text            |  16.47% |
| Benchmarks/Misc/oourafft                   |  15.94% |
| Benchmarks/Misc/flops-7                    |  14.99% |
| Benchmarks/FreeBench/distray               |  14.26% |
| SPEC/CFP2006/470.lbm                       |  14.00% |
| mediabench/mpeg2/mpeg2dec/mpeg2decode      |  12.28% |
| Benchmarks/SmallPT/smallpt                 |  10.36% |
| Benchmarks/Misc-C++/Large/ray              |   8.97% |
| Benchmarks/Misc/fp-convert                 |   8.75% |
| Benchmarks/Olden/perimeter                 |   7.10% |
| Benchmarks/Bullet/bullet                   |   7.03% |
| Benchmarks/Misc/mandel                     |   6.75% |
| Benchmarks/Olden/voronoi                   |   6.26% |
| Benchmarks/Misc/flops-8                    |   5.77% |
| Benchmarks/Misc/matmul_f64_4x4             |   5.19% |
| Benchmarks/MiBench/security-rijndael       |   5.15% |
| Benchmarks/Misc/flops-6                    |   5.10% |
| Benchmarks/Olden/tsp                       |   4.46% |
| Benchmarks/MiBench/consumer-lame           |   4.28% |
| Benchmarks/Misc/flops-5                    |   4.27% |
| Benchmarks/mafft/pairlocalalign            |   4.19% |
| Benchmarks/Misc/himenobmtxpa               |   4.07% |
| Benchmarks/Misc/lowercase                  |   4.06% |
| SPEC/CFP2006/433.milc                      |   3.99% |
| Benchmarks/tramp3d-v4                      |   3.79% |
| Benchmarks/FreeBench/pifft                 |   3.66% |
| Benchmarks/Ptrdist/ks                      |   3.21% |
| Benchmarks/Adobe-C++/loop_unroll           |   3.12% |
| SPEC/CINT2000/175.vpr                      |   3.12% |
| Benchmarks/nbench                          |   2.98% |
| SPEC/CFP2000/183.equake                    |   2.91% |
| Benchmarks/Misc/perlin                     |   2.85% |
| Benchmarks/Misc/flops-1                    |   2.82% |
| Benchmarks/Misc-C++-EH/spirit              |   2.80% |
| Benchmarks/Misc/flops-2                    |   2.77% |
| Benchmarks/NPB-serial/is                   |   2.42% |
| Benchmarks/ASC_Sequoia/CrystalMk           |   2.33% |
| Benchmarks/BenchmarkGame/n-body            |   2.28% |
| Benchmarks/SciMark2-C/scimark2             |   2.27% |
| Benchmarks/Olden/bh                        |   2.03% |
| skidmarks10/skidmarks                      |   1.81% |
| Benchmarks/Misc/flops                      |   1.72% |

Slowdowns:
| Benchmarks/llubenchmark/llu                | -14.14% |
| Benchmarks/Polybench/stencils/seidel-2d    |  -5.67% |
| Benchmarks/Adobe-C++/functionobjects       |  -5.25% |
| Benchmarks/Misc-C++/oopack_v1p8            |  -5.00% |
| Benchmarks/Shootout/hash                   |  -2.35% |
| Benchmarks/Prolangs-C++/ocean              |  -2.01% |
| Benchmarks/Polybench/medley/floyd-warshall |  -1.98% |
| Polybench/linear-algebra/kernels/3mm       |  -1.95% |
| Benchmarks/McCat/09-vor/vor                |  -1.68% |

llvm-svn: 196516

880e573d

Machine model comments. Explain a ProcessorUnit's BufferSize. · 093bdd17
Andrew Trick authored Dec 05, 2013
```
llvm-svn: 196515
```
093bdd17
Fix the A9 machine model. VTRN writes two registers. · ff199a4b
Andrew Trick authored Dec 05, 2013
```
llvm-svn: 196514
```
ff199a4b
comment typo and reformat · bb1247b9
Andrew Trick authored Dec 05, 2013
```
llvm-svn: 196513
```
bb1247b9

clang-format vsix cmake build: use ${LLVM_TOOLS_BINARY_DIR}/${CMAKE_CFG_INTDIR} · 6a3816a6

Hans Wennborg authored Dec 05, 2013

as the location for grabbing clang-format.exe, and also output the .vsix here.

This allows us to find clang-format.exe when building from a MSVC Solution.

llvm-svn: 196512

6a3816a6

Check the initial line number without going through PresumedLoc · 52937abc

Alp Toker authored Dec 05, 2013

No practical difference in this case and would return 1 either way, but this is
more self-explanatory.

llvm-svn: 196511

52937abc

Fix a tranche of comment, test and doc typos · f6a24ce4
Alp Toker authored Dec 05, 2013
```
llvm-svn: 196510
```
f6a24ce4
Add a default constructor to get deterministic behavior. · 4cc2b873
Rafael Espindola authored Dec 05, 2013
```
Should fix the msan and valgrind bots.

llvm-svn: 196509
```
4cc2b873

SLPVectorizer: An in-tree vectorized entry cannot also be a scalar external use · 7ee53cac

Arnold Schwaighofer authored Dec 05, 2013

We were creating external uses for scalar values in MustGather entries that also
had a ScalarToTreeEntry (they also are present in a vectorized tuple). This
meant we would keep a value 'alive' as a scalar and vectorized causing havoc.
This is not necessary because when we create a MustGather vector we explicitly
create external uses entries for the insertelement instructions of the
MustGather vector elements.

Fixes PR18129.

radar://15582184

llvm-svn: 196508

7ee53cac

[tsan] fix PR18146: sometimes a variable written into vptr could have an... · 2460c3fc

Kostya Serebryany authored Dec 05, 2013

[tsan] fix PR18146: sometimes a variable written into vptr could have an integer type (after other optimizations)

llvm-svn: 196507

2460c3fc

PR16532: work around old GCC bug in interception_type_test.cc · 5ca3de6e
Alexey Samsonov authored Dec 05, 2013
```
llvm-svn: 196506
```
5ca3de6e
Use !! to convert to a boolean value. · d2014f19
Rui Ueyama authored Dec 05, 2013
```
llvm-svn: 196505
```
d2014f19

[PECOFF] Handle .lib files as if they are grouped by --{start,end}-group. · 16c025e2

Rui Ueyama authored Dec 05, 2013

Currently we do not de-duplicate library files specified by /defaultlib option.
As a result, the same files are added multiple times to the input graph. In
particular, some popular files, such as kernel32.lib or oldnames.lib, are added
more than 10 times during linking of LLD. That makes the linker slower, as it
needs to parse the same file again and again.

This patch solves the issue by de-duplicating. The same file will be added only
once to the input graph. This patch improved the LLD linking time from 10.5
seconds to 7.7 seconds on my 4-core Core i7 Macbook Pro.

llvm-svn: 196504

16c025e2

[NVPTX] Fix off-by-one error when creating the VT list for an SDNode · 4459717b
Justin Holewinski authored Dec 05, 2013
```
llvm-svn: 196503
```
4459717b