Commits · fe4e088dfb6747f614ff5d48107d04e9976b9be4 · Roger Ferrer / llvm-epi-0.8

Nov 14, 2013

Rafael Espindola authored Nov 14, 2013

There is nothing special about quotes and newlines from the object
file point of view, only the assembler has to worry about expanding
the \n and \".

This patch then removes the special handling from the Mangler.

llvm-svn: 194667

fe4e088d

R600/SIFixSGPRCopies.cpp: Fix \param to \return. [-Wdocumentation] · b88288f6
NAKAMURA Takumi authored Nov 14, 2013
```
llvm-svn: 194662
```
b88288f6
Whitespace. · 78e80cd1
NAKAMURA Takumi authored Nov 14, 2013
```
llvm-svn: 194661
```
78e80cd1
[AArch64 neon] support poly64 and relevant intrinsic functions. · afc8bdfd
Kevin Qin authored Nov 14, 2013
```
llvm-svn: 194659
```
afc8bdfd
Implement aarch64 neon instruction class SIMD misc. · aec95baf
Kevin Qin authored Nov 14, 2013
```
llvm-svn: 194656
```
aec95baf
Add dyn_cast<> support to YAML I/O's IO class · dd34f77c
Nick Kledzik authored Nov 14, 2013
```
llvm-svn: 194655
```
dd34f77c

Added BlockFrequencyInfo::view for displaying the block frequency propagation graph via graphviz. · fd8aee76

Michael Gottesman authored Nov 14, 2013

This is useful for debugging issues in the BlockFrequency implementation since
one can easily visualize where probability mass and other errors occur in the
propagation.

llvm-svn: 194654

fd8aee76

Implement AArch64 NEON instruction set AdvSIMD (table). · bb60ccf3
Jiangning Liu authored Nov 14, 2013
```
llvm-svn: 194648
```
bb60ccf3

Fixing a heisenbug where the memory dependence analysis behaves differently · 5cbcf56a

Yunzhong Gao authored Nov 14, 2013

with and without -g.

Adding a test case to make sure that the threshold used in the memory
dependence analysis is respected. The test case also checks that debug
intrinsics are not counted towards this threshold.

Differential Revision: http://llvm-reviews.chandlerc.com/D2141

llvm-svn: 194646

5cbcf56a

Add simple support for tags in YAML I/O · 1e6033ca
Nick Kledzik authored Nov 14, 2013
```
llvm-svn: 194644
```
1e6033ca

llvm-cov: Slightly improved error checking. · 7981f5b8

Yuchen Wu authored Nov 14, 2013

- readInt() should check all 4 bytes can be read, not just 1.
- In the event of false data in the gcno file, it was possible to index
  into a non-existent index of SmallVector, causing assertion error.

llvm-svn: 194639

7981f5b8

llvm-cov: Removed StringMap holding GCOVLines. · d738beec

Yuchen Wu authored Nov 14, 2013

According to the hazy gcov documentation, it appeared to be technically
possible for lines within a block to belong to different source files.
However, upon further investigation, gcov does not actually support
multiple source files for a single block.

This change removes a level of separation between blocks and lines by
replacing the StringMap of GCOVLines with a SmallVector of ints
representing line numbers. This also means that the GCOVLines class is
no longer needed.

This paves the way for supporting the "-a" option, which will output
block information.

llvm-svn: 194637

d738beec

llvm-cov: Replaced asserts with proper error handling. · e28da84c

Yuchen Wu authored Nov 14, 2013

Unified the interface for read functions. They all return a boolean
indicating if the read from file succeeded. Functions that previously
returned the read value now store it into a variable that is passed in
by reference instead. Callers will need to check the return value to
detect if an error occurred.

Also added a new test which ensures that no assertions occur when file
contains invalid data. llvm-cov should return with error code 1 upon
failure.

llvm-svn: 194635

e28da84c

Fixed 80+ violations. · 857d6e3d
Michael Gottesman authored Nov 14, 2013
```
llvm-svn: 194634
```
857d6e3d
R600: Fix uninitialized variable usage · 415ef6db
Tom Stellard authored Nov 13, 2013
```
llvm-svn: 194632
```
415ef6db
Take care of long short branch immediate instructions for mips16 in · 4b7afe55
Reed Kotler authored Nov 13, 2013
```
constant islands.

llvm-svn: 194630
```
4b7afe55

R600/SI: Add support for private address space load/store · 81d871de

Tom Stellard authored Nov 13, 2013

Private address space is emulated using the register file with
MOVRELS and MOVRELD instructions.

llvm-svn: 194626

81d871de

R600/SI: Prefer SALU instructions for bit shift operations · 8216602a

Tom Stellard authored Nov 13, 2013

All shift operations will be selected as SALU instructions and then
if necessary lowered to VALU instructions in the SIFixSGPRCopies pass.

This allows us to do more operations on the SALU which will improve
performance and is also required for implementing private memory
using indirect addressing, since the private memory pointers must stay
in the scalar registers.

This patch includes some fixes from Matt Arsenault.

llvm-svn: 194625

8216602a

Nov 13, 2013

add more comments around the delinearization of arrays · 7ee14724
Sebastian Pop authored Nov 13, 2013
```
llvm-svn: 194612
```
7ee14724
Simplify code. No functionality change. · 9dca4b3e
Jakub Staszak authored Nov 13, 2013
```
llvm-svn: 194602
```
9dca4b3e
Use StringRef instead of std::string · 86a7492f
Jakub Staszak authored Nov 13, 2013
```
llvm-svn: 194601
```
86a7492f

[AArch64] Add support for legacy AArch32 NEON scalar shift by immediate · d3ae5f89

Chad Rosier authored Nov 13, 2013

instructions.  This patch does not include the shift right and accumulate
instructions.  A number of non-overloaded intrinsics have been remove in favor
of their overloaded counterparts.

llvm-svn: 194598

d3ae5f89

Enable generating legacy IT block for AArch32 · 0da5cc07

Weiming Zhao authored Nov 13, 2013

By default, the behavior of IT block generation will be determinated
dynamically base on the arch (armv8 vs armv7). This patch adds backend
options: -arm-restrict-it and -arm-no-restrict-it.  The former one
restricts the generation of IT blocks (the same behavior as thumbv8) for
both arches. The later one allows the generation of legacy IT block (the
same behavior as ARMv7 Thumb2) for both arches.

Clang will support -mrestrict-it and -mno-restrict-it, which is
compatible with GCC.

llvm-svn: 194592

0da5cc07

DIEHash: Move header include to be first in the implementation file to flush... · 9208b5ed

David Blaikie authored Nov 13, 2013

DIEHash: Move header include to be first in the implementation file to flush out header inclusion ordering issues

llvm-svn: 194588

9208b5ed

[SystemZ] Add the general form of BCR · 09de091c
Richard Sandiford authored Nov 13, 2013
```
At the moment this is just the MC support.

llvm-svn: 194585
```
09de091c
Move Delinearization pass into an anonymous namespace. · 9e501ec2
Benjamin Kramer authored Nov 13, 2013
```
llvm-svn: 194582
```
9e501ec2
Make sure LLVMLoadLibraryPermanently gets an extern "C" symbol. · 505d2408
Benjamin Kramer authored Nov 13, 2013
```
Otherwise it's impossible to use it. Also don't include C++ headers in
a C header.

llvm-svn: 194581
```
505d2408

Remove AllowQuotesInName and friends from MCAsmInfo. · fdc88137

Rafael Espindola authored Nov 13, 2013

Accepting quotes is a property of an assembler, not of an object file. For
example, ELF can support any names for sections and symbols, but the gnu
assembler only accepts quotes in some contexts and llvm-mc in a few more.

LLVM should not produce different symbols based on a guess about which assembler
will be reading the code it is printing.

llvm-svn: 194575

fdc88137

Don't call doFinalization from verifyFunction. · 156227ac

Rafael Espindola authored Nov 13, 2013

verifyFunction needs to call doInitialization to collect metadata and avoid
crashing when verifying debug info in a function.

But it should not call doFinalization since that is where the verifier will
check declarations, variables and aliases, which is not desirable when one
only wants to verify a function.

A possible cleanup would be to split the class into a ModuleVerifier and
FunctionVerifier.

Issue reported by Ilia Filippov. Patch by Michael Kruse.

llvm-svn: 194574

156227ac

Fix bug in .gpword directive parsing. · e10c1125
Vladimir Medic authored Nov 13, 2013
```
llvm-svn: 194570
```
e10c1125
Support for microMIPS trap instruction with immediate operands. · ccb70caa
Zoran Jovanovic authored Nov 13, 2013
```
llvm-svn: 194569
```
ccb70caa
Fix -Wdelete-non-virtual-dtor warnings by making SampleProfile methods non-virtual · aa19c0a1
Alexey Samsonov authored Nov 13, 2013
```
llvm-svn: 194568
```
aa19c0a1

SampleProfileLoader pass. Initial setup. · 8d6568b5

Diego Novillo authored Nov 13, 2013

This adds a new scalar pass that reads a file with samples generated
by 'perf' during runtime. The samples read from the profile are
incorporated and emmited as IR metadata reflecting that profile.

The profile file is assumed to have been generated by an external
profile source. The profile information is converted into IR metadata,
which is later used by the analysis routines to estimate block
frequencies, edge weights and other related data.

External profile information files have no fixed format, each profiler
is free to define its own. This includes both the on-disk representation
of the profile and the kind of profile information stored in the file.
A common kind of profile is based on sampling (e.g., perf), which
essentially counts how many times each line of the program has been
executed during the run.

The SampleProfileLoader pass is organized as a scalar transformation.
On startup, it reads the file given in -sample-profile-file to
determine what kind of profile it contains.  This file is assumed to
contain profile information for the whole application. The profile
data in the file is read and incorporated into the internal state of
the corresponding profiler.

To facilitate testing, I've organized the profilers to support two file
formats: text and native. The native format is whatever on-disk
representation the profiler wants to support, I think this will mostly
be bitcode files, but it could be anything the profiler wants to
support. To do this, every profiler must implement the
SampleProfile::loadNative() function.

The text format is mostly meant for debugging. Records are separated by
newlines, but each profiler is free to interpret records as it sees fit.
Profilers must implement the SampleProfile::loadText() function.

Finally, the pass will call SampleProfile::emitAnnotations() for each
function in the current translation unit. This function needs to
translate the loaded profile into IR metadata, which the analyzer will
later be able to use.

This patch implements the first steps towards the above design. I've
implemented a sample-based flat profiler. The format of the profile is
fairly simplistic. Each sampled function contains a list of relative
line locations (from the start of the function) together with a count
representing how many samples were collected at that line during
execution. I generate this profile using perf and a separate converter
tool.

Currently, I have only implemented a text format for these profiles. I
am interested in initial feedback to the whole approach before I send
the other parts of the implementation for review.

This patch implements:

- The SampleProfileLoader pass.
- The base ExternalProfile class with the core interface.
- A SampleProfile sub-class using the above interface. The profiler
  generates branch weight metadata on every branch instructions that
  matches the profiles.
- A text loader class to assist the implementation of
  SampleProfile::loadText().
- Basic unit tests for the pass.

Additionally, the patch uses profile information to compute branch
weights based on instruction samples.

This patch converts instruction samples into branch weights. It
does a fairly simplistic conversion:

Given a multi-way branch instruction, it calculates the weight of
each branch based on the maximum sample count gathered from each
target basic block.

Note that this assignment of branch weights is somewhat lossy and can be
misleading. If a basic block has more than one incoming branch, all the
incoming branches will get the same weight. In reality, it may be that
only one of them is the most heavily taken branch.

I will adjust this assignment in subsequent patches.

llvm-svn: 194566

8d6568b5

XCore target: implement exception handling · a83c0482
Robert Lytton authored Nov 13, 2013
```
llvm-svn: 194564
```
a83c0482
This patch fixes a bug in floating point operands parsing, when instruction... · 77ffd7af
Vladimir Medic authored Nov 13, 2013
```
This patch fixes a bug in floating point operands parsing, when instruction alias uses default register operand.

llvm-svn: 194562
```
77ffd7af
Mips16InstrInfo.cpp: Use <cctype> instead of <ctype.h> · 435f62a8
NAKAMURA Takumi authored Nov 13, 2013
```
Also, prune <stdlib.h>, seems stray.

llvm-svn: 194557
```
435f62a8

Allow the code which returns the length for inline assembler to know · 5c8ae095

Reed Kotler authored Nov 13, 2013

specifically about the .space directive. This allows us to force large
blocks of code to appear in test cases for things like constant islands
without having to make giant test cases to force things like long 
branches to take effect.

llvm-svn: 194555

5c8ae095

R600: Fix selection failure on EXTLOAD · 00a0d6f6
Matt Arsenault authored Nov 13, 2013
```
llvm-svn: 194547
```
00a0d6f6

SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too. · 34c652d3

Juergen Ributzka authored Nov 13, 2013

This patch reapplies r193676 with an additional fix for the Hexagon backend. The
SystemZ backend has already been fixed by r194148.

The Type Legalizer recognizes that VSELECT needs to be split, because the type
is to wide for the given target. The same does not always apply to SETCC,
because less space is required to encode the result of a comparison. As a result
VSELECT is split and SETCC is unrolled into scalar comparisons.

This commit fixes the issue by checking for VSELECT-SETCC patterns in the DAG
Combiner. If a matching pattern is found, then the result mask of SETCC is
promoted to the expected vector mask type for the given target. Now the type
legalizer will split both VSELECT and SETCC.

This allows the following X86 DAG Combine code to sucessfully detect the MIN/MAX
pattern. This fixes PR16695, PR17002, and <rdar://problem/14594431>.

Reviewed by Nadav

llvm-svn: 194542

34c652d3

Introduce an AnalysisManager which is like a pass manager but with a lot · 74015a70

Chandler Carruth authored Nov 13, 2013

more smarts in it. This is where most of the interesting logic that used
to live in the implicit-scheduling-hackery of the old pass manager will
live.

Like the previous commits, note that this is a very early prototype!
I expect substantial changes before this is ready to use.

The core of the design is the following:

- We have an AnalysisManager which can be used across a series of
  passes over a module.
- The code setting up a pass pipeline registers the analyses available
  with the manager.
- Individual transform passes can check than an analysis manager
  provides the analyses they require in order to fail-fast.
- There is *no* implicit registration or scheduling.
- Analysis passes are different from other passes: they produce an
  analysis result that is cached and made available via the analysis
  manager.
- Cached results are invalidated automatically by the pass managers.
- When a transform pass requests an analysis result, either the analysis
  is run to produce the result or a cached result is provided.

There are a few aspects of this design that I *know* will change in
subsequent commits:
- Currently there is no "preservation" system, that needs to be added.
- All of the analysis management should move up to the analysis library.
- The analysis management needs to support at least SCC passes. Maybe
  loop passes. Living in the analysis library will facilitate this.
- Need support for analyses which are *both* module and function passes.
- Need support for pro-actively running module analyses to have cached
  results within a function pass manager.
- Need a clear design for "immutable" passes.
- Need support for requesting cached results when available and not
  re-running the pass even if that would be necessary.
- Need more thorough testing of all of this infrastructure.

There are other aspects that I view as open questions I'm hoping to
resolve as I iterate a bit on the infrastructure, and especially as
I start writing actual passes against this.
- Should we have separate management layers for function, module, and
  SCC analyses? I think "yes", but I'm not yet ready to switch the code.
  Adding SCC support will likely resolve this definitively.
- How should the 'require' functionality work? Should *that* be the only
  way to request results to ensure that passes always require things?
- How should preservation work?
- Probably some other things I'm forgetting. =]

Look forward to more patches in shorter order now that this is in place.

llvm-svn: 194538

74015a70