Commits · 31d093c70564fe6afbaf63beaa76eb5b47ac4f47 · Roger Ferrer / llvm-epi-0.8

Sep 22, 2013

ISelDAG: spot chain cycles involving MachineNodes · 31d093c7

Tim Northover authored Sep 22, 2013

Previously, the DAGISel function WalkChainUsers was spotting that it
had entered already-selected territory by whether a node was a
MachineNode (amongst other things). Since it's fairly common practice
to insert MachineNodes during ISelLowering, this was not the correct
check.

Looking around, it seems that other nodes get their NodeId set to -1
upon selection, so this makes sure the same thing happens to all
MachineNodes and uses that characteristic to determine whether we
should stop looking for a loop during selection.

This should fix PR15840.

llvm-svn: 191165

31d093c7

[Sparc] Add support for TLS in sparc. · cb1dca60
Venkatraman Govindaraju authored Sep 22, 2013
```
llvm-svn: 191164
```
cb1dca60

X86: Use R_X86_64_TPOFF64 for FK_Data_8 · 7b1cdb98

David Majnemer authored Sep 22, 2013

Summary:
LLVM would crash when trying to come up with a relocation type for
assembly like:
movabsq $V@TPOFF, %rax

Instead, we say the relocation type is R_X86_64_TPOFF64.

Fixes PR17274.

Reviewers: dblaikie, nrieck, rafael

CC: llvm-commits

Differential Revision: http://llvm-reviews.chandlerc.com/D1717

llvm-svn: 191163

7b1cdb98

[SPARC] Make functions with GLOBAL_OFFSET_TABLE access as non-leaf functions. · 7e7eb8ce
Venkatraman Govindaraju authored Sep 22, 2013
```
llvm-svn: 191160
```
7e7eb8ce
[Sparc] Emit .register directive to declare the use of global registers %g2, %g4, %g6 and %g7. · e9ef5122
Venkatraman Govindaraju authored Sep 22, 2013
```
llvm-svn: 191158
```
e9ef5122

Correct the pre-increment load latencies in the PPC A2 itinerary · 25415c2b

Hal Finkel authored Sep 22, 2013

Pre-increment loads are microcoded on the A2, and the address increment occurs
only after the load completes. As a result, the latency of the GPR address
update is an additional 2 cycles on top of the load latency.

llvm-svn: 191156

25415c2b

[Sparc] Fix lowering FABS on fp128 (long double) on pre-v9 targets. · 829aec59
Venkatraman Govindaraju authored Sep 21, 2013
```
llvm-svn: 191154
```
829aec59

Sep 21, 2013

SROA: Handle casts involving vectors of pointers and integer scalars. · 90901a35

Benjamin Kramer authored Sep 21, 2013

SROA wants to convert any types of equivalent widths but it's not possible to
convert vectors of pointers to an integer scalar with a single cast. As a
workaround we add a bitcast to the corresponding int ptr type first. This type
of cast used to be an edge case but has become common with SLP vectorization.
Fixes PR17271.

llvm-svn: 191143

90901a35

Revert "SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too." · f043a653
Juergen Ributzka authored Sep 21, 2013
```
This reverts commit r191130.

llvm-svn: 191138
```
f043a653
Remove alignment restrictions from FMA load folding. · 58f6e64e
Craig Topper authored Sep 21, 2013
```
llvm-svn: 191136
```
58f6e64e
SLPVectorizer: Fix multiline comment warning · d743feef
Arnold Schwaighofer authored Sep 21, 2013
```
llvm-svn: 191135
```
d743feef

ELF: Parse types in directives like binutils gas · f90c3b5a

David Majnemer authored Sep 21, 2013

Allow binutils .type and .section directives to take the following
forms:
- @<type>
- %<type>
- "<type>"

llvm-svn: 191134

f90c3b5a

Fix the buildbot · c2551eb4
Juergen Ributzka authored Sep 21, 2013
```
llvm-svn: 191133
```
c2551eb4

[X86] Emulate AVX 256bit MIN/MAX support by splitting the vector. · ab930591

Juergen Ributzka authored Sep 21, 2013

In AVX 256bit vectors are valid vectors and therefore the Type Legalizer doesn't
split the VSELECT and SETCC nodes. AVX only supports MIN/MAX on 128bit vectors
and this fix enables vector splitting for this special case in the X86 DAG
Combiner.

This fix is related to PR16695, PR17002, and <rdar://problem/14594431>.

llvm-svn: 191131

ab930591

SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too. · e9a80fc9

Juergen Ributzka authored Sep 21, 2013

The Type Legalizer recognizes that VSELECT needs to be split, because the type
is to wide for the given target. The same does not always apply to SETCC,
because less space is required to encode the result of a comparison. As a result
VSELECT is split and SETCC is unrolled into scalar comparisons.

This commit fixes the issue by checking for VSELECT-SETCC patterns in the DAG
Combiner. If a matching pattern is found, then the result mask of SETCC is
promoted to the expected vector mask for the given target. This mask has usually
te same size as the VSELECT return type (except for Intel KNL). Now the type
legalizer will split both VSELECT and SETCC.

This allows the following X86 DAG Combine code to sucessfully detect the MIN/MAX
pattern. This fixes PR16695, PR17002, and <rdar://problem/14594431>.

llvm-svn: 191130

e9a80fc9

Initialize BSSSection explicitly in InitMachOMCObjectFileInfo() to appease msvc. · 68fa6f9d
NAKAMURA Takumi authored Sep 21, 2013
```
This can revert r191087.

llvm-svn: 191128
```
68fa6f9d
Set .reorder for the stub so that gas takes care of delay slot processing. · 78fb291e
Reed Kotler authored Sep 21, 2013
```
llvm-svn: 191125
```
78fb291e

Reapply "SLPVectorizer: Handle more horizontal reductions (disabled)"" · 500242d4

Arnold Schwaighofer authored Sep 21, 2013

Reapply r191108 with a fix for a memory corruption error I introduced.  Of
course, we can't reference the scalars that we replace by vectorizing and then
call their eraseFromParent method. I only 'needed' the scalars to get the
DebugLoc. Just store the DebugLoc before actually vectorizing instead. As a nice
side effect, this also simplifies the interface between BoUpSLP and the
HorizontalReduction class to returning a value pointer (the vectorized tree
root).

radar://14607682

llvm-svn: 191123

500242d4

LoopVectorizer: Only allow vectorization of intrinsics. We can't know for sure... · 3371172a

Nadav Rotem authored Sep 21, 2013

LoopVectorizer: Only allow vectorization of intrinsics. We can't know for sure that the functions 'abs' or 'round' are the functions from libm.

rdar://15012650

llvm-svn: 191122

3371172a

Revert "SLPVectorizer: Handle more horizontal reductions (disabled)" · f1dfbfdd

Arnold Schwaighofer authored Sep 21, 2013

This reverts commit r191108.

The horizontal.ll test case fails under libgmalloc. Thanks Shuxin for pointing
this out to me.

llvm-svn: 191121

f1dfbfdd

Move emission of the debug string table to early in the debug · 9cd26af8

Eric Christopher authored Sep 20, 2013

info finalization to greatly reduce the number of fixups that the
assembler has to handle in order to improve compile time.

llvm-svn: 191119

9cd26af8

Resurrect r191017 " GVN proceeds in the presence of dead code" plus a fix to PR17307 & 17308. · 6e35094b

Shuxin Yang authored Sep 20, 2013

The problem of r191017 is that when GVN fabricate a val-number for a dead instruction (in order
to make following expr-PRE happy), it forget to fabricate a leader-table entry for it as well.

llvm-svn: 191118

6e35094b

MC: Tidy up. · 4b905844

Jim Grosbach authored Sep 20, 2013

Clean up some simple code quality issues. Bring internal naming
conventions up to current standard, fix inconsistent formatting, and
tidy up a couple of odd contructs.

llvm-svn: 191117

4b905844

Migrate addGlobalName to the .cpp file as an intermediate step · 9c58f317
Eric Christopher authored Sep 20, 2013
```
to further work.

llvm-svn: 191113
```
9c58f317
InstCombine: Remove unused argument. No functionality change. · 0e2d162d
Benjamin Kramer authored Sep 20, 2013
```
llvm-svn: 191112
```
0e2d162d

Sep 20, 2013

[mips] MUL should clobber HI0 and LO0. · ff1fbda4
Akira Hatanaka authored Sep 20, 2013
```
I cannot think of a test case that reliably triggers this bug.

llvm-svn: 191109
```
ff1fbda4

SLPVectorizer: Handle more horizontal reductions (disabled) · 47249631

Arnold Schwaighofer authored Sep 20, 2013

Match reductions starting at binary operation feeding into a phi. The code
handles trees like

 r += v1 + v2 + v3 ...

and

 r += v1
 r += v2
 ...

and

 r *= v1 + v2 + ...

We currently only handle associative operations (add, fadd fast).

The code can now also handle reductions feeding into stores.

 a[i] = v1 + v2 + v3 + ...

The code is currently disabled behind the flag "-slp-vectorize-hor".  The cost
model for most architectures is not there yet.

I found one opportunity of a horizontal reduction feeding a phi in TSVC
(LoopRerolling-flt) and there are several opportunities where reductions feed
into stores.

radar://14607682

llvm-svn: 191108

47249631

Revert r191017, it results in segmentation faults in Qt. · 1fbe3236
Joerg Sonnenberger authored Sep 20, 2013
```
llvm-svn: 191104
```
1fbe3236

InstCombine: Canonicalize (gep i8* X, -(ptrtoint Y)) to (sub (ptrtoint X), (ptrtoint Y)) · e6461e30

Benjamin Kramer authored Sep 20, 2013

The GEP pattern is what SCEV expander emits for "ugly geps". The latter is what
you get for pointer subtraction in C code. The rest of instcombine already
knows how to deal with that so just canonicalize on that.

llvm-svn: 191090

e6461e30

Revert "llvm-c: Add LLVMGetPointerToFunction" · fc8ca533
Anders Waldenborg authored Sep 20, 2013
```
This reverts r191030

llvm-svn: 191075
```
fc8ca533
Lift alignment restrictions on load/store folding of VEXTRACTI128/VINSERTI128. · 9a3915a7
Craig Topper authored Sep 20, 2013
```
llvm-svn: 191073
```
9a3915a7

Allow subtarget selection of the default MachineScheduler and document the interface. · 978674b2

Andrew Trick authored Sep 20, 2013

The global registry is used to allow command line override of the
scheduler selection, but does not work well as the normal selection
API. For example, the same LLVM process should be able to target
multiple targets or subtargets.

llvm-svn: 191071

978674b2

Revert r191062; the build break was also fixed in a different (incompatible) way in r191060. · 3e9a6d34
Richard Smith authored Sep 20, 2013
```
llvm-svn: 191065
```
3e9a6d34
Unbreak Clang build after r191050: don't pass a StringRef to snprintf. · 2767048b
Richard Smith authored Sep 20, 2013
```
llvm-svn: 191062
```
2767048b

DebugInfo: GDBIndexEntry*String conversion functions now return const char*... · efd0bcb7

David Blaikie authored Sep 20, 2013

DebugInfo: GDBIndexEntry*String conversion functions now return const char* for easy llvm::formating

This was previously invoking UB by passing a user-defined type to
format. Thanks to Jordan Rose for pointing this out.

llvm-svn: 191060

efd0bcb7

Add braces to suppress Clang's dangling-else warning. · 9d117ab7
David Blaikie authored Sep 20, 2013
```
These violations were introduced in r191049

llvm-svn: 191059
```
9d117ab7

DebugInfo: constrain gnu pubnames test further · ac30f9e8

David Blaikie authored Sep 19, 2013

Ensures that the pubnames entries actually refer to the intended
entities. This test could be more flexible if there was a way to do
multiline FileCheck matches with captures (in that way the test wouldn't
need to have hardcoded offset values and would thus be resilient to
changes in the layout of the DIEs in this CU).

llvm-svn: 191055

ac30f9e8

Added support for generate DWARF .debug_aranges sections automatically. · 21101b32
Richard Mitton authored Sep 19, 2013
```
llvm-svn: 191052
```
21101b32

Rename ConvergingScheduler to GenericScheduler. · 665d3ec3

Andrew Trick authored Sep 19, 2013

This was an experimental scheduler a year ago. It's now used by
several subtargets, both in-order and out-of-order, and it
is about to be enabled by default for x86 and armv7. It will be the
new GenericScheduler for subtargets that don't provide their own
SchedulingStrategy.

llvm-svn: 191051

665d3ec3

DebugInfo: llvm-dwarfdump support for gnu_pubnames section · 404d3047
David Blaikie authored Sep 19, 2013
```
llvm-svn: 191050
```
404d3047