Commits · e41f37d99db1e341b3cf24ed49f21b58d63c15c4 · Roger Ferrer / llvm-epi-0.8

Sep 17, 2013
- Use reference instead of copy. · ec2ffa92
  Jakub Staszak authored Sep 16, 2013
```
llvm-svn: 190813
```
  ec2ffa92
Sep 16, 2013

[PowerPC] Fix PR17155 - Ignore COPY_TO_REGCLASS during emit. · c763c224

Bill Schmidt authored Sep 16, 2013

Fast-isel generates a COPY_TO_REGCLASS for widening f32 to f64, which
is a nop on PPC64.  This is needed to keep the register class system
happy, but on the fast-isel path it is not removed before emit as it
is for DAG select.  Ignore this op when emitting instructions.

llvm-svn: 190795

c763c224

Don't vectorize if there are outside loop users of the induction variable. · 53e622ce

Arnold Schwaighofer authored Sep 16, 2013

We would have to compute the pre increment value, either by computing it on
every loop iteration or by splitting the edge out of the loop and inserting a
computation for it there.

For now, just give up vectorizing such loops.

Fixes PR17179.

llvm-svn: 190790

53e622ce

[msan] Check return value of main(). · 604293fb
Evgeniy Stepanov authored Sep 16, 2013
```
llvm-svn: 190782
```
604293fb
This patch implements Mips load/store instructions from/to coprocessor 2. Test cases are added. · 05bcde6d
Vladimir Medic authored Sep 16, 2013
```
llvm-svn: 190780
```
05bcde6d
ARM: Deduplicate ConstantPoolValues. · 2ef689ca
Benjamin Kramer authored Sep 16, 2013
```
llvm-svn: 190779
```
2ef689ca

[SystemZ] Improve extload handling · 109a7c6f

Richard Sandiford authored Sep 16, 2013

The port originally had special patterns for extload, mapping them to the
same instructions as sextload.  It seemed neater to have patterns that
match "an extension that is allowed to be signed" and "an extension that
is allowed to be unsigned".

This was originally meant to be a clean-up, but it does improve the handling
of promoted integers a little, as shown by args-06.ll.

llvm-svn: 190777

109a7c6f

Make F16C feature flag imply AVX rather than just checking both at the patterns. · a6d204ec
Craig Topper authored Sep 16, 2013
```
llvm-svn: 190775
```
a6d204ec

Implement function prefix data as an IR feature. · 3fa50f9b

Peter Collingbourne authored Sep 16, 2013

Previous discussion:
http://lists.cs.uiuc.edu/pipermail/llvmdev/2013-July/063909.html

Differential Revision: http://llvm-reviews.chandlerc.com/D1191

llvm-svn: 190773

3fa50f9b

PPC: Don't restrict lvsl generation to after type legalization · 40c34781

Hal Finkel authored Sep 15, 2013

This is a re-commit of r190764, with an extra check to make sure that we're not
performing the transformation on illegal types (a small test case has been
added for this as well).

Original commit message:

The PPC backend uses a target-specific DAG combine to turn unaligned Altivec
loads into a permutation-based sequence when possible. Unfortunately, the
target-specific DAG combine is not always called on all loads of interest
(sometimes the routines in DAGCombine call CombineTo such that the new node and
users are not added to the worklist); allowing the combine to trigger early
(before type legalization) mitigates this problem. Because the autovectorizers
only create legal vector types, I don't expect a lot of cases where this
optimization is enabled by type legalization in practice.

llvm-svn: 190771

40c34781

Replace some unnecessary vector copies with references. · 7d605268
Benjamin Kramer authored Sep 15, 2013
```
llvm-svn: 190770
```
7d605268

Sep 15, 2013

ELF: Add support for the exclude section bit for gas compat. · ac511cac
Benjamin Kramer authored Sep 15, 2013
```
llvm-svn: 190769
```
ac511cac

MC: Add support for '?' flags in .section directives · a4b521b7

David Majnemer authored Sep 15, 2013

Summary:
The '?' flag uses the last section group if the last had a section
group.  We treat combining an explicit section group and the '?' as a
hard error.

This fixes PR17198.

Reviewers: rafael, bkramer

Reviewed By: bkramer

CC: llvm-commits

Differential Revision: http://llvm-reviews.chandlerc.com/D1686

llvm-svn: 190768

a4b521b7

Fix alignment of unwind data. · 8539b463

Kai Nacke authored Sep 15, 2013

For alignment purposes, the instruction array will always have an even
number of entries, with the final entry potentially unused (in which
case the array will be one longer than indicated by the count of unwind
codes field).

Reviewed by Anton Korobeynikov, Charles Davis and Nico Rieck.

llvm-svn: 190767

8539b463

Generate IMAGE_REL_AMD64_ADDR32NB relocations for SEH · 74adc8a4

Kai Nacke authored Sep 15, 2013

 data structures.

The Win64 EH data structures must be of type IMAGE_REL_AMD64_ADDR32NB
instead of IMAGE_REL_AMD64_ADDR32. This is easiely achieved by adding
the VK_COFF_IMGREL32 modifier to the symbol reference.
Change also references to start and end of the SEH range of a function
as offsets to start of the function.

Reviewed by Jim Grosbach, Charles Davis and Nico Rieck.

llvm-svn: 190766

74adc8a4

Revert r190764: PPC: Don't restrict lvsl generation to after type legalization · 31025a63

Hal Finkel authored Sep 15, 2013

This is causing test-suite failures.

Original commit message:

The PPC backend uses a target-specific DAG combine to turn unaligned Altivec
loads into a permutation-based sequence when possible. Unfortunately, the
target-specific DAG combine is not always called on all loads of interest
(sometimes the routines in DAGCombine call CombineTo such that the new node and
users are not added to the worklist); allowing the combine to trigger early
(before type legalization) mitigates this problem. Because the autovectorizers
only create legal vector types, I don't expect a lot of cases where this
optimization is enabled by type legalization in practice.

llvm-svn: 190765

31025a63

PPC: Don't restrict lvsl generation to after type legalization · 2945d4e9

Hal Finkel authored Sep 15, 2013

The PPC backend uses a target-specific DAG combine to turn unaligned Altivec
loads into a permutation-based sequence when possible. Unfortunately, the
target-specific DAG combine is not always called on all loads of interest
(sometimes the routines in DAGCombine call CombineTo such that the new node and
users are not added to the worklist); allowing the combine to trigger early
(before type legalization) mitigates this problem. Because the autovectorizers
only create legal vector types, I don't expect a lot of cases where this
optimization is enabled by type legalization in practice.

llvm-svn: 190764

2945d4e9

Prevent assert in CombinerGlobalAA with null values · 31658834

Hal Finkel authored Sep 15, 2013

DAGCombiner::isAlias can be called with SrcValue1 or SrcValue2 null, and we
can't use AA in this case (if we try, then the casting code in AA will assert).

llvm-svn: 190763

31658834

Expand the mask capability for deciding which functions are mips16 and mips32 · 65553152
Reed Kotler authored Sep 15, 2013
```
so it can be better used for general interoperability testing between mips32
and mips16.

llvm-svn: 190762
```
65553152
Remove unused StringRef that no compiler warned about, I wonder why. · 43cc98a7
Benjamin Kramer authored Sep 14, 2013
```
llvm-svn: 190759
```
43cc98a7

Sep 14, 2013

Add the remaining Intel SHA instructions · 8eb45a4e

Ben Langmuir authored Sep 14, 2013

Also assembly/disassembly tests, and for sha256rnds2, aliases with an explicit
xmm0 dependency.

llvm-svn: 190754

8eb45a4e

Fix spelling. · 042f10ce
Robert Wilhelm authored Sep 14, 2013
```
llvm-svn: 190750
```
042f10ce
Fix spelling. · 516be56f
Robert Wilhelm authored Sep 14, 2013
```
llvm-svn: 190749
```
516be56f

Remove the long, long defunct IR block placement pass. · ebeac5cb

Chandler Carruth authored Sep 14, 2013

This pass was based on the previous (essentially unused) profiling
infrastructure and the assumption that by ordering the basic blocks at
the IR level in a particular way, the correct layout would happen in the
end. This sometimes worked, and mostly didn't. It also was a really
naive implementation of the classical paper that dates from when branch
predictors were primarily directional and when loop structure wasn't
commonly available. It also didn't factor into the equation
non-fallthrough branches and other machine level details.

Anyways, for all of these reasons and more, I wrote
MachineBlockPlacement, which completely supercedes this pass. It both
uses modern profile information infrastructure, and actually works. =]

llvm-svn: 190748

ebeac5cb

Fixed bug when generating Load Upper Immediate microMIPS instruction. · fc26cfcd
Zoran Jovanovic authored Sep 14, 2013
```
llvm-svn: 190746
```
fc26cfcd
Support for microMIPS DIV instructions. · 3671a544
Zoran Jovanovic authored Sep 14, 2013
```
llvm-svn: 190745
```
3671a544
Support for misc microMIPS instructions. · ab852781
Zoran Jovanovic authored Sep 14, 2013
```
llvm-svn: 190744
```
ab852781

Make PrettyStackTraceEntry use ManagedStatic for its ThreadLocal. · 67d97093

Filip Pizlo authored Sep 13, 2013

This was somewhat tricky because ~PrettyStackTraceEntry() may run after
llvm_shutdown() has been called. This is rare and only happens for a common idiom
used in the main() functions of command-line tools. This works around the idiom by
skipping the stack clean-up if the PrettyStackTraceHead ManagedStatic is not
constructed (i.e. llvm_shutdown() has been called).

llvm-svn: 190730

67d97093

Sep 13, 2013

Add missing break statement in PPCISelLowering · c3cfbf86
Hal Finkel authored Sep 13, 2013
```
As it turns out, not a problem in practice, but it should be there.

llvm-svn: 190720
```
c3cfbf86

Adds support for Atom Silvermont (SLM) - -march=slm · 3fe264d6

Preston Gurd authored Sep 13, 2013

Implements Instruction scheduler latencies for Silvermont,
using latencies from the Intel Silvermont Optimization Guide.

Auto detects SLM.

Turns on post RA scheduler when generating code for SLM.

llvm-svn: 190717

3fe264d6

[Peephole] Rewrite copies to avoid cross register banks copies. · cf71c632

Quentin Colombet authored Sep 13, 2013

By definition copies across register banks are not coalescable. Still, it may be
possible to get rid of such a copy when the value is available in another
register of the same register file.
Consider the following example, where capital and lower letters denote different
register file:
b = copy A <-- cross-bank copy
...
C = copy b <-- cross-bank copy

This could have been optimized this way:
b = copy A  <-- cross-bank copy
...
C = copy A <-- same-bank copy

Note: b and C's definitions may be in different basic blocks.

This patch adds a peephole optimization that looks through a chain of copies
leading to a cross-bank copy and reuses a source that is on the same register
file if available.

This solution could also be used to get rid of some copies (e.g., A could have
been used instead of C). However, we do not do so because:
- It may over constrain the coloring of the source register for coalescing.
- The register allocator may not be able to find a nice split point for the
  longer live-range, leading to more spill.

<rdar://problem/14742333>

llvm-svn: 190713

cf71c632

[ARMv8] Change hasV8Fp to hasFPARMv8, and other command line options · ccd04894
Joey Gouly authored Sep 13, 2013
```
to be more consistent.

llvm-svn: 190692
```
ccd04894
[msan] Add source file:line to stack origin reports. · 0435ecd1
Evgeniy Stepanov authored Sep 13, 2013
```
Compiler part.

llvm-svn: 190689
```
0435ecd1
[ARMv8] Emit the proper .fpu directive. · 3c0e5567
Joey Gouly authored Sep 13, 2013
```
Patch by Bradley Smith!

llvm-svn: 190683
```
3c0e5567
Test commit to verify that commit access works. · def5d347
Zoran Jovanovic authored Sep 13, 2013
```
llvm-svn: 190676
```
def5d347
[SystemZ] Use getTarget{Insert,Extract}Subreg rather than getMachineNode · d8163208
Richard Sandiford authored Sep 13, 2013
```
Just a clean-up, no behavioral change intended.

llvm-svn: 190673
```
d8163208
[SystemZ] Try to fold shifts into TMxx · 030c1657
Richard Sandiford authored Sep 13, 2013
```
E.g. "SRL %r2, 2; TMLL %r2, 1" => "TMLL %r2, 4".

llvm-svn: 190672
```
030c1657
Avoid a compiler warning about Found not being used when assertions are · c9e95ad0
Duncan Sands authored Sep 13, 2013
```
disabled.

llvm-svn: 190668
```
c9e95ad0

AArch64: use RegisterOperand for NEON registers. · 635a9790

Tim Northover authored Sep 13, 2013

Previously we modelled VPR128 and VPR64 as essentially identical
register-classes containing V0-V31 (which had Q0-Q31 as "sub_alias"
sub-registers). This model is starting to cause significant problems
for code generation, particularly writing EXTRACT/INSERT_SUBREG
patterns for converting between the two.

The change here switches to classifying VPR64 & VPR128 as
RegisterOperands, which are essentially aliases for RegisterClasses
with different parsing and printing behaviour. This fits almost
exactly with their real status (VPR128 == FPR128 printed strangely,
VPR64 == FPR64 printed strangely).

llvm-svn: 190665

635a9790

Move operator to end of previous line to match coding standards. · 21a916b6
Craig Topper authored Sep 13, 2013
```
llvm-svn: 190659
```
21a916b6