Commits · c89aa9d96468e763c27da8668e4201e657dc2a4f · Roger Ferrer / llvm-epi-0.8

Jan 10, 2012

Teach the triple library about the androideabi environment. · 9a7510af
Chandler Carruth authored Jan 10, 2012
```
Patch by Evgeniy Stepanov.

llvm-svn: 147871
```
9a7510af
Move default case for covered enum outside of switch. · ad5b42c0
Richard Smith authored Jan 10, 2012
```
llvm-svn: 147870
```
ad5b42c0

For i386, don't use the generic code. · d5ab0260

Bill Wendling authored Jan 10, 2012

As the comment around 7746 says, it's better to use the x87 extended precision
here than SSE. And the generic code doesn't know how to do that. It also regains
the speed lost for the uint64_to_float.c testcase.
<rdar://problem/10669858>

llvm-svn: 147869

d5ab0260

Fix a -Wreturn-type warning in g++. · 3f103541
Richard Smith authored Jan 10, 2012
```
llvm-svn: 147867
```
3f103541
Cleanup these asserts to follow common LLVM style and coding · 4c0ee749
Chandler Carruth authored Jan 10, 2012
```
conventions. Also, clarify the grouping of one of the asserts to silence
-Wparentheses.

llvm-svn: 147863
```
4c0ee749

Add 'llvm_unreachable' to passify GCC's understanding of the constraints · f3e8502c

Chandler Carruth authored Jan 10, 2012

of several newly un-defaulted switches. This also helps optimizers
(including LLVM's) recognize that every case is covered, and we should
assume as much.

llvm-svn: 147861

f3e8502c

Various crash reporting tools have a problem with the dwarf generated for · 8d4a2204

Kevin Enderby authored Jan 10, 2012

assembly source when it generates the TAG_subprogram dwarf debug info for
the labels that have nothing between them as in this bit of assembly source:

% cat ZeroLength.s 
_func1:
_func2:
 nop

One solution would be to not emit the subsequent labels with the same address
and use the next label with a different address or the end of the section for
the AT_high_pc value of the TAG_subprogram.

Turns out in llvm-mc it is not possible in all cases to determine of two
symbols have the same value at the point we put out the TAG_subprogram dwarf
debug info.

So we will have llvm-mc instead of putting out TAG_subprogram's put out
DW_TAG_label's.  And the DW_TAG_label does not have a AT_high_pc value which
avoids the problem.

This commit is only the functional change to make the diffs clear as to what is
really being changed.  The next commit will be to clean up the names of such
things like MCGenDwarfSubprogramEntry to something like MCGenDwarfLabelEntry.

rdar://10666925

llvm-svn: 147860

8d4a2204

Add definition for intel asm variant. · 67bf992a

Devang Patel authored Jan 10, 2012

Right now, this just adds additional entries in match table. The parser does not use them yet.

llvm-svn: 147859

67bf992a

Remove unnecessary default cases in switches that cover all enum values. · edbb58c5
David Blaikie authored Jan 10, 2012
```
llvm-svn: 147855
```
edbb58c5

Fix a bug in the legalization of shuffle vectors. When we emulate shuffles... · 61bdf790

Nadav Rotem authored Jan 10, 2012

Fix a bug in the legalization of shuffle vectors. When we emulate shuffles using BUILD_VECTORS we may be using a BV of different type. Make sure to cast it back.

llvm-svn: 147851

61bdf790

Add definitions for AMD's bobcat (aka btver1) · 077ae1d7
Benjamin Kramer authored Jan 10, 2012
```
llvm-svn: 147846
```
077ae1d7

Fix a crash in AVX2 when trying to broadcast a double into a 128-bit vector.... · 430f3f1b

Craig Topper authored Jan 10, 2012

Fix a crash in AVX2 when trying to broadcast a double into a 128-bit vector. There is no vbroadcastsd xmm, but we do need to support 64-bit integers broadcasted into xmm. Also factor the AVX check into the isVectorBroadcast function. This makes more sense since the AVX2 check was already inside.

llvm-svn: 147844

430f3f1b

Remove hasXMM/hasXMMInt functions. Move callers to hasSSE1/hasSSE2. This is... · b0c0f72a

Craig Topper authored Jan 10, 2012

Remove hasXMM/hasXMMInt functions. Move callers to hasSSE1/hasSSE2. This is the final piece to remove the AVX hack that disabled SSE.

llvm-svn: 147843

b0c0f72a

Remove hasSSE*orAVX functions and change all callers to use just hasSSE*. AVX... · d97bbd7b

Craig Topper authored Jan 10, 2012

Remove hasSSE*orAVX functions and change all callers to use just hasSSE*. AVX is now an SSE level and no longer disables SSE checks.

llvm-svn: 147842

d97bbd7b

Instruction selection priority fixes to remove the XMM/XMMInt/orAVX... · eb8f9e9e

Craig Topper authored Jan 10, 2012

Instruction selection priority fixes to remove the XMM/XMMInt/orAVX predicates. Another commit will remove orAVX functions from X86SubTarget.

llvm-svn: 147841

eb8f9e9e

Allow machine-cse to look across MBB boundary when cse'ing instructions that · 0be4144a

Evan Cheng authored Jan 10, 2012

define physical registers. It's currently very restrictive, only catching
cases where the CE is in an immediate (and only) predecessor. But it catches
a surprising large number of cases.

rdar://10660865

llvm-svn: 147827

0be4144a

Enable LSR IV Chains with sufficient heuristics. · d5d2db9a

Andrew Trick authored Jan 10, 2012

These heuristics are sufficient for enabling IV chains by
default. Performance analysis has been done for i386, x86_64, and
thumbv7. The optimization is rarely important, but can significantly
speed up certain cases by eliminating spill code within the
loop. Unrolled loops are prime candidates for IV chains. In many
cases, the final code could still be improved with more target
specific optimization following LSR. The goal of this feature is for
LSR to make the best choice of induction variables.

Instruction selection may not completely take advantage of this
feature yet. As a result, there could be cases of slight code size
increase.

Code size can be worse on x86 because it doesn't support postincrement
addressing. In fact, when chains are formed, you may see redundant
address plus stride addition in the addressing mode. GenerateIVChains
tries to compensate for the common cases.

On ARM, code size increase can be mitigated by using postincrement
addressing, but downstream codegen currently misses some opportunities.

llvm-svn: 147826

d5d2db9a

Accurately model hardware alignment rounding. · f09a3165

Jakob Stoklund Olesen authored Jan 10, 2012

On Thumb, the displacement computation hardware uses the address of the
current instruction rouned down to a multiple of 4.  Include this
rounding in the UserOffset we compute for each instruction.

When inline asm is present, the instruction alignment may not be known.
Constrain the maximum displacement instead in that case.

This makes it possible for CreateNewWater() and OffsetIsInRange() to
agree about the valid displacements.  When they disagree, infinite
looping happens.

As always, test cases for this stuff are insane.

<rdar://problem/10660175>

llvm-svn: 147825

f09a3165

Remove the logging streamer. · 5cb98f10
Rafael Espindola authored Jan 10, 2012
```
llvm-svn: 147820
```
5cb98f10

Jan 09, 2012

Catch runaway ARMConstantIslandPass even in -Asserts builds. · 1a80e3a2

Jakob Stoklund Olesen authored Jan 09, 2012

The pass is prone to looping, and it is better to crash than loop
forever, even in a -Asserts build.

<rdar://problem/10660175>

llvm-svn: 147806

1a80e3a2

Fix asm string wrt variants. · 29ba4f97
Devang Patel authored Jan 09, 2012
```
llvm-svn: 147805
```
29ba4f97

Adding IV chain generation to LSR. · 248d410e

Andrew Trick authored Jan 09, 2012

After collecting chains, check if any should be materialized. If so,
hide the chained IV users from the LSR solver. LSR will only solve for
the head of the chain. GenerateIVChains will then materialize the
chained IV users by computing the IV relative to its previous value in
the chain.

In theory, chained IV users could be exposed to LSR's solver. This
would be considerably complicated to implement and I'm not aware of a
case where we need it. In practice it's more important to
intelligently prune the search space of nontrivial loops before
running the solver, otherwise the solver is often forced to prune the
most optimal solutions. Hiding the chained users does this well, so
that LSR is more likely to find the best IV for the chain as a whole.

llvm-svn: 147801

248d410e

Adding collection of IV chains to LSR. · 29fe5f03

Andrew Trick authored Jan 09, 2012

This collects a set of IV uses within the loop whose values can be
computed relative to each other in a sequence. Following checkins will
make use of this information.

llvm-svn: 147797

29fe5f03

Split AsmParser into two components - AsmParser and AsmParserVariant · 85d684a4

Devang Patel authored Jan 09, 2012

AsmParser holds info specific to target parser.
AsmParserVariant holds info specific to asm variants supported by the target.

llvm-svn: 147787

85d684a4

"Minor LSR debugging stuff" · 4dc3eff5
Andrew Trick authored Jan 09, 2012
```
llvm-svn: 147785
```
4dc3eff5
Update language check. Do not ignore DW_LANG_Python. · fa8df483
Devang Patel authored Jan 09, 2012
```
Patch by  Joe Groff!

llvm-svn: 147781
```
fa8df483
Move assert to the right place. · f7fe24f4
Benjamin Kramer authored Jan 09, 2012
```
llvm-svn: 147779
```
f7fe24f4
InstCombine: Teach foldLogOpOfMaskedICmpsHelper that sign bit tests are bit tests. · f9d0cc01
Benjamin Kramer authored Jan 09, 2012
```
This subsumes several other transforms while enabling us to catch more cases.

llvm-svn: 147777
```
f9d0cc01

Don't rely on the fact that shift values are never very large, and thus · c16622da

Chandler Carruth authored Jan 09, 2012

this substraction will result in small negative numbers at worst which
become very large positive numbers on assignment and are thus caught by
the <=4 check on the next line. The >0 check clearly intended to catch
these as negative numbers.

Spotted by inspection, and impossible to trigger given the shift widths
that can be used.

llvm-svn: 147773

c16622da

Remove AVX hack in X86Subtarget. AVX/AVX2 are now treated as an SSE level.... · f287a450

Craig Topper authored Jan 09, 2012

Remove AVX hack in X86Subtarget. AVX/AVX2 are now treated as an SSE level. Predicate functions have been altered to maintain previous names and behavior.

llvm-svn: 147770

f287a450

Add HasAVX predicate to some of the AVX patterns. · b89805c7
Craig Topper authored Jan 09, 2012
```
llvm-svn: 147769
```
b89805c7

Reorder a bunch of patterns to put the AVX version first thus giving it... · a51f7f75

Craig Topper authored Jan 09, 2012

Reorder a bunch of patterns to put the AVX version first thus giving it priority over the SSE version. Another step towards trying to remove the AVX hack that disables SSE from X86Subtarget.

llvm-svn: 147768

a51f7f75

Clean up patterns for MOVNT*. Not sure why there were floating point types on... · ef7f5bf8

Craig Topper authored Jan 09, 2012

Clean up patterns for MOVNT*. Not sure why there were floating point types on MOVNTPS and MOVNTDQ. And v4i64 was completely missing.

llvm-svn: 147767

ef7f5bf8

Mark MOVNTI as being supported in SSE2 OR AVX mode. This instruction has no... · c1f5622a

Craig Topper authored Jan 09, 2012

Mark MOVNTI as being supported in SSE2 OR AVX mode. This instruction has no AVX equivalent so we should use the SSE version.

llvm-svn: 147766

c1f5622a

Move SSE2 logical operations PAND/POR/PXOR/PANDN above SSE1 logical operations... · a081644f

Craig Topper authored Jan 09, 2012

Move SSE2 logical operations PAND/POR/PXOR/PANDN above SSE1 logical operations ANDPS/ORPS/XORPS/ANDNPS. This fixes a pattern ordering issue that meant that the SSE2 instructions could never be directly selected since the SSE1 patterns would always match first. This is largely moot with the ExeDepsFix pass, but I'm trying to audit for all such ordering issues.

llvm-svn: 147765

a081644f

Change some places that were checking for AVX OR SSE1/2 to use... · 210e4f81

Craig Topper authored Jan 09, 2012

Change some places that were checking for AVX OR SSE1/2 to use hasXMM/hasXMMInt instead. Also fix one place that checked SSE3, but accidentally excluded AVX to use hasSSE3orAVX. This is a step towards removing the AVX hack from the X86Subtarget.h

llvm-svn: 147764

210e4f81

Don't print an unused label before .cfi_endproc. · f28213ca
Rafael Espindola authored Jan 09, 2012
```
llvm-svn: 147763
```
f28213ca

Don't disable MMX support when AVX is enabled. Fix predicates for MMX... · 744f6311

Craig Topper authored Jan 09, 2012

Don't disable MMX support when AVX is enabled. Fix predicates for MMX instructions that were added along with SSE instructions to check for AVX in addition to SSE level.

llvm-svn: 147762

744f6311

Enable FISTTP* instructions when AVX is enabled. · c1ab7afe
Craig Topper authored Jan 08, 2012
```
llvm-svn: 147758
```
c1ab7afe

Jan 08, 2012

Tweak my last commit to be less conservative about uses. · 6609f741

Benjamin Kramer authored Jan 08, 2012

We still save an instruction when just the "and" part is replaced.
Also change the code to match comments more closely.

llvm-svn: 147753

6609f741