Commits · eb8f9e9e5b49aca1748cff20783974522f7e01b4 · Roger Ferrer / llvm-epi-0.8

Jan 10, 2012

Instruction selection priority fixes to remove the XMM/XMMInt/orAVX... · eb8f9e9e

Craig Topper authored Jan 10, 2012

Instruction selection priority fixes to remove the XMM/XMMInt/orAVX predicates. Another commit will remove orAVX functions from X86SubTarget.

llvm-svn: 147841

eb8f9e9e

Allow machine-cse to look across MBB boundary when cse'ing instructions that · 0be4144a

Evan Cheng authored Jan 10, 2012

define physical registers. It's currently very restrictive, only catching
cases where the CE is in an immediate (and only) predecessor. But it catches
a surprising large number of cases.

rdar://10660865

llvm-svn: 147827

0be4144a

Enable LSR IV Chains with sufficient heuristics. · d5d2db9a

Andrew Trick authored Jan 10, 2012

These heuristics are sufficient for enabling IV chains by
default. Performance analysis has been done for i386, x86_64, and
thumbv7. The optimization is rarely important, but can significantly
speed up certain cases by eliminating spill code within the
loop. Unrolled loops are prime candidates for IV chains. In many
cases, the final code could still be improved with more target
specific optimization following LSR. The goal of this feature is for
LSR to make the best choice of induction variables.

Instruction selection may not completely take advantage of this
feature yet. As a result, there could be cases of slight code size
increase.

Code size can be worse on x86 because it doesn't support postincrement
addressing. In fact, when chains are formed, you may see redundant
address plus stride addition in the addressing mode. GenerateIVChains
tries to compensate for the common cases.

On ARM, code size increase can be mitigated by using postincrement
addressing, but downstream codegen currently misses some opportunities.

llvm-svn: 147826

d5d2db9a

Accurately model hardware alignment rounding. · f09a3165

Jakob Stoklund Olesen authored Jan 10, 2012

On Thumb, the displacement computation hardware uses the address of the
current instruction rouned down to a multiple of 4.  Include this
rounding in the UserOffset we compute for each instruction.

When inline asm is present, the instruction alignment may not be known.
Constrain the maximum displacement instead in that case.

This makes it possible for CreateNewWater() and OffsetIsInRange() to
agree about the valid displacements.  When they disagree, infinite
looping happens.

As always, test cases for this stuff are insane.

<rdar://problem/10660175>

llvm-svn: 147825

f09a3165

Remove the logging streamer. · 5cb98f10
Rafael Espindola authored Jan 10, 2012
```
llvm-svn: 147820
```
5cb98f10

Jan 09, 2012

Catch runaway ARMConstantIslandPass even in -Asserts builds. · 1a80e3a2

Jakob Stoklund Olesen authored Jan 09, 2012

The pass is prone to looping, and it is better to crash than loop
forever, even in a -Asserts build.

<rdar://problem/10660175>

llvm-svn: 147806

1a80e3a2

Fix asm string wrt variants. · 29ba4f97
Devang Patel authored Jan 09, 2012
```
llvm-svn: 147805
```
29ba4f97
Use descriptive variable name and remove incorrect operand number check. · 700e2e75
Devang Patel authored Jan 09, 2012
```
llvm-svn: 147802
```
700e2e75

Adding IV chain generation to LSR. · 248d410e

Andrew Trick authored Jan 09, 2012

After collecting chains, check if any should be materialized. If so,
hide the chained IV users from the LSR solver. LSR will only solve for
the head of the chain. GenerateIVChains will then materialize the
chained IV users by computing the IV relative to its previous value in
the chain.

In theory, chained IV users could be exposed to LSR's solver. This
would be considerably complicated to implement and I'm not aware of a
case where we need it. In practice it's more important to
intelligently prune the search space of nontrivial loops before
running the solver, otherwise the solver is often forced to prune the
most optimal solutions. Hiding the chained users does this well, so
that LSR is more likely to find the best IV for the chain as a whole.

llvm-svn: 147801

248d410e

Adding collection of IV chains to LSR. · 29fe5f03

Andrew Trick authored Jan 09, 2012

This collects a set of IV uses within the loop whose values can be
computed relative to each other in a sequence. Following checkins will
make use of this information.

llvm-svn: 147797

29fe5f03

Split AsmParser into two components - AsmParser and AsmParserVariant · 85d684a4

Devang Patel authored Jan 09, 2012

AsmParser holds info specific to target parser.
AsmParserVariant holds info specific to asm variants supported by the target.

llvm-svn: 147787

85d684a4

"Minor LSR debugging stuff" · 4dc3eff5
Andrew Trick authored Jan 09, 2012
```
llvm-svn: 147785
```
4dc3eff5
Update language check. Do not ignore DW_LANG_Python. · fa8df483
Devang Patel authored Jan 09, 2012
```
Patch by  Joe Groff!

llvm-svn: 147781
```
fa8df483
Move assert to the right place. · f7fe24f4
Benjamin Kramer authored Jan 09, 2012
```
llvm-svn: 147779
```
f7fe24f4
InstCombine: Teach foldLogOpOfMaskedICmpsHelper that sign bit tests are bit tests. · f9d0cc01
Benjamin Kramer authored Jan 09, 2012
```
This subsumes several other transforms while enabling us to catch more cases.

llvm-svn: 147777
```
f9d0cc01

Don't rely on the fact that shift values are never very large, and thus · c16622da

Chandler Carruth authored Jan 09, 2012

this substraction will result in small negative numbers at worst which
become very large positive numbers on assignment and are thus caught by
the <=4 check on the next line. The >0 check clearly intended to catch
these as negative numbers.

Spotted by inspection, and impossible to trigger given the shift widths
that can be used.

llvm-svn: 147773

c16622da

Cleanup and FileCheck-ize a test. · ea27c16a
Chandler Carruth authored Jan 09, 2012
```
llvm-svn: 147772
```
ea27c16a

Remove AVX hack in X86Subtarget. AVX/AVX2 are now treated as an SSE level.... · f287a450

Craig Topper authored Jan 09, 2012

Remove AVX hack in X86Subtarget. AVX/AVX2 are now treated as an SSE level. Predicate functions have been altered to maintain previous names and behavior.

llvm-svn: 147770

f287a450

Add HasAVX predicate to some of the AVX patterns. · b89805c7
Craig Topper authored Jan 09, 2012
```
llvm-svn: 147769
```
b89805c7

Reorder a bunch of patterns to put the AVX version first thus giving it... · a51f7f75

Craig Topper authored Jan 09, 2012

Reorder a bunch of patterns to put the AVX version first thus giving it priority over the SSE version. Another step towards trying to remove the AVX hack that disables SSE from X86Subtarget.

llvm-svn: 147768

a51f7f75

Clean up patterns for MOVNT*. Not sure why there were floating point types on... · ef7f5bf8

Craig Topper authored Jan 09, 2012

Clean up patterns for MOVNT*. Not sure why there were floating point types on MOVNTPS and MOVNTDQ. And v4i64 was completely missing.

llvm-svn: 147767

ef7f5bf8

Mark MOVNTI as being supported in SSE2 OR AVX mode. This instruction has no... · c1f5622a

Craig Topper authored Jan 09, 2012

Mark MOVNTI as being supported in SSE2 OR AVX mode. This instruction has no AVX equivalent so we should use the SSE version.

llvm-svn: 147766

c1f5622a

Move SSE2 logical operations PAND/POR/PXOR/PANDN above SSE1 logical operations... · a081644f

Craig Topper authored Jan 09, 2012

Move SSE2 logical operations PAND/POR/PXOR/PANDN above SSE1 logical operations ANDPS/ORPS/XORPS/ANDNPS. This fixes a pattern ordering issue that meant that the SSE2 instructions could never be directly selected since the SSE1 patterns would always match first. This is largely moot with the ExeDepsFix pass, but I'm trying to audit for all such ordering issues.

llvm-svn: 147765

a081644f

Change some places that were checking for AVX OR SSE1/2 to use... · 210e4f81

Craig Topper authored Jan 09, 2012

Change some places that were checking for AVX OR SSE1/2 to use hasXMM/hasXMMInt instead. Also fix one place that checked SSE3, but accidentally excluded AVX to use hasSSE3orAVX. This is a step towards removing the AVX hack from the X86Subtarget.h

llvm-svn: 147764

210e4f81

Don't print an unused label before .cfi_endproc. · f28213ca
Rafael Espindola authored Jan 09, 2012
```
llvm-svn: 147763
```
f28213ca

Don't disable MMX support when AVX is enabled. Fix predicates for MMX... · 744f6311

Craig Topper authored Jan 09, 2012

Don't disable MMX support when AVX is enabled. Fix predicates for MMX instructions that were added along with SSE instructions to check for AVX in addition to SSE level.

llvm-svn: 147762

744f6311

Enable FISTTP* instructions when AVX is enabled. · c1ab7afe
Craig Topper authored Jan 08, 2012
```
llvm-svn: 147758
```
c1ab7afe

Jan 08, 2012
- Tweak my last commit to be less conservative about uses. · 6609f741
  Benjamin Kramer authored Jan 08, 2012
```
We still save an instruction when just the "and" part is replaced.
Also change the code to match comments more closely.

llvm-svn: 147753
```
  6609f741
- Don't forget to transfer implicit uses of return instruction. · 4882e488
  Evan Cheng authored Jan 08, 2012
```
llvm-svn: 147752
```
  4882e488
- Avoid eraseing copies from a reserved register unless the definition can be · 520730ff
  Evan Cheng authored Jan 08, 2012
```
safely proven not to have been clobbered. No small test case possible.

llvm-svn: 147751
```
  520730ff
- InstCombine: If we have a bit test and a sign test anded/ored together, merge... · da37e153
  Benjamin Kramer authored Jan 08, 2012
```
InstCombine: If we have a bit test and a sign test anded/ored together, merge the sign bit into the bit test.

This is common in bit field code, e.g. checking if the first or the last bit of a bit field is set.

llvm-svn: 147749
```
  da37e153
- Reverted commit #147601 upon Evan's request. · 540651cf
  Victor Umansky authored Jan 08, 2012
```
llvm-svn: 147748
```
  540651cf
- Remove MCELFStreamer.h. · 81a6274e
  Rafael Espindola authored Jan 07, 2012
```
llvm-svn: 147745
```
  81a6274e
Jan 07, 2012
- Don't print a label before .cfi_startproc when we don't need to. This makes · 38241203
  Rafael Espindola authored Jan 07, 2012
```
the produce assembly when using CFI just a bit more readable.

llvm-svn: 147743
```
  38241203
- Make clever use of alignment and padding to shrink GlobalValue. · ecb9fa11
  Benjamin Kramer authored Jan 07, 2012
```
-8 bytes on x86_64, no change on x86.

llvm-svn: 147742
```
  ecb9fa11
- Match SelectionDAG logic for enabling movt. · 083dbdca
  Jakob Stoklund Olesen authored Jan 07, 2012
```
Darwin doesn't do static, and ELF targets only support static.

llvm-svn: 147740
```
  083dbdca
- Fix typo in the X86 backend readme. Patch from Jaeden Amero. · f210619d
  Craig Topper authored Jan 07, 2012
```
llvm-svn: 147739
```
  f210619d
- Remove VectorExtras. This unused helper was written for a type of API that is discouraged now. · 6898db62
  Benjamin Kramer authored Jan 07, 2012
```
llvm-svn: 147738
```
  6898db62
- Remove unnecessary check of hasAVX(). It's already included in hasXMM(). · ca66bba4
  Craig Topper authored Jan 07, 2012
```
llvm-svn: 147734
```
  ca66bba4
- Replace some uses of hasNUsesOfValue(0, X) with !hasAnyUseOfValue(X) · 0515cd41
  Craig Topper authored Jan 07, 2012
```
llvm-svn: 147733
```
  0515cd41