Commits · c07fe69850d6b98f6dd468689c2e9e03138e73eb · Roger Ferrer / llvm-epi-0.8

Jan 12, 2012

Add intel style operand parser skeleton. · 46831de2
Devang Patel authored Jan 12, 2012
```
This is a work in progress.

llvm-svn: 148002
```
46831de2

Switch all of the uses of my InsertDAGNode helper to follow the exact · eb21da06

Chandler Carruth authored Jan 12, 2012

same pattern. We already had this pattern is a few places, but others
tried to make a rough approximation of an actual DAG structure. As not
everywhere went to this trouble, nothing could rely on this being done.
In fact, I've checked all references to these node Ids, and the ones
that are using the topo-sort properties are actually satisfied with
a strict-weak-ordering. The requirement appears to be that Use >= Def.

I've added a big blurb of comments to this bit of the transform to
clarify why the order is so important for the next reader of the code.

I'm starting with this change as it is very small, and trivially
reverted if something breaks or the >= above really does need to be >.
If that proves the case, we can hide the problem by reverting this
patch, but the problem exists elsewhere as well, and so a more
comprehensive solution will be needed.

llvm-svn: 148001

eb21da06

Jan 11, 2012

Fix assert. · d284c1d8
Eric Christopher authored Jan 11, 2012
```
llvm-svn: 147966
```
d284c1d8

Support segmented stacks on mac. · d90466bc

Rafael Espindola authored Jan 11, 2012

This uses TLS slot 90, which actually belongs to JavaScriptCore. We only support
frames with static size
Patch by Brian Anderson.

llvm-svn: 147960

d90466bc

Generate the segmented stack prologue for fastcc too. · 4eecacb9
Rafael Espindola authored Jan 11, 2012
```
Patch by Brian Anderson.

llvm-svn: 147958
```
4eecacb9

Revert r147945 which disabled an addressing mode transformation. I had · 3212a342

Chandler Carruth authored Jan 11, 2012

hoped this would revive one of the llvm-gcc selfhost build bots, but it
didn't so it doesn't appear that my transform is the culprit.

If anyone else is seeing failures, please let me know!

llvm-svn: 147957

3212a342

Use unsigned comparison in segmented stack prologue. · 2b89448d

Rafael Espindola authored Jan 11, 2012

This is a comparison of two addresses, and GCC does the comparison unsigned.

Patch by Brian Anderson.

llvm-svn: 147954

2b89448d

Explicitly set the scale to 1 on some segstack prologue instrs. · 6635ae1c
Rafael Espindola authored Jan 11, 2012
```
Patch by Brian Anderson.

llvm-svn: 147952
```
6635ae1c
Add XOP Intrinsics and tests · 21f83d9f
Jan Sjödin authored Jan 11, 2012
```
llvm-svn: 147949
```
21f83d9f

Fix a bug in the lowering of BUILD_VECTOR for AVX. SCALAR_TO_VECTOR does not... · baae7e45

Nadav Rotem authored Jan 11, 2012

Fix a bug in the lowering of BUILD_VECTOR for AVX. SCALAR_TO_VECTOR does not zero untouched elements. Use INSERT_VECTOR_ELT instead.

llvm-svn: 147948

baae7e45

Disable the transformation I added in r147936 to see if it fixes some · 9bc48e52

Chandler Carruth authored Jan 11, 2012

strange build bot failures that look like a miscompile into an infloop.
I'll investigate this tomorrow, but I'd both like to know whether my
patch is the culprit, and get the bots back to green.

llvm-svn: 147945

9bc48e52

Hoist a really redundant code pattern into a helper function, and delete · 3eacfb83
Chandler Carruth authored Jan 11, 2012
```
lots of lines of code. No functionality changed.

llvm-svn: 147942
```
3eacfb83
Simplify the AND-rooted mask+shift checking code to match that of the · b0049f4a
Chandler Carruth authored Jan 11, 2012
```
SRL-rooted code.

llvm-svn: 147941
```
b0049f4a

Unify the interface of the three mask+shift transform helpers, and · 3dbcda84

Chandler Carruth authored Jan 11, 2012

factor the differences that were hiding in one of them into its other
caller, the SRL handling code. No change in behavior.

llvm-svn: 147940

3dbcda84

Clarify and make explicit some of the requirements for transforming · aa01e666

Chandler Carruth authored Jan 11, 2012

mask+shift pairs at the beginning of the ISD::AND case block, and then
hoist the final pattern into a helper function, simplifying and
reflowing it appropriately. This should have no observable behavior
change, but several simplifications fell out of this such as directly
computing the new mask constant, etc.

llvm-svn: 147939

aa01e666

Fix undefined code and reenable test case. · 60399837

Jakob Stoklund Olesen authored Jan 11, 2012

I don't think the compact encoding code is right, but at least is has
defined behavior now.

llvm-svn: 147938

60399837

Hoist the logic to transform shift+mask combinations into sub-register · 51d3076b

Chandler Carruth authored Jan 11, 2012

extracts and scaled addressing modes into its own helper function. No
functionality changed here, just hoisting and layout fixes falling out
of that hoisting.

llvm-svn: 147937

51d3076b

Teach the X86 instruction selection to do some heroic transforms to · 55b2cdee

Chandler Carruth authored Jan 11, 2012

detect a pattern which can be implemented with a small 'shl' embedded in
the addressing mode scale. This happens in real code as follows:

  unsigned x = my_accelerator_table[input >> 11];

Here we have some lookup table that we look into using the high bits of
'input'. Each entity in the table is 4-bytes, which means this
implicitly gets turned into (once lowered out of a GEP):

  *(unsigned*)((char*)my_accelerator_table + ((input >> 11) << 2));

The shift right followed by a shift left is canonicalized to a smaller
shift right and masking off the low bits. That hides the shift right
which x86 has an addressing mode designed to support. We now detect
masks of this form, and produce the longer shift right followed by the
proper addressing mode. In addition to saving a (rather large)
instruction, this also reduces stalls in Intel chips on benchmarks I've
measured.

In order for all of this to work, one part of the DAG needs to be
canonicalized *still further* than it currently is. This involves
removing pointless 'trunc' nodes between a zextload and a zext. Without
that, we end up generating spurious masks and hiding the pattern.

llvm-svn: 147936

55b2cdee

Add big endian mips support. Based on a patch by Jack Carter. · 647841b1
Rafael Espindola authored Jan 11, 2012
```
llvm-svn: 147924
```
647841b1
Add the skeleton of an asm parser for mips. · 870c4e92
Rafael Espindola authored Jan 11, 2012
```
llvm-svn: 147923
```
870c4e92

ARM Ld/St Optimizer fix. · 642f0f6a

Andrew Trick authored Jan 11, 2012

Allow LDRD to be formed from pairs with different LDR encodings. This was the original intention of the pass. Somewhere along the way, the LDR opcodes were refined which broke the optimization. We really don't care what the original opcodes are as long as they both map to the same LDRD and the immediate still fits.

Fixes rdar://10435045 ARMLoadStoreOptimization cannot handle mixed LDRi8/LDRi12

llvm-svn: 147922

642f0f6a

Jan 10, 2012

Fixed order of operands in comment to match code. · 995c6332
Lang Hames authored Jan 10, 2012
```
llvm-svn: 147890
```
995c6332

Default stack alignment for 32bit x86 should be 4 Bytes, not 8 Bytes. · 96cd35cf

Joerg Sonnenberger authored Jan 10, 2012

Add a test that checks the stack alignment of a simple function for
Darwin, Linux and NetBSD for 32bit and 64bit mode.

llvm-svn: 147888

96cd35cf

Consider unknown alignment caused by OptimizeThumb2Instructions(). · 20f1dd5f

Jakob Stoklund Olesen authored Jan 10, 2012

This function runs after all constant islands have been placed, and may
shrink some instructions to their 2-byte forms.  This can actually cause
some constant pool entries to move out of range because of growing
alignment padding.

Treat instructions that may be shrunk the same as inline asm - they
erode the known alignment bits.

Also reinstate an old assertion in verify(). It is correct now that
basic block offsets include alignments.

Add a single large test case that will hopefully exercise many parts of
the constant island pass.

<rdar://problem/10670199>

llvm-svn: 147885

20f1dd5f

Add missing VEX predicates to VMOVSDto64rr/VMOVSDto64mr. This fixes a few · 1a8f0ccd
Chad Rosier authored Jan 10, 2012
```
failing test cases on our internal AVX nightly tester.
rdar://10663637

llvm-svn: 147881
```
1a8f0ccd
ARM updating VST2 pseudo-lowering fixed vs. register update. · 74ac7d50
Jim Grosbach authored Jan 10, 2012
```
rdar://10663487

llvm-svn: 147876
```
74ac7d50
Fix some leftover control reaches end of non-void function warnings. · 233149cf
Benjamin Kramer authored Jan 10, 2012
```
llvm-svn: 147874
```
233149cf
Move default case for covered enum outside of switch. · ad5b42c0
Richard Smith authored Jan 10, 2012
```
llvm-svn: 147870
```
ad5b42c0

For i386, don't use the generic code. · d5ab0260

Bill Wendling authored Jan 10, 2012

As the comment around 7746 says, it's better to use the x87 extended precision
here than SSE. And the generic code doesn't know how to do that. It also regains
the speed lost for the uint64_to_float.c testcase.
<rdar://problem/10669858>

llvm-svn: 147869

d5ab0260

Fix a -Wreturn-type warning in g++. · 3f103541
Richard Smith authored Jan 10, 2012
```
llvm-svn: 147867
```
3f103541

Add 'llvm_unreachable' to passify GCC's understanding of the constraints · f3e8502c

Chandler Carruth authored Jan 10, 2012

of several newly un-defaulted switches. This also helps optimizers
(including LLVM's) recognize that every case is covered, and we should
assume as much.

llvm-svn: 147861

f3e8502c

Add definition for intel asm variant. · 67bf992a

Devang Patel authored Jan 10, 2012

Right now, this just adds additional entries in match table. The parser does not use them yet.

llvm-svn: 147859

67bf992a

Remove unnecessary default cases in switches that cover all enum values. · edbb58c5
David Blaikie authored Jan 10, 2012
```
llvm-svn: 147855
```
edbb58c5
Add definitions for AMD's bobcat (aka btver1) · 077ae1d7
Benjamin Kramer authored Jan 10, 2012
```
llvm-svn: 147846
```
077ae1d7

Fix a crash in AVX2 when trying to broadcast a double into a 128-bit vector.... · 430f3f1b

Craig Topper authored Jan 10, 2012

Fix a crash in AVX2 when trying to broadcast a double into a 128-bit vector. There is no vbroadcastsd xmm, but we do need to support 64-bit integers broadcasted into xmm. Also factor the AVX check into the isVectorBroadcast function. This makes more sense since the AVX2 check was already inside.

llvm-svn: 147844

430f3f1b

Remove hasXMM/hasXMMInt functions. Move callers to hasSSE1/hasSSE2. This is... · b0c0f72a

Craig Topper authored Jan 10, 2012

Remove hasXMM/hasXMMInt functions. Move callers to hasSSE1/hasSSE2. This is the final piece to remove the AVX hack that disabled SSE.

llvm-svn: 147843

b0c0f72a

Remove hasSSE*orAVX functions and change all callers to use just hasSSE*. AVX... · d97bbd7b

Craig Topper authored Jan 10, 2012

Remove hasSSE*orAVX functions and change all callers to use just hasSSE*. AVX is now an SSE level and no longer disables SSE checks.

llvm-svn: 147842

d97bbd7b

Instruction selection priority fixes to remove the XMM/XMMInt/orAVX... · eb8f9e9e

Craig Topper authored Jan 10, 2012

Instruction selection priority fixes to remove the XMM/XMMInt/orAVX predicates. Another commit will remove orAVX functions from X86SubTarget.

llvm-svn: 147841

eb8f9e9e

Accurately model hardware alignment rounding. · f09a3165

Jakob Stoklund Olesen authored Jan 10, 2012

On Thumb, the displacement computation hardware uses the address of the
current instruction rouned down to a multiple of 4.  Include this
rounding in the UserOffset we compute for each instruction.

When inline asm is present, the instruction alignment may not be known.
Constrain the maximum displacement instead in that case.

This makes it possible for CreateNewWater() and OffsetIsInRange() to
agree about the valid displacements.  When they disagree, infinite
looping happens.

As always, test cases for this stuff are insane.

<rdar://problem/10660175>

llvm-svn: 147825

f09a3165

Remove the logging streamer. · 5cb98f10
Rafael Espindola authored Jan 10, 2012
```
llvm-svn: 147820
```
5cb98f10