Commits · b878caa5e2ac6b96a79e011f77f25fdbb6e61ccb · Roger Ferrer / llvm-epi-0.8

Jul 21, 2011

Add support for 256-bit versions of VPERMIL instruction. This is a new · b878caa5

Bruno Cardoso Lopes authored Jul 21, 2011

instruction introduced in AVX, which can operate on 128 and 256-bit vectors.
It considers a 256-bit vector as two independent 128-bit lanes. It can permute
any 32 or 64 elements inside a lane, and restricts the second lane to
have the same permutation of the first one. With the improved splat support
introduced early today, adding codegen for this instruction enable more
efficient 256-bit code:

Instead of:
  vextractf128  $0, %ymm0, %xmm0
  punpcklbw %xmm0, %xmm0
  punpckhbw %xmm0, %xmm0
  vinsertf128 $0, %xmm0, %ymm0, %ymm1
  vinsertf128 $1, %xmm0, %ymm1, %ymm0
  vextractf128  $1, %ymm0, %xmm1
  shufps  $1, %xmm1, %xmm1
  movss %xmm1, 28(%rsp)
  movss %xmm1, 24(%rsp)
  movss %xmm1, 20(%rsp)
  movss %xmm1, 16(%rsp)
  vextractf128  $0, %ymm0, %xmm0
  shufps  $1, %xmm0, %xmm0
  movss %xmm0, 12(%rsp)
  movss %xmm0, 8(%rsp)
  movss %xmm0, 4(%rsp)
  movss %xmm0, (%rsp)
  vmovaps (%rsp), %ymm0
We get:
  vextractf128  $0, %ymm0, %xmm0
  punpcklbw %xmm0, %xmm0
  punpckhbw %xmm0, %xmm0
  vinsertf128 $0, %xmm0, %ymm0, %ymm1
  vinsertf128 $1, %xmm0, %ymm1, %ymm0
  vpermilps $85, %ymm0, %ymm0

llvm-svn: 135662

b878caa5

Improve splat promotion to handle AVX types: v32i8 and v16i16. Also · fb4920eb

Bruno Cardoso Lopes authored Jul 21, 2011

refactor the code and add a bunch of comments. The final shuffle
emitted by handling 256-bit types is suitable for the VPERM shuffle
instruction which is going to be introduced in a next commit (with
a testcase which cover this commit)

llvm-svn: 135661

fb4920eb

Add aditional patterns for vextractf128 instruction · 18a8d25b
Bruno Cardoso Lopes authored Jul 21, 2011
```
llvm-svn: 135660
```
18a8d25b
Add aditional patterns for vinsertf128 instruction · 2389881b
Bruno Cardoso Lopes authored Jul 21, 2011
```
llvm-svn: 135659
```
2389881b
Add v16i16 type to VR256 class · 0a57b225
Bruno Cardoso Lopes authored Jul 21, 2011
```
llvm-svn: 135658
```
0a57b225
Move code around. No functionality changes · e6f88326
Bruno Cardoso Lopes authored Jul 21, 2011
```
llvm-svn: 135657
```
e6f88326
Tidy up code · 0bdeacf0
Bruno Cardoso Lopes authored Jul 21, 2011
```
llvm-svn: 135656
```
0bdeacf0
Mark instructions which are part of the frame setup with the MachineInstr::FrameSetup flag. · 28b6e12d
Bill Wendling authored Jul 21, 2011
```
llvm-svn: 135645
```
28b6e12d
Remove unused function. · ed93564c
Bill Wendling authored Jul 20, 2011
```
llvm-svn: 135635
```
ed93564c
Remove the now defunct getCompactUnwindEncoding method from the frame lowering code. · 01bd7d9d
Bill Wendling authored Jul 20, 2011
```
llvm-svn: 135634
```
01bd7d9d

Jul 20, 2011

Goodbye TargetAsmInfo. This eliminate last bit of CodeGen and Target in llvm-mc. · bbf3b0de

Evan Cheng authored Jul 20, 2011

There is still a bit more refactoring left to do in Targets. But we are now very
close to fixing all the layering issues in MC.

llvm-svn: 135611

bbf3b0de

Extend the hack for _GLOBAL_OFFSET_TABLE_ slightly; PR10389. · ae60b6b0
Eli Friedman authored Jul 20, 2011
```
llvm-svn: 135607
```
ae60b6b0

- Move CodeModel from a TargetMachine global option to MCCodeGenInfo. · efd9b424

Evan Cheng authored Jul 20, 2011

- Introduce JITDefault code model. This tells targets to set different default
  code model for JIT. This eliminates the ugly hack in TargetMachine where
  code model is changed after construction.

llvm-svn: 135580

efd9b424

X86Subtarget.h: Assume "x86_64-cygwin", though it has not been released yet,... · b66d2555

NAKAMURA Takumi authored Jul 20, 2011

X86Subtarget.h: Assume "x86_64-cygwin", though it has not been released yet, to appease test/CodeGen/X86 on cygwin.

llvm-svn: 135564

b66d2555

Jul 19, 2011
- Introduce MCCodeGenInfo, which keeps information that can affect codegen · 2129f596
  Evan Cheng authored Jul 19, 2011
```
(including compilation, assembly). Move relocation model Reloc::Model from
TargetMachine to MCCodeGenInfo so it's accessible even without TargetMachine.

llvm-svn: 135468
```
  2129f596
- Move getInitialFrameState from TargetFrameInfo to MCAsmInfo (suggestions for · 67c033e6
  Evan Cheng authored Jul 18, 2011
```
better location welcome).

llvm-svn: 135438
```
  67c033e6
Jul 18, 2011
- Sink getDwarfRegNum, getLLVMRegNum, getSEHRegNum from TargetRegisterInfo down · d60fa58b
  Evan Cheng authored Jul 18, 2011
```
to MCRegisterInfo. Also initialize the mapping at construction time.

This patch eliminate TargetRegisterInfo from TargetAsmInfo. It's another step
towards fixing the layering violation.

llvm-svn: 135424
```
  d60fa58b
- Be more smart with VCVTSS2SD. Also place the patterns close to the · 50c1d981
  Bruno Cardoso Lopes authored Jul 18, 2011
```
definitions.

llvm-svn: 135407
```
  50c1d981
- Add AVX 128-bit sqrt versions · 4208cace
  Bruno Cardoso Lopes authored Jul 18, 2011
```
llvm-svn: 135404
```
  4208cace
- land David Blaikie's patch to de-constify Type, with a few tweaks. · 229907cd
  Chris Lattner authored Jul 18, 2011
```
llvm-svn: 135375
```
  229907cd
Jul 16, 2011

Add AVX 128-bit patterns for sint_to_fp · 44800401
Bruno Cardoso Lopes authored Jul 16, 2011
```
llvm-svn: 135332
```
44800401

Fix a couple of things: · 8df9cfc2

Bruno Cardoso Lopes authored Jul 15, 2011

1) Make non-legal 256-bit loads to be promoted to v4i64. This lets us
canonize the loads and handle things the same way we use to handle
for 128-bit registers. Despite of what one of the removed comments
explained, the load promotion would not mess with VPERM, it's only a
matter of doing the appropriate bitcasts when this instructions comes
to be introduced. Also make LOAD v8i32 legal.

2) Doing 1) exposed two bugs:
- v4i64 was being promoted to itself for several opcodes (introduced
in r124447 by David Greene) causing endless recursion and the stack to
explode.
- there was no support for allOnes BUILD_VECTORs and ANDNP would fail to
match because it was generating early target constant pools during
lowering.

3) The testcases are already checked-in, doing 1) exposed the
bugs in the current testcases.

4) Tidy up code to be more clear and explicit about AVX.

llvm-svn: 135313

8df9cfc2

Add a few patterns for 256-bit bitcasts. No testcases now, they are · 1fe1377e
Bruno Cardoso Lopes authored Jul 15, 2011
```
comming together with other tests.

llvm-svn: 135312
```
1fe1377e

Jul 15, 2011

PR10370: Make sure we know how to relax push correctly on x86-64. · 3846acc9
Eli Friedman authored Jul 15, 2011
```
llvm-svn: 135303
```
3846acc9

Remove an unnecessary header from this file. I don't think this header · 65667dbf

Chandler Carruth authored Jul 15, 2011

was really intended, and it may have been required prior to some of the
recent refactors. Including it however causes LLVMX86Desc to need
symbols from LLVMX86CodeGen, forming a dependency cycle. This was masked
in almost all builds: Clang, and GCC w/ optimizations didn't actually
emit the symbols!

llvm-svn: 135242

65667dbf

Move some parts of TargetAsmInfo down to MCAsmInfo. This is not the greatest · a83b37a9
Evan Cheng authored Jul 15, 2011
```
solution but it is a small step towards removing the horror that is
TargetAsmInfo.

llvm-svn: 135237
```
a83b37a9

Major update to CMake build to reflect changes in r135219 in the · 9a0001ae

Chandler Carruth authored Jul 15, 2011

backend. Moved some MCAsmInfo files down into the MCTargetDesc
sublibraries, removed some (i suspect long) dead files from other parts
of the CMake build, etc. Also copied the include directory hack from the
Makefile.

Finally, updated the lib deps. I spot checked this, and think its
correct, but review appreciated there.

llvm-svn: 135234

9a0001ae

Rename createAsmInfo to createMCAsmInfo and move registration code to... · 1705ab00

Evan Cheng authored Jul 14, 2011

Rename createAsmInfo to createMCAsmInfo and move registration code to MCTargetDesc to prepare for next round of changes.

llvm-svn: 135219

1705ab00

* Redo the permutation encoding for frameless stacks to be more like what the · 2d825b5e
Bill Wendling authored Jul 14, 2011
```
  unwind library expects.
* Comment the permutation encoding for frameless stacks.

llvm-svn: 135202
```
2d825b5e

Jul 14, 2011

Port operand types for ARM and X86 over from EDIS to the .td files. · 9654eef4
Benjamin Kramer authored Jul 14, 2011
```
llvm-svn: 135198
```
9654eef4
Next round of MC refactoring. This patch factor MC table instantiations, MC · bc153d49
Evan Cheng authored Jul 14, 2011
```
registeration and creation code into XXXMCDesc libraries.

llvm-svn: 135184
```
bc153d49

Check register class matching instead of width of type matching · 92464be2

Eric Christopher authored Jul 14, 2011

when determining validity of matching constraint. Allow i1
types access to the GR8 reg class for x86.

Fixes PR10352 and rdar://9777108

llvm-svn: 135180

92464be2

Add 256-bit load/store recognition and matching in several places. · 6778597d
Bruno Cardoso Lopes authored Jul 14, 2011
```
llvm-svn: 135171
```
6778597d

· 771f2967

Nadav Rotem authored Jul 14, 2011

[VECTOR-SELECT]
During type legalization we often use the SIGN_EXTEND_INREG SDNode.
When this SDNode is legalized during the LegalizeVector phase, it is
scalarized because non-simple types are automatically marked to be expanded.
In this patch we add support for lowering SIGN_EXTEND_INREG manually.
This fixes CodeGen/X86/vec_sext.ll when running with the '-promote-elements'
flag.

llvm-svn: 135144

771f2967

Fix up assertion in r135018 so it doesn't trigger on 32-bit; when we're in... · bc2ae1c8

Eli Friedman authored Jul 14, 2011

Fix up assertion in r135018 so it doesn't trigger on 32-bit; when we're in 32-bit, it doesn't matter whether the operation overflows because the computed address is not wider than the immediate.

llvm-svn: 135120

bc2ae1c8

Add code to handle a "frameless" unwind stack. · d11ea81d

Bill Wendling authored Jul 13, 2011

The frameless unwind stack has a special encoding, the algorithm for which is in
"permuteEncode".

llvm-svn: 135103

d11ea81d

Jul 13, 2011

Make X86ISD::ANDNP more general and Codegen 256-bit VANDNP. A more · 9613b649
Bruno Cardoso Lopes authored Jul 13, 2011
```
general version of X86ISD::ANDNP also opened the room for a little bit
of refactoring.

llvm-svn: 135088
```
9613b649
The target specific node PANDN name is misleading. That happens because · 7ba479d2
Bruno Cardoso Lopes authored Jul 13, 2011
```
it's later selected to a ANDNPD/ANDNPS instruction instead of the PANDN
instruction. Rename it.

llvm-svn: 135087
```
7ba479d2

Make sure we don't combine a large displacement and a frame index in the same... · 344ec797

Eli Friedman authored Jul 13, 2011

Make sure we don't combine a large displacement and a frame index in the same addressing mode on x86-64.  It can overflow, leading to a crash/miscompile.

<rdar://problem/9763308>

llvm-svn: 135084

344ec797

Refactor out checking for displacements on x86-64 addressing modes. No... · ef67e7d6

Eli Friedman authored Jul 13, 2011

Refactor out checking for displacements on x86-64 addressing modes. No functionality change. Refactoring in preparation for an additional safety check in FoldOffsetIntoAddress.

Part of <rdar://problem/9763308>.

llvm-svn: 135079

ef67e7d6