Commits · 7adee1a01a39b891a69cf740aa87184f08effe5a · Roger Ferrer / llvm-epi-0.8

Nov 23, 2011

X86: Use btq for bit tests if the immediate can't be encoded in 32 bits. · ebcb4518

Benjamin Kramer authored Nov 23, 2011

Before:
	movabsq	$4294967296, %rax       ## encoding: [0x48,0xb8,0x00,0x00,0x00,0x00,0x01,0x00,0x00,0x00]
	testq	%rax, %rdi              ## encoding: [0x48,0x85,0xf8]
	jne	LBB0_2                  ## encoding: [0x75,A]

After:
	btq	$32, %rdi               ## encoding: [0x48,0x0f,0xba,0xe7,0x20]
	jb	LBB0_2                  ## encoding: [0x72,A]

btq is usually slower than testq because it doesn't fuse with the jump, but here we're better off
saving one register and a giant movabsq.

llvm-svn: 145103

ebcb4518

I added several lines in X86 code generator that allow to choose · 779ba6d7

Elena Demikhovsky authored Nov 23, 2011

VSHUFPS/VSHUFPD instructions while lowering VECTOR_SHUFFLE node. I check a commuted VSHUFP mask.

The patch was reviewed by Bruno.

llvm-svn: 145099

779ba6d7

Fix PR11422. · 02845410

Jakob Stoklund Olesen authored Nov 23, 2011

This was a bug in keeping track of the available domains when merging
domain values.

The wrong domain mask caused ExecutionDepsFix to try to move VANDPSYrr
to the integer domain which is only available in AVX2.

Also add an assertion to catch future attempts at emitting AVX2
instructions.

llvm-svn: 145096

02845410

Nov 22, 2011

More fixes to the X86InstComments for shuffle instructions. In particular add... · 83c45926

Craig Topper authored Nov 22, 2011

More fixes to the X86InstComments for shuffle instructions. In particular add AVX flavors of many instructions and fix the destination operand for some of the existing AVX entries.

llvm-svn: 145063

83c45926

Fix shuffle decoding logic to handle UNPCKLPS/UNPCKLPD on 256-bit vectors... · ccb70975

Craig Topper authored Nov 22, 2011

Fix shuffle decoding logic to handle UNPCKLPS/UNPCKLPD on 256-bit vectors correctly. Add support for decoding UNPCKHPS/UNPCKHPD for AVX 128-bit and 256-bit forms.

llvm-svn: 145055

ccb70975

Add methods for querying minimum SSE version along with AVX. Simplifies all... · f5639777

Craig Topper authored Nov 22, 2011

Add methods for querying minimum SSE version along with AVX. Simplifies all the places that had to check a version of SSE and AVX.

llvm-svn: 145053

f5639777

Nov 21, 2011
- Lowering for v32i8 to VPUNPCKLBW/VPUNPCKHBW when AVX2 is enabled. · 6270d072
  Craig Topper authored Nov 21, 2011
```
llvm-svn: 145028
```
  6270d072
- Add support for lowering 256-bit shuffles to VPUNPCKL/H for i16, i32, i64 if AVX2 is enabled. · 669199ca
  Craig Topper authored Nov 21, 2011
```
llvm-svn: 145026
```
  669199ca
- Make LowerSIGN_EXTEND_INREG split 256-bit vectors when AVX1 is enabled and use... · a065238c
  Craig Topper authored Nov 21, 2011
```
Make LowerSIGN_EXTEND_INREG split 256-bit vectors when AVX1 is enabled and use AVX2 shifts when AVX2 is enabled.

llvm-svn: 145022
```
  a065238c
Nov 20, 2011

Add code for lowering v32i8 shifts by a splat to AVX2 immediate shift... · e79761df

Craig Topper authored Nov 20, 2011

Add code for lowering v32i8 shifts by a splat to AVX2 immediate shift instructions. Remove 256-bit splat handling from LowerShift as it was already handled by PerformShiftCombine.

llvm-svn: 145005

e79761df

Nov 19, 2011
- Use 256-bit vcmpeqd for creating an all ones vector when AVX2 is enabled. · a3a65836
  Craig Topper authored Nov 19, 2011
```
llvm-svn: 145004
```
  a3a65836
- Remove some of the special classes that worked around an old tablegen... · bac86038
  Craig Topper authored Nov 19, 2011
```
Remove some of the special classes that worked around an old tablegen limitation of not being able to remove redundant bitconverts from patterns.

llvm-svn: 145003
```
  bac86038
- Custom lower AVX2 variable shift intrinsics to shl/srl/sra nodes and remove the intrinsic patterns. · 3af6ae08
  Craig Topper authored Nov 19, 2011
```
llvm-svn: 144999
```
  3af6ae08
- Synthesize SSSE3/AVX 128-bit horizontal integer add/sub instructions from... · f984efbf
  Craig Topper authored Nov 19, 2011
```
Synthesize SSSE3/AVX 128-bit horizontal integer add/sub instructions from add/sub of appropriate shuffle vectors.

llvm-svn: 144989
```
  f984efbf
- Collapse X86 PSIGNB/PSIGNW/PSIGND node types. · 81390be0
  Craig Topper authored Nov 19, 2011
```
llvm-svn: 144988
```
  81390be0
- Extend VPBLENDVB and VPSIGN lowering to work for AVX2. · de6b73bb
  Craig Topper authored Nov 19, 2011
```
llvm-svn: 144987
```
  de6b73bb
- Remove unused parameters from the AVX maskmov classes. · 66e2b5a6
  Craig Topper authored Nov 19, 2011
```
llvm-svn: 144985
```
  66e2b5a6
Nov 18, 2011
- Add AVX2 vpbroadcast support · 1ec141d0
  Nadav Rotem authored Nov 18, 2011
```
llvm-svn: 144967
```
  1ec141d0
Nov 17, 2011

Fix SSE/AVX integer comparison patterns to understand that all integer vector... · f41e1d02

Craig Topper authored Nov 17, 2011

Fix SSE/AVX integer comparison patterns to understand that all integer vector loads are promoted to i64 vector loads so patterns need a bitconvert. Also slightly simplify the AVX2 variable shift patterns by using the predefined bitconvert pattern fragments.

llvm-svn: 144896

f41e1d02

Remove seemingly unnecessary duplicate VROUND definitions. · f17b6005
Craig Topper authored Nov 17, 2011
```
llvm-svn: 144885
```
f17b6005

Turn on vzeroupper insertion on call boundaries for AVX; it works as far as I... · 20439a42

Eli Friedman authored Nov 17, 2011

Turn on vzeroupper insertion on call boundaries for AVX; it works as far as I know, and I'd like to see wider testing.

llvm-svn: 144867

20439a42

Nov 16, 2011
- Another missing X86ISD::MOVLPD pattern. rdar://10450317 · 011538dc
  Evan Cheng authored Nov 16, 2011
```
llvm-svn: 144839
```
  011538dc
- Added missing comment about new custom lowering of DEC64 · 48784ed5
  Pete Cooper authored Nov 16, 2011
```
llvm-svn: 144811
```
  48784ed5
- Sink codegen optimization level into MCCodeGenInfo along side relocation model · ecb2908b
  Evan Cheng authored Nov 16, 2011
```
and code model. This eliminates the need to pass OptLevel flag all over the
place and makes it possible for any codegen pass to use this information.

llvm-svn: 144788
```
  ecb2908b
- Fix the execution domain on a bunch of SSE/AVX instructions. · 3ed7d9ee
  Craig Topper authored Nov 16, 2011
```
llvm-svn: 144784
```
  3ed7d9ee
- Remove code to enable execution dependency fix pass on VR256. VR128 is sufficient after r144636. · 07d8b5e2
  Craig Topper authored Nov 16, 2011
```
llvm-svn: 144777
```
  07d8b5e2
Nov 15, 2011

AVX: Add support for vbroadcast from BUILD_VECTOR and refactor some of the vbroadcast code. · 37010002
Nadav Rotem authored Nov 15, 2011
```
 

llvm-svn: 144720
```
37010002
Added custom lowering for load->dec->store sequence in x86 when the EFLAGS registers is used · 7c7ba1ba
Pete Cooper authored Nov 15, 2011
```
by later instructions.

Only done for DEC64m right now.

Fixes <rdar://problem/6172640>

llvm-svn: 144705
```
7c7ba1ba
Remove some unnecessary includes of PseudoSourceValue.h. · 0745e645
Jay Foad authored Nov 15, 2011
```
llvm-svn: 144631
```
0745e645

Fix PR11370 for real. Prevents converting 256-bit FP instruction to AVX2... · 649d1c5e

Craig Topper authored Nov 15, 2011

Fix PR11370 for real. Prevents converting 256-bit FP instruction to AVX2 256-bit integer instructions when AVX2 isn't enabled.

llvm-svn: 144629

649d1c5e

Properly qualify AVX2 specific parts of execution dependency table. Also... · 05baa85f

Craig Topper authored Nov 15, 2011

Properly qualify AVX2 specific parts of execution dependency table. Also enable converting between 256-bit PS/PD operations when AVX1 is enabled. Fixes PR11370.

llvm-svn: 144622

05baa85f

Break false dependencies before partial register updates. · f8ad336b

Jakob Stoklund Olesen authored Nov 15, 2011

Two new TargetInstrInfo hooks lets the target tell ExecutionDepsFix
about instructions with partial register updates causing false unwanted
dependencies.

The ExecutionDepsFix pass will break the false dependencies if the
updated register was written in the previoius N instructions.

The small loop added to sse-domains.ll runs twice as fast with
dependency-breaking instructions inserted.

llvm-svn: 144602

f8ad336b

Nov 14, 2011
- Add a missing pattern for X86ISD::MOVLPD. rdar://10436044 · fb13d32b
  Evan Cheng authored Nov 14, 2011
```
llvm-svn: 144566
```
  fb13d32b
- Changed SSE4/AVX <2 x i64> extract and insert ops to be Custom lowered · 890e02e8
  Pete Cooper authored Nov 14, 2011
```
Constant idx case is still done in tablegen but other cases are then expanded

Fixes <rdar://problem/10435460>

llvm-svn: 144557
```
  890e02e8
- Add AVX2 version of instructions to load folding tables. Also add a bunch of... · 182b00a2
  Craig Topper authored Nov 14, 2011
```
Add AVX2 version of instructions to load folding tables. Also add a bunch of missing SSE/AVX instructions.

llvm-svn: 144525
```
  182b00a2
- Add neverHasSideEffects, mayLoad, and mayStore to many patternless SSE/AVX... · a331515c
  Craig Topper authored Nov 14, 2011
```
Add neverHasSideEffects, mayLoad, and mayStore to many patternless SSE/AVX instructions. Remove MMX check from LowerVECTOR_SHUFFLE since MMX vector types won't go through it anyway.

llvm-svn: 144522
```
  a331515c
Nov 13, 2011
- Add BLSI, BLSMSK, and BLSR to getTargetNodeName. · b8bcb473
  Craig Topper authored Nov 13, 2011
```
llvm-svn: 144502
```
  b8bcb473
Nov 12, 2011

Add more AVX2 shift lowering support. Move AVX2 variable shift to use patterns... · 3dc75f9e
Craig Topper authored Nov 12, 2011
```
Add more AVX2 shift lowering support. Move AVX2 variable shift to use patterns instead of custom lowering code.

llvm-svn: 144457
```
3dc75f9e

build: Attempt to rectify inconsistencies between CMake and LLVMBuild versions... · 52823cc9

Daniel Dunbar authored Nov 12, 2011

build: Attempt to rectify inconsistencies between CMake and LLVMBuild versions of explicit dependencies.
 - The hope is that we have a tool/test to verify these are accurate (and tight) soon.

llvm-svn: 144444

52823cc9

Nov 11, 2011
- Add lowering for AVX2 shift instructions. · ea28a34c
  Craig Topper authored Nov 11, 2011
```
llvm-svn: 144380
```
  ea28a34c