Commits · 0a928fa32ea6604caa553c3998301fda00f88433 · Roger Ferrer / llvm-epi-0.8

Sep 26, 2012
- Remove hasNoAVX method. Can just invert hasAVX instead. · 0a928fa3
  Craig Topper authored Sep 26, 2012
```
llvm-svn: 164664
```
  0a928fa3
Sep 04, 2012

Preston Gurd authored Sep 04, 2012

- CodeGenPrepare pass for identifying div/rem ops
- Backend specifies the type mapping using addBypassSlowDivType
- Enabled only for Intel Atom with O2 32-bit -> 8-bit
- Replace IDIV with instructions which test its value and use DIVB if the value
is positive and less than 256.
- In the case when the quotient and remainder of a divide are used a DIV
and a REM instruction will be present in the IR. In the non-Atom case
they are both lowered to IDIVs and CSE removes the redundant IDIV instruction,
using the quotient and remainder from the first IDIV. However,
due to this optimization CSE is not able to eliminate redundant
IDIV instructions because they are located in different basic blocks.
This is overcome by calculating both the quotient (DIV) and remainder (REM)
in each basic block that is inserted by the optimization and reusing the result
values when a subsequent DIV or REM instruction uses the same operands.
- Test cases check for the presents of the optimization when calculating
either the quotient, remainder,  or both.

Patch by Tyler Nowicki!

llvm-svn: 163150

cdf540d5

Aug 30, 2012

Introduce 'UseSSEx' to force SSE legacy encoding · bbd10792

Michael Liao authored Aug 30, 2012

- Add 'UseSSEx' to force SSE legacy insn not being selected when AVX is
  enabled.

  As the penalty of inter-mixing SSE and AVX instructions, we need
  prevent SSE legacy insn from being generated except explicitly
  specified through some intrinsics. For patterns supported by both
  SSE and AVX, so far, we force AVX insn will be tried first relying on
  AddedComplexity or position in td file. It's error-prone and
  introduces bugs accidentally.

  'UseSSEx' is disabled when AVX is turned on. For SSE insns inherited
  by AVX, we need this predicate to force VEX encoding or SSE legacy
  encoding only.

  For insns not inherited by AVX, we still use the previous predicates,
  i.e. 'HasSSEx'. So far, these insns fall into the following
  categories:
  * SSE insns with MMX operands
  * SSE insns with GPR/MEM operands only (xFENCE, PREFETCH, CLFLUSH,
    CRC, and etc.)
  * SSE4A insns.
  * MMX insns.
  * x87 insns added by SSE.

2 test cases are modified:

 - test/CodeGen/X86/fast-isel-x86-64.ll
   AVX code generation is different from SSE one. 'vcvtsi2sdq' cannot be
   selected by fast-isel due to complicated pattern and fast-isel
   fallback to materialize it from constant pool.

 - test/CodeGen/X86/widen_load-1.ll
   AVX code generation is different from SSE one after fixing SSE/AVX
   inter-mixing. Exec-domain fixing prefers 'vmovapd' instead of
   'vmovaps'.

llvm-svn: 162919

bbd10792

Aug 24, 2012
- Custom lower FMA intrinsics to target specific nodes and remove the patterns. · 663d160a
  Craig Topper authored Aug 24, 2012
```
llvm-svn: 162534
```
  663d160a
Aug 23, 2012
- Favor FMA3 over FMA4 if both are enabled. · 4a4634d6
  Craig Topper authored Aug 23, 2012
```
llvm-svn: 162454
```
  4a4634d6
Aug 01, 2012
- Whitespace. · 24c19d20
  Chad Rosier authored Aug 01, 2012
```
llvm-svn: 161122
```
  24c19d20
Jun 03, 2012
- Rename FMA3 feature flag to just FMA to match gcc so it can be added to clang. · 79dbb0c6
  Craig Topper authored Jun 03, 2012
```
llvm-svn: 157903
```
  79dbb0c6
May 31, 2012

X86: Rename the CLMUL target feature to PCLMUL. · a0396e45

Benjamin Kramer authored May 31, 2012

It was renamed in gcc/gas a while ago and causes all kinds of
confusion because it was named differently in llvm and clang.

llvm-svn: 157745

a0396e45

Apr 23, 2012

This patch fixes a problem which arose when using the Post-RA scheduler · 9a091475

Preston Gurd authored Apr 23, 2012

on X86 Atom. Some of our tests failed because the tail merging part of
the BranchFolding pass was creating new basic blocks which did not
contain live-in information. When the anti-dependency code in the Post-RA
scheduler ran, it would sometimes rename the register containing
the function return value because the fact that the return value was
live-in to the subsequent block had been lost. To fix this, it is necessary
to run the RegisterScavenging code in the BranchFolding pass.

This patch makes sure that the register scavenging code is invoked
in the X86 subtarget only when post-RA scheduling is being done.
Post RA scheduling in the X86 subtarget is only done for Atom.

This patch adds a new function to the TargetRegisterClass to control
whether or not live-ins should be preserved during branch folding.
This is necessary in order for the anti-dependency optimizations done
during the PostRASchedulerList pass to work properly when doing
Post-RA scheduling for the X86 in general and for the Intel Atom in particular.

The patch adds and invokes the new function trackLivenessAfterRegAlloc()
instead of using the existing requiresRegisterScavenging().
It changes BranchFolding.cpp to call trackLivenessAfterRegAlloc() instead of
requiresRegisterScavenging(). It changes the all the targets that
implemented requiresRegisterScavenging() to also implement
trackLivenessAfterRegAlloc().  

It adds an assertion in the Post RA scheduler to make sure that post RA
liveness information is available when it is needed.

It changes the X86 break-anti-dependencies test to use –mcpu=atom, in order
to avoid running into the added assertion.

Finally, this patch restores the use of anti-dependency checking
(which was turned off temporarily for the 3.1 release) for
Intel Atom in the Post RA scheduler.

Patch by Andy Zhang!

Thanks to Jakob and Anton for their reviews.

llvm-svn: 155395

9a091475

Mar 17, 2012
- Reorder includes in Target backends to following coding standards. Remove some... · b25fda95
  Craig Topper authored Mar 17, 2012
```
Reorder includes in Target backends to following coding standards. Remove some superfluous forward declarations.

llvm-svn: 152997
```
  b25fda95
Feb 19, 2012
- some comment fix for X86 and ARM · e1d61969
  Jia Liu authored Feb 19, 2012
```
llvm-svn: 150902
```
  e1d61969
Feb 18, 2012
- Emacs-tag and some comment fix for all ARM, CellSPU, Hexagon, MBlaze, MSP430,... · b22310fd
  Jia Liu authored Feb 18, 2012
```
Emacs-tag and some comment fix for all ARM, CellSPU, Hexagon, MBlaze, MSP430, PPC, PTX, Sparc, X86, XCore.

llvm-svn: 150878
```
  b22310fd
Feb 07, 2012
- Use LEA to adjust stack ptr for Atom. Patch by Andy Zhang. · 1b81fddd
  Evan Cheng authored Feb 07, 2012
```
llvm-svn: 150008
```
  1b81fddd
Feb 05, 2012

Begin fleshing out more convenience predicates in llvm::Triple and · ebd90c58

Chandler Carruth authored Feb 05, 2012

convert at least one client over to use them. Subsequent patches both to
LLVM and Clang will try to convert more people over to a common set of
predicates.

This round of predicates is focused on OS-categorization predicates.

llvm-svn: 149815

ebd90c58

Feb 02, 2012

Instruction scheduling itinerary for Intel Atom. · 8523b16f

Andrew Trick authored Feb 01, 2012

Adds an instruction itinerary to all x86 instructions, giving each a default latency of 1, using the InstrItinClass IIC_DEFAULT.

Sets specific latencies for Atom for the instructions in files X86InstrCMovSetCC.td, X86InstrArithmetic.td, X86InstrControl.td, and X86InstrShiftRotate.td. The Atom latencies for the remainder of the x86 instructions will be set in subsequent patches.

Adds a test to verify that the scheduler is working.

Also changes the scheduling preference to "Hybrid" for i386 Atom, while leaving x86_64 as ILP.

Patch by Preston Gurd!

llvm-svn: 149558

8523b16f

Jan 10, 2012

Remove hasXMM/hasXMMInt functions. Move callers to hasSSE1/hasSSE2. This is... · b0c0f72a

Craig Topper authored Jan 10, 2012

Remove hasXMM/hasXMMInt functions. Move callers to hasSSE1/hasSSE2. This is the final piece to remove the AVX hack that disabled SSE.

llvm-svn: 147843

b0c0f72a

Remove hasSSE*orAVX functions and change all callers to use just hasSSE*. AVX... · d97bbd7b

Craig Topper authored Jan 10, 2012

Remove hasSSE*orAVX functions and change all callers to use just hasSSE*. AVX is now an SSE level and no longer disables SSE checks.

llvm-svn: 147842

d97bbd7b

Instruction selection priority fixes to remove the XMM/XMMInt/orAVX... · eb8f9e9e

Craig Topper authored Jan 10, 2012

Instruction selection priority fixes to remove the XMM/XMMInt/orAVX predicates. Another commit will remove orAVX functions from X86SubTarget.

llvm-svn: 147841

eb8f9e9e

Jan 09, 2012

Remove AVX hack in X86Subtarget. AVX/AVX2 are now treated as an SSE level.... · f287a450

Craig Topper authored Jan 09, 2012

Remove AVX hack in X86Subtarget. AVX/AVX2 are now treated as an SSE level. Predicate functions have been altered to maintain previous names and behavior.

llvm-svn: 147770

f287a450

Dec 09, 2011
- Remove hasSSE1orAVX(). It's the same as hasXMM(). · 557cda7f
  Evan Cheng authored Dec 09, 2011
```
llvm-svn: 146246
```
  557cda7f
Dec 08, 2011

Many of the SSE patterns should not be selected when AVX is available. This... · 4d1a2d44

Evan Cheng authored Dec 08, 2011

Many of the SSE patterns should not be selected when AVX is available. This led to the following code in X86Subtarget.cpp

if (HasAVX)
X86SSELevel = NoMMXSSE;

This is so patterns that are predicated on hasSSE3, etc. would not be selected when avx is available. Instead, the AVX variant is selected.
However, this breaks instructions which do not have AVX variants.

The right way to fix this is for the SSE but not-AVX patterns to predicate on something like hasSSE3() && !hasAVX().
Then we can take out the hack in X86Subtarget.cpp. Patterns which do not have AVX variants do not need to change.

However, we need to audit all the patterns before we make the change. This patch is workaround that fixes one specific case,
the prefetch instructions. rdar://10538297

llvm-svn: 146163

4d1a2d44

Dec 02, 2011
- Add XOP feature flag. · 1280eb1d
  Jan Sjödin authored Dec 02, 2011
```
llvm-svn: 145682
```
  1280eb1d
Nov 22, 2011

Add methods for querying minimum SSE version along with AVX. Simplifies all... · f5639777

Craig Topper authored Nov 22, 2011

Add methods for querying minimum SSE version along with AVX. Simplifies all the places that had to check a version of SSE and AVX.

llvm-svn: 145053

f5639777

Oct 30, 2011
- Add intrinsics and feature flag for read/write FS/GS base instructions. Also add AVX2 feature flag. · 228d9131
  Craig Topper authored Oct 30, 2011
```
llvm-svn: 143319
```
  228d9131
Oct 18, 2011
- Remove NaClMode · 49045ddb
  David Meyer authored Oct 18, 2011
```
llvm-svn: 142338
```
  49045ddb
Oct 16, 2011
- Add X86 BZHI instruction as well as BMI2 feature detection. · aea148c3
  Craig Topper authored Oct 16, 2011
```
llvm-svn: 142122
```
  aea148c3
Oct 14, 2011

Add X86 TZCNT instruction and patterns to select it. Also added core-avx2... · 3657fe4b

Craig Topper authored Oct 14, 2011

Add X86 TZCNT instruction and patterns to select it. Also added core-avx2 processor which is gcc's name for Haswell.

llvm-svn: 141939

3657fe4b

Oct 13, 2011

Revert r141854 because it was causing failures: · 063f55ff

Bill Wendling authored Oct 13, 2011

http://lab.llvm.org:8011/builders/llvm-x86_64-linux/builds/101

--- Reverse-merging r141854 into '.':
U    test/MC/Disassembler/X86/x86-32.txt
U    test/MC/Disassembler/X86/simple-tests.txt
D    test/CodeGen/X86/bmi.ll
U    lib/Target/X86/X86InstrInfo.td
U    lib/Target/X86/X86ISelLowering.cpp
U    lib/Target/X86/X86.td
U    lib/Target/X86/X86Subtarget.h

llvm-svn: 141857

063f55ff

Add X86 TZCNT instruction and patterns to select it. Also added core-avx2... · 8cc93880

Craig Topper authored Oct 13, 2011

Add X86 TZCNT instruction and patterns to select it. Also added core-avx2 processor which is gcc's name for Haswell.

llvm-svn: 141854

8cc93880

Oct 11, 2011
- Add X86 LZCNT instruction. Including instruction selection support. · 271064e8
  Craig Topper authored Oct 11, 2011
```
llvm-svn: 141651
```
  271064e8
Oct 09, 2011
- Add Ivy Bridge 16-bit floating point conversion instructions for the X86 disassembler. · fe9179fa
  Craig Topper authored Oct 09, 2011
```
llvm-svn: 141505
```
  fe9179fa
Oct 03, 2011

Add support for MOVBE and RDRAND instructions for the assembler and... · 786bdb9e

Craig Topper authored Oct 03, 2011

Add support for MOVBE and RDRAND instructions for the assembler and disassembler. Includes feature flag checking, but no instrinsic support. Fixes PR10832, PR11026 and PR11027.

llvm-svn: 141007

786bdb9e

Sep 05, 2011

Add a new MC bit for NaCl (Native Client) mode. NaCl requires that certain · 73df7e38

Nick Lewycky authored Sep 05, 2011

instructions are more aligned than the CPU requires, and adds some additional
directives, to follow in future patches. Patch by David Meyer!

llvm-svn: 139125

73df7e38

Aug 26, 2011
- Add support for generating CMPXCHG16B on x86-64 for the cmpxchg IR instruction. · 5e570427
  Eli Friedman authored Aug 26, 2011
```
llvm-svn: 138660
```
  5e570427
Jul 20, 2011

X86Subtarget.h: Assume "x86_64-cygwin", though it has not been released yet,... · b66d2555

NAKAMURA Takumi authored Jul 20, 2011

X86Subtarget.h: Assume "x86_64-cygwin", though it has not been released yet, to appease test/CodeGen/X86 on cygwin.

llvm-svn: 135564

b66d2555

Jul 09, 2011
- Restore old behavior. Always auto-detect features unless cpu or features are specified. · 60fc0fca
  Evan Cheng authored Jul 08, 2011
```
llvm-svn: 134757
```
  60fc0fca
Jul 07, 2011
- Add Mode64Bit feature and sink it down to MC layer. · 13bcc6c1
  Evan Cheng authored Jul 07, 2011
```
llvm-svn: 134641
```
  13bcc6c1
- Compute feature bits at time of MCSubtargetInfo initialization. · 1a72add6
  Evan Cheng authored Jul 07, 2011
```
llvm-svn: 134606
```
  1a72add6
Jul 02, 2011
- Rename XXXGenSubtarget.inc to XXXGenSubtargetInfo.inc for consistency. · c9c090d7
  Evan Cheng authored Jul 01, 2011
```
llvm-svn: 134281
```
  c9c090d7
Jul 01, 2011
- Rename TargetSubtarget to TargetSubtargetInfo for consistency. · 0d639a28
  Evan Cheng authored Jul 01, 2011
```
llvm-svn: 134259
```
  0d639a28