Commits · 90e0eaffa8eac92450b4ae41249c9640ec6dfcd9 · Roger Ferrer / llvm-epi-0.8

Sep 01, 2012
- Teach DAG combine a number of tricks to simplify FMA expressions in fast-math mode. · 90e0eaff
  Owen Anderson authored Sep 01, 2012
```
llvm-svn: 163051
```
  90e0eaff
- Fix typo · ec385012
  Michael Liao authored Sep 01, 2012
```
llvm-svn: 163049
```
  ec385012
- SelectionDAG: when constructing VZEXT_LOAD from other loads, make sure its · 26c5d0f6
  Manman Ren authored Aug 31, 2012
```
output chain is correctly setup.

As an example, if the original load must happen before later stores, we need
to make sure the constructed VZEXT_LOAD is constrained to be before the stores.

rdar://11457792

llvm-svn: 163036
```
  26c5d0f6
- Mark FMA4 instructions as commutable and add them to the folding tables. · 908e6851
  Craig Topper authored Aug 31, 2012
```
llvm-svn: 163035
```
  908e6851
- Remove an unused argument. The MCInst opcode is set in the ConvertToMCInst() · 451ef13c
  Chad Rosier authored Aug 31, 2012
```
function nowadays.

llvm-svn: 163030
```
  451ef13c
- Add selection of RegOp2MemOpTable3 to canFoldMemoryOperand · 7573c8f0
  Craig Topper authored Aug 31, 2012
```
llvm-svn: 163029
```
  7573c8f0
Aug 31, 2012

Add MachineInstr::tieOperands, remove setIsTied(). · 5c8eda0e

Jakob Stoklund Olesen authored Aug 31, 2012

Manage tied operands entirely internally to MachineInstr. This makes it
possible to change the representation of tied operands, as I will do
shortly.

The constraint that tied uses and defs must be in the same order was too
restrictive.

llvm-svn: 163021

5c8eda0e

Fix PR12359 · 3224543b

Michael Liao authored Aug 31, 2012

- In addition to undefined, if V2 is zero vector, skip 2nd PSHUFB and POR as
  well as PSHUFB will zero elements with negative indices.

  Patch by Sriram Murali <sriram.murali@intel.com>

llvm-svn: 163018

3224543b

The instruction DINS may be transformed into DINSU or DEXTM depending · b3f3b17e

Jack Carter authored Aug 31, 2012

on the size of the extraction and its position in the 64 bit word.

This patch allows support of the dext transformations with mips64 direct
object output.

0 <= msb < 32 0 <= lsb < 32 0 <= pos < 32 1 <= size <= 32
DINS
The field is entirely contained in the right-most word of the doubleword

32 <= msb < 64 0 <= lsb < 32 0 <= pos < 32 2 <= size <= 64
DINSM
The field straddles the words of the doubleword

32 <= msb < 64 32 <= lsb < 64 32 <= pos < 64 1 <= size <= 32
DINSU
The field is entirely contained in the left-most word of the doubleword

llvm-svn: 163010

b3f3b17e

Move the GCOVFormat enums into their own namespace per the LLVM coding standard. · 6bbe4896
Bill Wendling authored Aug 31, 2012
```
llvm-svn: 163008
```
6bbe4896
Add a comment to explain what's really going on. · 9d1fc367
Chad Rosier authored Aug 31, 2012
```
llvm-svn: 163005
```
9d1fc367
The ConvertToMCInst() function can't fail, so remove the now dead Match_ConversionFail enum. · a8f3c4fe
Chad Rosier authored Aug 31, 2012
```
llvm-svn: 163002
```
a8f3c4fe
Mark FMA3 instructions as commutable so that the operands to the multiply part can be commuted. · c0387f6b
Craig Topper authored Aug 31, 2012
```
llvm-svn: 163001
```
c0387f6b

Use CloneMachineInstr to make a new MI in commuteInstruction to make the code... · a8227cb7

Craig Topper authored Aug 31, 2012

Use CloneMachineInstr to make a new MI in commuteInstruction to make the code tolerant of instructions with more than two input operands.

llvm-svn: 163000

a8227cb7

Add support for converting llvm.fma to fma4 instructions. · c30fdbc4
Craig Topper authored Aug 31, 2012
```
llvm-svn: 162999
```
c30fdbc4

Don't enforce ordered inline asm operands. · 96f87069

Jakob Stoklund Olesen authored Aug 31, 2012

I was too optimistic, inline asm can have tied operands that don't
follow the def order.

Fixes PR13742.

llvm-svn: 162998

96f87069

Clean up ProfileDataLoader a bit. · e7e52357

Benjamin Kramer authored Aug 31, 2012

- Overloading operator<< for raw_ostream and pointers is dangerous, it alters
  the behavior of code that includes the header.
- Remove unused ID.
- Use LLVM's byte swapping helpers instead of a hand-coded.
- Make ReadProfilingData work directly on a pointer.

No functionality change.

llvm-svn: 162992

e7e52357

Cleanups due to feedback. No functionality change. Patch by Alistair. · 5aed004c
Bill Wendling authored Aug 31, 2012
```
llvm-svn: 162979
```
5aed004c
Clean up AddedComplexity further after adding UseSSEx · 969f3913
Michael Liao authored Aug 31, 2012
```
llvm-svn: 162973
```
969f3913

Fix a couple of typos in EmitAtomic. · d3bda3c5

Jakob Stoklund Olesen authored Aug 31, 2012

Thumb2 instructions are mostly constrained to rGPR, not tGPR which is
for Thumb1.

rdar://problem/12203728

llvm-svn: 162968

d3bda3c5

X86: Fix encoding of 'movd %xmm0, %rax' · e423e865

Jim Grosbach authored Aug 31, 2012

The assembly string for the VMOVPQIto64rr instruction incorrectly lacked the 'v'
prefix, resulting in mis-assembly of the vanilla movd instruction.

llvm-svn: 162963

e423e865

With the fix in r162954/162955 every cvt function returns true. Thus, have · 98cfa104
Chad Rosier authored Aug 31, 2012
```
the ConvertToMCInst() return void, rather then a bool.  Update all the cvt
functions as well.

llvm-svn: 162961
```
98cfa104

Take account of boolean vector contents when promoting a build vector from i1... · e969340f

Pete Cooper authored Aug 30, 2012

Take account of boolean vector contents when promoting a build vector from i1 to some other type.  rdar://problem/12210060

llvm-svn: 162960

e969340f

Teach the DAG combiner to turn chains of FADDs (x+x+x+x+...) into FMULs by... · cc61f87c

Owen Anderson authored Aug 30, 2012

Teach the DAG combiner to turn chains of FADDs (x+x+x+x+...) into FMULs by constants.  This is only enabled in unsafe FP math mode, since it does not preserve rounding effects for all such constants.

llvm-svn: 162956

cc61f87c

Fix for r162954. Return the Error. · db482ef7
Chad Rosier authored Aug 30, 2012
```
llvm-svn: 162955
```
db482ef7
Move a check to the validateInstruction() function where it more properly belongs. · 8513ffbb
Chad Rosier authored Aug 30, 2012
```
llvm-svn: 162954
```
8513ffbb
Typo. · 5eec49fe
Chad Rosier authored Aug 30, 2012
```
llvm-svn: 162952
```
5eec49fe

Aug 30, 2012

· ea973bda

Nadav Rotem authored Aug 30, 2012

Currently targets that do not support selects with scalar conditions and vector operands - scalarize the code. ARM is such a target
because it does not support CMOV of vectors. To implement this efficientlyi, we broadcast the condition bit and use a sequence of NAND-OR
to select between the two operands. This is the same sequence we use for targets that don't have vector BLENDs (like SSE2).

rdar://12201387

llvm-svn: 162926

ea973bda

Introduce 'UseSSEx' to force SSE legacy encoding · bbd10792

Michael Liao authored Aug 30, 2012

- Add 'UseSSEx' to force SSE legacy insn not being selected when AVX is
  enabled.

  As the penalty of inter-mixing SSE and AVX instructions, we need
  prevent SSE legacy insn from being generated except explicitly
  specified through some intrinsics. For patterns supported by both
  SSE and AVX, so far, we force AVX insn will be tried first relying on
  AddedComplexity or position in td file. It's error-prone and
  introduces bugs accidentally.

  'UseSSEx' is disabled when AVX is turned on. For SSE insns inherited
  by AVX, we need this predicate to force VEX encoding or SSE legacy
  encoding only.

  For insns not inherited by AVX, we still use the previous predicates,
  i.e. 'HasSSEx'. So far, these insns fall into the following
  categories:
  * SSE insns with MMX operands
  * SSE insns with GPR/MEM operands only (xFENCE, PREFETCH, CLFLUSH,
    CRC, and etc.)
  * SSE4A insns.
  * MMX insns.
  * x87 insns added by SSE.

2 test cases are modified:

 - test/CodeGen/X86/fast-isel-x86-64.ll
   AVX code generation is different from SSE one. 'vcvtsi2sdq' cannot be
   selected by fast-isel due to complicated pattern and fast-isel
   fallback to materialize it from constant pool.

 - test/CodeGen/X86/widen_load-1.ll
   AVX code generation is different from SSE one after fixing SSE/AVX
   inter-mixing. Exec-domain fixing prefers 'vmovapd' instead of
   'vmovaps'.

llvm-svn: 162919

bbd10792

Apply "/Og-" also to MSC15(aka VS9) on VMCore/Function.cpp. · fa814380
NAKAMURA Takumi authored Aug 30, 2012
```
llvm-svn: 162917
```
fa814380

PPCISelLowering.cpp: Fix r162725. · ac49029f

NAKAMURA Takumi authored Aug 30, 2012

[Tobias von Koch] What's happening here is that the CR6SET/CR6UNSET is breaking the chain of register copies glued to the function call (BL_SVR4 node). The scheduler then moves other instructions in between those and the function call, which isn't good!

Right. That's the case where there is no chain of register copies before the call, so InFlag == 0... Attached is a new revision of the patch which should fix this for good.

llvm-svn: 162916

ac49029f

PPCISelLowering.cpp: Whitespace. · 8ad54e04
NAKAMURA Takumi authored Aug 30, 2012
```
llvm-svn: 162915
```
8ad54e04
test · 30c3e14e
Michael Ilseman authored Aug 30, 2012
```
llvm-svn: 162914
```
30c3e14e

LoopRotate: Also rotate loops with multiple exits. · afdfdb5c

Benjamin Kramer authored Aug 30, 2012

The old PHI updating code in loop-rotate was replaced with SSAUpdater a while
ago, it has no problems with comples PHIs. What had to be fixed is detecting
whether a loop was already rotated and updating dominators when multiple exits
were present.

This change increases overall code size a bit, mostly due to additional loop
unrolling opportunities. Passes test-suite and selfhost with -verify-dom-info.
Fixes PR7447.

Thanks to Andy for the input on the domtree updating code.

llvm-svn: 162912

afdfdb5c

InstCombine: Fix comment to reflect the code. · d4a64716
Benjamin Kramer authored Aug 30, 2012
```
llvm-svn: 162911
```
d4a64716

Don't use MCInstrDesc flags for implicit operands. · 0eecbbeb

Jakob Stoklund Olesen authored Aug 30, 2012

When a MachineInstr is constructed, its implicit operands are added
first, then the explicit operands are inserted before the implicits.

MCInstrDesc has oprand flags like early clobber and operand ties that
apply to the explicit operands.

Don't look at those flags when the implicit operands are first added in
the explicit operands's positions.

llvm-svn: 162910

0eecbbeb

Whitespace · f54e3aae
Alexey Samsonov authored Aug 30, 2012
```
llvm-svn: 162907
```
f54e3aae
It is illegal to transform (sdiv (ashr X c1) c2) -> (sdiv x (2^c1 * c2)), · d5f5777b
Nadav Rotem authored Aug 30, 2012
```
because C always rounds towards zero.

Thanks Dirk and Ben.

llvm-svn: 162899
```
d5f5777b
Add support for moving pure S-register to NEON pipeline if desired · ca9f384f
Tim Northover authored Aug 30, 2012
```
llvm-svn: 162898
```
ca9f384f

Refactor fetching file/line info from DWARFContext to simplify the · 45be793e

Alexey Samsonov authored Aug 30, 2012

code and allow better code reuse. Make the code a bit more conforming
to LLVM code style.
No functionality change.

llvm-svn: 162895

45be793e