Commits · 742534c4dca641d1587a67f443e060af6ce2a2f5 · Roger Ferrer / llvm-epi-0.8

Sep 06, 2012

Release build: guard dump functions with "ifndef NDEBUG" · 742534c4
Manman Ren authored Sep 06, 2012
```
No functional change.

llvm-svn: 163339
```
742534c4

Allow overlaps between virtreg and physreg live ranges. · 866908c4

Jakob Stoklund Olesen authored Sep 06, 2012

The RegisterCoalescer understands overlapping live ranges where one
register is defined as a copy of the other. With this change, register
allocators using LiveRegMatrix can do the same, at least for copies
between physical and virtual registers.

When a physreg is defined by a copy from a virtreg, allow those live
ranges to overlap:

  %CL<def> = COPY %vreg11:sub_8bit; GR32_ABCD:%vreg11
  %vreg13<def,tied1> = SAR32rCL %vreg13<tied0>, %CL<imp-use,kill>

We can assign %vreg11 to %ECX, overlapping the live range of %CL.

llvm-svn: 163336

866908c4

Handle overlapping regunit intervals in LiveIntervals::addKillFlags(). · bb4bdd89

Jakob Stoklund Olesen authored Sep 06, 2012

We will soon allow virtual register live ranges to overlap regunit live
ranges when the physreg is defined as a copy of the virtreg:

  %EAX = COPY %vreg5
  FOO %vreg5
  BAR %EAX<kill>

There is no real interference since %vreg5 and %EAX have the same value
where they overlap.

This patch prevents addKillFlags from adding virtreg kill flags to FOO
where the assigned physreg is overlapping the virtual register live
range.

llvm-svn: 163335

bb4bdd89

Clear kill flags while computing live ranges. · 4aed4703

Jakob Stoklund Olesen authored Sep 06, 2012

Kill flags are difficult to maintain, and liveness queries are better
handled by live intervals.

Kill flags are reinserted after register allocation by addKillFlags().

llvm-svn: 163334

4aed4703

Dont cast away const needlessly. Found by gcc48 -Wcast-qual. · 4717a8d6
Roman Divacky authored Sep 06, 2012
```
llvm-svn: 163324
```
4717a8d6
Disable stack coloring by default in order to resolve the i386 failures. · 9e3cc9f8
Nadav Rotem authored Sep 06, 2012
```
llvm-svn: 163316
```
9e3cc9f8
Fix a few old-GCC warnings. No functional change. · a8e15b08
Nadav Rotem authored Sep 06, 2012
```
llvm-svn: 163309
```
a8e15b08

Add a new optimization pass: Stack Coloring, that merges disjoint static... · 7c277da3

Nadav Rotem authored Sep 06, 2012

Add a new optimization pass: Stack Coloring, that merges disjoint static allocations (allocas). Allocas are known to be
disjoint if they are marked by disjoint lifetime markers (@llvm.lifetime.XXX intrinsics).

llvm-svn: 163299

7c277da3

[ms-inline asm] Use the asm dialect from the MI to set the parser dialect. · f24ae7b0
Chad Rosier authored Sep 05, 2012
```
llvm-svn: 163273
```
f24ae7b0
Cleanup a few magic numbers. · e53314f7
Chad Rosier authored Sep 05, 2012
```
llvm-svn: 163263
```
e53314f7
Stop casting away const qualifier needlessly. · ad06cee2
Roman Divacky authored Sep 05, 2012
```
llvm-svn: 163258
```
ad06cee2
[ms-inline asm] We only need one bit to represent the AsmDialect in the · cbd2a198
Chad Rosier authored Sep 05, 2012
```
MachineInstr.

llvm-svn: 163257
```
cbd2a198
Constify this properly. Found by gcc48 -Wcast-qual. · 9338344a
Roman Divacky authored Sep 05, 2012
```
llvm-svn: 163256
```
9338344a
Constify SDNodeIterator an stop its only non-const user being cast stripped · 66526022
Roman Divacky authored Sep 05, 2012
```
of its constness. Found by gcc48 -Wcast-qual.

llvm-svn: 163254
```
66526022

Sep 05, 2012

[ms-inline asm] Propagate the asm dialect into the MachineInstr representation. · 994f4040
Chad Rosier authored Sep 05, 2012
```
llvm-svn: 163243
```
994f4040
Remove unused typedefs gcc4.8 warns about. · 09c8a3dd
Roman Divacky authored Sep 05, 2012
```
llvm-svn: 163225
```
09c8a3dd

Fixed the DAG combiner to better handle the folding of AND nodes for vector... · 3f40d872

Silviu Baranga authored Sep 05, 2012

Fixed the DAG combiner to better handle the folding of AND nodes for vector types. The previous code was making the assumption that the length of the bitmask returned by isConstantSplat was equal to the size of the vector type. Now we first make sure that the splat value has at least the length of the vector lane type, then we only use as many fields as we have available in the splat value.

llvm-svn: 163203

3f40d872

Reorder the comments of EmitExceptionTable. · 1b170de7
Logan Chien authored Sep 05, 2012
```
llvm-svn: 163194
```
1b170de7

Convert vextracti128/vextractf128 intrinsics to extract_subvector at DAG build... · 2db2353b

Craig Topper authored Sep 05, 2012

Convert vextracti128/vextractf128 intrinsics to extract_subvector at DAG build time. Similar was previously done for vinserti128/vinsertf128. Add patterns for folding these extract_subvectors with stores.

llvm-svn: 163192

2db2353b

Search the whole instruction for tied operands. · ade363e8

Jakob Stoklund Olesen authored Sep 04, 2012

Implicit uses can be dynamically tied to defs. This will soon be used
for predicated instructions on ARM.

llvm-svn: 163177

ade363e8

Sep 04, 2012

Typo. · d92e2bc2
Jakob Stoklund Olesen authored Sep 04, 2012
```
llvm-svn: 163154
```
d92e2bc2

Actually use the MachineOperand field for isRegTiedToDefOperand(). · 9fceda74

Jakob Stoklund Olesen authored Sep 04, 2012

The MachineOperand::TiedTo field was maintained, but not used.

This patch enables it in isRegTiedToDefOperand() and
isRegTiedToUseOperand() which are the actual functions use by the
register allocator.

llvm-svn: 163153

9fceda74

Move tie checks into MachineVerifier::visitMachineOperand. · c7579cdd
Jakob Stoklund Olesen authored Sep 04, 2012
```
llvm-svn: 163152
```
c7579cdd

Allow tied uses and defs in different orders. · 0a09da83

Jakob Stoklund Olesen authored Sep 04, 2012

After much agonizing, use a full 4 bits of precious MachineOperand space
to encode this. This uses existing padding, and doesn't grow
MachineOperand beyond its current 32 bytes.

This allows tied defs among the first 15 operands on a normal
instruction, just like the current MCInstrDesc constraint encoding.
Inline assembly needs to be able to tie more than the first 15 operands,
and gets special treatment.

Tied uses can appear beyond 15 operands, as long as they are tied to a
def that's in range.

llvm-svn: 163151

0a09da83

Generic Bypass Slow Div · cdf540d5

Preston Gurd authored Sep 04, 2012

- CodeGenPrepare pass for identifying div/rem ops
- Backend specifies the type mapping using addBypassSlowDivType
- Enabled only for Intel Atom with O2 32-bit -> 8-bit
- Replace IDIV with instructions which test its value and use DIVB if the value
is positive and less than 256.
- In the case when the quotient and remainder of a divide are used a DIV
and a REM instruction will be present in the IR. In the non-Atom case
they are both lowered to IDIVs and CSE removes the redundant IDIV instruction,
using the quotient and remainder from the first IDIV. However,
due to this optimization CSE is not able to eliminate redundant
IDIV instructions because they are located in different basic blocks.
This is overcome by calculating both the quotient (DIV) and remainder (REM)
in each basic block that is inserted by the optimization and reusing the result
values when a subsequent DIV or REM instruction uses the same operands.
- Test cases check for the presents of the optimization when calculating
either the quotient, remainder,  or both.

Patch by Tyler Nowicki!

llvm-svn: 163150

cdf540d5

Sep 03, 2012
- IRBuilderify the SjlLjEHPrepare pass. · 8d9890ab
  Benjamin Kramer authored Sep 03, 2012
```
No functionality change.

llvm-svn: 163115
```
  8d9890ab
- When updating live range endpoints, make sure to preserve the early clobber bit. · 90152701
  Lang Hames authored Sep 03, 2012
```
Fixs PR13719.

llvm-svn: 163107
```
  90152701
Sep 02, 2012
- Fix a typo. · 10f6b880
  Nadav Rotem authored Sep 02, 2012
```
llvm-svn: 163094
```
  10f6b880
- Generate better select code by allowing the target to use scalar select, and not sign-extend. · 500d691d
  Nadav Rotem authored Sep 02, 2012
```
llvm-svn: 163086
```
  500d691d
- Only legalise a VSELECT in to bitwise operations if the vector mask bool is... · 2455e9c4
  Pete Cooper authored Sep 01, 2012
```
Only legalise a VSELECT in to bitwise operations if the vector mask bool is zeros or all ones.  A vector bool with just ones isn't suitable for masking with.

No test case unfortunately as i couldn't find a target which fit all
the conditions needed to hit this code.

llvm-svn: 163075
```
  2455e9c4
Sep 01, 2012
- Revert "Take account of boolean vector contents when promoting a build vector... · 2117ac40
  Pete Cooper authored Sep 01, 2012
```
Revert "Take account of boolean vector contents when promoting a build vector from i1 to some other type.  rdar://problem/12210060"

This reverts commit 5dd9e214fb92847e947f9edab170f9b4e52b908f.

Thanks to Duncan for explaining how this should have been done.

Conflicts:

	test/CodeGen/X86/vec_select.ll

llvm-svn: 163064
```
  2117ac40
- Fix typo. · 64f361e0
  Logan Chien authored Sep 01, 2012
```
llvm-svn: 163059
```
  64f361e0
- Teach DAG combine a number of tricks to simplify FMA expressions in fast-math mode. · 90e0eaff
  Owen Anderson authored Sep 01, 2012
```
llvm-svn: 163051
```
  90e0eaff
- Fix typo · ec385012
  Michael Liao authored Sep 01, 2012
```
llvm-svn: 163049
```
  ec385012
Aug 31, 2012

Add MachineInstr::tieOperands, remove setIsTied(). · 5c8eda0e

Jakob Stoklund Olesen authored Aug 31, 2012

Manage tied operands entirely internally to MachineInstr. This makes it
possible to change the representation of tied operands, as I will do
shortly.

The constraint that tied uses and defs must be in the same order was too
restrictive.

llvm-svn: 163021

5c8eda0e

Use CloneMachineInstr to make a new MI in commuteInstruction to make the code... · a8227cb7

Craig Topper authored Aug 31, 2012

Use CloneMachineInstr to make a new MI in commuteInstruction to make the code tolerant of instructions with more than two input operands.

llvm-svn: 163000

a8227cb7

Don't enforce ordered inline asm operands. · 96f87069

Jakob Stoklund Olesen authored Aug 31, 2012

I was too optimistic, inline asm can have tied operands that don't
follow the def order.

Fixes PR13742.

llvm-svn: 162998

96f87069

Take account of boolean vector contents when promoting a build vector from i1... · e969340f

Pete Cooper authored Aug 30, 2012

Take account of boolean vector contents when promoting a build vector from i1 to some other type.  rdar://problem/12210060

llvm-svn: 162960

e969340f

Teach the DAG combiner to turn chains of FADDs (x+x+x+x+...) into FMULs by... · cc61f87c

Owen Anderson authored Aug 30, 2012

Teach the DAG combiner to turn chains of FADDs (x+x+x+x+...) into FMULs by constants.  This is only enabled in unsafe FP math mode, since it does not preserve rounding effects for all such constants.

llvm-svn: 162956

cc61f87c

Aug 30, 2012

· ea973bda

Nadav Rotem authored Aug 30, 2012

Currently targets that do not support selects with scalar conditions and vector operands - scalarize the code. ARM is such a target
because it does not support CMOV of vectors. To implement this efficientlyi, we broadcast the condition bit and use a sequence of NAND-OR
to select between the two operands. This is the same sequence we use for targets that don't have vector BLENDs (like SSE2).

rdar://12201387

llvm-svn: 162926

ea973bda