Commits · b5bf9b64d23a9077c39e9bf5fcd6dc005d0c4057 · Roger Ferrer / llvm-epi-0.8

Jan 28, 2013
- Extracted ObjCARC.cpp into its own library libLLVMObjCARCOpts in preparation... · 79d8d812
  Michael Gottesman authored Jan 28, 2013
```
Extracted ObjCARC.cpp into its own library libLLVMObjCARCOpts in preparation for refactoring the ARC Optimizer.

llvm-svn: 173647
```
  79d8d812
Jan 27, 2013
- Legalizer: Reword comment again, per Duncan's suggestion. · cf9dae17
  Benjamin Kramer authored Jan 27, 2013
```
llvm-svn: 173625
```
  cf9dae17
- Legalizer: Add an assert and tweak a comment to clarify the assumptions this code makes. · 084e675e
  Benjamin Kramer authored Jan 27, 2013
```
llvm-svn: 173620
```
  084e675e
- When the legalizer is splitting vector shifts, the result may not have the right shift amount type. · 05cc9396
  Benjamin Kramer authored Jan 27, 2013
```
Fix that by adding a cast to the shift expander. This came up with vector shifts
on sse-less X86 CPUs.

   <2 x i64>       = shl <2 x i64> <2 x i64>
-> i64,i64         = shl i64 i64; shl i64 i64
-> i32,i32,i32,i32 = shl_parts i32 i32 i64; shl_parts i32 i32 i64

Now we cast the last two i64s to the right type. Fixes the crash in PR14668.

llvm-svn: 173615
```
  05cc9396
Jan 25, 2013

Use const reference instead of vector copying. · c641adae
Jakub Staszak authored Jan 25, 2013
```
llvm-svn: 173497
```
c641adae

This patch aims to reduce compile time in LegalizeTypes by using SmallDenseMap, · 0959bb70

Preston Gurd authored Jan 25, 2013

with an initial number of elements,  instead of DenseMap, which has
zero initial elements, in order to avoid the copying of elements
when the size changes and to avoid allocating space every time
LegalizeTypes is run. This patch will not affect the memory footprint,
because DenseMap will increase the element size to 64
when the first element is added.

Patch by Wan Xiaofei.

llvm-svn: 173448

0959bb70

MIsched: Print block name. No functionality. · 54b2ce38
Andrew Trick authored Jan 25, 2013
```
llvm-svn: 173433
```
54b2ce38
MachineScheduler support for viewGraph. · ea9fd951
Andrew Trick authored Jan 25, 2013
```
llvm-svn: 173432
```
ea9fd951
ScheduleDAG: colorize the DOT graph and improve formatting. · b36388a1
Andrew Trick authored Jan 25, 2013
```
llvm-svn: 173431
```
b36388a1
ScheduleDAG: Added isBoundaryNode to conveniently detect a common corner case. · 646eeb66
Andrew Trick authored Jan 25, 2013
```
This fixes DAG subtree analysis at the boundary.

llvm-svn: 173427
```
646eeb66

SchedDFS: Complete support for nested subtrees. · ffc8097c

Andrew Trick authored Jan 25, 2013

Maintain separate per-node and per-tree book-keeping.
Track all instructions above a DAG node including nested subtrees.
Seperately track instructions within a subtree.
Record subtree parents.

llvm-svn: 173426

ffc8097c

MIsched: Improve the interface to SchedDFS analysis (subtrees). · e2c3f5c9

Andrew Trick authored Jan 25, 2013

Allow the strategy to select SchedDFS. Allow the results of SchedDFS
to affect initialization of the scheduler state.

llvm-svn: 173425

e2c3f5c9

SchedDFS: Initial support for nested subtrees. · 5b07eeb2

Andrew Trick authored Jan 25, 2013

This is mostly refactoring, along with adding an instruction count
within the subtrees and ensuring we only look at data edges.

llvm-svn: 173420

5b07eeb2

MISched: Add SchedDFSResult to ScheduleDAGMI to formalize the · 44f750a3
Andrew Trick authored Jan 25, 2013
```
interface and allow other strategies to select it.

llvm-svn: 173413
```
44f750a3

SchedDFS: Refactor and tweak the subtree selection criteria. · b52a8564

Andrew Trick authored Jan 25, 2013

For sanity, create a root when NumDataSuccs >= 4. Splitting large
subtrees will no longer be detrimental after my next checkin to handle
nested tree. A magic number of 4 is fine because single subtrees
seldom rejoin more than this. It makes subtrees easier to visualize
and heuristics more sane.

llvm-svn: 173399

b52a8564

Avoid creating duplicate CFG edges in the IfConversion pass. · e0ef4743
Jakob Stoklund Olesen authored Jan 24, 2013
```
Patch by Stefan Hepp.

llvm-svn: 173395
```
e0ef4743

Jan 24, 2013
- MachineScheduler: enable biasCriticalPath for all DAGs. · 92da4240
  Andrew Trick authored Jan 24, 2013
```
llvm-svn: 173318
```
  92da4240
- MIsched: Added biasCriticalPath. · d3b8629a
  Andrew Trick authored Jan 24, 2013
```
Allow schedulers to order DAG edges by critical path. This makes
DFS-based heuristics more stable and effective.

llvm-svn: 173317
```
  d3b8629a
Jan 23, 2013

Add the heuristic to differentiate SSPStrong from SSPRequired. · 7c8f96a9

Bill Wendling authored Jan 23, 2013

The requirements of the strong heuristic are:

* A Protector is required for functions which contain an array, regardless of
  type or length.

* A Protector is required for functions which contain a structure/union which
  contains an array, regardless of type or length.  Note, there is no limit to
  the depth of nesting.

* A protector is required when the address of a local variable (i.e., stack
  based variable) is exposed. (E.g., such as through a local whose address is
  taken as part of the RHS of an assignment or a local whose address is taken as
  part of a function argument.)

llvm-svn: 173231

7c8f96a9

Add the IR attribute 'sspstrong'. · d154e283

Bill Wendling authored Jan 23, 2013

SSPStrong applies a heuristic to insert stack protectors in these situations:

* A Protector is required for functions which contain an array, regardless of
  type or length.

* A Protector is required for functions which contain a structure/union which
  contains an array, regardless of type or length.  Note, there is no limit to
  the depth of nesting.

* A protector is required when the address of a local variable (i.e., stack
  based variable) is exposed. (E.g., such as through a local whose address is
  taken as part of the RHS of an assignment or a local whose address is taken as
  part of a function argument.)

This patch implements the SSPString attribute to be equivalent to
SSPRequired. This will change in a subsequent patch.

llvm-svn: 173230

d154e283

Jan 22, 2013

Make APFloat constructor require explicit semantics. · 29178a34

Tim Northover authored Jan 22, 2013

Previously we tried to infer it from the bit width size, with an added
IsIEEE argument for the PPC/IEEE 128-bit case, which had a default
value. This default value allowed bugs to creep in, where it was
inappropriate.

llvm-svn: 173138

29178a34

Jan 21, 2013

Introduce a new data structure, the SparseMultiSet, and changes to the MI scheduler to use it. · 3e3194f4

Michael Ilseman authored Jan 21, 2013

A SparseMultiSet adds multiset behavior to SparseSet, while retaining SparseSet's desirable properties. Essentially, SparseMultiSet provides multiset behavior by storing its dense data in doubly linked lists that are inlined into the dense vector. This allows it to provide good data locality as well as vector-like constant-time clear() and fast constant time find(), insert(), and erase(). It also allows SparseMultiSet to have a builtin recycler rather than keeping SparseSet's behavior of always swapping upon removal, which allows it to preserve more iterators. It's often a better alternative to a SparseSet of a growable container or vector-of-vector.

llvm-svn: 173064

3e3194f4

Jan 20, 2013

· 9450fcff

Nadav Rotem authored Jan 20, 2013

Revert 172708.

The optimization handles esoteric cases but adds a lot of complexity both to the X86 backend and to other backends.
This optimization disables an important canonicalization of chains of SEXT nodes and makes SEXT and ZEXT asymmetrical.
Disabling the canonicalization of consecutive SEXT nodes into a single node disables other DAG optimizations that assume
that there is only one SEXT node. The AVX mask optimizations is one example. Additionally this optimization does not update the cost model.

llvm-svn: 172968

9450fcff

The last of PR14471 - emission of constant floats · a39a76ef
David Blaikie authored Jan 20, 2013
```
llvm-svn: 172941
```
a39a76ef

Jan 18, 2013
- Split out DW_OP_addr for the split debug info DWARF5 proposal. · e9ec2458
  Eric Christopher authored Jan 18, 2013
```
llvm-svn: 172857
```
  e9ec2458
- Use AttributeSet accessor methods instead of Attribute accessor methods. · 658d24d2
  Bill Wendling authored Jan 18, 2013
```
Further encapsulation of the Attribute object. Don't allow direct access to the
Attribute object as an aggregate.

llvm-svn: 172853
```
  658d24d2
- Remove unused parameter. Also use the AttributeSet query methods instead of... · 4f972ea2
  Bill Wendling authored Jan 18, 2013
```
Remove unused parameter. Also use the AttributeSet query methods instead of the Attribute query methods.

llvm-svn: 172852
```
  4f972ea2
- [MC/Mach-O] Implement integrated assembler support for linker options. · 95856128
  Daniel Dunbar authored Jan 18, 2013
```
 - Also, fixup syntax errors in LangRef and missing newline in the MCAsmStreamer.

llvm-svn: 172837
```
  95856128
Jan 17, 2013

Optimization for the following SIGN_EXTEND pairs: · f6a30e05

Elena Demikhovsky authored Jan 17, 2013

v8i8  -> v8i64, 
v8i8  -> v8i32, 
v4i8  -> v4i64, 
v4i16 -> v4i64 
for AVX and AVX2.

Bug 14865.

llvm-svn: 172708

f6a30e05

Fix the assembly and dissassembly of DW_FORM_sec_offset. Found this by · 4c7765f1

Eric Christopher authored Jan 17, 2013

changing both the string of the dwo_name to be correct and the type of
the statement list.

Testcases all around.

llvm-svn: 172699

4c7765f1

Add the DW_AT_GNU_addr_base for the skeleton cu. Add support for · 18266171
Eric Christopher authored Jan 17, 2013
```
emitting the dwarf32 version of DW_FORM_sec_offset and correct
disassembler support.

llvm-svn: 172698
```
18266171
Move MachineTraceMetrics.h into include/llvm/CodeGen. · 965665bb
Jakob Stoklund Olesen authored Jan 17, 2013
```
Let targets use it.

llvm-svn: 172688
```
965665bb

Provide a place for targets to insert ILP optimization passes. · 213a2f8b

Jakob Stoklund Olesen authored Jan 17, 2013

Move the early if-conversion pass into this group.

ILP optimizations usually need to find the right balance between
register pressure and ILP using the MachineTraceMetrics analysis to
identify critical paths and estimate other costs. Such passes should run
together so they can share dominator tree and loop info analyses.

Besides if-conversion, future passes to run here here could include
expression height reduction and ARM's MLxExpansion pass.

llvm-svn: 172687

213a2f8b

Jan 16, 2013

Define metadata interfaces for describing a static data member · 4d23a4ae

Eric Christopher authored Jan 16, 2013

of a class. Emit static data member declarations and definitions
through correctly.

Part of PR14471.

Patch by Paul Robinson!

llvm-svn: 172590

4d23a4ae

Split address information for DWARF5 split dwarf proposal. This involves · 962c9089

Eric Christopher authored Jan 15, 2013

using the DW_FORM_GNU_addr_index and a separate .debug_addr section which
stays in the executable and is fully linked.

Sneak in two other small changes:

a) Print out the debug_str_offsets.dwo section.
b) Change form we're expecting the entries in the debug_str_offsets.dwo
   section to take from ULEB128 to U32.

Add tests for all of this in the fission-cu.ll test.

llvm-svn: 172578

962c9089

Jan 14, 2013

This patch addresses an incorrect transformation in the DAG combiner. · d006c693

Bill Schmidt authored Jan 14, 2013

The included test case is derived from one of the GCC compatibility tests.
The problem arises after the selection DAG has been converted to type-legalized
form. The combiner first sees a 64-bit load that can be converted into a
pre-increment form. The original load feeds into a SRL that isolates the
upper 32 bits of the loaded doubleword. This looks like an opportunity for
DAGCombiner::ReduceLoadWidth() to replace the 64-bit load with a 32-bit load.

However, this transformation is not valid, as the replacement load is not
a pre-increment load. The pre-increment load produces an extra result,
which feeds a subsequent add instruction. The replacement load only has
one result value, and this value is propagated to all uses of the pre-
increment load, including the add. Because the add is looking for the
second result value as its operand, it ends up attempting to add a constant
to a token chain, resulting in a crash.

So the patch simply disables this transformation for any load with more than
two result values.

llvm-svn: 172480

d006c693

Jan 12, 2013

When lowering an inreg sext first shift left, then right arithmetically. · 5ea0349e
Benjamin Kramer authored Jan 12, 2013
```
Shifting right two times will only yield zero. Should fix
SingleSource/UnitTests/SignlessTypes/factor.

llvm-svn: 172322
```
5ea0349e

Limit the search space in RAGreedy::tryEvict(). · 3dd236cd

Jakob Stoklund Olesen authored Jan 12, 2013

When tryEvict() is looking for a cheaper register in the allocation
order, skip the tail of too expensive registers when possible.

llvm-svn: 172281

3dd236cd

Precompute some information about register costs. · 8f644449

Jakob Stoklund Olesen authored Jan 12, 2013

Remember the minimum cost of the registers in an allocation order and
the number of registers at the end of the allocation order that have the
same cost per use.

This information can be used to limit the search space for
RAGreedy::tryEvict() when looking for a cheaper register.

llvm-svn: 172280

8f644449

Jan 11, 2013
- PPC: Implement efficient lowering of sign_extend_inreg. · dbe5c72d
  Nadav Rotem authored Jan 11, 2013
```
llvm-svn: 172269
```
  dbe5c72d