Commits · 8bcc971174766c582506c8232c55cf2fba5376b6 · Roger Ferrer / llvm-epi-0.8

Aug 29, 2012

Make MemoryBuiltins aware of TargetLibraryInfo. · 8bcc9711

Benjamin Kramer authored Aug 29, 2012

This disables malloc-specific optimization when -fno-builtin (or -ffreestanding)
is specified. This has been a problem for a long time but became more severe
with the recent memory builtin improvements.

Since the memory builtin functions are used everywhere, this required passing
TLI in many places. This means that functions that now have an optional TLI
argument, like RecursivelyDeleteTriviallyDeadFunctions, won't remove dead
mallocs anymore if the TLI argument is missing. I've updated most passes to do
the right thing.

Fixes PR13694 and probably others.

llvm-svn: 162841

8bcc9711

Convert FMA4 patterns to use target specific nodes instead of intrinsics to align with FMA3. · a999c662
Craig Topper authored Aug 29, 2012
```
llvm-svn: 162829
```
a999c662
Add virtual keywords for methods that override the base class. · 5f96ca51
Craig Topper authored Aug 29, 2012
```
llvm-svn: 162826
```
5f96ca51
Cleanup sloppy code. Jakob's review. · b57e2257
Andrew Trick authored Aug 29, 2012
```
llvm-svn: 162825
```
b57e2257
[arm-fast-isel] Add support for ARM PIC. · e87e559e
Jush Lu authored Aug 29, 2012
```
llvm-svn: 162823
```
e87e559e

Fix ARM vector copies of overlapping register tuples. · bd0073dd

Andrew Trick authored Aug 29, 2012

I have tested the fix, but have not been successfull in generating
a robust unit test. This can only be exposed through particular
register assignments.

llvm-svn: 162821

bd0073dd

cleanup · 4cc6949a
Andrew Trick authored Aug 29, 2012
```
llvm-svn: 162820
```
4cc6949a

Verify the tied operand flags. · dbbff789

Jakob Stoklund Olesen authored Aug 29, 2012

WHen running with -verify-machineinstrs, check that tied operands come
in matching use/def pairs, and that they are consistent with MCInstrDesc
when it applies.

llvm-svn: 162816

dbbff789

Maintain a vaild isTied bit as operands are added and removed. · 2b166645

Jakob Stoklund Olesen authored Aug 29, 2012

The isTied bit is set automatically when a tied use is added and
MCInstrDesc indicates a tied operand. The tie is broken when one of the
tied operands is removed.

llvm-svn: 162814

2b166645

Typo. · 3b1336ce
Chad Rosier authored Aug 28, 2012
```
llvm-svn: 162807
```
3b1336ce
Add comments on the literal value used. · 407d659f
Michael Liao authored Aug 28, 2012
```
llvm-svn: 162805
```
407d659f

Profile: set branch weight metadata with data generated from profiling. · abbb01ab

Manman Ren authored Aug 28, 2012

This patch implements ProfileDataLoader which loads profile data generated by
-insert-edge-profiling and updates branch weight metadata accordingly.

Patch by Alastair Murray.

llvm-svn: 162799

abbb01ab

Aug 28, 2012

The instruction DEXT may be transformed into DEXTU or DEXTM depending · cd6b0e13

Jack Carter authored Aug 28, 2012

on the size of the extraction and its position in the 64 bit word.

This patch allows support of the dext transformations with mips64 direct
object output.

0 <= msb < 32 0 <= lsb < 32 0 <= pos < 32 1 <= size <= 32
DINS
The field is entirely contained in the right-most word of the doubleword

32 <= msb < 64 0 <= lsb < 32 0 <= pos < 32 2 <= size <= 64
DINSM
The field straddles the words of the doubleword

32 <= msb < 64 32 <= lsb < 64 32 <= pos < 64 1 <= size <= 32
DINSU
The field is entirely contained in the left-most word of the doubleword

llvm-svn: 162782

cd6b0e13

Explicitly update the number of nodes to be traversed · 710e1a59
Michael Liao authored Aug 28, 2012
```
llvm-svn: 162780
```
710e1a59

Some instructions are passed to the assembler to be · c20a21b8

Jack Carter authored Aug 28, 2012

transformed to the final instruction variant. An
example would be dsrll which is transformed into 
dsll32 if the shift value is greater than 32.

For direct object output we need to do this transformation
in the codegen. If the instruction was inside branch
delay slot, it was being missed. This patch corrects this
oversight.

llvm-svn: 162779

c20a21b8

Emit word of zeroes after the last instruction as a start of the mandatory · 8c4b6a30

Roman Divacky authored Aug 28, 2012

traceback table on PowerPC64. This helps gdb handle exceptions. The other
mandatory fields are ignored by gdb and harder to implement so just add
there a FIXME.

Patch by Bill Schmidt. PR13641.

llvm-svn: 162778

8c4b6a30

Follow-up patch to r162731. · 206cefe6

Akira Hatanaka authored Aug 28, 2012

Fix a couple of bugs in mips' long branch pass.
This patch was supposed to be committed along with r162731, so I don't have a
new test case.

llvm-svn: 162777

206cefe6

Add a MachineOperand::isTied() flag. · e56c60c5

Jakob Stoklund Olesen authored Aug 28, 2012

While in SSA form, a MachineInstr can have pairs of tied defs and uses.
The tied operands are used to represent read-modify-write operands that
must be assigned the same physical register.

Previously, tied operand pairs were computed from fixed MCInstrDesc
fields, or by using black magic on inline assembly instructions.

The isTied flag makes it possible to add tied operands to any
instruction while getting rid of (some of) the inlineasm magic.

Tied operands on normal instructions are needed to represent predicated
individual instructions in SSA form. An extra <tied,imp-use> operand is
required to represent the output value when the instruction predicate is
false.

Adding a predicate to:

  %vreg0<def> = ADD %vreg1, %vreg2

Will look like:

  %vreg0<tied,def> = ADD %vreg1, %vreg2, pred:3, %vreg7<tied,imp-use>

The virtual register %vreg7 is the value given to %vreg0 when the
predicate is false. It will be assigned the same physreg as %vreg0.

This commit adds the isTied flag and sets it based on MCInstrDesc when
building an instruction. The flag is not used for anything yet.

llvm-svn: 162774

e56c60c5

Don't allow TargetFlags on MO_Register MachineOperands. · dba99d0d

Jakob Stoklund Olesen authored Aug 28, 2012

Register operands are manipulated by a lot of target-independent code,
and it is not always possible to preserve target flags. That means it is
not safe to use target flags on register operands.

None of the targets in the tree are using register operand target flags.
External targets should be using immediate operands to annotate
instructions with operand modifiers.

llvm-svn: 162770

dba99d0d

Add PPC Freescale e500mc and e5500 subtargets. · 742b535e

Hal Finkel authored Aug 28, 2012

Add subtargets for Freescale e500mc (32-bit) and e5500 (64-bit) to
the PowerPC backend.

Patch by Tobias von Koch.

llvm-svn: 162764

742b535e

InstCombine: Defensively avoid undefined shifts by limiting the amount to the bit width. · 1e1a1ded

Benjamin Kramer authored Aug 28, 2012

No test case, undefined shifts get folded early, but can occur when other
transforms generate a constant. Thanks to Duncan for bringing this up.

llvm-svn: 162755

1e1a1ded

InstCombine: Guard the transform introduced in r162743 against large ints and non-const shifts. · 9c0a807c
Benjamin Kramer authored Aug 28, 2012
```
llvm-svn: 162751
```
9c0a807c

· d457787f

Nadav Rotem authored Aug 28, 2012

Make sure that we don't call getZExtValue on values > 64 bits.
Thanks Benjamin for noticing this.

llvm-svn: 162749

d457787f

· 11935b29

Nadav Rotem authored Aug 28, 2012

Teach InstCombine to canonicalize  [SU]div+[AL]shl patterns.

For example:
  %1 = lshr i32 %x, 2
  %2 = udiv i32 %1, 100

rdar://12182093

llvm-svn: 162743

11935b29

The commutative flag is already correctly set within the multiclass. If we set · cc567180
Bill Wendling authored Aug 28, 2012
```
it here, then a 'register-memory' version would wrongly get the commutative
flag.
<rdar://problem/12180135>

llvm-svn: 162741
```
cc567180
Convert V_SETALLONES/AVX_SETALLONES/AVX2_SETALLONES to Post-RA pseudos. · 72f51c39
Craig Topper authored Aug 28, 2012
```
llvm-svn: 162740
```
72f51c39
Merge AVX_SET0PSY/AVX_SET0PDY/AVX2_SET0 into a single post-RA pseudo. · bd509eea
Craig Topper authored Aug 28, 2012
```
llvm-svn: 162738
```
bd509eea

Fix PR12312 · b7d85b63

Michael Liao authored Aug 28, 2012

- Add a target-specific DAG optimization to recognize a pattern PTEST-able.
  Such a pattern is a OR'd tree with X86ISD::OR as the root node. When
  X86ISD::OR node has only its flag result being used as a boolean value and
  all its leaves are extracted from the same vector, it could be folded into an
  X86ISD::PTEST node.

llvm-svn: 162735

b7d85b63

Remove extra MayLoad/MayStore flags from atomic_load/store. · 87cb471e

Jakob Stoklund Olesen authored Aug 28, 2012

These extra flags are not required to properly order the atomic
load/store instructions. SelectionDAGBuilder chains atomics as if they
were volatile, and SelectionDAG::getAtomic() sets the isVolatile bit on
the memory operands of all atomic operations.

The volatile bit is enough to order atomic loads and stores during and
after SelectionDAG.

This means we set mayLoad on atomic_load, mayStore on atomic_store, and
mayLoad+mayStore on the remaining atomic read-modify-write operations.

llvm-svn: 162733

87cb471e

Revert r162713: "Add ATOMIC_LDR* pseudo-instructions to model atomic_load on ARM." · b3de7b17

Jakob Stoklund Olesen authored Aug 28, 2012

This wasn't the right way to enforce ordering of atomics.

We are already setting the isVolatile bit on memory operands of atomic
operations which is good enough to enforce the correct ordering.

llvm-svn: 162732

b3de7b17

Fix mips' long branch pass. · b5af7121

Akira Hatanaka authored Aug 28, 2012

Instructions emitted to compute branch offsets now use immediate operands
instead of symbolic labels. This change was needed because there were problems
when R_MIPS_HI16/LO16 relocations were used to make shared objects.

llvm-svn: 162731

b5af7121

Split several PPC instruction classes. · 679c73cb

Hal Finkel authored Aug 28, 2012

Slight reorganisation of PPC instruction classes for scheduling. No
functionality change for existing subtargets.
 - Clearly separate load/store-with-update instructions from regular loads and stores.
 - Split IntRotateD -> IntRotateD and IntRotateDI
 - Split out fsub and fadd from FPGeneral -> FPAddSub
 - Update existing itineraries

Patch by Tobias von Koch.

llvm-svn: 162729

679c73cb

Fix bug 13532. · adb14f56

Akira Hatanaka authored Aug 28, 2012

In SelectionDAGLegalize::ExpandLegalINT_TO_FP, expand INT_TO_FP nodes without
using any f64 operations if f64 is not a legal type.

Patch by Stefan Kristiansson. 

llvm-svn: 162728

adb14f56

Allow remat of LI on PPC. · 686f2ee2

Hal Finkel authored Aug 28, 2012

Allow load-immediates to be rematerialised in the register coalescer for
PPC. This makes test/CodeGen/PowerPC/big-endian-formal-args.ll fail,
because it relies on a register move getting emitted. The immediate load is
equivalent, so change this test case.

Patch by Tobias von Koch.

llvm-svn: 162727

686f2ee2

Add the Freescale vendor to Triple. · b5d177e5

Hal Finkel authored Aug 28, 2012

Adds the vendor 'fsl' (used by Freescale SDK) to Triple. This will allow
clang support for Freescale cross-compile configurations.

Patch by Tobias von Koch.

llvm-svn: 162726

b5d177e5

Eliminate redundant CR moves on PPC32. · 5ab37803

Hal Finkel authored Aug 28, 2012

The 32-bit ABI requires CR bit 6 to be set if the call has fp arguments and
unset if it doesn't. The solution up to now was to insert a MachineNode to
set/unset the CR bit, which produces a CR vreg. This vreg was then copied
into CR bit 6. When the register allocator saw a bunch of these in the same
function, it allocated the set/unset CR bit in some random CR register (1
extra instruction) and then emitted CR moves before every vararg function
call, rather than just setting and unsetting CR bit 6 directly before every
vararg function call. This patch instead inserts a PPCcrset/PPCcrunset
instruction which are then matched by a dedicated instruction pattern.

Patch by Tobias von Koch.

llvm-svn: 162725

5ab37803

Optimize zext on PPC64. · e39526a7

Hal Finkel authored Aug 28, 2012

The zeroextend IR instruction is lowered to an 'and' node with an immediate
mask operand, which in turn gets legalised to a sequence of ori's & ands.
This can be done more efficiently using the rldicl instruction.

Patch by Tobias von Koch.

llvm-svn: 162724

e39526a7

More missing mayLoad flags on AVX multiclasses. · 89d6b29d
Jakob Stoklund Olesen authored Aug 28, 2012
```
llvm-svn: 162714
```
89d6b29d

Add ATOMIC_LDR* pseudo-instructions to model atomic_load on ARM. · b24cb8c5

Jakob Stoklund Olesen authored Aug 27, 2012

It is not safe to use normal LDR instructions because they may be
reordered by the scheduler. The ATOMIC_LDR pseudos have a mayStore flag
that prevents reordering.

Atomic loads are also prevented from participating in rematerialization
and load folding.

llvm-svn: 162713

b24cb8c5

Fix compile error when building with C++11 - clang thinks that PRIx64 is a... · ef271cce
Marshall Clow authored Aug 27, 2012
```
Fix compile error when building with C++11 - clang thinks that PRIx64 is a user-defined suffix or something

llvm-svn: 162704
```
ef271cce