Commits · 048f99de112c306c928acbc0ccf7fccc2d4f7b39 · Roger Ferrer / llvm-epi-0.8

May 27, 2013

Convert sqrt functions into sqrt instructions when -ffast-math is in effect. · 048f99de

Preston Gurd authored May 27, 2013

When -ffast-math is in effect (on Linux, at least), clang defines
__FINITE_MATH_ONLY__ > 0 when including <math.h>. This causes the
preprocessor to include <bits/math-finite.h>, which renames the sqrt functions.
For instance, "sqrt" is renamed as "__sqrt_finite". 

This patch adds the 3 new names in such a way that they will be treated
as equivalent to their respective original names.

llvm-svn: 182739

048f99de

Add a cpu to try to bring back the atom bots. · cca5f562
Rafael Espindola authored May 27, 2013
```
llvm-svn: 182734
```
cca5f562

PPC: Add a isConsecutiveLS utility function · 8ebfe6c2

Hal Finkel authored May 27, 2013

isConsecutiveLS is a slightly more general form of
SelectionDAG::isConsecutiveLoad. Aside from also handling stores, it also does
not assume equality of the chain operands is necessary. In the case of the PPC
backend, this chain condition is checked in a more general way by the
surrounding code.

Mostly, this part of the refactoring in preparation for supporting optimized
unaligned stores.

llvm-svn: 182723

8ebfe6c2

llvm-objdump.cpp: Appease MSC16 x64. utostr(n++) causes internal compiler error. · d5c2e60b
NAKAMURA Takumi authored May 27, 2013
```
llvm-svn: 182722
```
d5c2e60b

May 26, 2013

Prefer to duplicate PPC Altivec loads when expanding unaligned loads · 7d8a691b

Hal Finkel authored May 26, 2013

When expanding unaligned Altivec loads, we use the decremented offset trick to
prevent page faults. Unfortunately, if we have a sequence of consecutive
unaligned loads, this leads to suboptimal code generation because the 'extra'
load from the first unaligned load can be combined with the base load from the
second (but only if the decremented offset trick is not used for the first).
Search up and down the chain, through loads and token factors, looking for
consecutive loads, and if one is found, don't use the offset reduction trick.
These duplicate loads are later combined to yield the desired sequence (in the
future, we might want a more-powerful chain search, but that will require some
changes to allow the combiner routines to access the AA object).

This should complete the initial implementation of the optimized unaligned
Altivec load expansion. There is some refactoring that should be done, but
that will happen when the unaligned store expansion is added.

llvm-svn: 182719

7d8a691b

Add LDC compiler to list of external OS projects using LLVM 3.3 · 4157b371
Kai Nacke authored May 26, 2013
```
llvm-svn: 182718
```
4157b371
Fix PR16143: Insert DEBUG_VALUE before terminator. · c66d26ad
Andrew Trick authored May 26, 2013
```
llvm-svn: 182717
```
c66d26ad
Fixed bug when tests in executable partially used absolute paths. · a035f3b2
Galina Kistanova authored May 26, 2013
```
llvm-svn: 182715
```
a035f3b2

Disable the StringMapEntry copy constructor, to make sure we · 4093afda

Chris Lattner authored May 25, 2013

reject things like: "for (auto Entry : SomeStringMap)".  Previously
this would copy the value but not the tail allocated string data
(the key).

llvm-svn: 182713

4093afda

May 25, 2013

Add support for DWARF line number table entries for values in the instruction · 80cbcd2d
Cameron Zwarich authored May 25, 2013
```
stream.

llvm-svn: 182712
```
80cbcd2d
Add some comments to the stringify function. · 5bed56d2
Eric Christopher authored May 25, 2013
```
llvm-svn: 182710
```
5bed56d2

PPC: Combine duplicate (offset) lvsl Altivec intrinsics · bc2ee4c4

Hal Finkel authored May 25, 2013

The lvsl permutation control instruction is a function only of the alignment of
the pointer operand (relative to the 16-byte natural alignment of Altivec
vectors). As a result, multiple lvsl intrinsics where the operands differ by a
multiple of 16 can be combined.

llvm-svn: 182708

bc2ee4c4

Track IR ordering of SelectionDAG nodes 4/4. · 8972aba1
Andrew Trick authored May 25, 2013
```
Unit test cases for -pre-RA-sched=source.

llvm-svn: 182706
```
8972aba1

Track IR ordering of SelectionDAG nodes 3/4. · e2431c64

Andrew Trick authored May 25, 2013

Remove the old IR ordering mechanism and switch to new one.  Fix unit
test failures.

llvm-svn: 182704

e2431c64

Track IR ordering of SelectionDAG nodes 2/4. · ef9de2a7

Andrew Trick authored May 25, 2013

Change SelectionDAG::getXXXNode() interfaces as well as call sites of
these functions to pass in SDLoc instead of DebugLoc.

llvm-svn: 182703

ef9de2a7

Track IR ordering of SelectionDAG nodes 1/4. · 175143bf

Andrew Trick authored May 25, 2013

Use a field in the SelectionDAGNode object to track its IR ordering.
This adds fields and utility classes without changing existing
interfaces or functionality.

llvm-svn: 182701

175143bf

Fix RecyclingAllocator::PrintStats to print the underlying allocator's stats. · fc1c5fe9
Andrew Trick authored May 25, 2013
```
llvm-svn: 182700
```
fc1c5fe9
Add to testsuite. · ba63e07f
Eric Christopher authored May 24, 2013
```
llvm-svn: 182693
```
ba63e07f

ArrayRef-ize MD5 and clean up a few variable names. · fcee6f0a

Eric Christopher authored May 24, 2013

Add a stringize method to make dumping a bit easier, and add a testcase
exercising a few different paths.

llvm-svn: 182692

fcee6f0a

PPC: Initial support for permutation-based unaligned Altivec loads · cf2e9080

Hal Finkel authored May 24, 2013

Altivec only directly supports aligned loads, but the loads have a strange
property: If given an unaligned address, they truncate the address to the next
lower aligned address, and load from there. This property, along with an extra
load and some special-purpose permutation-control instructions that generate
the appropriate permutations from the original unaligned address, allow
efficient lowering of aligned loads. This code uses the trick explained in the
Apple Velocity Engine optimization overview document to prevent the needed
extra load from possibly causing a page fault if the original address happens
to be aligned.

As noted in the FIXMEs, there are several additional optimizations that can be
performed to reduce the cost of these loads even more. These will be
implemented in future commits.

llvm-svn: 182691

cf2e9080

[Support] Remove Count{Leading,Trailing}Zeros_{32,64}. · a8db3f6f
Michael J. Spencer authored May 24, 2013
```
llvm-svn: 182690
```
a8db3f6f
Tidy up. Whitespace. · c161680c
Jim Grosbach authored May 24, 2013
```
llvm-svn: 182689
```
c161680c

Follow up of the introduction of MCSymbolizer. · f482805c

Quentin Colombet authored May 24, 2013

- Ressurect old MCDisassemble API to soften transition.
- Extend MCTargetDesc to set target specific symbolizer.

llvm-svn: 182688

f482805c

clang formatted APFloat.h · 410bd525
Michael Gottesman authored May 24, 2013
```
llvm-svn: 182686
```
410bd525
clang-formatted APInt.h · 356ead3f
Michael Gottesman authored May 24, 2013
```
llvm-svn: 182685
```
356ead3f
MathExtras: Return the result of find(First|Last)Set in the input type. · 2ce482e6
Benjamin Kramer authored May 24, 2013
```
Otherwise ZB_Max returns a wrong result when sizeof(T) > sizeof(size_t).

llvm-svn: 182684
```
2ce482e6
Replace Count{Leading,Trailing}Zeros_{32,64} with count{Leading,Trailing}Zeros. · df1ecbd7
Michael J. Spencer authored May 24, 2013
```
llvm-svn: 182680
```
df1ecbd7
[Support][MathExtras] Fix literal type issues. · 795ecd2c
Michael J. Spencer authored May 24, 2013
```
llvm-svn: 182679
```
795ecd2c

May 24, 2013

Add missing header for atexit. · 4fd69975
Michael J. Spencer authored May 24, 2013
```
llvm-svn: 182672
```
4fd69975

[Support][MathExtras] Add missing include and disable... · 0d9d75f2

Michael J. Spencer authored May 24, 2013

[Support][MathExtras] Add missing include and disable _BitScan{Forward,Reverse}64 on non x64 MSVC systems.

llvm-svn: 182671

0d9d75f2

[objc-arc] KnownSafe does not imply that it is safe to perform code motion... · e67f40c5

Michael Gottesman authored May 24, 2013

[objc-arc] KnownSafe does not imply that it is safe to perform code motion across CFG edges since even if it is safe to remove RR pairs, we may still be able to move a retain/release into a loop.

rdar://13949644

llvm-svn: 182670

e67f40c5

[objc-arc] Make sure that multiple owners is propogated correctly through the... · 5a91bbf3

Michael Gottesman authored May 24, 2013

[objc-arc] Make sure that multiple owners is propogated correctly through the pass via the usage of a global data structure.

rdar://13750319

llvm-svn: 182669

5a91bbf3

[Support] Add type generic bit utilities to MathExtras.h · eb91eac9
Michael J. Spencer authored May 24, 2013
```
llvm-svn: 182667
```
eb91eac9

LoopVectorize: LoopSimplify can't canonicalize loops with an indirectbr in it,... · 6ac1e623

Benjamin Kramer authored May 24, 2013

LoopVectorize: LoopSimplify can't canonicalize loops with an indirectbr in it, don't assert on those cases.

Fixes PR16139.

llvm-svn: 182656

6ac1e623

Do not reserve space for the ColdEdges and NormalEdges vectors. · c2c44676

Diego Novillo authored May 24, 2013

Discussion and rationale at
http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130520/175698.html

llvm-svn: 182653

c2c44676

[SystemZ] Improve AsmParser handling of invalid instructions · dc5ed713

Richard Sandiford authored May 24, 2013

Previously, an invalid instruction like:

	foo     %r1, %r0

would generate the rather odd error message:

....: error: unknown token in expression
	foo     %r1, %r0
		^

We now get the more informative:

....: error: invalid instruction
	foo     %r1, %r0
	^

The same would happen if an address were used where a register was expected.
We now get "invalid operand for instruction" instead.

llvm-svn: 182644

dc5ed713

[SystemZ] Improve AsmParser register parsing · 675f8699

Richard Sandiford authored May 24, 2013

The idea is to make sure that:

(1) "register expected" is restricted to cases where ParseRegister()
    is called and the token obviously isn't a register.

(2) "invalid register" is restricted to cases where a register-like "%..."
    sequence is found, but the "..." makes no sense.

(3) the generic "invalid operand for instruction" is used in cases where
    the wrong register type is used (GPR instead of FPR, etc.).

(4) the new "invalid register pair" is used if the register has the right type,
    but is not a valid register pair.

Testing of (1)-(3) is now restricted to regs-bad.s.  It uses a representative
instruction for each register class to make sure that only registers from
that class are accepted.

(4) is tested by both regs-bad.s (which checks all invalid register pairs)
and insn-bad.s (which tests one invalid pair for each instruction that
requires a pair).

While there, I changed "Number" to "Num" for consistency with the
operand class.

llvm-svn: 182643

675f8699

Run clang-format over the scalarizePHI function. · b34294d0
Joey Gouly authored May 24, 2013
```
llvm-svn: 182640
```
b34294d0

scalarizePHI needs to insert the next ExtractElement in the same block · 83699284

Joey Gouly authored May 24, 2013

as the BinaryOperator, *not* in the block where the IRBuilder is currently
inserting into. Fixes a bug where scalarizePHI would create instructions
that would not dominate all uses.

llvm-svn: 182639

83699284

Add a new function attribute 'cold' to functions. · c6399539

Diego Novillo authored May 24, 2013

Other than recognizing the attribute, the patch does little else.
It changes the branch probability analyzer so that edges into
blocks postdominated by a cold function are given low weight.

Added analysis and code generation tests.  Added documentation for the
new attribute.

llvm-svn: 182638

c6399539