Commits · 4d3514ee94f31e4a39f85003d87e1abe873be1bf · Roger Ferrer / llvm-epi-0.8

May 05, 2013

[XCore] Add BLRB instructions. · 4d3514ee
Richard Osborne authored May 05, 2013
```
llvm-svn: 181152
```
4d3514ee

[XCore] Remove '-' from back branch asm syntax. · 53a04fe2

Richard Osborne authored May 05, 2013

Instead operands are treated as negative immediates
where the sign bit is implicit in the instruction
encoding.

llvm-svn: 181151

53a04fe2

Use remove_if to erase parts of a vector. Avoids O(n^2) worst cases. · 1ce5d800
Benjamin Kramer authored May 05, 2013
```
llvm-svn: 181150
```
1ce5d800
InlineSpiller: Remove quadratic behavior. · 391f5a6e
Benjamin Kramer authored May 05, 2013
```
No functionality change.

llvm-svn: 181149
```
391f5a6e

For ARM backend, fixed "byval" attribute support. · 8c02c982

Stepan Dyatkovskiy authored May 05, 2013

Now even the small structures could be passed within byval (small enough
to be stored in GPRs).
In regression tests next function prototypes are checked:

PR15293:
  %artz = type { i32 }
  define void @foo(%artz* byval %s)
  define void @foo2(%artz* byval %s, i32 %p, %artz* byval %s2)
foo: "s" stored in R0
foo2: "s" stored in R0, "s2" stored in R2.

Next AAPCS rules are checked:
5.5 Parameters Passing, C.4 and C.5,
"ParamSize" is parameter size in 32bit words:
-- NSAA != 0, NCRN < R4 and NCRN+ParamSize > R4.
   Parameter should be sent to the stack; NCRN := R4.
-- NSAA != 0, and NCRN < R4, NCRN+ParamSize < R4.
   Parameter stored in GPRs; NCRN += ParamSize.

llvm-svn: 181148

8c02c982

Add missing PatternMatch.cpp to CMakeLists.txt · eb4f2d63
Arnold Schwaighofer authored May 05, 2013
```
llvm-svn: 181147
```
eb4f2d63
PatternMatch: Fix documentation - 'function' not 'attribute' · 2c4508a9
Arnold Schwaighofer authored May 05, 2013
```
llvm-svn: 181146
```
2c4508a9

Remove a recently redundant transform from X86ISelLowering. · 66fb70de

David Majnemer authored May 05, 2013

X86ISelLowering has support to treat:
(icmp ne (and (xor %flags, -1), (shl 1, flag)), 0)

as if it were actually:
(icmp eq (and %flags, (shl 1, flag)), 0)

However, r179386 has code at the InstCombine level to handle this.

llvm-svn: 181145

66fb70de

LoopVectorize: Add support for floating point min/max reductions · d96e427e

Arnold Schwaighofer authored May 05, 2013

Add support for min/max reductions when "no-nans-float-math" is enabled. This
allows us to assume we have ordered floating point math and treat ordered and
unordered predicates equally.

radar://13723044

llvm-svn: 181144

d96e427e

PatternMatch: Matcher for (un)ordered floating point min/max · e972d03f

Arnold Schwaighofer authored May 05, 2013

Add support for matching 'ordered' and 'unordered' floating point min/max
constructs.

In LLVM we can express min/max functions as a combination of compare and select.
We have support for matching such constructs for integers but not for floating
point. In floating point math there is no total order because of the presence of
'NaN'. Therefore, we have to be careful to preserve the original fcmp semantics
when interpreting floating point compare select combinations as a minimum or
maximum function. The resulting 'ordered/unordered' floating point maximum
function has to select the same value as the select/fcmp combination it is based
on.

 ordered_max(x,y)   = max(x,y) iff x and y are not NaN, y otherwise
 unordered_max(x,y) = max(x,y) iff x and y are not NaN, x otherwise
 ordered_min(x,y)   = min(x,y) iff x and y are not NaN, y otherwise
 unordered_min(x,y) = min(x,y) iff x and y are not NaN, x otherwise

This matches the behavior of the underlying select(fcmp(olt/ult/.., L, R), L, R)
construct.

Any code using this predicate has to preserve this semantics.

A follow-up patch will use this to implement floating point min/max reductions
in the vectorizer.

radar://13723044

llvm-svn: 181143

e972d03f

LoopVectorizer: Cleanup of miminimum/maximum pattern match code · f5183729

Arnold Schwaighofer authored May 05, 2013

No need for setting the operands. The pointers are going to be bound by the
matcher.

radar://13723044

llvm-svn: 181142

f5183729

LoopVectorize: We don't need an identity element for min/max reductions · a670a0a3

Arnold Schwaighofer authored May 05, 2013

We can just use the initial element that feeds the reduction.

  max(max(x, y), z) == max(max(x,y), max(x,z))

radar://13723044

llvm-svn: 181141

a670a0a3

ArrayRef<T>() -> None cleanup · 010316ce
Dmitri Gribenko authored May 05, 2013
```
llvm-svn: 181140
```
010316ce
Replace ArrayRef<T>() with None, now that we have an implicit ArrayRef constructor from None · 44ebbd54
Dmitri Gribenko authored May 05, 2013
```
Patch by Robert Wilhelm.

llvm-svn: 181139
```
44ebbd54
Add ArrayRef constructor from None, and do the cleanups that this constructor enables · 3238fb75
Dmitri Gribenko authored May 05, 2013
```
Patch by Robert Wilhelm.

llvm-svn: 181138
```
3238fb75
whitespace · d61dcfc4
Nadav Rotem authored May 04, 2013
```
llvm-svn: 181137
```
d61dcfc4
Fix an odd comment. · 42932bdc
Nadav Rotem authored May 04, 2013
```
llvm-svn: 181136
```
42932bdc

May 04, 2013

AArch64: enable MCJIT and tests now that everything passes. · 7b55b97d

Tim Northover authored May 04, 2013

This removes dire warnings about AArch64 being unsupported and enables
the tests when appropriate on this platform.

llvm-svn: 181135

7b55b97d

AArch64: implement 64-bit absolute relocation in MCJIT · b23d8dbb

Tim Northover authored May 04, 2013

This is about the simplest relocation, but surprisingly rare in actual
code.

It occurs in (for example) the MCJIT test test-ptr-reloc.ll.

llvm-svn: 181134

b23d8dbb

AArch64: add stubs to support long function calls on MCJIT · 37cde975

Tim Northover authored May 04, 2013

As with global accesses, external functions could exist anywhere in
memory. Therefore the stub must create a complete 64-bit address. This
patch implements the fragment as (roughly):
    movz x16, #:abs_g3:somefunc
    movk x16, #:abs_g2_nc:somefunc
    movk x16, #:abs_g1_nc:somefunc
    movk x16, #:abs_g0_nc:somefunc
    br x16

In principle we could save 4 bytes by using a literal-load instead,
but it is unclear that would be more efficient and can only be tested
when real hardware is readily available.

This allows (for example) the MCJIT test 2003-05-07-ArgumentTest to
pass on AArch64.

llvm-svn: 181133

37cde975

AArch64: implement relocations for global access · 4d01c1e0

Tim Northover authored May 04, 2013

The large memory model (default and main viable for JIT) emits
addresses in need of relocation as
    movz x0, #:abs_g3:somewhere
    movk x0, #:abs_g2_nc:somewhere
    movk x0, #:abs_g1_nc:somewhere
    movk x0, #:abs_g0_nc:somewhere

To support this we must implement those four relocations in the
dynamic loader.

This allows (for example) the test-global.ll MCJIT test to pass on
AArch64.

llvm-svn: 181132

4d01c1e0

AArch64: implement first relocation required for MCJIT · fa1b2f85

Tim Northover authored May 04, 2013

R_AARCH64_PCREL32 is present in even trivial .eh_frame sections and so
is required to compile any function without the "nounwind" attribute.

This change implements very basic infrastructure in the RuntimeDyldELF
file and allows (for example) the test-shift.ll MCJIT test to pass
on AArch64.

llvm-svn: 181131

fa1b2f85

Build system changes to enable MCJIT on AArch64 · a958a570

Tim Northover authored May 04, 2013

These changes just allow AArch64 to take part in the MCJIT world when
built correctly.

llvm-svn: 181130

a958a570

AArch64: use __clear_cache under GCCish environments · 6c26b327

Tim Northover authored May 04, 2013

AArch64 is going to need some kind of cache-invalidation in order to
successfully JIT since it has a weak memory-model. This is provided by
a __clear_cache builtin in libgcc, which acts very much like the
32-bit ARM equivalent (on platforms where it exists).

llvm-svn: 181129

6c26b327

Fix buildbot failure on 64 bit linux due to std::max() having different · 2f75a0c0
Richard Osborne authored May 04, 2013
```
operand types.

llvm-svn: 181128
```
2f75a0c0
[XCore] Remove unused operand type. · 0a7abb65
Richard Osborne authored May 04, 2013
```
llvm-svn: 181127
```
0a7abb65

[XCore] Make use of the target independent global address offset folding. · 54ff84a8

Richard Osborne authored May 04, 2013

This let us to remove some custom code that matched constant offsets
from globals at instruction selection time as a special addressing mode.
No intended functionality change.

llvm-svn: 181126

54ff84a8

[XCore] Simplify code that checks for an aligned base plus a constant. · a282fa5b

Richard Osborne authored May 04, 2013

The code now makes use of ComputeMaskedBits,
SelectionDAG::isBaseWithConstantOffset and TargetLowering::isGAPlusOffset
where appropriate reducing the amount of logic needed in XCoreISelLowering.
No intended functionality change.

llvm-svn: 181125

a282fa5b

[XCore] Move lowering of thread local storage to a separate pass. · 8bbea9cd

Richard Osborne authored May 04, 2013

Thread local storage is not supported by the XMOS linker so we handle
thread local variables by lowering the variable to an array of n elements
(where n is the number of hardware threads per core, currently 8
for all XMOS devices) indexed by the the current thread ID.

Previously this lowering was spread across the XCoreISelLowering and the
XCoreAsmPrinter classes. Moving this to a separate pass should be much
cleaner.

llvm-svn: 181124

8bbea9cd

Properly parsing __declspec(safebuffers), though there is no semantic hookup. ... · 444eb6e2

Aaron Ballman authored May 04, 2013

Properly parsing __declspec(safebuffers), though there is no semantic hookup.  For more information about safebuffers, see MSDN: http://msdn.microsoft.com/en-us/library/dd778695(v=vs.110).aspx

llvm-svn: 181123

444eb6e2

Reverting r181004 since it has broken test/Sema/wchar.c. · d428ff46
Aaron Ballman authored May 04, 2013
```
llvm-svn: 181122
```
d428ff46

AArch64: assert code model is small for TLS accesses · 85dcbde2

Tim Northover authored May 04, 2013

Supporting TLS in the large memory model is rather difficult at the
moment, so make sure no-one gets into difficulties by mistake.

llvm-svn: 181121

85dcbde2

AArch64: support literal pool access in large memory model. · 885698a2
Tim Northover authored May 04, 2013
```
llvm-svn: 181120
```
885698a2
AArch64: support large code model for jump-tables · 8ff187df
Tim Northover authored May 04, 2013
```
llvm-svn: 181119
```
8ff187df
AArch64: implement support for blockaddress in large code model · 9fc1cddb
Tim Northover authored May 04, 2013
```
llvm-svn: 181118
```
9fc1cddb

AArch64: implement large code model access to global variables. · 2dbef345

Tim Northover authored May 04, 2013

The MOVZ/MOVK instruction sequence may not be the most efficient (a
literal-pool load could be better) but adding that would require
reinstating the ConstantIslands pass.

For now the sequence is correct, and that's enough. Beware, as of
commit GNU ld does not appear to support the relocations needed for
this. Its primary purpose (for now) will be to support JITed code,
since in that case there is no guarantee of where your code will end
up in memory relative to external symbols it references.

llvm-svn: 181117

2dbef345

[XCore] Use static relocation model by default. · df9e5741

Richard Osborne authored May 04, 2013

This allows us to get get rid of a hack in XCoreTargetObjectFile where the
the DataRel* sections were overridden.

llvm-svn: 181116

df9e5741

Moved pretty printer test for thread local storage in its own file · b2d998f3
Enea Zaffanella authored May 04, 2013
```
and specified the triple.

llvm-svn: 181115
```
b2d998f3
Lex: Fix quadratic behavior when unescaping _Pragma strings. · c2f5f29b
Benjamin Kramer authored May 04, 2013
```
No functionality change.

llvm-svn: 181114
```
c2f5f29b
In VarDecl nodes, store the thread storage class specifier as written. · acb8ecd6
Enea Zaffanella authored May 04, 2013
```
llvm-svn: 181113
```
acb8ecd6