Commits · 36b5eea258c6a30c389fca5aa62c89f3a2276329 · Roger Ferrer / llvm-epi-0.8

Feb 04, 2014

DIBuilder: simplify array generation to produce true zero-length arrays · 2c7a2684

David Blaikie authored Feb 03, 2014

For some anachronistic reason we were producing {i32 0} for zero-length
debug info arrays.

(this change is paired with a Clang change and may cause temporary
buildbot noise)

Let's not.

llvm-svn: 200721

2c7a2684

Feb 03, 2014

Add DEBUG_TYPE to SIAnnotateControlFlow · d5ab971b
Matt Arsenault authored Feb 03, 2014
```
llvm-svn: 200720
```
d5ab971b

inalloca: Don't remove dead arguments in the presence of inalloca args · d47a59a4

Reid Kleckner authored Feb 03, 2014

It disturbs the layout of the parameters in memory and registers,
leading to problems in the backend.

The plan for optimizing internal inalloca functions going forward is to
essentially SROA the argument memory and demote any captured arguments
(things that aren't trivially written by a load or store) to an indirect
pointer to a static alloca.

llvm-svn: 200717

d47a59a4

AArch64 & ARM: refactor crypto intrinsics to take scalars · 24979d8e

Tim Northover authored Feb 03, 2014

Some of the SHA instructions take a scalar i32 as one argument (largely because
they work on 160-bit hash fragments). This wasn't reflected in the IR
previously, with ARM and AArch64 choosing different types (<4 x i32> and <1 x
i32> respectively) which was ugly.

This makes all the affected intrinsics take a uniform "i32", allowing them to
become non-polymorphic at the same time.

llvm-svn: 200706

24979d8e

Expand vector bswap in LegalizeVectorOps · 5c968d94

Hal Finkel authored Feb 03, 2014

ISD::BSWAP was missing from the list of node types that should be expanded
element-wise.

llvm-svn: 200705

5c968d94

Undef'ing _WIN32_IE to silence an MSVC warning about redefining a macro value. · 42f6622b
Aaron Ballman authored Feb 03, 2014
```
No functional change intended.

llvm-svn: 200704
```
42f6622b
Add a note about Clang+LLVM on Sparc64. · 5a96c87b
Venkatraman Govindaraju authored Feb 03, 2014
```
llvm-svn: 200699
```
5a96c87b

Remove outdated & incorrect part of comment. · 309e77fb

Eli Bendersky authored Feb 03, 2014

This comment was copied over from another class in r34170, where it made sense.

llvm-svn: 200697

309e77fb

Don't use -ffunction-sections if -fno-function-sections is not supported in the compiler. · c7b7e256
Evgeniy Stepanov authored Feb 03, 2014
```
This will disable -ffunction-sections in older versions of Clang where it
breaks build of sanitizer runtime library.

llvm-svn: 200695
```
c7b7e256

Introduce SmallPtrSetImpl<T *> which allows insert, erase, count, and · 784de75c

Chandler Carruth authored Feb 03, 2014

iteration. This alows the majority of operations to be performed without
encoding a specific small size. It follows the model of
SmallVectorImpl<T>.

llvm-svn: 200688

784de75c

Rename the non-templated base class of SmallPtrSet to · 173bd7ed

Chandler Carruth authored Feb 03, 2014

'SmallPtrSetImplBase'. This more closely matches the organization of
SmallVector and should allow introducing a SmallPtrSetImpl which serves
the same purpose as SmallVectorImpl: isolating the element type from the
particular small size chosen. This in turn allows a lot of
simplification of APIs by not coding them against a specific small size
which is rarely needed.

llvm-svn: 200687

173bd7ed

Remove unnecessary include of AArch64GenInstrInfo.inc from... · e7a9ee5c

Craig Topper authored Feb 03, 2014

Remove unnecessary include of AArch64GenInstrInfo.inc from AArch64Disassembler.cpp. None of the GET_ defines were set that would make the include do anything.

llvm-svn: 200677

e7a9ee5c

Feb 02, 2014
- Lower llvm.expect intrinsic correctly for i1 · 1ff08e38
  Duncan P. N. Exon Smith authored Feb 02, 2014
```
LowerExpectIntrinsic previously only understood the idiom of an expect
intrinsic followed by a comparison with zero. For llvm.expect.i1, the
comparison would be stripped by the early-cse pass.

Patch by Daniel Micay.

llvm-svn: 200664
```
  1ff08e38
- Unaligned access is supported on ARMv6 and ARMv7 for the NetBSD target. · 4455ffc4
  Joerg Sonnenberger authored Feb 02, 2014
```
Patch from Matt Thomas.

llvm-svn: 200654
```
  4455ffc4
- [CMake] Move cmake_minimum_required(2.8.8) at the top. · 85d65ff4
  NAKAMURA Takumi authored Feb 02, 2014
```
Suggested by Stephen Kelly.

llvm-svn: 200645
```
  85d65ff4
- [CMake] Untabify. · 78293371
  NAKAMURA Takumi authored Feb 02, 2014
```
llvm-svn: 200644
```
  78293371
- TableGen/X86RecognizableInstr.h: Prune out-of-date "@param isSSE". [-Wdocumentation] · d8dd194f
  NAKAMURA Takumi authored Feb 02, 2014
```
llvm-svn: 200628
```
  d8dd194f
- Merge x86 HasOpSizePrefix/HasOpSize16Prefix into a 2-bit OpSize field with 0... · fa6298a1
  Craig Topper authored Feb 02, 2014
```
Merge x86 HasOpSizePrefix/HasOpSize16Prefix into a 2-bit OpSize field with 0 meaning no 0x66 prefix in any mode. Rename Opsize16->OpSize32 and OpSize->OpSize16. The classes now refer to their operand size rather than the mode in which they need a 0x66 prefix. Hopefully can merge REX_W into this as OpSize64.

llvm-svn: 200626
```
  fa6298a1
- Simplify some code since VEX and EVEX instructions never have HasOpSizePrefix. · 8e92e85a
  Craig Topper authored Feb 02, 2014
```
llvm-svn: 200625
```
  8e92e85a
- Merge HasVEXPrefix/HasEVEXPrefix/HasXOPPrefix into a 2-bit 'encoding' field in TSFlags. · d402df3c
  Craig Topper authored Feb 02, 2014
```
llvm-svn: 200624
```
  d402df3c
- Replace PPC instruction-size code with MCInstrDesc getSize · a7bbaf6d
  Hal Finkel authored Feb 02, 2014
```
As part of the cleanup done to enable the disassembler, the PPC instructions
now have a valid Size description field. This can now be used to replace some
custom logic in a few places to compute instruction sizes.

Patch by David Wiberg!

llvm-svn: 200623
```
  a7bbaf6d
- LoopVectorizer: Enable unrolling of conditional stores and the load/store · 17455633
  Arnold Schwaighofer authored Feb 02, 2014
```
unrolling heuristic per default

Benchmarking on x86_64 (thanks Chandler!) and ARM has shown those options speed
up some benchmarks while not causing any interesting regressions.

llvm-svn: 200621
```
  17455633
- Add some xfailed R600 tests for 64-bit private accesses. · 6e63dd27
  Matt Arsenault authored Feb 02, 2014
```
llvm-svn: 200620
```
  6e63dd27
- R600/SI: Fix insertelement with dynamic indices. · f5958dde
  Matt Arsenault authored Feb 02, 2014
```
This didn't work for any integer vectors, and didn't
work with some sizes of float vectors. This should now
work with all sizes of float and i32 vectors.

llvm-svn: 200619
```
  f5958dde
Feb 01, 2014

[Sparc] Set %o7 as the return address register instead of %i7 in... · 52b6473d

Venkatraman Govindaraju authored Feb 01, 2014

[Sparc] Set %o7 as the return address register instead of %i7 in MCRegisterInfo. Also, add CFI instructions to initialize the frame correctly.

llvm-svn: 200617

52b6473d

ARMTTI: We don't have 16 allocatable scalar registers · 445f7fb0

Arnold Schwaighofer authored Feb 01, 2014

This caused an regression on libquantum after enabling the new loop vectorizer
unroll heuristics.

llvm-svn: 200616

445f7fb0

MC: Fix .octa output for APInts with BitWidth > 128 · 6c9a6f9b
David Woodhouse authored Feb 01, 2014
```
llvm-svn: 200615
```
6c9a6f9b

MC: Add support for .octa · d6de0d99

David Woodhouse authored Feb 01, 2014

This is a minimal implementation which accepts only constants rather than
full expressions, but that should be perfectly sufficient for all known
users for now.

Patch from PaX Team <pageexec@freemail.hu>

llvm-svn: 200614

d6de0d99

MC: Add AsmLexer::BigNum token for integers greater than 64 bits · f42a6662

David Woodhouse authored Feb 01, 2014

This will be needed for .octa support, but we don't want to just use the
existing AsmLexer::Integer for it and then have to litter all its users
with explicit checks for the size, and make them use the new get APIntVal()
method.

So let the lexer produce an AsmLexer::Integer as before for numbers which
are small enough — which appears to cover what was previously a nasty
special case handling of numbers which don't fit in int64_t but *do* fit
in uint64_t.

Where the number is too large even for that, produce an AsmLexer::BigNum
instead. We do nothing with these except complain about them for now,
but that will be changed shortly...

Based on a patch from PaX Team <pageexec@freemail.hu>

llvm-svn: 200613

f42a6662

[LPM] Apply a really big hammer to fix PR18688 by recursively reforming · 1665152c

Chandler Carruth authored Feb 01, 2014

LCSSA when we promote to SSA registers inside of LICM.

Currently, this is actually necessary. The promotion logic in LICM uses
SSAUpdater which doesn't understand how to place LCSSA PHI nodes.
Teaching it to do so would be a very significant undertaking. It may be
worthwhile and I've left a FIXME about this in the code as well as
starting a thread on llvmdev to try to figure out the right long-term
solution.

For now, the PR needs to be fixed. Short of using the promition
SSAUpdater to place both the LCSSA PHI nodes and the promoted PHI nodes,
I don't see a cleaner or cheaper way of achieving this. Fortunately,
LCSSA is relatively lazy and sparse -- it should only update
instructions which need it. We can also skip the recursive variant when
we don't promote to SSA values.

llvm-svn: 200612

1665152c

Remove some unused #includes · fc49d198
Eli Bendersky authored Feb 01, 2014
```
llvm-svn: 200611
```
fc49d198
Silence GCC warnings. · 029750fb
Benjamin Kramer authored Feb 01, 2014
```
llvm-svn: 200610
```
029750fb

[inliner] Skip debug intrinsics even earlier in computing the inline · 6b4cc8b6

Chandler Carruth authored Feb 01, 2014

cost so that they don't impact the vector bonus. Fundamentally, counting
unsimplified instructions is just *wrong*; it will continue to introduce
instability as things which do not generate code bizarrely impact
inlining. For example, sufficiently nested inlined functions could turn
off the vector bonus with lifetime markers just like the debug
intrinsics do. =/

This is a short-term tactical fix. Long term, I think we need to remove
the vector bonus entirely. That's a separate patch and discussion
though.

The patch to fix this provided by Dario Domizioli. I've added some
comments about the planned direction and used a heavily pruned form of
debug info intrinsics for the test case. While this debug info doesn't
work or "do" anything useful, it lets us easily test all manner of
interference easily, and I suspect this will not be the last time we
want to craft a pattern where debug info interferes with the inliner in
a problematic way.

llvm-svn: 200609

6b4cc8b6

Simplify some x86 format classes and remove some ambiguities in their application. · da7160d6
Craig Topper authored Feb 01, 2014
```
llvm-svn: 200608
```
da7160d6
Update a .fill test to use the updated semantics. · 5a67e2b1
David Majnemer authored Feb 01, 2014
```
Something funny happened, this should've been part of r200606.

llvm-svn: 200607
```
5a67e2b1

MC: Improve the .fill directive's compatibility with GAS · 522d3db7

David Majnemer authored Feb 01, 2014

Per the GAS documentation, .fill should permit pattern widths that
aren't a power of two. While I was in the neighborhood, I added some
sanity checking. This change was motivated by a use of this construct
in the Linux Kernel.

llvm-svn: 200606

522d3db7

Hopefully fix mingw32 bots. · ad141abd

Peter Collingbourne authored Feb 01, 2014

For some reason this symbolic constant isn't defined in some versions of mingw32.

llvm-svn: 200605

ad141abd

Revert "[SLPV] Recognize vectorizable intrinsics during SLP vectorization ..." · a04504fe

Reid Kleckner authored Feb 01, 2014

This reverts commit r200576.  It broke 32-bit self-host builds by
vectorizing two calls to @llvm.bswap.i64, which we then fail to expand.

llvm-svn: 200602

a04504fe

[stackprotector] Implement the sspstrong rules for stack layout. · 24c7f063

Josh Magee authored Feb 01, 2014

This changes the PrologueEpilogInserter and LocalStackSlotAllocation passes to
follow the extended stack layout rules for sspstrong and sspreq.

The sspstrong layout rules are:
 1. Large arrays and structures containing large arrays (>= ssp-buffer-size)
are closest to the stack protector.
 2. Small arrays and structures containing small arrays (< ssp-buffer-size) are
2nd closest to the protector.
 3. Variables that have had their address taken are 3rd closest to the
protector.


Differential Revision: http://llvm-reviews.chandlerc.com/D2546

llvm-svn: 200601

24c7f063

Implement inalloca codegen for x86 with the new inalloca design · f5b76518

Reid Kleckner authored Jan 31, 2014

Calls with inalloca are lowered by skipping all stores for arguments
passed in memory and the initial stack adjustment to allocate argument
memory.

Now the frontend is responsible for the memory layout, and the backend
doesn't have to do any work.  As a result these changes are pretty
minimal.

Reviewers: echristo

Differential Revision: http://llvm-reviews.chandlerc.com/D2637

llvm-svn: 200596

f5b76518