Commits · e1eaecd52805c9a975c1a6cc6a85f216b0a48361 · Roger Ferrer / llvm-epi-0.8

Aug 12, 2013

Move stack protector names to the same place. · e1eaecd5
Bill Wendling authored Aug 12, 2013
```
llvm-svn: 188198
```
e1eaecd5

[stackprotector] Add in the stackprotector libcall. · 7dce16f6

Michael Gottesman authored Aug 12, 2013

We support this libcall on all platforms except for OpenBSD (See
lib/Codegen/StackProtector.cpp).

llvm-svn: 188193

7dce16f6

Fix PR16797 - Support PHINodes with multiple inputs from the same basic block. · e23147bb

Nadav Rotem authored Aug 12, 2013

Do not generate new vector values for the same entries because we know that the incoming values
from the same block must be identical.

llvm-svn: 188185

e23147bb

Fixes a bug when iterating on paths · 73537eac

Tareq A. Siraj authored Aug 12, 2013

This fixes the incorrect implementation of iterating on file/directory
paths.

Differential Review: http://llvm-reviews.chandlerc.com/D1277

llvm-svn: 188183

73537eac

This patch implements ei and di instructions for mips. Test cases are added. · 939877ee
Vladimir Medic authored Aug 12, 2013
```
llvm-svn: 188176
```
939877ee
Remove unused SpecialCaseList constructors · 15dc0af7
Alexey Samsonov authored Aug 12, 2013
```
llvm-svn: 188171
```
15dc0af7
Add SpecialCaseList::createOrDie() factory and use it in sanitizer passes · e4b5fb88
Alexey Samsonov authored Aug 12, 2013
```
llvm-svn: 188169
```
e4b5fb88

Remove all checking for the various terminfo headers (term.h and · 91219858

Chandler Carruth authored Aug 12, 2013

curses.h). Finding these headers is next to impossible. For example, on
Debian systems libtinfo-dev provides the terminfo reading library we
want, but *not* term.h. For the header, you have to use libncurses-dev.
And libncursesw-dev provides a *different* term.h in a different
location!

These headers aren't worth it. We want two functions the signatures of
which are clearly spec'ed in sys-v and other documentation. Just declare
them ourselves and call them. This should fix some debian builders and
provide better support for "minimal" debian systems that do want color
autodetection.

llvm-svn: 188165

91219858

[SystemZ] Use CLC and IPM to implement memcmp · 564681c8

Richard Sandiford authored Aug 12, 2013

For now this is restricted to fixed-length comparisons with a length
in the range [1, 256], as for memcpy() and MVC.

llvm-svn: 188163

564681c8

[SystemZ] Add a definition of the CLC instruction · 761703a2
Richard Sandiford authored Aug 12, 2013
```
llvm-svn: 188162
```
761703a2
[SystemZ] Add a definition of the IPM instruction · 87326c73
Richard Sandiford authored Aug 12, 2013
```
llvm-svn: 188161
```
87326c73

Target a minimal terminfo library rather than necessarily a full curses · f11f1e43

Chandler Carruth authored Aug 12, 2013

library for color support detection. This still will use a curses
library if that is all we have available on the system. This change
tries to use a smaller subset of the curses library, specifically the
subset that is on some systems split off into a separate library. For
example, if you install ncurses configured --with-tinfo, a 'libtinfo' is
install that provides just the terminfo querying functionality. That
library is now used instead of curses when it is available.

This happens to fix a build error on systems with that library because
when we tried to link ncurses into the binary, we didn't pull tinfo in
as well. =]

It should also provide an easy path for supporting the NetBSD
libterminfo library, but as I don't have access to a NetBSD system I'm
leaving adding that support to those folks.

llvm-svn: 188160

f11f1e43

Allow compatible extension attributes for tail calls · 707d68f0

Tim Northover authored Aug 12, 2013

If the tail-callee and caller give the same bits via the same signext/zeroext
attribute then a tail-call should be allowed, since the extension has already
been done by the callee.

llvm-svn: 188159

707d68f0

Remove global construction. const char* is sufficient here. · c9b7d47b
Benjamin Kramer authored Aug 12, 2013
```
No functionality change.

llvm-svn: 188158
```
c9b7d47b

Introduce factory methods for SpecialCaseList · 9e4fdd26

Alexey Samsonov authored Aug 12, 2013

Summary:
Doing work in constructors is bad: this change suggests to
call SpecialCaseList::create(Path, Error) instead of
"new SpecialCaseList(Path)". Currently the latter may crash with
report_fatal_error, which is undesirable - sometimes we want to report
the error to user gracefully - for example, if he provides an incorrect
file as an argument of Clang's -fsanitize-blacklist flag.

Reviewers: pcc

Reviewed By: pcc

CC: llvm-commits

Differential Revision: http://llvm-reviews.chandlerc.com/D1327

llvm-svn: 188156

9e4fdd26

Fix big-endian handling of integer-to-vector bitcasts in InstCombine · feb34713

Richard Sandiford authored Aug 12, 2013

These functions used to assume that the lsb of an integer corresponds
to vector element 0, whereas for big-endian it's the other way around:
the msb is in the first element and the lsb is in the last element.

Fixes MultiSource/Benchmarks/mediabench/gsm/toast for z.

llvm-svn: 188155

feb34713

Aug 11, 2013

Don't generate floating point stubs for mips16 code if the function · d265e888

Reed Kotler authored Aug 11, 2013

is actually an instrinsic that will not occur in libc. This list here
is not exhaustive but fixes the one places in test-suite where this occurs.
I have filed a bug against myself to research the full list and add them
to the array of such cases. In the future, actual stub generation will occur
in a later phase and we won't need this code because we will know at that time
during the compilation that in fact no helper function was even needed.

llvm-svn: 188149

d265e888

AVX-512: Added more tests for BROADCAST · 5fed3b95
Elena Demikhovsky authored Aug 11, 2013
```
llvm-svn: 188148
```
5fed3b95
AVX-512: Added VPERM* instructons and MOV* zmm-to-zmm instructions. · cf5b1458
Elena Demikhovsky authored Aug 11, 2013
```
Added a test for shuffles using VPERM.

llvm-svn: 188147
```
cf5b1458

Re-instate r187323 which fast-tracks promotable allocas as soon as the · d7cd7e36

Chandler Carruth authored Aug 11, 2013

SROA-based analysis has enough information. This should work now that
both mem2reg *and* the SSAUpdater-based AllocaPromoter have been updated
to be able to promote the types of allocas that the SROA analysis
detects.

I've included tests for the AllocaPromoter that were only possible to
write once we fast-tracked promotable allocas without rewriting them.
This includes a test both for r187347 and r188145.

Original commit log for r187323:
"""
Now that mem2reg understands how to cope with a slightly wider set of uses of
an alloca, we can pre-compute promotability while analyzing an alloca for
splitting in SROA. That lets us short-circuit the common case of a bunch of
trivially promotable allocas. This cuts 20% to 30% off the run time of SROA for
typical frontend-generated IR sequneces I'm seeing. It gets the new SROA to
within 20% of ScalarRepl for such code. My current benchmark for these numbers
is PR15412, but it fits the general pattern of IR emitted by Clang so it should
be widely applicable.
"""

llvm-svn: 188146

d7cd7e36

Finish fixing the SSAUpdater-based AllocaPromoter strategy in SROA to cope with · c17283b4

Chandler Carruth authored Aug 11, 2013

the more general set of patterns that are now handled by mem2reg and that we
can detect quickly while doing SROA's initial analysis. Notably, this allows it
to promote through no-op bitcast and GEP sequences. A core part of the
SSAUpdater approach is the ability to test whether a particular instruction is
part of the set being promoted. Testing this becomes significantly more complex
in the world where the operand to every load and store isn't the alloca itself.
I ended up using the approach of walking up the def-chain until we find the
alloca. I benchmarked this against keeping a set of pointer operands and
keeping a set of the loads and stores we care about, and this one seemed faster
although the difference was very small.

No test case yet because currently the rewriting always "fixes" the inputs to
not require this. The next patch which re-enables early promotion of easy cases
in SROA will include a test case that specifically exercises this aspect of the
alloca promoter.

llvm-svn: 188145

c17283b4

Reformat some bits of AllocaPromoter and simplify the name and type of · 45b136f4

Chandler Carruth authored Aug 11, 2013

our visiting datastructures in the AllocaPromoter/SSAUpdater path of
SROA. Also shift the order if clears around to be more consistent.

No functionality changed here, this is just a cleanup.

llvm-svn: 188144

45b136f4

Incorrect JAL instruction attributes caused the optimizer to make a wrong · 705c5951
Reed Kotler authored Aug 10, 2013
```
instruction move. Just affects static relocation. -static works fine now
with mips16 for the most part.

llvm-svn: 188143
```
705c5951

Aug 10, 2013

Revert r188119 "Kill some duplicated code for removing unreachable BBs." · 3dcdb89d

Arnold Schwaighofer authored Aug 10, 2013

It is breaking builbots with libgmalloc enabled on Mac OS X.

$ cd llvm ; mkdir release ; cd release
$ ../configure --enable-optimized —prefix=$PWD/install
$ make
$ make check
$ Release+Asserts/bin/llvm-lit -v --param use_gmalloc=1 --param \
  gmalloc_path=/usr/lib/libgmalloc.dylib \
  ../test/Instrumentation/DataFlowSanitizer/args-unreachable-bb.ll

llvm-svn: 188142

3dcdb89d

[Sparc] Enable xword directive in sparcv9. · b50bf5a0
Venkatraman Govindaraju authored Aug 10, 2013
```
llvm-svn: 188141
```
b50bf5a0
Teach ValueTracking about address spaces · f55e5e79
Matt Arsenault authored Aug 10, 2013
```
llvm-svn: 188140
```
f55e5e79
R600/SI: FMA is faster than fmul and fadd for f64 · d3a039fe
Niels Ole Salscheider authored Aug 10, 2013
```
llvm-svn: 188136
```
d3a039fe
R600/SI: Add FMA pattern · 6509ac65
Niels Ole Salscheider authored Aug 10, 2013
```
llvm-svn: 188135
```
6509ac65

Check for $PWD in llvm::sys::current_path. · 6ee16387

Rafael Espindola authored Aug 10, 2013

Some users (clang, libTooling) require this. After this patch we can remove
the calls to getenv("PWD") from clang.

llvm-svn: 188125

6ee16387

[objc-arc] Track if we encountered an additive overflow while computing... · d6ce6cbd

Michael Gottesman authored Aug 09, 2013

[objc-arc] Track if we encountered an additive overflow while computing {TopDown,BottomUp}PathCounts and do nothing if it occurred.

I fixed the aforementioned problems that came up on some of the linux boxes.
Major thanks to Nick Lewycky for his help debugging!

rdar://14590914

llvm-svn: 188122

d6ce6cbd

Kill some duplicated code for removing unreachable BBs. · 32090aba

Peter Collingbourne authored Aug 09, 2013

This moves removeUnreachableBlocksFromFn from SimplifyCFGPass.cpp
to Utils/Local.cpp and uses it to replace the implementation of
llvm::removeUnreachableBlocks, which appears to do a strict subset
of what removeUnreachableBlocksFromFn does.

Differential Revision: http://llvm-reviews.chandlerc.com/D1334

llvm-svn: 188119

32090aba

Aug 09, 2013

DataFlowSanitizer: Remove unreachable BBs so IR continues to verify · ae66d57b
Peter Collingbourne authored Aug 09, 2013
```
under the args ABI.

Differential Revision: http://llvm-reviews.chandlerc.com/D1316

llvm-svn: 188113
```
ae66d57b

Add another intrinsic that LLVM gives an incorrect prototype to. · be316cff

Reed Kotler authored Aug 09, 2013

I need to go through all the runtime routine list and see if there
are any more I need to add for mips16 floating point. Prototypes must
be correct or else I don't know to add a helper function call.

llvm-svn: 188106

be316cff

[stackprotector] Simplify SP Pass so that we emit different fail basic blocks... · 8afcf3a4

Michael Gottesman authored Aug 09, 2013

[stackprotector] Simplify SP Pass so that we emit different fail basic blocks for each fail condition.

This patch decouples the stack protector pass so that we can support stack
protector implementations that do not use the IR level generated stack protector
fail basic block.

No codesize increase is caused by this change since the MI level tail merge pass
properly merges together the fail condition blocks (see the updated test).

llvm-svn: 188105

8afcf3a4

Mark obviously const methods. Also use reference for parameters when possible. · 23ec6a97
Jakub Staszak authored Aug 09, 2013
```
llvm-svn: 188103
```
23ec6a97

Add a overload to CostTable which allows it to infer the size of the table. · 21585fd9

Benjamin Kramer authored Aug 09, 2013

Use it to avoid repeating ourselves too often. Also store MVT::SimpleValueType
in the TTI tables so they can be statically initialized, MVT's constructors
create bloated initialization code otherwise.

llvm-svn: 188095

21585fd9

DebugInfo: provide the ability to add members to a class after it has been constructed · f103c2f9

David Blaikie authored Aug 09, 2013

This is necessary to allow Clang to only emit implicit members when
there is code generated for them, rather than whenever they are ODR
used.

llvm-svn: 188082

f103c2f9

Make helper static and fix formatting. · df03449a
Benjamin Kramer authored Aug 09, 2013
```
llvm-svn: 188074
```
df03449a

This fixes the Thumb2 CPS assembly syntax. · 4c2801f7

Mihai Popa authored Aug 09, 2013

In Thumb1, only one variant is supported: CPS{effect} {flags}

Thumb2 supports three:
CPS{effect}.W {flags}
CPS{effect} {flags} {mode}
CPS {mode}

Canonically, .W should be used only when ambiguity is present between encodings of different width.
The wide suffix is still accepted for the latter two forms via aliases.

llvm-svn: 188071

4c2801f7

Fix assembling of Thumb2 branch instructions. · ad18d3ce

Mihai Popa authored Aug 09, 2013

The long encoding for Thumb2 unconditional branches is broken.
Additionally, there is no range checking for target operands; as such
for instructions originating in assembly code, only short Thumb encodings
are generated, regardless of the bitsize needed for the offset.

Adding range checking is non trivial due to the representation of Thumb
branch instructions. There is no true difference between conditional and
unconditional branches in terms of operands and syntax - even unconditional
branches have a predicate which is expected to match that of the IT block
they are in. Yet, the encodings and the permitted size of the offset differ.

Due to this, for any mnemonic there are really 4 encodings to choose for.

The problem cannot be handled in the parser alone or by manipulating td files.
Because the parser builds first a set of match candidates and then checks them
one by one, whatever tablegen-only solution might be found will ultimately be
dependent of the parser's evaluation order. What's worse is that due to the fact
that all branches have the same syntax and the same kinds of operands, that
order is governed by the lexicographical ordering of the names of operand
classes...

To circumvent all this, any necessary disambiguation is added to the instruction
validation pass.

llvm-svn: 188067

ad18d3ce