Commits · 12982a816c4be8702202cfcf5c3a83a29f70b5b9 · Roger Ferrer / llvm-epi

Nov 11, 2015

[X86] Replace LEAs with INC/DEC when profitable · 12982a81

Michael Kuperstein authored Nov 11, 2015

If possible and profitable, replace lea %reg, 1(%reg) and lea %reg, -1(%reg) with inc %reg and dec %reg respectively.

Patch by: anton.nadolsky@intel.com
Differential Revision: http://reviews.llvm.org/D14059

llvm-svn: 252722

12982a81

[ASan] Allow -fsanitize-recover=address. · 5bfeca12
Yury Gribov authored Nov 11, 2015
```
    
Differential Revision: http://reviews.llvm.org/D14243

llvm-svn: 252721
```
5bfeca12

Make test/Driver/biarch.c use FileCheck instead of grep · efb0c378

Artyom Skrobov authored Nov 11, 2015

Summary:
For clarity and ease of maintenance, I suggest porting this test
to use the same tooling as the rest of the tests.

Reviewers: joerg, rengolin, dougk, yaron.keren

Subscribers: cfe-commits

Differential Revision: http://reviews.llvm.org/D14548

llvm-svn: 252720

efb0c378

[ASan] Enable optional ASan recovery. · d7731988
Yury Gribov authored Nov 11, 2015
```
Differential Revision: http://reviews.llvm.org/D14242

llvm-svn: 252719
```
d7731988
Don't pass a member variable to a method. NFC. · 8e37f791
Rafael Espindola authored Nov 11, 2015
```
llvm-svn: 252718
```
8e37f791

Move relocate to the base class. · 9a6e4632

Rafael Espindola authored Nov 11, 2015

This is in preparation for adding .eh_frame support. They will have
another input section type but will also need to be relocated.

llvm-svn: 252717

9a6e4632

Simplify. NFC. · c240e366
Rafael Espindola authored Nov 11, 2015
```
llvm-svn: 252716
```
c240e366

sanitizer: speedup coverage by 33% · e38d3c8f

Dmitry Vyukov authored Nov 11, 2015

Atomic RMW is not necessary in InitializeGuardArray.
It is supposed to run when no user code runs.
And if user code runs concurrently, then the atomic
RMW won't help anyway. So replace it with non-atomic RMW.

InitializeGuardArray takes more than 50% of time during re2 fuzzing:

real	0m47.215s
51.56% a.out a.out [.] __sanitizer_reset_coverage

6.68%  a.out  a.out                [.] __sanitizer_cov
3.41%  a.out  a.out                [.] __sanitizer::internal_bzero_aligned16(void*, unsigned long)
1.79%  a.out  a.out                [.] __asan::Allocator::Allocate(unsigned long, unsigned long,
With this change:

real 0m31.661s
26.21% a.out a.out [.] sanitizer_reset_coverage
10.12% a.out a.out [.] sanitizer_cov

5.38%  a.out  a.out                [.] __sanitizer::internal_bzero_aligned16(void*, unsigned long)
2.53%  a.out  a.out                [.] __asan::Allocator::Allocate(unsigned long, unsigned long,
That's 33% speedup.

Reviewed in http://reviews.llvm.org/D14537

llvm-svn: 252715

e38d3c8f

test: Shorten test case to reduce 'make polly-check' time · 56e3fefb

Tobias Grosser authored Nov 11, 2015

Thinking more about the last commit I came to realize that for testing the
new functionality it is sufficient to verify that the iteration domains
we construct for a simple test case do not contain any of the complexity that
caused compile time issues for larger inputs.

llvm-svn: 252714

56e3fefb

ScopInfo: Pass domain constraints through error blocks · b76cd3cc

Tobias Grosser authored Nov 11, 2015

Previously, we just skipped error blocks during scop construction. With
this change we make sure we can construct domains for error blocks such that
these domains can be forwarded to subsequent basic blocks.

This change ensures that basic blocks that post-dominate and are dominated by
a basic block that branches to an error condition have the very same iteration
domain as the branching basic block. Before, this change we would construct
a domain that excludes all error conditions. Such domains could become _very_
complex and were undesirable to build.

Another solution would have been to drop these constraints using a
dominance/post-dominance check instead of modeling the error blocks. Such
a solution could also work in case of unreachable statements or infinite
loops in the scop. However, as we currently (to my believe incorrectly) model
unreachable basic blocks in the post-dominance tree, such a solution is not
yet feasible and requires first a change to LLVM's post-dominance tree
construction.

This commit addresses the most sever compile time issue reported in:
http://llvm.org/PR25458

llvm-svn: 252713

b76cd3cc

[X86] Add 'pause' builtin that's already in llvm and use it instead of inline... · fb79b5f2
Craig Topper authored Nov 11, 2015
```
[X86] Add 'pause' builtin that's already in llvm and use it instead of inline assembly to implement _mm_pause.

llvm-svn: 252712
```
fb79b5f2

[X86] Use __builtin_ia32_paddq and __builtin_ia32_psubq to implement a couple... · a5455524

Craig Topper authored Nov 11, 2015

[X86] Use __builtin_ia32_paddq and __builtin_ia32_psubq to implement a couple intrinsics that were supposed to operate on MMX registers. Otherwise we end up operating on GPRs. Throw in a test for _mm_mul_su32 while I was there.

llvm-svn: 252711

a5455524

[X86] Header formatting fixes. NFC · 880f60b7
Craig Topper authored Nov 11, 2015
```
llvm-svn: 252710
```
880f60b7
[X86] Fix feature flags on some MMX register instructions that really were... · b24a58e2
Craig Topper authored Nov 11, 2015
```
[X86] Fix feature flags on some MMX register instructions that really were introduced with SSE or SSE2.

llvm-svn: 252709
```
b24a58e2
[X86] Remove redundant MMX isel patterns. · 700a1a23
Craig Topper authored Nov 11, 2015
```
llvm-svn: 252708
```
700a1a23
Marked test_qRegisterInfo_returns_{one_valid_result,all_valid_results} XFAIL on Darwin. · fa93eb8f
Todd Fiala authored Nov 11, 2015
```
Tracked by:
https://llvm.org/bugs/show_bug.cgi?id=25486

llvm-svn: 252707
```
fa93eb8f

[FIX] Cast pre-loaded values correctly or reload them with adjusted type. · dcfedf35

Johannes Doerfert authored Nov 11, 2015

Especially for structs, the SAI object of a base pointer does not
describe all the types that the user might expect when he loads from
that base pointer. While we will still cast integers and pointers we
will now reload the value with the correct type if floating point and
non-floating point values are involved. However, there are now TODOs
where we use bitcasts instead of a proper conversion or reloading.

This fixes bug 25479.

llvm-svn: 252706

dcfedf35

[libFuzzer] better links · 240a159b
Kostya Serebryany authored Nov 11, 2015
```
llvm-svn: 252705
```
240a159b
[libFuzzer] more trophies · 65e7126f
Kostya Serebryany authored Nov 11, 2015
```
llvm-svn: 252704
```
65e7126f

Bump up test timeout interval on Darwin from 4 to 6 minutes. · 88722f71

Todd Fiala authored Nov 11, 2015

We have several tests that TIMEOUT under heavy load but just need a bit
more time to complete.

llvm-svn: 252703

88722f71

Mark TestCompletion.py test_symbol_name_dwarf XFAIL on Darwin. · f4500f0e

Todd Fiala authored Nov 11, 2015

This test fails most of the time when run under heavy load.  The dsym
variant doesn't seem to be failing.

Tracking XFAIL marker with:
https://llvm.org/bugs/show_bug.cgi?id=25485

llvm-svn: 252702

f4500f0e

[FIX] Create empty invariant equivalence classes · fc4bfc46

Johannes Doerfert authored Nov 11, 2015

  We now create all invariant equivalence classes for required invariant loads
  instead of creating them on-demand. This way we can check if a parameter
  references an invariant load that is actually not executed and was therefor
  not materialized. If that happens the parameter is not materialized either.

This fixes bug 25469.

llvm-svn: 252701

fc4bfc46

[X86] Add missing typecasts in intrinsic macros. This should make them more... · d619eaaa

Craig Topper authored Nov 11, 2015

[X86] Add missing typecasts in intrinsic macros. This should make them more robust against inputs that aren't already the right type.

llvm-svn: 252700

d619eaaa

Mark TestTerminal.py as XFAIL on OS X. · b9b0c9c6

Todd Fiala authored Nov 11, 2015

See the following tracking bug:
https://llvm.org/bugs/show_bug.cgi?id=25484

llvm-svn: 252699

b9b0c9c6

lit: Show all output with --show-all, even in combination with --succinct · 5bcb152f
Matthias Braun authored Nov 11, 2015
```
I missed an earlier exit for the --succinct case when I introduced the
-a option.

llvm-svn: 252698
```
5bcb152f

[X86] Change pointer type in AVX2 gather builtins to be the scalar type... · 19744ee6

Craig Topper authored Nov 11, 2015

[X86] Change pointer type in AVX2 gather builtins to be the scalar type instead of the vector type. This matches gcc and removes extras casts.

llvm-svn: 252697

19744ee6

Implement `internal_start/join_thread` on Mac OS X · 26f70505

Ismail Pazarbasi authored Nov 11, 2015

Summary: Depends on D9637

Test Plan:

Reviewers: kcc, glider, samsonov

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D9638

llvm-svn: 252696

26f70505

[tsan] Pass correct interposed function prefix to report function · fcb8c7e4

Ismail Pazarbasi authored Nov 11, 2015

Summary:
On Darwin, interposed functions are prefixed with "wrap_". On Linux,
they are prefixed with "__interceptor_".

Reviewers: dvyukov, samsonov, glider, kcc, kubabrecka

Subscribers: zaks.anna, llvm-commits

Differential Revision: http://reviews.llvm.org/D14512

llvm-svn: 252695

fcb8c7e4

ADT: Avoid relying on UB in ilist_node::getNextNode() · 4042d91b

Duncan P. N. Exon Smith authored Nov 11, 2015

Re-implement `ilist_node::getNextNode()` and `getPrevNode()` without
relying on the sentinel having a "next" pointer.  Instead, get access to
the owning list and compare against the `begin()` and `end()` iterators.

This only works when the node *can* get access to the owning list.  The
new support is in `ilist_node_with_parent<>`, and any class `Ty`
inheriting from `ilist_node<NodeTy>` that wants `getNextNode()` and/or
`getPrevNode()` should inherit from
`ilist_node_with_parent<NodeTy, ParentTy>` instead.  The requirements:

  - `NodeTy` must have a `getParent()` function that returns the parent.
  - `ParentTy` must have a `getSublistAccess()` static that, given a(n
    ignored) `NodeTy*` (to determine which list), returns a member field
    pointer to the appropriate `ilist<>`.

This isn't the cleanest way to get access to the owning list, but it
leverages the API already used in the IR hierarchy (see, e.g.,
`Instruction::getSublistAccess()`).

If anyone feels like ripping out the calls to `getNextNode()` and
`getPrevNode()` and replacing with direct iterator logic, they can also
remove the access function, etc., but as an incremental step, I'm
maintaining the API where it's currently used in tree.

If these requirements are *not* met, call sites with access to the ilist
can call `iplist<NodeTy>::getNextNode(NodeTy*)` directly, as in
ilistTest.cpp.

Why rewrite this?

The old code was broken, calling `getNext()` on a sentinel that possibly
didn't have a "next" pointer at all!  The new code avoids that
particular flavour of UB (see the commit message for r252538 for more
details about the "lucky" memory layout that made this function so
interesting).

There's still some UB here: the end iterator gets downcast to `NodeTy*`,
even when it's a sentinel (which is typically
`ilist_half_node<NodeTy*>`).  I'll tackle that in follow-up commits.
See this llvm-dev thread for more details:
http://lists.llvm.org/pipermail/llvm-dev/2015-October/091115.html

What's the danger?

There might be some code that relies on `getNextNode()` or
`getPrevNode()` *never* returning `nullptr` -- i.e., that relies on them
being broken when the sentinel is an `ilist_half_node<NodeTy>`.  I tried
to root out those cases with the audits I did leading up to r252380, but
it's possible I missed one or two.  I hope not.

(If (1) you have out-of-tree code, (2) you've reverted r252380
temporarily, and (3) you get some weird crashes with this commit, then I
recommend un-reverting r252380 and auditing the compile errors looking
for "strange" implicit conversions.)

llvm-svn: 252694

4042d91b

Reorder the check strings in test case following r252692. · 25aa9f09
Akira Hatanaka authored Nov 11, 2015
```
rdar://problem/19836465

llvm-svn: 252693
```
25aa9f09

Sort the enums in Attributes.h in case insensitive alphabetical order. · 5dda5926

Akira Hatanaka authored Nov 11, 2015

Sort the enums in preparation for moving the attributes to a table-gen
file.

rdar://problem/19836465

llvm-svn: 252692

5dda5926

Fix a FIXME about using std::is_sorted. · ed60b436
Eric Christopher authored Nov 11, 2015
```
llvm-svn: 252691
```
ed60b436

Add support for GCC's '__auto_type' extension, per the GCC manual: · e301ba2b

Richard Smith authored Nov 11, 2015

https://gcc.gnu.org/onlinedocs/gcc/Typeof.html

Differences from the GCC extension:
 * __auto_type is also permitted in C++ (but only in places where
   it could appear in C), allowing its use in headers that might
   be shared across C and C++, or used from C++98
 * __auto_type can be combined with a declarator, as with C++ auto
   (for instance, "__auto_type *p")
 * multiple variables can be declared in a single __auto_type
   declaration, with the C++ semantics (the deduced type must be
   the same in each case)

This patch also adds a missing restriction on applying typeof to
a bit-field, which GCC has historically rejected in C (due to
lack of clarity as to whether the operand should be promoted).
The same restriction also applies to __auto_type in C (in both
GCC and Clang).

This also fixes PR25449.

Patch by Nicholas Allegra!

llvm-svn: 252690

e301ba2b

Disabling sancov.cc on windows. · a543f77b

Mike Aizatsky authored Nov 11, 2015

It can't be built due to cxxabi missing. Will
fix later.

Differential Revision: http://reviews.llvm.org/D14559

llvm-svn: 252689

a543f77b

N3922: direct-list-initialization of an auto-typed variable no longer deduces a · 42b10572

Richard Smith authored Nov 11, 2015

std::initializer_list<T> type. Instead, the list must contain a single element
and the type is deduced from that.

In Clang 3.7, we warned by default on all the cases that would change meaning
due to this change. In Clang 3.8, we will support only the new rules -- per
the request in N3922, this change is applied as a Defect Report against earlier
versions of the C++ standard.

This change is not entirely trivial, because for lambda init-captures we
previously did not track the difference between direct-list-initialization and
copy-list-initialization. The difference was not previously observable, because
the two forms of initialization always did the same thing (the elements of the
initializer list were always copy-initialized regardless of the initialization
style used for the init-capture).

llvm-svn: 252688

42b10572

[WebAssembly] Support non-legal argument and return types. · 754cd11d
Dan Gohman authored Nov 11, 2015
```
llvm-svn: 252687
```
754cd11d
[elf2] Add support for local TLS symbols. · dc9c5df5
Michael J. Spencer authored Nov 11, 2015
```
llvm-svn: 252686
```
dc9c5df5
[elf2][x86-64] Add support for DTPOFF64 · ac2307b9
Michael J. Spencer authored Nov 11, 2015
```
llvm-svn: 252685
```
ac2307b9
[elf2][x86-64] Add support for DTPOFF32 · a5d9d1f1
Michael J. Spencer authored Nov 11, 2015
```
llvm-svn: 252684
```
a5d9d1f1

Sancov in C++. · 125e636b

Mike Aizatsky authored Nov 11, 2015

Summary:
First batch of sancov.py rewrite in C++.
Supports "-print" and "-covered_functions" commands.

Differential Revision: http://reviews.llvm.org/D14356

llvm-svn: 252683

125e636b