Commits · f0e58816e48c0bd0ddd97b87cb54b096feb8a8a8 · Roger Ferrer / llvm-epi

Jun 30, 2017

CREDITS.TXT: Update myself. · f0e58816
NAKAMURA Takumi authored Jun 30, 2017
```
llvm-svn: 306818
```
f0e58816
[X86] Updated 32-bit memcmp tests to run with/without SSE2 · e5e92322
Simon Pilgrim authored Jun 30, 2017
```
llvm-svn: 306816
```
e5e92322
Revert of r306525: "Canonicalize clamp of float types to minmax" · bde9b14c
Nikolai Bozhenov authored Jun 30, 2017
```
llvm-svn: 306815
```
bde9b14c

[YAML] - Teach yaml2obj/obj2yaml to work with numeric relocation values. · 892c6c86

George Rimar authored Jun 30, 2017

That may be useful if we want to produce or parse object containing
broken relocation values using yaml2obj/obj2yaml.

Previously that was impossible because only enum values were parsed
correctly, this patch allows to put any numeric value as a
relocation type.

Differential revision: https://reviews.llvm.org/D34758

llvm-svn: 306814

892c6c86

[DWARF] - Simplify HandleExpectedError implementation in DWARFDebugInfoTest · d8508b0a

George Rimar authored Jun 30, 2017

Current implementation looks a bit confusing. It looks like it should
report/print something on error, but it does not do that.
It silently drops a error message when creating triple, though
this behavior is fine generally.

For example if LLVM configured with -DLLVM_TARGETS_TO_BUILD=ARM and
our host is windows, it is expected that we will be unable to
create "i386-pc-windows-msvc" target.

Patch introduces isConfigurationSupported() function that checks
if current configuration is supported for each test and returns early if not.

llvm-svn: 306812

d8508b0a

Fixed misplaced table border in the docs. · 47092752
Ilya Biryukov authored Jun 30, 2017
```
llvm-svn: 306811
```
47092752

Added Dockerfiles to build clang from sources. · af351dae

Ilya Biryukov authored Jun 30, 2017

Reviewers: klimek, chandlerc, mehdi_amini

Reviewed By: klimek, mehdi_amini

Subscribers: mehdi_amini, jlebar, llvm-commits

Differential Revision: https://reviews.llvm.org/D34197

llvm-svn: 306810

af351dae

fix trivial typos, NFC · a89d4b5f
Hiroshi Inoue authored Jun 30, 2017
```
llvm-svn: 306808
```
a89d4b5f

[GlobalISel] Make multi-step legalization work. · b539ea53

Kristof Beyls authored Jun 30, 2017

In r301116, a custom lowering needed to be introduced to be able to
legalize 8 and 16-bit divisions on ARM targets without a division
instruction, since 2-step legalization (WidenScalar from 8 bit to 32
bit, then Libcall the 32-bit division) doesn't work.

This fixes this and makes this kind of multi-step legalization, where
first the size of the type needs to be changed and then some action is
needed that doesn't require changing the size of the type,
straighforward to specify.

Differential Revision: https://reviews.llvm.org/D32529

llvm-svn: 306806

b539ea53

[LV] Optimize for size when vectorizing loops with tiny trip count · 8d26f0a6

Ayal Zaks authored Jun 30, 2017

It may be detrimental to vectorize loops with very small trip count, as various
costs of the vectorized loop body as well as enclosing overheads including
runtime tests and scalar iterations may outweigh the gains of vectorizing. The
current cost model measures the cost of the vectorized loop body only, expecting
it will amortize other costs, and loops with known or expected very small trip
counts are not vectorized at all. This patch allows loops with very small trip
counts to be vectorized, but under OptForSize constraints, which ensure the cost
of the loop body is dominant, having no runtime guards nor scalar iterations.

Patch inspired by D32451.

Differential Revision: https://reviews.llvm.org/D34373

llvm-svn: 306803

8d26f0a6

[InstCombine] Add test cases to demonstrate failure to fold (a | b) ^ (~a |... · 97cd0173

Craig Topper authored Jun 30, 2017

[InstCombine] Add test cases to demonstrate failure to fold (a | b) ^ (~a | ~b) --> ~(a ^ b) and its commuted variants.

llvm-svn: 306801

97cd0173

[InstCombine] In foldXorToXor, move the commutable matcher from the LHS match... · 880bf826

Craig Topper authored Jun 30, 2017

[InstCombine] In foldXorToXor, move the commutable matcher from the LHS match to the RHS match. No meaningful change intended.

There are two conditions ORed here with similar checks and each contain two matches that must be true for the if to succeed. With the commutable match on the first half of the OR then both ifs basically have the same first part and only the second part distinguishs. With this change we move the commutable match to second half and make the first half unique.

This caused some tests to change because we now produce a commuted result, but this shouldn't matter in practice.

llvm-svn: 306800

880bf826

fix trivial typo; NFC · c3969644
Hiroshi Inoue authored Jun 30, 2017
```
llvm-svn: 306798
```
c3969644

Remove the BBVectorize pass. · 3545a9e1

Chandler Carruth authored Jun 30, 2017

It served us well, helped kick-start much of the vectorization efforts
in LLVM, etc. Its time has come and past. Back in 2014:
http://lists.llvm.org/pipermail/llvm-dev/2014-November/079091.html

Time to actually let go and move forward. =]

I've updated the release notes both about the removal and the
deprecation of the corresponding C API.

llvm-svn: 306797

3545a9e1

[llvm-readobj] Improve printouts for COFF ARM64 binaries · 43c85453
Martin Storsjö authored Jun 30, 2017
```
Differential Revision: https://reviews.llvm.org/D34835

llvm-svn: 306795
```
43c85453

[llvm-readobj] Include the PE magic value in printouts · 8ae07ac8

Martin Storsjö authored Jun 30, 2017

This is useful for a testcase in lld.

Differential Revision: https://reviews.llvm.org/D34836

llvm-svn: 306794

8ae07ac8

Revert "r306541 - Add zero-length check to memcpy/memset load store loop expansion" · 3b704ceb
Daniel Jasper authored Jun 30, 2017
```
Segfaults in non-optimized builds. I'll get a stack trace and a
reproducer to Teresa.

llvm-svn: 306793
```
3b704ceb
Revert "r306473 - re-commit r306336: Enable vectorizer-maximize-bandwidth by default." · 5ce1ce74
Daniel Jasper authored Jun 30, 2017
```
This still breaks PPC tests we have. I'll forward reproduction
instructions to dehao.

llvm-svn: 306792
```
5ce1ce74

Rewrite demangle memory handling. · dbb92cad

Eric Christopher authored Jun 30, 2017

The return of itaniumDemangle is allocated with malloc rather than new[]
and so using unique_ptr isn't called for here. As a note for the future
we should rewrite it to do this.

llvm-svn: 306788

dbb92cad

[SCEV] Use depth limit instead of local cache for SExt and ZExt · 8d0322e6

Max Kazantsev authored Jun 30, 2017

In rL300494 there was an attempt to deal with excessive compile time on
invocations of getSign/ZeroExtExpr using local caching. This approach only
helps if we request the same SCEV multiple times throughout recursion. But
in the bug PR33431 we see a case where we request different values all the time,
so caching does not help and the size of the cache grows enormously.

In this patch we remove the local cache for this methods and add the recursion
depth limit instead, as we do for arithmetics. This gives us a guarantee that the
invocation sequence is limited and reasonably short.

Differential Revision: https://reviews.llvm.org/D34273

llvm-svn: 306785

8d0322e6

Try to appease a buildbot. · 3fde2d36

Vedant Kumar authored Jun 30, 2017

The failure is:
C:\ps4-buildslave2\llvm-clang-x86_64-expensive-checks-win\llvm\unittests\ProfileData\CoverageMappingTest.cpp(244):
error C2668: 'llvm::make_unique': ambiguous call to overloaded function

http://lab.llvm.org:8011/builders/llvm-clang-x86_64-expensive-checks-win/builds/3489/

llvm-svn: 306784

3fde2d36

Reduce indenting and clean up comparisons around sign bit. · a95aac37
Eric Christopher authored Jun 30, 2017
```
llvm-svn: 306781
```
a95aac37

Change the type of Undecorated to unique_ptr<char[]> since we're looking at a... · 5dcdc7a3

Eric Christopher authored Jun 30, 2017

Change the type of Undecorated to unique_ptr<char[]> since we're looking at a null terminated string and not a single character.

Fixes an error in tcmalloc sized delete checking.

llvm-svn: 306780

5dcdc7a3

Reduce the complexity of the signbit/branch test functions. · 710c1c8f
Eric Christopher authored Jun 30, 2017
```
llvm-svn: 306779
```
710c1c8f

[Dominators] Don't compute DFS InOut numbers eagerly. · 837755cf

Jakub Kuderski authored Jun 30, 2017

Summary:
DFS InOut numbers currently get eagerly computer upon DomTree construction. They are only needed to answer dome dominance queries and they get invalidated by updates and recalculations. Because of that, it is faster in practice to compute them lazily when they are actually needed.

Clang built without this patch takes 6m 45s to boostrap on my machine, and with the patch applied 6m 38s.

Reviewers: sanjoy, dberlin, chandlerc

Reviewed By: dberlin

Subscribers: davide, llvm-commits

Differential Revision: https://reviews.llvm.org/D34296

llvm-svn: 306778

837755cf

Add a C API section to the release notes. · bc02ef17
Eric Christopher authored Jun 30, 2017
```
llvm-svn: 306777
```
bc02ef17

[Coverage] Remove two overloads of CoverageMapping::load. NFC. · cc34e619

Vedant Kumar authored Jun 30, 2017

These overloads are essentially dead, and pose a maintenance cost
without adding any benefit. This is coming up now because I'd like to
experiment with changing the way we store coverage mapping data, and
would rather not have to fix up the old overloads while doing so.

Testing: check-{llvm,profile}, build clang.
llvm-svn: 306776

cc34e619

[WebAssembly] Add support for exception handling instructions · ac62b05d

Heejin Ahn authored Jun 30, 2017

Summary:
This adds backend support for throw, rethrow, try, and try_end instructions.
This needs the corresponding clang builtin support:
https://reviews.llvm.org/D34783
This follows the Wasm exception handling proposal in
https://github.com/WebAssembly/exception-handling/blob/master/proposals/Exceptions.md

Reviewers: sunfish, dschuff

Reviewed By: dschuff

Subscribers: jfb, sbc100, jgravelle-google

Differential Revision: https://reviews.llvm.org/D34826

llvm-svn: 306774

ac62b05d

[DWARF] Move a couple of member functions to the DWARFUnit baseclass. NFC. · e60147c8
Wolfgang Pieb authored Jun 30, 2017
```
Reviewer: dblaikie

Differential revision: https://reviews.llvm.org/D34765

llvm-svn: 306771
```
e60147c8

Unified logic for computing target ABI in backend and front end by moving this... · ee837a59

Eric Christopher authored Jun 30, 2017

Unified logic for computing target ABI in backend and front end by moving this common code to Support/TargetParser.

Modeled Triple::GNU after front end code (aapcs abi) and updated tests that expect apcs abi.

Based heavily on a patch by Ana Pazos!

llvm-svn: 306768

ee837a59

[GISel]: New Opcode G_FLOG/G_FLOG2 · 20f62070
Aditya Nandakumar authored Jun 29, 2017
```
https://reviews.llvm.org/D34837

llvm-svn: 306766
```
20f62070

Hook the sample PGO machinery in the new PM · 2f31d0d8

Dehao Chen authored Jun 29, 2017

Summary: This patch hooks up SampleProfileLoaderPass with the new PM.

Reviewers: chandlerc, davidxl, davide, tejohnson

Reviewed By: chandlerc, tejohnson

Subscribers: tejohnson, llvm-commits, sanjoy

Differential Revision: https://reviews.llvm.org/D34720

llvm-svn: 306763

2f31d0d8

To help readability of mightUseCTR pull out the inline asm handling support into a function. · 56f481b7
Eric Christopher authored Jun 29, 2017
```
llvm-svn: 306762
```
56f481b7
Make the PPCCTRLoops pass depend on being able to access the TargetMachine and... · b16eacf5
Eric Christopher authored Jun 29, 2017
```
Make the PPCCTRLoops pass depend on being able to access the TargetMachine and clean up accordingly.

llvm-svn: 306761
```
b16eacf5

Remove redundant copy in recurrences · 0e35ea3b

Taewook Oh authored Jun 29, 2017

Summary:
If there is a chain of instructions formulating a recurrence, commuting operands can help removing a redundant copy. In the following example code,

```
BB#1: ; Loop Header
  %vreg0<def> = COPY %vreg13<kill>; GR32:%vreg0,%vreg13
  ...

BB#6: ; Loop Latch
  %vreg2<def> = COPY %vreg15<kill>; GR32:%vreg2,%vreg15
  %vreg10<def,tied1> = ADD32rr %vreg1<kill,tied0>, %vreg0<kill>, %EFLAGS<imp-def,dead>; GR32:%vreg10,%vreg1,%vreg0
  %vreg3<def,tied1> = ADD32rr %vreg2<kill,tied0>, %vreg10<kill>, %EFLAGS<imp-def,dead>; GR32:%vreg3,%vreg2,%vreg10
  CMP32ri8 %vreg3, 10, %EFLAGS<imp-def>; GR32:%vreg3
  %vreg13<def> = COPY %vreg3<kill>; GR32:%vreg13,%vreg3
  JL_1 <BB#1>, %EFLAGS<imp-use,kill>
```

Existing two-address generation pass generates following code:

```
BB#1:
  %vreg0<def> = COPY %vreg13<kill>; GR32:%vreg0,%vreg13
  ...

BB#6:
    Predecessors according to CFG: BB#5 BB#4
  %vreg2<def> = COPY %vreg15<kill>; GR32:%vreg2,%vreg15
  %vreg10<def> = COPY %vreg1<kill>; GR32:%vreg10,%vreg1
  %vreg10<def,tied1> = ADD32rr %vreg10<tied0>, %vreg0<kill>, %EFLAGS<imp-def,dead>; GR32:%vreg10,%vreg0
  %vreg3<def> = COPY %vreg10<kill>; GR32:%vreg3,%vreg10
  %vreg3<def,tied1> = ADD32rr %vreg3<tied0>, %vreg2<kill>, %EFLAGS<imp-def,dead>; GR32:%vreg3,%vreg2
  CMP32ri8 %vreg3, 10, %EFLAGS<imp-def>; GR32:%vreg3
  %vreg13<def> = COPY %vreg3<kill>; GR32:%vreg13,%vreg3
  JL_1 <BB#1>, %EFLAGS<imp-use,kill>
  JMP_1 <BB#7>
```

This is suboptimal because the assembly code generated has a redundant copy at the end of #BB6 to feed %vreg13 to BB#1:

```
.LBB0_6:
  addl  %esi, %edi
  addl  %ebx, %edi
  cmpl  $10, %edi
  movl  %edi, %esi
  jl  .LBB0_1
```

This redundant copy can be elimiated by making instructions in the recurrence chain to compute the value "into" the register that actually holds the feedback value. In this example, this can be achieved by commuting %vreg0 and %vreg1 to compute %vreg10. With that change, code after two-address generation becomes

```
BB#1:
  %vreg0<def> = COPY %vreg13<kill>; GR32:%vreg0,%vreg13
  ...

BB#6: derived from LLVM BB %bb7
    Predecessors according to CFG: BB#5 BB#4
  %vreg2<def> = COPY %vreg15<kill>; GR32:%vreg2,%vreg15
  %vreg10<def> = COPY %vreg0<kill>; GR32:%vreg10,%vreg0
  %vreg10<def,tied1> = ADD32rr %vreg10<tied0>, %vreg1<kill>, %EFLAGS<imp-def,dead>; GR32:%vreg10,%vreg1
  %vreg3<def> = COPY %vreg10<kill>; GR32:%vreg3,%vreg10
  %vreg3<def,tied1> = ADD32rr %vreg3<tied0>, %vreg2<kill>, %EFLAGS<imp-def,dead>; GR32:%vreg3,%vreg2
  CMP32ri8 %vreg3, 10, %EFLAGS<imp-def>; GR32:%vreg3
  %vreg13<def> = COPY %vreg3<kill>; GR32:%vreg13,%vreg3
  JL_1 <BB#1>, %EFLAGS<imp-use,kill>
  JMP_1 <BB#7>
```

and the final assembly does not have redundant copy:

```
.LBB0_6:
  addl  %edi, %eax
  addl  %ebx, %eax
  cmpl  $10, %eax
  jl  .LBB0_1
```

Reviewers: qcolombet, MatzeB, wmi

Reviewed By: wmi

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D31821

llvm-svn: 306758

0e35ea3b

[ThinkLTO] Invoke build(Thin)?LTOPreLinkDefaultPipeline. · 66470691

Tim Shen authored Jun 29, 2017

Previously it doesn't actually invoke the designated new PM builder
functions.

This patch moves NameAnonGlobalPass out from PassBuilder, as Chandler
points out that PassBuilder is used for non-O0 builds, and for
optimizations only.

Differential Revision: https://reviews.llvm.org/D34728

llvm-svn: 306756

66470691

[CFLAA] Remove unneded function declaration. NFCI. · f6b3d211
Davide Italiano authored Jun 29, 2017
```
llvm-svn: 306754
```
f6b3d211

Jun 29, 2017
- [SLPVectorizer] Moving Entry->NeedToGather check out of inner loop, · f05c73c1
  Dinar Temirbulatov authored Jun 29, 2017
```
                since it is invariant there. NFCI.

llvm-svn: 306749
```
  f05c73c1
- Revert "[mips] Fix multiprecision arithmetic." · dede76f4
  Simon Dardis authored Jun 29, 2017
```
This reverts commit r305389. This broke chromium builds, so reverting
while I investigate further.

llvm-svn: 306741
```
  dede76f4
- [AArch64] Silence an unused variable warning in Release builds. NFC. · 4c1bc656
  Chad Rosier authored Jun 29, 2017
```
llvm-svn: 306738
```
  4c1bc656