Commits · d0ee66c2e9decd064b84399fb81616a01b6d1f22 · Lorenzo Albano / LLVM bpEVL

Dec 01, 2016

Move most EH from MachineModuleInfo to MachineFunction · d0ee66c2

Matthias Braun authored Dec 01, 2016

Recommitting r288293 with some extra fixes for GlobalISel code.

Most of the exception handling members in MachineModuleInfo is actually
per function data (talks about the "current function") so it is better
to keep it at the function instead of the module.

This is a necessary step to have machine module passes work properly.

Also:
- Rename TidyLandingPads() to tidyLandingPads()
- Use doxygen member groups instead of "//===- EH ---"... so it is clear
  where a group ends.
- I had to add an ugly const_cast at two places in the AsmPrinter
  because the available MachineFunction pointers are const, but the code
  wants to call tidyLandingPads() in between
  (markFunctionEnd()/endFunction()).

Differential Revision: https://reviews.llvm.org/D27227

llvm-svn: 288405

d0ee66c2

[CodeGen][ARM] Make sure the value and type used to create a bitcast · 6c299ca6

Akira Hatanaka authored Dec 01, 2016

have the same size.

This fixes an asset that is triggered when an address of a boolean
variable is passed to __builtin_arm_ldrex or __builtin_arm_strex.

rdar://problem/29269006

llvm-svn: 288404

6c299ca6

Handle empty strings when looking for a CFString's encoding. · 0134152c
Sean Callanan authored Dec 01, 2016
```
Should fix the bots.

llvm-svn: 288403
```
0134152c
Fix a bug with llvm-size and the -m option with multiple files not printing the file names. · 5997c948
Kevin Enderby authored Dec 01, 2016
```
llvm-svn: 288402
```
5997c948
Fix unused variable warning in Release builds. NFC. · 6a8704c1
Benjamin Kramer authored Dec 01, 2016
```
llvm-svn: 288401
```
6a8704c1

Fix module map to create a module for the configured header Config/abi-breaking.h · 9676d5ed

Mehdi Amini authored Dec 01, 2016

A client of a header that relies on ABI breaking should get the macro
exported there.
Before this, the unittest for Support/Error including Support/Error.h
didn't get the macro exported by the Support module, because the
latter only re-export its submodules and included module, not
textual headers.

Hopefully, it'll also fix the build with local submodule visibility,
since the LLVM_Utils contains two submodules: ADT and Support. They
both include abi-breaking.h that defines a symbol. The textual
inclusion lead to a double definition of the symbol which broke
the parent module.

Differential Revision: https://reviews.llvm.org/D27273

llvm-svn: 288400

9676d5ed

This change removes the dependency on DwarfDebug that was used for... · 35630c33

Greg Clayton authored Dec 01, 2016

This change removes the dependency on DwarfDebug that was used for DW_FORM_ref_addr by making a new DIEUnit class in DIE.cpp.

The DIEUnit class represents a compile or type unit and it owns the unit DIE as an instance variable. This allows anyone with a DIE, to get the unit DIE, and then get back to its DIEUnit without adding any new ivars to the DIE class. Why was this needed? The DIE class has an Offset that is always the CU relative DIE offset, not the "offset in debug info section" as was commented in the header file (the comment has been corrected). This is great for performance because most DIE references are compile unit relative and this means most code that accessed the DIE's offset didn't need to make it into a compile unit relative offset because it already was. When we needed to emit a DW_FORM_ref_addr though, we needed to find the absolute offset of the DIE by finding the DIE's compile/type unit. This class did have the absolute debug info/type offset and could be added to the CU relative offset to compute the absolute offset. With this change we can easily get back to a DIE's DIEUnit which will have this needed offset. Prior to this is required having a DwarfDebug and required calling:

DwarfCompileUnit *DwarfDebug::lookupUnit(const DIE *CU) const;
Now we can use the DIEUnit class to do so without needing DwarfDebug. All clients now use DIEUnit objects (the DwarfDebug stack and the DwarfLinker). A follow on patch for the DWARF generator will also take advantage of this.

Differential Revision: https://reviews.llvm.org/D27170

llvm-svn: 288399

35630c33

[SLP] Fixed cost model for horizontal reduction. · 62af7252

Alexey Bataev authored Dec 01, 2016

Currently when cost of scalar operations is evaluated the vector type is
used for scalar operations. Patch fixes this issue and fixes evaluation
of the vector operations cost.
Several test showed that vector cost model is too optimistic. It
allowed vectorization of 8 or less add/fadd operations, though scalar
code is faster. Actually, only for 16 or more operations vector code
provides better performance.

Differential Revision: https://reviews.llvm.org/D26277

llvm-svn: 288398

62af7252

[clang] Implement support for -defsym assembler option · 358faec6

Mandeep Singh Grang authored Dec 01, 2016

Summary:
Adds support for -Wa,-defsym,abc=1 option.

Related llvm patch: https://reviews.llvm.org/D26214

Reviewers: rafael, t.p.northover, colinl, echristo, compnerd, rengolin

Subscribers: mehdi_amini

Tags: #clang-c

Differential Revision: https://reviews.llvm.org/D26213

llvm-svn: 288397

358faec6

[llvm] Implement support for -defsym assembler option · 32360071

Mandeep Singh Grang authored Dec 01, 2016

Summary:
Changes to llvm-mc to move common logic to separate function.

Related clang patch: https://reviews.llvm.org/D26213

Reviewers: rafael, t.p.northover, colinl, echristo, rengolin

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D26214

llvm-svn: 288396

32360071

[X86][SSE] Moved shuffle mask widening/narrowing helper functions earlier in the file. · 17d5b6b4
Simon Pilgrim authored Dec 01, 2016
```
Will be necessary for a future patch.

llvm-svn: 288395
```
17d5b6b4
swiftcc: Add an api to query whether a target ABI stores swifterror in a register · b0f2c339
Arnold Schwaighofer authored Dec 01, 2016
```
llvm-svn: 288394
```
b0f2c339
Fix VS2015 build of clang-format-vsix by using NuGet to pull in required assemblies · 043d9e36
Hans Wennborg authored Dec 01, 2016
```
Also added a gitignore to help track the right items to commit.

Patch by Antonio Maiorano <amaiorano@gmail.com>!

llvm-svn: 288393
```
043d9e36
[libFuzzer] add a test for r288389 (-rss_limit_mb=0 means no limit). · 09f4fa52
Kostya Serebryany authored Dec 01, 2016
```
llvm-svn: 288392
```
09f4fa52
[SystemZ] Fix fallout from r288374 · d36b31d0
Ulrich Weigand authored Dec 01, 2016
```
Avoid undefined behavior due to too-large shift count.

llvm-svn: 288391
```
d36b31d0

[AsmParser] Diagnose empty symbol for .set directive · cf26d563

Weiming Zhao authored Dec 01, 2016

Summary: Diagnose empty symbol to avoid hitting assertion in MCContext::getOrCreateSymbol

Reviewers: eli.friedman, rengolin

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D26728

llvm-svn: 288390

cf26d563

[libFuzzer] treat -rss_limit_mb=0 as no limit · dc6b8ca8
Kostya Serebryany authored Dec 01, 2016
```
llvm-svn: 288389
```
dc6b8ca8

[TableGen] Ignore fake args for parsing-related arg counts. · 8a36ace5

George Burgess IV authored Dec 01, 2016

We should complain about the following:

```
void foo() __attribute__((unavailable("a", "b")));
```

Instead, we currently just ignore "b". (...We also end up ignoring "a",
because we assume elsewhere that this attribute can only have 1 or 0
args.)

This happens because `unavailable` has a fake enum arg, and
`AttributeList::{getMinArgs,getMaxArgs}` include fake args in their
counts.

llvm-svn: 288388

8a36ace5

Remove another hack from the android toolchain file · c962c6ac
Pavel Labath authored Dec 01, 2016
```
This is no longer an issue with recent versions of the android ndk.

llvm-svn: 288387
```
c962c6ac

Handle UTF-16 and UTF-32 constant CFStrings · cd1eb72b

Sean Callanan authored Dec 01, 2016

We have a longstanding issue where the expression parser does not handle wide CFStrings (e.g., @"凸凹") correctly, producing the useless error message

Internal error [IRForTarget]: An Objective-C constant string's string initializer is not an array
error: warning: expression result unused
error: The expression could not be prepared to run in the target

This is just a side effect of the fact that we don't handle wide string constants when converting these to CFStringCreateWithBytes. That function takes the string's encoding as an argument, so I made it work and added a testcase.

https://reviews.llvm.org/D27291
<rdar://problem/13190557>

llvm-svn: 288386

cd1eb72b

Recommit r287403 (reverted in r287804): [lit] When setting SDKROOT on Darwin,... · 93f12aff

Kuba Mracek authored Dec 01, 2016

Recommit r287403 (reverted in r287804): [lit] When setting SDKROOT on Darwin, use '--sdk macosx' to find the right SDK path.

This shouls now be safe and not break any more bots. It's strictly better to use '--sdk macosx', otherwise xcrun can return weird things for example when you have Command Line Tools or the SDK installed into '/'.

llvm-svn: 288385

93f12aff

Add a blank line to make sphinx happy. · 7636149f
Malcolm Parsons authored Dec 01, 2016
```
llvm-svn: 288384
```
7636149f

Protect sequences test under libcpp-no-exceptions · de344ac8

Roger Ferrer Ibanez authored Dec 01, 2016

Replace throw with TEST_THROW and protect tests that do throw. Also add missing assert(false).

Differential Revision: https://reviews.llvm.org/D27252

llvm-svn: 288383

de344ac8

Protect futures test under libcpp-no-exceptions · f3fce920

Roger Ferrer Ibanez authored Dec 01, 2016

Skip tests that expect an exception be thrown.

Differential Revision: https://reviews.llvm.org/D27253

llvm-svn: 288382

f3fce920

[GVN, OptDiag] Print the interesting instructions involved in missed load-elimination · 4ddb8c01

Adam Nemet authored Dec 01, 2016

[recommitting after the fix in r288307]

This includes the intervening store and the load/store that we're trying
to forward from in the optimization remark for the missed load
elimination.

This is hooked up under a new mode in ORE that allows for compile-time
budget for a bit more analysis to print more insightful messages.  This
mode is currently enabled for -fsave-optimization-record (-Rpass is
trickier since it is controlled in the front-end).

With this we can now print the red remark in http://lab.llvm.org:8080/artifacts/opt-view_test-suite/build/SingleSource/Benchmarks/Dhrystone/CMakeFiles/dry.dir/html/_org_test-suite_SingleSource_Benchmarks_Dhrystone_dry.c.html#L446

Differential Revision: https://reviews.llvm.org/D26490

llvm-svn: 288381

4ddb8c01

[GVN, OptDiag] Include the value that is forwarded in load elimination · 8b5fba80

Adam Nemet authored Dec 01, 2016

[recommitting after the fix in r288307]

This requires some changes to the opt-diag API.  Hal and I have
discussed this at the Dev Meeting and came up with a streaming delimiter
(setExtraArgs) to solve this.

Arguments after this delimiter are only included in the optimization
records and not in the remarks printed in the compiler output.  (Note,
how in the test the content of the YAML file changes but the remarks on
the compiler output don't.)

This implements the green GVN message with a bug fix at line
http://lab.llvm.org:8080/artifacts/opt-view_test-suite/build/SingleSource/Benchmarks/Dhrystone/CMakeFiles/dry.dir/html/_org_test-suite_SingleSource_Benchmarks_Dhrystone_dry.c.html#L446

The fix is that now we properly include the constant value in the
message: "load of type i32 eliminated in favor of 7"

Differential Revision: https://reviews.llvm.org/D26489

llvm-svn: 288380

8b5fba80

Protect optional test under libcpp-no-exceptions · c9a8a559

Roger Ferrer Ibanez authored Dec 01, 2016

Replace throw with TEST_THROW and skip tests that throw exceptions

Differential Revision: https://reviews.llvm.org/D27254

llvm-svn: 288379

c9a8a559

Protect std::ostream::sentry test under libcpp-no-exceptions · 9f1bcb65
Roger Ferrer Ibanez authored Dec 01, 2016
```
Skip test that throws an exception.

Differential Revision: https://reviews.llvm.org/D27255

llvm-svn: 288378
```
9f1bcb65
[SLP] Additional tests with the cost of vector operations. · fc617690
Alexey Bataev authored Dec 01, 2016
```
llvm-svn: 288377
```
fc617690
[change-namespace] don't generate replacements for files that don't match file pattern. · c265b02b
Eric Liu authored Dec 01, 2016
```
Reviewers: hokein

Subscribers: cfe-commits

Differential Revision: https://reviews.llvm.org/D27302

llvm-svn: 288376
```
c265b02b

[clang-tidy] Rename modernize-use-default to modernize-use-equals-default · bcf23661

Malcolm Parsons authored Dec 01, 2016

Reviewers: angelgarcia, aaron.ballman, alexfh

Subscribers: JDevlieghere, Prazek, mgorny, cfe-commits

Differential Revision: https://reviews.llvm.org/D26511

llvm-svn: 288375

bcf23661

[SystemZ] Fix applyFixup for 12-bit fixups · 55082cdd

Ulrich Weigand authored Dec 01, 2016

Now that we have fixups that only fill parts of a byte, it turns
out we have to mask off the bits outside the fixup area when
applying them.  Failing to do so caused invalid object code to
be emitted for bprp with a negative 12-bit displacement.

llvm-svn: 288374

55082cdd

Parallelize ICF to make LLD's ICF really fast. · c1835319

Rui Ueyama authored Dec 01, 2016

ICF is short for Identical Code Folding. It is a size optimization to
identify two or more functions that happened to have the same contents
to merges them. It usually reduces output size by a few percent.

ICF is slow because it is computationally intensive process. I tried
to paralellize it before but failed because I couldn't make a
parallelized version produce consistent outputs. Although it didn't
create broken executables, every invocation of the linker generated
slightly different output, and I couldn't figure out why.

I think I now understand what was going on, and also came up with a
simple algorithm to fix it. So is this patch.

The result is very exciting. Chromium for example has 780,662 input
sections in which 20,774 are reducible by ICF. LLD previously took
7.980 seconds for ICF. Now it finishes in 1.065 seconds.

As a result, LLD can now link a Chromium binary (output size 1.59 GB)
in 10.28 seconds on my machine with ICF enabled. Compared to gold
which takes 40.94 seconds to do the same thing, this is an amazing
number.

From here, I'll describe what we are doing for ICF, what was the
previous problem, and what I did in this patch.

In ICF, two sections are considered identical if they have the same
section flags, section data, and relocations. Relocations are tricky,
becuase two relocations are considered the same if they have the same
relocation type, values, and if they point to the same section _in
terms of ICF_.

Here is an example. If foo and bar defined below are compiled to the
same machine instructions, ICF can (and should) merge the two,
although their relocations point to each other.

  void foo() { bar(); }
  void bar() { foo(); }

This is not an easy problem to solve.

What we are doing in LLD is some sort of coloring algorithm. We color
non-identical sections using different colors repeatedly, and sections
in the same color when the algorithm terminates are considered
identical. Here is the details:

  1. First, we color all sections using their hash values of section
  types, section contents, and numbers of relocations. At this moment,
  relocation targets are not taken into account. We just color
  sections that apparently differ in different colors.

  2. Next, for each color C, we visit sections having color C to see
  if their relocations are the same. Relocations are considered equal
  if their targets have the same color. We then recolor sections that
  have different relocation targets in new colors.

  3. If we recolor some section in step 2, relocations that were
  previously pointing to the same color targets may now be pointing to
  different colors. Therefore, repeat 2 until a convergence is
  obtained.

Step 2 is a heavy operation. For Chromium, the first iteration of step
2 takes 2.882 seconds, and the second iteration takes 1.038 seconds,
and in total it needs 23 iterations.

Parallelizing step 1 is easy because we can color each section
independently. This patch does that.

Parallelizing step 2 is tricky. We could work on each color
independently, but we cannot recolor sections in place, because it
will break the invariance that two possibly-identical sections must
have the same color at any moment.

Consider sections S1, S2, S3, S4 in the same color C, where S1 and S2
are identical, S3 and S4 are identical, but S2 and S3 are not. Thread
A is about to recolor S1 and S2 in C'. After thread A recolor S1 in
C', but before recolor S2 in C', other thread B might observe S1 and
S2. Then thread B will conclude that S1 and S2 are different, and it
will split thread B's sections into smaller groups wrongly. Over-
splitting doesn't produce broken results, but it loses a chance to
merge some identical sections. That was the cause of indeterminism.

To fix the problem, I made sections have two colors, namely current
color and next color. At the beginning of each iteration, both colors
are the same. Each thread reads from current color and writes to next
color. In this way, we can avoid threads from reading partial
results. After each iteration, we flip current and next.

This is a very simple solution and is implemented in less than 50
lines of code.

I tested this patch with Chromium and confirmed that this parallelized
ICF produces the identical output as the non-parallelized one.

Differential Revision: https://reviews.llvm.org/D27247

llvm-svn: 288373

c1835319

[analyzer] Drop explicit mention of range constraint solver · 1a154e08

Dominic Chen authored Dec 01, 2016

Summary: The basic constraint solver was dropped in rL162384, leaving the range constraint solver as the default and only constraint solver. Explicitly specifying it is unnecessary, and makes it difficult to test with other solver backends.

Reviewers: zaks.anna, dcoughlin

Subscribers: cfe-commits

Differential Revision: https://reviews.llvm.org/D26694

llvm-svn: 288372

1a154e08

Revert "[SLP] Additional tests with the cost of vector operations." · e59a8351
Alexey Bataev authored Dec 01, 2016
```
This reverts commit a61718435fc4118c82f8aa6133fd81f803789c1e.

llvm-svn: 288371
```
e59a8351

[GVN] Basic optimization remark support · 4d2a6e59

Adam Nemet authored Dec 01, 2016

[recommitting after the fix in r288307]

Follow-on patches will add more interesting cases.

The goal of this patch-set is to get the GVN messages printed in
opt-viewer from Dhrystone as was presented in my Dev Meeting talk.  This
is the optimization view for the function (the last remark in the
function has a bug which is fixed in this series):
http://lab.llvm.org:8080/artifacts/opt-view_test-suite/build/SingleSource/Benchmarks/Dhrystone/CMakeFiles/dry.dir/html/_org_test-suite_SingleSource_Benchmarks_Dhrystone_dry.c.html#L430

Differential Revision: https://reviews.llvm.org/D26488

llvm-svn: 288370

4d2a6e59

[SLP] Additional tests with the cost of vector operations. · 2ff76847
Alexey Bataev authored Dec 01, 2016
```
llvm-svn: 288369
```
2ff76847
fixed type in Windows-specific code · e0a2c3e9
Andrey Churbanov authored Dec 01, 2016
```
llvm-svn: 288368
```
e0a2c3e9

[X86][SSE] Classify AND bitmasks as variable shuffle masks · 5fe62360

Simon Pilgrim authored Dec 01, 2016

They are loading the bitmasks from the constant pool so the cost is similar to loading a shuffle mask.

llvm-svn: 288367

5fe62360

Adds hasUnqualifiedDesugaredType to allow matching through type sugar. · a37e110d
Manuel Klimek authored Dec 01, 2016
```
Differential Revision: https://reviews.llvm.org/D27207

llvm-svn: 288366
```
a37e110d