Commits · 4c85e72ad3288ea55f09924d73eb3c3f7aba8269 · Roger Ferrer / llvm-epi

Jan 25, 2019

Build LLVM-C.dll by default on windows and enable in release package · 4c85e72a

Hans Wennborg authored Jan 25, 2019

With the fixes to the building of LLVM-C.dll in D56781 this should now
be safe to land. This will greatly simplify dealing with LLVM for people
that just want to use the C API on windows. This is a follow up from
D35077.

Patch by Jakob Bornecrantz!

Differential revision: https://reviews.llvm.org/D56774

llvm-svn: 352250

4c85e72a

[NFC] Test commit : fix typo. · 31f47b81
Alexey Lapshin authored Jan 25, 2019
```
llvm-svn: 352248
```
31f47b81

[RISCV] Add target DAG combine for bitcast fabs/fneg on RV32FD · 0092df06

Alex Bradbury authored Jan 25, 2019

DAGCombiner::visitBITCAST will perform:
 fold (bitconvert (fneg x)) -> (xor (bitconvert x), signbit)
 fold (bitconvert (fabs x)) -> (and (bitconvert x), (not signbit))

As shown in double-bitmanip-dagcombines.ll, this can be advantageous. But
RV32FD doesn't use bitcast directly (as i64 isn't a legal type), and instead
uses RISCVISD::SplitF64. This patch adds an equivalent DAG combine for
SplitF64.

llvm-svn: 352247

0092df06

[llvm] Opt-in flag for X86DiscriminateMemOps · 519f42d9

Mircea Trofin authored Jan 25, 2019

Summary:
Currently, if an instruction with a memory operand has no debug information,
X86DiscriminateMemOps will generate one based on the first line of the
enclosing function, or the last seen debug info.

This may cause confusion in certain debugging scenarios. The long term
approach would be to use the line number '0' in such cases, however, that
brings in challenges: the base discriminator value range is limited
(4096 values).

For the short term, adding an opt-in flag for this feature.

See bug 40319 (https://bugs.llvm.org/show_bug.cgi?id=40319)

Reviewers: dblaikie, jmorse, gbedwell

Reviewed By: dblaikie

Subscribers: aprantl, eraman, hiraditya

Differential Revision: https://reviews.llvm.org/D57257

llvm-svn: 352246

519f42d9

[GlobalISel][AArch64][NFC] Fix incorrect comment in selectUnmergeValues · 1f9bc285
Jessica Paquette authored Jan 25, 2019
```
s/scalar/vector/

llvm-svn: 352243
```
1f9bc285
Revert rL352238. · a34bcbf3
Alina Sbirlea authored Jan 25, 2019
```
llvm-svn: 352241
```
a34bcbf3

[RISCV] Add another potential combine to {double,float}-bitmanip-dagcombines.ll · d760910d

Alex Bradbury authored Jan 25, 2019

(fcopysign a, (fneg b)) will be expanded to bitwise operations by
DAGTypeLegalizer::SoftenFloatRes_FCOPYSIGN if the floating point type isn't
legal. Arguably it might be worth doing a combine even if it is legal.

llvm-svn: 352240

d760910d

[WarnMissedTransforms] Set default to 1. · 890a8e57

Alina Sbirlea authored Jan 25, 2019

Summary:
Set default value for retrieved attributes to 1, since the check is against 1.
Eliminates the warning noise generated when the attributes are not present.

Reviewers: sanjoy

Subscribers: jlebar, llvm-commits

Differential Revision: https://reviews.llvm.org/D57253

llvm-svn: 352238

890a8e57

Reapply: [RISCV] Set isAsCheapAsAMove for ADDI, ORI, XORI, LUI · 05a60643
Ana Pazos authored Jan 25, 2019
```
This reapplies commit r352010 with RISC-V test fixes.

llvm-svn: 352237
```
05a60643

[MBP] Don't move bottom block before header if it can't reduce taken branches · 81f3fd4b

Guozhi Wei authored Jan 25, 2019

If bottom of block BB has only one successor OldTop, in most cases it is profitable to move it before OldTop, except the following case:

-->OldTop<-
|    .    |
|    .    |
|    .    |
---Pred   |
     |    |
    BB-----

Move BB before OldTop can't reduce the number of taken branches, this patch detects this case and prevent the moving.

Differential Revision: https://reviews.llvm.org/D57067

llvm-svn: 352236

81f3fd4b

[X86] Combine masked store and truncate into masked truncating stores. · 4cf28bad

Craig Topper authored Jan 25, 2019

We also need to combine to masked truncating with saturation stores, but I'm leaving that for a future patch.

This does regress some tests that used truncate wtih saturation followed by a masked store. Those now use a truncating store and use min/max to saturate.

Differential Revision: https://reviews.llvm.org/D57218

llvm-svn: 352230

4cf28bad

[HotColdSplit] Introduce a cost model to control splitting behavior · db3f9774

Vedant Kumar authored Jan 25, 2019

The main goal of the model is to avoid *increasing* function size, as
that would eradicate any memory locality benefits from splitting. This
happens when:

  - There are too many inputs or outputs to the cold region. Argument
    materialization and reloads of outputs have a cost.

  - The cold region has too many distinct exit blocks, causing a large
    switch to be formed in the caller.

  - The code size cost of the split code is less than the cost of a
    set-up call.

A secondary goal is to prevent excessive overall binary size growth.

With the cost model in place, I experimented to find a splitting
threshold that works well in practice. To make warm & cold code easily
separable for analysis purposes, I moved split functions to a "cold"
section. I experimented with thresholds between [0, 4] and set the
default to the threshold which minimized geomean __text size.

Experiment data from building LNT+externals for X86 (N = 639 programs,
all sizes in bytes):

| Configuration | __text geom size | __cold geom size | TEXT geom size |
| **-Os**       | 1736.3           | 0, n=0           | 10961.6        |
| -Os, thresh=0 | 1740.53          | 124.482, n=134   | 11014          |
| -Os, thresh=1 | 1734.79          | 57.8781, n=90    | 10978.6        |
| -Os, thresh=2 | ** 1733.85 **    | 65.6604, n=61    | 10977.6        |
| -Os, thresh=3 | 1733.85          | 65.3071, n=61    | 10977.6        |
| -Os, thresh=4 | 1735.08          | 67.5156, n=54    | 10965.7        |
| **-Oz**       | 1554.4           | 0, n=0           | 10153          |
| -Oz, thresh=2 | ** 1552.2 **     | 65.633, n=61     | 10176          |
| **-O3**       | 2563.37          | 0, n=0           | 13105.4        |
| -O3, thresh=2 | ** 2559.49 **    | 71.1072, n=61    | 13162.4        |

Picking thresh=2 reduces the geomean __text section size by 0.14% at
-Os, -Oz, and -O3 and causes ~0.2% growth in the TEXT segment. Note that
TEXT size is page-aligned, whereas section sizes are byte-aligned.

Experiment data from building LNT+externals for ARM64 (N = 558 programs,
all sizes in bytes):

| Configuration | __text geom size | __cold geom size | TEXT geom size |
| **-Os**       | 1763.96          | 0, n=0           | 42934.9        |
| -Os, thresh=2 | ** 1760.9 **     | 76.6755, n=61    | 42934.9        |

Picking thresh=2 reduces the geomean __text section size by 0.17% at
-Os and causes no growth in the TEXT segment.

Measurements were done with D57082 (r352080) applied.

Differential Revision: https://reviews.llvm.org/D57125

llvm-svn: 352228

db3f9774

[MC] Teach the MachO object writer about N_FUNC_COLD · 13ef84fc

Vedant Kumar authored Jan 25, 2019

N_FUNC_COLD is a new MachO symbol attribute. It's a hint to the linker
to order a symbol towards the end of its section, to improve locality.

Example:

```
void a1() {}
__attribute__((cold)) void a2() {}
void a3() {}
int main() {
  a1();
  a2();
  a3();
  return 0;
}
```

A linker that supports N_FUNC_COLD will order _a2 to the end of the text
section. From `nm -njU` output, we see:

```
_a1
_a3
_main
_a2
```

Differential Revision: https://reviews.llvm.org/D57190

llvm-svn: 352227

13ef84fc

[opt-viewer] Add javascript to expand/hide full message for multiline remarks. · fd7ee479

Florian Hahn authored Jan 25, 2019

This patch adds support for displaying remarks with multiple
lines. For such remarks, it creates a hidden div
containing the message's lines except the first one in a <pre>
tag. It also prepends a link (with '+' as text) to the regular remark
line. This link can be used to show/hide the div containing the
full remark.

In combination with D57159, this allows for better displaying of
multiline remarks in the html pages generated by opt-viewer.

The Javascript is very simple and should be supported by any recent
major browser.

Reviewers: hfinkel, anemet, thegameg, serge-sans-paille

Reviewed By: anemet

Differential Revision: https://reviews.llvm.org/D57167

llvm-svn: 352223

fd7ee479

[x86] simplify logic in lowerShuffleWithUndefHalf(); NFCI · 0020f8bb

Sanjay Patel authored Jan 25, 2019

  
This seems unnecessarily complicated because we gave names to
opposite polarity bools and have code comments that don't really
line up with the logic. 

Step 1: remove UndefUpper and assert that it is the opposite of 
UndefLower after the initial early exit.

llvm-svn: 352217

0020f8bb

[DiagnosticInfo] Add support for preserving newlines in remark arguments. · ca95ee5e

Florian Hahn authored Jan 25, 2019

This patch adds a new type StringBlockVal which can be used to emit a
YAML block scalar, which preserves newlines in a multiline string. It
also updates  MappingTraits<DiagnosticInfoOptimizationBase::Argument> to
use it for argument values with more than a single newline.

This is helpful for remarks that want to display more in-depth
information in a more structured way.

Reviewers: thegameg, anemet

Reviewed By: anemet

Subscribers: hfinkel, hiraditya, llvm-commits

Differential Revision: https://reviews.llvm.org/D57159

llvm-svn: 352216

ca95ee5e

[TEST][COMMIT] - fix comment typo in AsmPrinter/DwarfDebug.cpp - NFC · 4db70d96
Tom Weaver authored Jan 25, 2019
```
llvm-svn: 352214
```
4db70d96
[TblGen][NFC] Fix documentation formatting · 2ee81933
Javed Absar authored Jan 25, 2019
```
llvm-svn: 352212
```
2ee81933

[RISCV][NFC] s/f32/f64 in double-arith.ll · c67515d5

Alex Bradbury authored Jan 25, 2019

The intrinsic names erroneously used the .f32 variant. As the return and
argument types were still double the intrinsics calls worked properly.

llvm-svn: 352211

c67515d5

[X86] Simplify X86ISD::ADD/SUB if we don't use the result flag · f56298f4

Simon Pilgrim authored Jan 25, 2019

Simplify to the generic ISD::ADD/SUB if we don't make use of the result flag.

This mainly helps with ADDCARRY/SUBBORROW intrinsics which get expanded to X86ISD::ADD/SUB but could be simplified further.

Noticed in some of the test cases in PR31754

Differential Revision: https://reviews.llvm.org/D57234

llvm-svn: 352210

f56298f4

[x86] narrow a shuffle that doesn't use or set any high elements · 21aa6ddc

Sanjay Patel authored Jan 25, 2019

This isn't the final fix for our reduction/horizontal codegen, but it takes care 
of a lot of the problems. After we narrow the shuffle, existing combines for 
insert/extract and binops kick in, and we end up with cheaper 128-bit ops.

The avg and mul reduction tests show an existing shuffle lowering hole for 
AVX2/AVX512. I think in its most minimal form this is:
https://bugs.llvm.org/show_bug.cgi?id=40434
...but we might need multiple fixes to get it right.

Differential Revision: https://reviews.llvm.org/D57156

llvm-svn: 352209

21aa6ddc

Revert r351954 "Add a value_type to ArrayRef." · b1201270
Clement Courbet authored Jan 25, 2019
```
This breaks arm self-hosted buildbots.

llvm-svn: 352206
```
b1201270

[JSON] Work around excess-precision issue when comparing T_Integer numbers. · 1e7491ea

Sam McCall authored Jan 25, 2019

Reviewers: bkramer

Subscribers: kristina, llvm-commits

Differential Revision: https://reviews.llvm.org/D57237

llvm-svn: 352204

1e7491ea

gn build: Merge r352149 · e4ed82d6
Nico Weber authored Jan 25, 2019
```
llvm-svn: 352202
```
e4ed82d6
gn build: Revert r352200, commit message was wrong · 0c828ccc
Nico Weber authored Jan 25, 2019
```
llvm-svn: 352201
```
0c828ccc
gn build: Merge r352148 · 74bb231b
Nico Weber authored Jan 25, 2019
```
llvm-svn: 352200
```
74bb231b

[RISCV] Add tests to demonstrate bitcasted fneg/fabs dagcombines · 38c4ec31

Alex Bradbury authored Jan 25, 2019

This target-independent code won't trigger for cases such as RV32FD where
custom SelectionDAG nodes are generated. These new tests demonstrate such
cases. Additionally, float-arith.ll was updated so that fneg.s, fsgnjn.s, and
fabs.s selection patterns are actually exercised.

llvm-svn: 352199

38c4ec31

Fix line endings and trim trailing whitespace. NFCI. · d6e1e356
Simon Pilgrim authored Jan 25, 2019
```
llvm-svn: 352198
```
d6e1e356

gitignore: ignore clangd index files. · 7852b710

Haojian Wu authored Jan 25, 2019

Reviewers: kadircet

Subscribers: ilya-biryukov, ioeric, MaskRay, jkorous, arphaman, llvm-commits

Differential Revision: https://reviews.llvm.org/D57227

llvm-svn: 352197

7852b710

[X86] Add addcarry/subborrow combine tests · d41ccddd
Simon Pilgrim authored Jan 25, 2019
```
Show failure to simplify cases with zero op/flags

llvm-svn: 352196
```
d41ccddd

[llvm-symbolizer] Add switch to adjust addresses by fixed offset · 759d5e67

James Henderson authored Jan 25, 2019

If a stack trace or similar has a list of addresses from an executable
or DSO loaded at a variable address (e.g. due to ASLR), the addresses
will not directly correspond to the addresses stored in the object file.
If a user wishes to use llvm-symbolizer, they have to subtract the load
address from every address. This is somewhat inconvenient, especially as
the output of --print-address will result in the adjusted address being
listed, rather than the address coming from the stack trace, making it
harder to map results between the two.

This change adds a new switch to llvm-symbolizer --adjust-vma which
takes an offset, which is then used to automatically do this
calculation. The printed address remains the input address (allowing for
easy mapping), whilst the specified offset is applied to the addresses
when performing the lookup.

The switch is conceptually similar to llvm-objdump's new switch of the
same name (see D57051), which in turn mirrors a GNU switch. There is no
equivalent switch in addr2line.

Reviewed by: grimar

Differential Revision: https://reviews.llvm.org/D57151

llvm-svn: 352195

759d5e67

[NFC] One more crashing test on LoopSimplifyCFG · 7822d25d
Max Kazantsev authored Jan 25, 2019
```
llvm-svn: 352194
```
7822d25d
Fix gcc -Wparentheses warning. NFCI. · dea6174b
Simon Pilgrim authored Jan 25, 2019
```
llvm-svn: 352193
```
dea6174b
Fix gcc -Wparentheses warning. NFCI. · cdf58092
Simon Pilgrim authored Jan 25, 2019
```
llvm-svn: 352191
```
cdf58092
[NFC] Add failing test on LCSSA forming · e5116e9b
Max Kazantsev authored Jan 25, 2019
```
llvm-svn: 352190
```
e5116e9b

[ARM GlobalISel] Support shifts for Thumb2 · 8976ad12

Diana Picus authored Jan 25, 2019

Same as ARM.

On this occasion we split some of the instruction select tests for more
complicated instructions into their own files, so we can reuse them for
ARM and Thumb mode. Likewise for the legalizer tests.

llvm-svn: 352188

8976ad12

[ARM GlobalISel] Remove rebase artifact from r351882. NFC · 23628c7b

Diana Picus authored Jan 25, 2019

r351882 introduced some superfluous calls to mark G_INTTOPTR and
G_PTRTOINT as legal (looks like a rebase mishap). Remove them.

llvm-svn: 352187

23628c7b

[TblGen] Extend !if semantics through new feature !cond · a3e3d852

Javed Absar authored Jan 25, 2019

This patch extends TableGen language with !cond operator.
Instead of embedding !if inside !if which can get cumbersome,
one can now use !cond.
Below is an example to convert an integer 'x' into a string:

    !cond(!lt(x,0) : "Negative",
          !eq(x,0) : "Zero",
          !eq(x,1) : "One,
          1        : "MoreThanOne")

Reviewed By: hfinkel, simon_tatham, greened
Differential Revision: https://reviews.llvm.org/D55758

llvm-svn: 352185

a3e3d852

[llvm-objcopy] Add support for -g as an alias for --strip-debug · 914e838e

Douglas Yung authored Jan 25, 2019

This change adds an option -g to llvm-objcopy which is an alias for the existing option --strip-debug.

This fixes PR40003.

Reviewed by: alexshap

Differential Revision: https://reviews.llvm.org/D57217

llvm-svn: 352182

914e838e

[llvm-mca][X86] Add missing shuffle tests · d36f7730

Simon Pilgrim authored Jan 25, 2019

Match the coverage of test\CodeGen\X86\avx512-shuffle-schedule.ll so we can get rid of -print-schedule (and fix PR37160) without losing schedule tests

llvm-svn: 352179

d36f7730