Commits · d41ccddda94df6aafe11c4f40f167244a756a665 · Roger Ferrer / llvm-epi

Jan 25, 2019

[X86] Add addcarry/subborrow combine tests · d41ccddd
Simon Pilgrim authored Jan 25, 2019
```
Show failure to simplify cases with zero op/flags

llvm-svn: 352196
```
d41ccddd

[llvm-symbolizer] Add switch to adjust addresses by fixed offset · 759d5e67

James Henderson authored Jan 25, 2019

If a stack trace or similar has a list of addresses from an executable
or DSO loaded at a variable address (e.g. due to ASLR), the addresses
will not directly correspond to the addresses stored in the object file.
If a user wishes to use llvm-symbolizer, they have to subtract the load
address from every address. This is somewhat inconvenient, especially as
the output of --print-address will result in the adjusted address being
listed, rather than the address coming from the stack trace, making it
harder to map results between the two.

This change adds a new switch to llvm-symbolizer --adjust-vma which
takes an offset, which is then used to automatically do this
calculation. The printed address remains the input address (allowing for
easy mapping), whilst the specified offset is applied to the addresses
when performing the lookup.

The switch is conceptually similar to llvm-objdump's new switch of the
same name (see D57051), which in turn mirrors a GNU switch. There is no
equivalent switch in addr2line.

Reviewed by: grimar

Differential Revision: https://reviews.llvm.org/D57151

llvm-svn: 352195

759d5e67

[NFC] One more crashing test on LoopSimplifyCFG · 7822d25d
Max Kazantsev authored Jan 25, 2019
```
llvm-svn: 352194
```
7822d25d
Fix gcc -Wparentheses warning. NFCI. · dea6174b
Simon Pilgrim authored Jan 25, 2019
```
llvm-svn: 352193
```
dea6174b
Fix gcc -Wparentheses warning. NFCI. · cdf58092
Simon Pilgrim authored Jan 25, 2019
```
llvm-svn: 352191
```
cdf58092
[NFC] Add failing test on LCSSA forming · e5116e9b
Max Kazantsev authored Jan 25, 2019
```
llvm-svn: 352190
```
e5116e9b

[ARM GlobalISel] Support shifts for Thumb2 · 8976ad12

Diana Picus authored Jan 25, 2019

Same as ARM.

On this occasion we split some of the instruction select tests for more
complicated instructions into their own files, so we can reuse them for
ARM and Thumb mode. Likewise for the legalizer tests.

llvm-svn: 352188

8976ad12

[ARM GlobalISel] Remove rebase artifact from r351882. NFC · 23628c7b

Diana Picus authored Jan 25, 2019

r351882 introduced some superfluous calls to mark G_INTTOPTR and
G_PTRTOINT as legal (looks like a rebase mishap). Remove them.

llvm-svn: 352187

23628c7b

[TblGen] Extend !if semantics through new feature !cond · a3e3d852

Javed Absar authored Jan 25, 2019

This patch extends TableGen language with !cond operator.
Instead of embedding !if inside !if which can get cumbersome,
one can now use !cond.
Below is an example to convert an integer 'x' into a string:

    !cond(!lt(x,0) : "Negative",
          !eq(x,0) : "Zero",
          !eq(x,1) : "One,
          1        : "MoreThanOne")

Reviewed By: hfinkel, simon_tatham, greened
Differential Revision: https://reviews.llvm.org/D55758

llvm-svn: 352185

a3e3d852

[llvm-objcopy] Add support for -g as an alias for --strip-debug · 914e838e

Douglas Yung authored Jan 25, 2019

This change adds an option -g to llvm-objcopy which is an alias for the existing option --strip-debug.

This fixes PR40003.

Reviewed by: alexshap

Differential Revision: https://reviews.llvm.org/D57217

llvm-svn: 352182

914e838e

[llvm-mca][X86] Add missing shuffle tests · d36f7730

Simon Pilgrim authored Jan 25, 2019

Match the coverage of test\CodeGen\X86\avx512-shuffle-schedule.ll so we can get rid of -print-schedule (and fix PR37160) without losing schedule tests

llvm-svn: 352179

d36f7730

[MSP430] Fix absolute addressing mode printing in AsmPrinter · 509d5c4a

Anton Korobeynikov authored Jan 25, 2019

Align checks for absolute addressing mode with its current
implementation (SR is used as a base register).

This fixes https://bugs.llvm.org/show_bug.cgi?id=39993

Patch by Kristina Bessonova!

Differential Revision: https://reviews.llvm.org/D56785

llvm-svn: 352178

509d5c4a

[NFC] Add test with multiple loops · 6f2a0c68
Max Kazantsev authored Jan 25, 2019
```
llvm-svn: 352176
```
6f2a0c68

[PowerPC] Enhance the fast selection of cmp instruction and clean up related asserts · 308a609c

Zi Xuan Wu authored Jan 25, 2019

Fast selection of llvm icmp and fcmp instructions is not handled well about VSX instruction support.

We'd use VSX float comparison instruction instead of non-vsx float comparison instruction
if the operand register class is VSSRC or VSFRC because i32 and i64 are mapped to VSSRC and
VSFRC correspondingly if VSX feature is opened.

If the target does not have corresponding VSX instruction comparison for some type,
just copy VSX-related register to common float register class and use non-vsx comparison instruction.

Differential Revision: https://reviews.llvm.org/D57078

llvm-svn: 352174

308a609c

[X86] Add non-masked versions of vpconflict intrinsics so we can use a select... · 6fd9af58

Craig Topper authored Jan 25, 2019

[X86] Add non-masked versions of vpconflict intrinsics so we can use a select in the header file in clang.

I'll remove and autoupgrade the old intrinsics in a future commit.

llvm-svn: 352172

6fd9af58

[RISCV] Custom-legalise i32 SDIV/UDIV/UREM on RV64M · 456d3798

Alex Bradbury authored Jan 25, 2019

Follow the same custom legalisation strategy as used in D57085 for
variable-length shifts (see that patch summary for more discussion). Although
we may lose out on some late-stage DAG combines, I think this custom
legalisation strategy is ultimately easier to reason about.

There are some codegen changes in rv64m-exhaustive-w-insts.ll but they are all
neutral in terms of the number of instructions.

Differential Revision: https://reviews.llvm.org/D57096

llvm-svn: 352171

456d3798

[LoopSimplifyCFG] Fix inconsistency in blocks in loop markup · 38cd9acb

Max Kazantsev authored Jan 25, 2019

2nd part of D57095 with the same reason, just in another place. We never
fold branches that are not immediately in the current loop, but this check
is missing in `IsEdgeLive` As result, it may think that the edge in subloop is
dead while it's live. It's a pessimization in the current stance.

Differential Revision: https://reviews.llvm.org/D57147
Reviewed By: rupprecht	

llvm-svn: 352170

38cd9acb

[RISCV] Custom-legalise 32-bit variable shifts on RV64 · 299d690a

Alex Bradbury authored Jan 25, 2019

The previous DAG combiner-based approach had an issue with infinite loops
between the target-dependent and target-independent combiner logic (see
PR40333). Although this was worked around in rL351806, the combiner-based
approach is still potentially brittle and can fail to select the 32-bit shift
variant when profitable to do so, as demonstrated in the pr40333.ll test case.

This patch instead introduces target-specific SelectionDAG nodes for
SHLW/SRLW/SRAW and custom-lowers variable i32 shifts to them. pr40333.ll is a
good example of how this approach can improve codegen.

This adds DAG combine that does SimplifyDemandedBits on the operands (only
lower 32-bits of first operand and lower 5 bits of second operand are read).
This seems better than implementing SimplifyDemandedBitsForTargetNode as there
is no guarantee that would be called (and it's not for e.g. the anyext return
test cases). Also implements ComputeNumSignBitsForTargetNode.

There are codegen changes in atomic-rmw.ll and atomic-cmpxchg.ll but the new
instruction sequences are semantically equivalent.

Differential Revision: https://reviews.llvm.org/D57085

llvm-svn: 352169

299d690a

AMDGPU/GlobalISel: Remove leftover setAction · 3b9a82ff
Matt Arsenault authored Jan 25, 2019
```
Also move G_GEP actions together.

llvm-svn: 352168
```
3b9a82ff
AMDGPU/GlobalISel: Scalarize add/sub · 3e08b772
Matt Arsenault authored Jan 25, 2019
```
llvm-svn: 352167
```
3e08b772
GlobalISel: fewerElementsVector for more cast types · e6cebd0d
Matt Arsenault authored Jan 25, 2019
```
llvm-svn: 352166
```
e6cebd0d
GlobalISel: fewerElementsVector for a few more trivial ops · 95fd95cf
Matt Arsenault authored Jan 25, 2019
```
llvm-svn: 352165
```
95fd95cf
AMDGPU/GlobalISel: Legalize smulh/umulh and scalarize mul · 5d622fbc
Matt Arsenault authored Jan 25, 2019
```
llvm-svn: 352162
```
5d622fbc
[HotColdSplit] Describe the pass in more detail, NFC · 9d70f2b9
Vedant Kumar authored Jan 25, 2019
```
llvm-svn: 352161
```
9d70f2b9

[HotColdSplit] Split more aggressively before/after cold invokes · 65de025d

Vedant Kumar authored Jan 25, 2019

While a cold invoke itself and its unwind destination can't be
extracted, code which unconditionally executes before/after the invoke
may still be profitable to extract.

With cost model changes from D57125 applied, this gives a 3.5% increase
in split text across LNT+externals on arm64 at -Os.

llvm-svn: 352160

65de025d

GlobalISel: Support fewerElementsVector for icmp/fcmp · 1b1e685f
Matt Arsenault authored Jan 25, 2019
```
Also legalize 64-bit compares for AMDGPU

llvm-svn: 352157
```
1b1e685f
GlobalISel: Implement fewerElementsVector for extensions · ca676343
Matt Arsenault authored Jan 25, 2019
```
llvm-svn: 352155
```
ca676343

hwasan: If we split the entry block, move static allocas back into the entry block. · 1a8acfb7

Peter Collingbourne authored Jan 25, 2019

Otherwise they are treated as dynamic allocas, which ends up increasing
code size significantly. This reduces size of Chromium base_unittests
by 2MB (6.7%).

Differential Revision: https://reviews.llvm.org/D57205

llvm-svn: 352152

1a8acfb7

gn build: Set is_clang to true in stage2 toolchains. · 0b247d18
Peter Collingbourne authored Jan 25, 2019
```
Differential Revision: https://reviews.llvm.org/D57202

llvm-svn: 352146
```
0b247d18
GlobalISel: Add convenience mutatations to scalarize · 990f5077
Matt Arsenault authored Jan 25, 2019
```
llvm-svn: 352143
```
990f5077

simplify COFF module assembly test and move it to Object · 6710cc7d

Bob Haarman authored Jan 25, 2019

Reviewers: pcc, rnk

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D57192

llvm-svn: 352142

6710cc7d

gn build: Build clang with -fno-strict-aliasing, make building with gcc much quieter · 0e7ba668

Nico Weber authored Jan 25, 2019

- gcc doesn't understand -Wstring-conversion, so pass that only to clang
- disable a few gcc warnings that are noisy and also disabled in the cmake build
- -Wstrict-aliasing pointed out that the cmake build builds clang with
  -fno-strict-aliasing, so do that too

Differential Revision: https://reviews.llvm.org/D57191

llvm-svn: 352141

0e7ba668

Try to address Windows bot failure after r352080 · a48cd9ae

Vedant Kumar authored Jan 25, 2019

See the bot error message reported in https://reviews.llvm.org/D57082.

Avoid trying to match full class names in -debug-pass-manager output,
because they aren't portable.

llvm-svn: 352138

a48cd9ae

GlobalISel: Add helper to LLT to get a scalar or vector · 7ba2d82c
Matt Arsenault authored Jan 25, 2019
```
llvm-svn: 352136
```
7ba2d82c
[GlobalISel][AArch64] Avoid unused variable warning for variable only used in assert · 653020d3
Benjamin Kramer authored Jan 24, 2019
```
llvm-svn: 352133
```
653020d3

[PowerPC] Exploit store instructions that store a single vector element · b9b75de0

Nemanja Ivanovic authored Jan 24, 2019

This patch exploits the instructions that store a single element from a vector
to preform a (store (extract_elt)). We already have code that does this with
ISA 3.0 instructions that were added to handle i8/i16 types. However, we had
never exploited the existing ones that handle f32/f64/i32/i64 types.

Differential revision: https://reviews.llvm.org/D56175

llvm-svn: 352131

b9b75de0

RegBankSelect: Fix use after free in r352123 · 6bab7ab1
Matt Arsenault authored Jan 24, 2019
```
llvm-svn: 352130
```
6bab7ab1
[GlobalISel][AArch64] Avoid unused function warnings in Release builds · 1411ecf0
Benjamin Kramer authored Jan 24, 2019
```
llvm-svn: 352129
```
1411ecf0
pdbutil: Remove unused variables · dcc96310
David Blaikie authored Jan 24, 2019
```
llvm-svn: 352128
```
dcc96310

[x86] move half-size shuffle mask creation to helper; NFC · 4c304b29

Sanjay Patel authored Jan 24, 2019

As noted in D57156, we want to check at least part of
this pattern earlier (in combining), so this will allow
the code to be shared instead of duplicated.

llvm-svn: 352127

4c304b29