Commits · be9c00347fa7dcfb78f9f3a6c6c61a00a9071d31 · Lorenzo Albano / LLVM bpEVL

Sep 12, 2017

bpf: add " ll" in the LD_IMM64 asmstring · be9c0034

Yonghong Song authored Sep 11, 2017



This partially revert previous fix in commit f5858045aa0b
("bpf: proper print imm64 expression in inst printer").

In that commit, the original suffix "ll" is removed from
LD_IMM64 asmstring. In the customer print method, the "ll"
suffix is printed if the rhs is an immediate. For example,
"r2 = 5ll" => "r2 = 5ll", and "r3 = varll" => "r3 = var".

This has an issue though for assembler. Since assembler
relies on asmstring to do pattern matching, it will not
be able to distiguish between "mov r2, 5" and
"ld_imm64 r2, 5" since both asmstring is "r2 = 5".
In such cases, the assembler uses 64bit load for all
"r = <val>" asm insts.

This patch adds back " ll" suffix for ld_imm64 with one
additional space for "#reg = #global_var" case.

Signed-off-by: Yonghong Song <yhs@fb.com>
Acked-by: Alexei Starovoitov <ast@kernel.org>
llvm-svn: 312978

be9c0034

Update testcases that are XFAILed on Darwin for llvm-dwarfdump changes. · 424aac36
Adrian Prantl authored Sep 11, 2017
```
llvm-svn: 312977
```
424aac36

[llvm-cov] Try to fix a test on Windows · 8eae1f9d

Vedant Kumar authored Sep 11, 2017

Failing bot:
http://lab.llvm.org:8011/builders/llvm-clang-x86_64-expensive-checks-win/builds/4791

This looks like another stderr redirection issue.

llvm-svn: 312975

8eae1f9d

llvm-dwarfdump: Make -brief the default and add a -verbose option instead. · 16aa4cf7
Adrian Prantl authored Sep 11, 2017
```
Differential Revision: https://reviews.llvm.org/D37717

llvm-svn: 312972
```
16aa4cf7
[CodeGen] Fix some Clang-tidy modernize-use-using and Include What You Use... · 32a40564
Eugene Zelenko authored Sep 11, 2017
```
[CodeGen] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC).

llvm-svn: 312971
```
32a40564

llvm-dwarfdump: Replace -debug-dump=sect option with individual options. · 7bc1b282

Adrian Prantl authored Sep 11, 2017

As discussed on llvm-dev in
http://lists.llvm.org/pipermail/llvm-dev/2017-September/117301.html
this changes the command line interface of llvm-dwarfdump to match the
one used by the dwarfdump utility shipping on macOS. In addition to
being shorter to type this format also has the advantage of allowing
more than one section to be specified at the same time.

In a nutshell, with this change

  $ llvm-dwarfdump --debug-dump=info
  $ llvm-dwarfdump --debug-dump=apple-objc

becomes

  $ dwarfdump --debug-info --apple-objc

Differential Revision: https://reviews.llvm.org/D37714

llvm-svn: 312970

7bc1b282

[llvm-cov] Allow hiding instantiation/region coverage from summary tables · 50479f60

Eli Friedman authored Sep 11, 2017

Region coverage is difficult to explain without going deep into how
coverage is implemented. Instantiation coverage is easier to explain,
but probably not useful in most cases (templates don't exist in C, and
most C++ code contains relatively few templates).

This patch adds the options "-show-region-summary" and
"-show-instantiation-summary" to allow hiding those columns.
"-show-instantiation-summary" is turned off by default.

llvm-svn: 312969

50479f60

LowerTypeTests: Add import/export support for targets without absolute symbol constants. · b9b60253
Peter Collingbourne authored Sep 11, 2017
```
The rationale is the same as for r312967.

Differential Revision: https://reviews.llvm.org/D37408

llvm-svn: 312968
```
b9b60253

WholeProgramDevirt: Add import/export support for targets without absolute symbol constants. · b15a35e6

Peter Collingbourne authored Sep 11, 2017

Not all targets support the use of absolute symbols to export
constants. In particular, ARM has a wide variety of constant encodings
that cannot currently be relocated by linkers. So instead of exporting
the constants using symbols, export them directly in the summary.
The values of the constants are left as zeroes on targets that support
symbolic exports.

This may result in more cache misses when targeting those architectures
as a result of arbitrary changes in constant values, but this seems
somewhat unavoidable for now.

Differential Revision: https://reviews.llvm.org/D37407

llvm-svn: 312967

b15a35e6

Sep 11, 2017

[llvm-cov] Don't attach exec counts to lines which start a skipped region · 71bc1afa

Vedant Kumar authored Sep 11, 2017

These lines by definition don't have an execution count.

This is the final part of the fix for:
https://bugs.llvm.org/show_bug.cgi?id=34166

llvm-svn: 312955

71bc1afa

[InstSimplify] fix some test names; NFC · bb1b1c97
Sanjay Patel authored Sep 11, 2017
```
Too much division...the quotient is the answer.

llvm-svn: 312943
```
bb1b1c97

[InstSimplify] add tests for possible sdiv/srem simplifications; NFC · a36eb771

Sanjay Patel authored Sep 11, 2017

As noted in PR34517, the handling of signed div/rem is not on par with
unsigned div/rem. Signed is harder to reason about, but it should be
possible to handle at least some of these using the same technique that
we use for unsigned: use icmp logic to see if there's a relationship
between the quotient and divisor.

llvm-svn: 312938

a36eb771

AMDGPU: Allow coldcc calls · 537bd3b9
Matt Arsenault authored Sep 11, 2017
```
llvm-svn: 312936
```
537bd3b9

[mips][microMIPS] add lapc instruction · d4f3723c

Petar Jovanovic authored Sep 11, 2017

Implement LAPC instruction for mips32r6, mips64r6 and micromips32r6.

Patch by Milos Stojanovic.

Differential Revision: https://reviews.llvm.org/D35984

llvm-svn: 312934

d4f3723c

Unmerge GEPs to reduce register pressure on IndirectBr edges. · 9364432c

Hiroshi Yamauchi authored Sep 11, 2017

Summary:
GEP merging can sometimes increase the number of live values and register
pressure across control edges and cause performance problems particularly if the
increased register pressure results in spills.

This change implements GEP unmerging around an IndirectBr in certain cases to
mitigate the issue. This is in the CodeGenPrepare pass (after all the GEP
merging has happened.)

With this patch, the Python interpreter loop runs faster by ~5%.

Reviewers: sanjoy, hfinkel

Reviewed By: hfinkel

Subscribers: eastig, junbuml, llvm-commits

Differential Revision: https://reviews.llvm.org/D36772

llvm-svn: 312930

9364432c

[AMDGPU] Produce madak and madmk from the two-address pass · 710da42b

Stanislav Mekhanoshin authored Sep 11, 2017

These two instructions are normally selected, but when the
two address pass converts mac into mad we end up with the
mad where we could have one of these.

Differential Revision: https://reviews.llvm.org/D37389

llvm-svn: 312928

710da42b

[X86] Remove portions of r275950 that are no longer needed with i1 not being a legal type · 7b02020c

Craig Topper authored Sep 11, 2017

Summary:
r275950 added support for turning (trunc (X >> N) to i1) into BT(X, N). But that's no longer necessary now that i1 isn't legal.

This patch removes the support for that, but preserves some of the refactorings done in that commit.

Reviewers: guyblank, RKSimon, spatel, zvi

Reviewed By: RKSimon

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D37673

llvm-svn: 312925

7b02020c

[SelectionDAG] Remove a check for type being a vector type after calling getShiftAmountTy. NFCI · 8dff57a0
Craig Topper authored Sep 11, 2017
```
getShiftAmountTy already returns the vector type when called for vectors.

llvm-svn: 312924
```
8dff57a0
X86 Tests: More AVX512 conversions tests. NFC · 255488a1
Zvi Rackover authored Sep 11, 2017
```
Adding more tests for AVX512 fp<->int conversions that were missing.

llvm-svn: 312921
```
255488a1

[ScalarEvolution] Refactor forgetLoop() to improve performance · ce90060d

Marcello Maggioni authored Sep 11, 2017

forgetLoop() has pretty bad performance because it goes over
the same instructions over and over again in particular when
nested loop are involved.
The refactoring changes the function to a not-recursive function
and reusing the allocation for data-structures and the Visited
set.

NFCI

Differential Revision: https://reviews.llvm.org/D37659

llvm-svn: 312920

ce90060d

Fix typo · eb4474cf
Matt Arsenault authored Sep 11, 2017
```
llvm-svn: 312919
```
eb4474cf

[X86][SSE] Add support for X86ISD::PACKSS to ComputeNumSignBitsForTargetNode · b092bd32

Simon Pilgrim authored Sep 11, 2017

Helps improve combineLogicBlendIntoPBLENDV support by allowing us to peek into through PACKSS truncations of vector comparison results.

Differential Revision: https://reviews.llvm.org/D37680

llvm-svn: 312916

b092bd32

[AMDGPU] exp should not be in WQM mode · 660ba2b8

Tim Renouf authored Sep 11, 2017

A mrt exp with vm=1 must be in exact (non-WQM) mode, as it also exports
the exec mask as the valid mask to determine which pixels to render.

This commit marks any exp as needing to be in exact mode.

Actually, if there are multiple mrt exps, only one needs to have vm=1,
and only that one needs to be in exact mode. But that is an optimization
for another day.

Differential Revision: https://reviews.llvm.org/D36305

llvm-svn: 312915

660ba2b8

[TableGen] Ensure that __lsan_is_turned_off isn't removed by DCE in llvm-tblgen · 8a1c2b41

Francis Ricci authored Sep 11, 2017

Summary:
Since asan is linked dynamically on Darwin, the weak interface symbol
is removed by -Wl,-dead_strip.

Reviewers: kcc, compnerd, aaron.ballman

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D37636

llvm-svn: 312914

8a1c2b41

[InstSimplify] reorder methods; NFC · fa877fd4

Sanjay Patel authored Sep 11, 2017

I'm trying to refactor some shared code for integer div/rem,
but I keep having to scroll through fdiv. The FP ops have
nothing in common with the integer ops, so I'm moving FP
below everything else. 

While here, improve a couple of comments and fix some formatting.

llvm-svn: 312913

fa877fd4

[X86][SSE] Add further test cases showing failure to compute sign bits through PACKSS · d0ff65b5

Simon Pilgrim authored Sep 11, 2017

Suggested in D37680

Note: had to drop AVX512VL tests as there is an infinite loop in the new tests that needs further investigation (not relevant to D37680).
llvm-svn: 312910

d0ff65b5

[X86][SKX][KNL] Updating several CodeGen tests to use the attr flag instead of mcpu flag · 3ddffced

Gadi Haber authored Sep 11, 2017

NFC.
 Updated 3 Codegen regression tests to use the -mattr flag instead of the -mcpu flags as follows:
 Instead of -mcpu=skx use -mattr=+avx512f,+avx512bw,+avx512vl,+avx512dq
 Instead of -mcpu=knl use -mattr=+avx512f

Reviewers: delena
Revision: https://reviews.llvm.org/D37674
llvm-svn: 312909

3ddffced

[ARM] Enable the use of SVC anywhere in an IT block · c429aabb
Andre Vieira authored Sep 11, 2017
```
Differential Revision: https://reviews.llvm.org/D37374

llvm-svn: 312908
```
c429aabb
[Interleved][Stride 3]Adding test for case the VF=64 target with AVX512. · 9707ba09
Michael Zuckerman authored Sep 11, 2017
```
llvm-svn: 312907
```
9707ba09
[X86][SSE] Add test showing failure to compute sign bits through PACKSS · f6fa1d03
Simon Pilgrim authored Sep 11, 2017
```
Prevents combineLogicBlendIntoPBLENDV from merging to PBLENDV

llvm-svn: 312906
```
f6fa1d03

[AVR] Enable the '__do_copy_data' function · 0fc5fe0a

Dylan McKay authored Sep 11, 2017

Also enables '__do_clear_bss'.

These functions are automaticalled called by the CRT if they are
declared.

We need these to be called otherwise RAM will start completely
uninitialised, even though we need to copy RAM variables from progmem to
RAM.

llvm-svn: 312905

0fc5fe0a

[GlobalISel][X86] G_ANYEXT support. · 1f14364d

Igor Breger authored Sep 11, 2017

Summary: G_ANYEXT support

Reviewers: zvi, delena

Reviewed By: delena

Subscribers: rovka, kristof.beyls, llvm-commits

Differential Revision: https://reviews.llvm.org/D37675

llvm-svn: 312903

1f14364d

Fixed a typo in llvm-cov/deferred-region.cpp test. · d386c299
Ilya Biryukov authored Sep 11, 2017
```
Input redirection was using `2&>1` instead of `2>&1`.

llvm-svn: 312902
```
d386c299
AMDGPU: trivial comment change · 6cb007fc
Tim Renouf authored Sep 11, 2017
```
... to check commit access for new committer.

llvm-svn: 312900
```
6cb007fc

[ARM] Use ADDCARRY / SUBCARRY · 12b20f23

Roger Ferrer Ibanez authored Sep 11, 2017

This is a preparatory step for D34515 and also is being recommitted as its
first version caused PR34045.

This change:
 - makes nodes ISD::ADDCARRY and ISD::SUBCARRY legal for i32
 - lowering is done by first converting the boolean value into the carry flag
   using (_, C) ← (ARMISD::ADDC R, -1) and converted back to an integer value
   using (R, _) ← (ARMISD::ADDE 0, 0, C). An ARMISD::ADDE between the two
   operations does the actual addition.
 - for subtraction, given that ISD::SUBCARRY second result is actually a
   borrow, we need to invert the value of the second operand and result before
   and after using ARMISD::SUBE. We need to invert the carry result of
   ARMISD::SUBE to preserve the semantics.
 - given that the generic combiner may lower ISD::ADDCARRY and
   ISD::SUBCARRYinto ISD::UADDO and ISD::USUBO we need to update their lowering
   as well otherwise i64 operations now would require branches. This implies
   updating the corresponding test for unsigned.
 - add new combiner to remove the redundant conversions from/to carry flags
   to/from boolean values (ARMISD::ADDC (ARMISD::ADDE 0, 0, C), -1) → C
 - fixes PR34045

Differential Revision: https://reviews.llvm.org/D35192

llvm-svn: 312898

12b20f23

Fixed a bug in splitting Scatter operation in the Type Legalizer. · cc477bbc

Elena Demikhovsky authored Sep 11, 2017

After the split of the Scatter operation, the order of the new instructions is well defined - Lo goes before Hi. Otherwise the semantic of Scatter (from LSB to MSB) is broken.
I'm chaining 2 nodes to prevent reordering.

Differential Revision https://reviews.llvm.org/D37670

llvm-svn: 312894

cc477bbc

[ORC] Kill off a dead typedef. · 70a6929f
Lang Hames authored Sep 11, 2017
```
llvm-svn: 312893
```
70a6929f

Sep 10, 2017
- Use llvm_unreachable for unknown TargetCostKind. · b1db6b7d
  Simon Pilgrim authored Sep 10, 2017
```
TargetTransformInfo::getInstructionCost's switch covers all TargetCostKind cases so we shouldn't return for a default case.

llvm-svn: 312888
```
  b1db6b7d
- [X86][SSE] Tidyup + clang-format combineX86ShuffleChain call. NFCI. · 5e2ed8be
  Simon Pilgrim authored Sep 10, 2017
```
llvm-svn: 312887
```
  5e2ed8be
- [X86][SSE] Move combineTo call out of combineX86ShufflesConstants. NFCI. · ff347d3e
  Simon Pilgrim authored Sep 10, 2017
```
Move towards making it possible to use the shuffle combines for cases where we don't want to call DCI.CombineTo() with the result.

llvm-svn: 312886
```
  ff347d3e