Commits · 69f6dfa0f8c54eacd02eebb2f2ef9f49c487675b · Roger Ferrer / llvm-epi

Nov 06, 2018

[LICM] Use ICFLoopSafetyInfo in LICM · 69f6dfa0

Max Kazantsev authored Nov 06, 2018

This patch makes LICM use `ICFLoopSafetyInfo` that is a smarter version
of LoopSafetyInfo that leverages power of Implicit Control Flow Tracking
to keep track of throwing instructions and give less pessimistic answers
to queries related to throws.

The ICFLoopSafetyInfo itself has been introduced in rL344601. This patch
enables it in LICM only.

Differential Revision: https://reviews.llvm.org/D50377
Reviewed By: apilipenko

llvm-svn: 346201

69f6dfa0

[NFC] Add motivating test case for revert in rL346198 · c210c65e
Max Kazantsev authored Nov 06, 2018
```
llvm-svn: 346199
```
c210c65e

Revert "[IndVars] Smart hard uses detection" · e059f445

Max Kazantsev authored Nov 06, 2018

This reverts commit 2f425e9c7946b9d74e64ebbfa33c1caa36914402.

It seems that the check that we still should do the transform if we
know the result is constant is missing in this code. So the logic that
has been deleted by this change is still sometimes accidentally useful.
I revert the change to see what can be done about it. The motivating
case is the following:

@Y = global [400 x i16] zeroinitializer, align 1

define i16 @foo() {
entry:
  br label %for.body

for.body:                                         ; preds = %entry, %for.body
  %i = phi i16 [ 0, %entry ], [ %inc, %for.body ]

  %arrayidx = getelementptr inbounds [400 x i16], [400 x i16]* @Y, i16 0, i16 %i
  store i16 0, i16* %arrayidx, align 1
  %inc = add nuw nsw i16 %i, 1
  %cmp = icmp ult i16 %inc, 400
  br i1 %cmp, label %for.body, label %for.end

for.end:                                          ; preds = %for.body
  %inc.lcssa = phi i16 [ %inc, %for.body ]
  ret i16 %inc.lcssa
}

We should be able to figure out that the result is constant, but the patch
breaks it.

Differential Revision: https://reviews.llvm.org/D51584

llvm-svn: 346198

e059f445

[LLVM-C] Fix Windows Build of Core · 6c7073f2

Robert Widmann authored Nov 06, 2018

strndup doesn't exist outside of GNU-land and modern macOSes.  Use
strdup instead as c_str() is guaranteed to be NUL-terminated.

llvm-svn: 346197

6c7073f2

[LLVM-C] Improve Intrinsics Bindings · d36f3b0f

Robert Widmann authored Nov 06, 2018

Summary:
Improve the intrinsic bindings with operations for

- Retrieving and automatically inserting the declaration of an intrinsic by ID
- Retrieving the name of a non-overloaded intrinsic by ID
- Retrieving the name of an overloaded intrinsic by ID and overloaded parameter types

Improve the echo test to copy non-overloaded intrinsics by ID.

Reviewers: whitequark, deadalnix

Reviewed By: whitequark

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D53626

llvm-svn: 346195

d36f3b0f

[X86] Autogenerate complete checks. NFC · 17057b52
Craig Topper authored Nov 06, 2018
```
llvm-svn: 346188
```
17057b52

Revert "[WebAssembly] Fixup `main` signature by default" · 5292d17e

Sam Clegg authored Nov 06, 2018

This reverts rL345880.  It caused some test failures on the
webassembly waterfall.  e.g. binaryen2.test_mainenv fails due
the fact that `envp` ends up being undef rather than 0.

Differential Revision: https://reviews.llvm.org/D54117

llvm-svn: 346187

5292d17e

Specify REQUIRES: default_triple in two debuginfo tests · d0534530
Justin Bogner authored Nov 06, 2018
```
These were failing when specifying LLVM_DEFAULT_TARGET_TRIPLE=''

llvm-svn: 346185
```
d0534530
TargetMachine: Move lib/CodeGen specific callbacks to LLVMTargetMachine; NFC · 7ab3a66a
Matthias Braun authored Nov 05, 2018
```
llvm-svn: 346184
```
7ab3a66a

MachineFunction: Store more specific reference to LLVMTargetMachine; NFC · 7a75a91b

Matthias Braun authored Nov 05, 2018

MachineFunction can only be used in code using lib/CodeGen, hence we
can keep a more specific reference to LLVMTargetMachine rather than just
TargetMachine around.

Do the same for references in ScheduleDAG and RegUsageInfoCollector.

llvm-svn: 346183

7a75a91b

MachineModuleInfo: Store more specific reference to LLVMTargetMachine; NFC · 3d849f67

Matthias Braun authored Nov 05, 2018

MachineModuleInfo can only be used in code using lib/CodeGen, hence we
can keep a more specific reference to LLVMTargetMachine rather than just
TargetMachine around.

llvm-svn: 346182

3d849f67

[DWARF] Support types CU list in .gdb_index dumping · 54d23a8e

Fangrui Song authored Nov 05, 2018

Some executables have non-empty types CU list and -gdb-index would report "<error reporting>" before.

llvm-svn: 346181

54d23a8e

[TargetLowering] Change TargetLoweringBase::getPreferredVectorAction to take... · 0b5f8169

Craig Topper authored Nov 05, 2018

[TargetLowering] Change TargetLoweringBase::getPreferredVectorAction to take an MVT instead of an EVT. NFC

The main caller of this already has an MVT and several targets called getSimpleVT inside without checking isSimple. This makes the simpleness explicit.

llvm-svn: 346180

0b5f8169

Nov 05, 2018

AMDGPU: Add sram-ecc feature · 108927b9
Konstantin Zhuravlyov authored Nov 05, 2018
```
Differential Revision: https://reviews.llvm.org/D53222

llvm-svn: 346177
```
108927b9
Revert "[GlobalISel] Refactor the artifact combiner a bit by using MIPatternMatch" · f668d32d
Volkan Keles authored Nov 05, 2018
```
This reverts r346166 as it breaks
test-suite-verify-machineinstrs-aarch64-globalisel-O0-g.

llvm-svn: 346175
```
f668d32d

[X86] Don't turn any_extend from a mask register into a sign_extend during... · def82a81

Craig Topper authored Nov 05, 2018

[X86] Don't turn any_extend from a mask register into a sign_extend during lowering. Add patterns to match any_extend during isel instead.

SimplifyDemandedBits can turn a sign_extend back into an any_extend and trigger an infinite loop. So instead legalize it the same way as a sign_extend, but preserve the opcode. Then just pattern match it the same as sign_extend during isel.

I don't have a reduced test case for such an infinite loop yet.

llvm-svn: 346170

def82a81

[InstSimplify] fold select (fcmp X, Y), X, Y · 14401078

Sanjay Patel authored Nov 05, 2018

This is NFCI for InstCombine because it calls InstSimplify, 
so I left the tests for this transform there. As noted in
the code comment, we can allow this fold more often by using
FMF and/or value tracking.

llvm-svn: 346169

14401078

[InstSimplify] add tests for select+fcmp; NFC · 72c2d355

Sanjay Patel authored Nov 05, 2018

These are translated from InstCombine's test file with the same name.
We should move the transform from InstCombine to InstSimplify.

llvm-svn: 346168

72c2d355

[GlobalISel] Refactor the artifact combiner a bit by using MIPatternMatch · 6b1162fa

Volkan Keles authored Nov 05, 2018

Reviewers: aditya_nandakumar

Reviewed By: aditya_nandakumar

Subscribers: rovka, kristof.beyls, llvm-commits

Differential Revision: https://reviews.llvm.org/D54116

llvm-svn: 346166

6b1162fa

[X86] Regenerate test checks in preparation for a patch. NFC · ab896b08

Craig Topper authored Nov 05, 2018

I'm preparing a patch to avoid creating critical edges in cmov expansion. Updating these tests to make the changes by the next patch easier to see.

llvm-svn: 346161

ab896b08

[COFF][LLD] Add link support for Microsoft precompiled headers OBJs · 71c43cea

Alexandre Ganea authored Nov 05, 2018

This change allows for link-time merging of debugging information from
Microsoft precompiled types OBJs compiled with cl.exe /Z7 /Yc and /Yu.

This fixes llvm.org/PR34278

Differential Revision: https://reviews.llvm.org/D45213

llvm-svn: 346154

71c43cea

Only call FlushFileBuffers() when writing executables on Windows · 3b9b4d21

Alexandre Ganea authored Nov 05, 2018

This is a follow-up for "r325274: Call FlushFileBuffers on output files."

Previously, FlushFileBuffers() was called in all cases when writing a file. The objective was to go around a bug in the Windows kernel (as described here: https://randomascii.wordpress.com/2018/02/25/compiler-bug-linker-bug-windows-kernel-bug/). However that is required only when writing EXEs, any other file type doesn't need flushing.

This patch calls FlushFileBuffers() only for EXEs. In addition, we completly disable FlushFileBuffers() for known Windows 10 versions that do not exhibit the original kernel bug.

Differential Revision: https://reviews.llvm.org/D53727

llvm-svn: 346152

3b9b4d21

[MergeICmps] Do not perform the transformation if GEP is used outside of block · 2b7ae47c

Taewook Oh authored Nov 05, 2018

Summary:
This patch prevents MergeICmps to performn the transformation if the address operand GEP of the load instruction has a use outside of the load's parent block. Without this patch, compiler crashes with the given test case because the use of `%first.i` is still around when the basic block is erased from https://github.com/llvm-mirror/llvm/blob/master/lib/Transforms/Scalar/MergeICmps.cpp#L620. I think checking `isUsedOutsideOfBlock` with `GEP` is the original intention of the code, as the checking for `LoadI` is already performed in the same function.

This patch is incomplete though, as this makes the pass overly conservative and fails the test `tuple-four-int8.ll`. I believe what needs to be done is checking if GEP has a use outside of block that is not the part of "Comparisons" chain. Submit the patch as of now to prevent compiler crash.

Reviewers: courbet, trentxintong

Reviewed By: courbet

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D54089

llvm-svn: 346151

2b7ae47c

[InstCombine] add/adjust tests for fcmp+select substitution; NFC · 1cfba9b5

Sanjay Patel authored Nov 05, 2018

There was no coverage for at least 2 out of the 4 patterns because
of fcmp canonicalization. The tests and code should be moved to
InstSimplify in a follow-up because this doesn't create any new values.

llvm-svn: 346150

1cfba9b5

[Power9] Add support for stxvw4x.be and stxvd2x.be intrinsics · 7509880b

Zaara Syeda authored Nov 05, 2018

On Power9, we don't have patterns to select the following intrinsics:
llvm.ppc.vsx.stxvw4x.be
llvm.ppc.vsx.stxvd2x.be

This patch adds support for these.

Differential Revision: https://reviews.llvm.org/D53581

llvm-svn: 346148

7509880b

[InstCombine] canonicalize -0.0 to +0.0 in fcmp · c26fd1e7

Sanjay Patel authored Nov 05, 2018

As stated in IEEE-754 and discussed in:
https://bugs.llvm.org/show_bug.cgi?id=38086
...the sign of zero does not affect any FP compare predicate.

Known regressions were fixed with:
rL346097 (D54001)
rL346143

The transform will help reduce pattern-matching complexity to solve:
https://bugs.llvm.org/show_bug.cgi?id=39475
...as well as improve CSE and codegen (a zero constant is almost always
easier to produce than 0x80..00).

llvm-svn: 346147

c26fd1e7

[InstCombine] loosen FP 0.0 constraint for fcmp+select substitution · 87aa1006

Sanjay Patel authored Nov 05, 2018

It looks like we correctly removed edge cases with 0.0 from D50714,
but we were a bit conservative because getBinOpIdentity() doesn't
distinguish between +0.0 and -0.0 and 'nsz' is effectively always
true for fcmp (see discussion in:
https://bugs.llvm.org/show_bug.cgi?id=38086

Without this change, we would get regressions by canonicalizing
to +0.0 in all fcmp, and that's a step towards solving:
https://bugs.llvm.org/show_bug.cgi?id=39475

llvm-svn: 346143

87aa1006

[InstCombine] adjust tests for select with FP identity op; NFC · 8b2a1f7f
Sanjay Patel authored Nov 05, 2018
```
These are mislabeled as negative tests.

llvm-svn: 346142
```
8b2a1f7f
[FPEnv] Add constrained CEIL/FLOOR/ROUND/TRUNC intrinsics · 9757d5d6
Cameron McInally authored Nov 05, 2018
```
Differential Revision: https://reviews.llvm.org/D53411

llvm-svn: 346141
```
9757d5d6

[ThinLTO] Add an option to disable (thin)lto internalization. · 7ca74448

Xin Tong authored Nov 05, 2018

Summary:
LTO and ThinLTO optimizes the IR differently.

One source of differences is the amount of internalizations that
can happen.

Add an option to enable/disable internalization so that other
differences can be studied in isolation. e.g. inlining.

There are other things lto and thinlto do differently, I will add
flags to enable/disable them as needed.

Reviewers: tejohnson, pcc, steven_wu

Subscribers: mehdi_amini, inglorion, steven_wu, dexonsmith, dang, llvm-commits

Differential Revision: https://reviews.llvm.org/D53294

llvm-svn: 346140

7ca74448

[TargetLowering] Begin generalizing TargetLowering::expandFP_TO_SINT support. NFCI. · 6bd468bd
Simon Pilgrim authored Nov 05, 2018
```
Prior to initial work to add vector expansion support, remove assumptions that we're working on scalar types.

llvm-svn: 346139
```
6bd468bd
[InstCombine] add/adjust tests for select with fsub identity op; NFC · 92a53eab
Sanjay Patel authored Nov 05, 2018
```
llvm-svn: 346138
```
92a53eab

[NFCI][FPEnv] Split constrained intrinsic tests · 51a91e86

Cameron McInally authored Nov 05, 2018

The constrained intrinsic tests have grown in number. Split off
the FMA tests into their own file to reduce double coverage.

Differential Revision: https://reviews.llvm.org/D53932

llvm-svn: 346137

51a91e86

[InstCombine] add tests for select with FP identity op; NFC · 278db2fb
Sanjay Patel authored Nov 05, 2018
```
llvm-svn: 346136
```
278db2fb

[Inliner] Penalise inlining of calls with loops at Oz · ba9f245b

David Green authored Nov 05, 2018

We currently seem to underestimate the size of functions with loops in them,
both in terms of absolute code size and in the difficulties of dealing with
such code. (Calls, for example, can be tail merged to further reduce
codesize). At -Oz, we can then increase code size by inlining small loops
multiple times.

This attempts to penalise functions with loops at -Oz by adding a CallPenalty
for each top level loop in the function. It uses LI (and hence DT) to calculate
the number of loops. As we are dealing with minsize, the inline threshold is
small and functions at this point should be relatively small, making the
construction of these cheap.

Differential Revision: https://reviews.llvm.org/D52716

llvm-svn: 346134

ba9f245b

[Mips] Supplement long branch pseudo instructions · 8d7c3517

Stefan Maksimovic authored Nov 05, 2018

Expand on LONG_BRANCH_LUi and LONG_BRANCH_(D)ADDiu pseudo
instructions by creating variants which support
less operands/accept GPR64Opnds as their operand in order
to appease the machine verifier pass.

Differential Revision: https://reviews.llvm.org/D53977

llvm-svn: 346133

8d7c3517

[NFC][ARM] Adding extra test for ARM CGP · 7275eec6
Sam Parker authored Nov 05, 2018
```
Added a reproducer that I received a while ago.

llvm-svn: 346132
```
7275eec6

[AMDGPU] Fix the new atomic optimizer in pixel shaders. · 233a02d0

Neil Henning authored Nov 05, 2018

The new atomic optimizer I previously added in D51969 did not work
correctly when a pixel shader was using derivatives, and had helper
lanes active.

To fix this we add an llvm.amdgcn.ps.live call that guards a branch
around the entire atomic operation - ensuring that all helper lanes are
inactive within the wavefront when we compute our atomic results.

I've added a test case that can cause derivatives, and exposes the
problem.

Differential Revision: https://reviews.llvm.org/D53930

llvm-svn: 346128

233a02d0

[CMake] Expose opt-remark tooling through libOptRemarks.dylib · 2ae1be72

Francis Visoiu Mistrih authored Nov 05, 2018

* Create an install target for it
* Add it under tools/opt-remarks
* Add an export file for the dylib
* Install the llvm-c/OptRemarks.h header
* Add an API to query its version

rdar://45458839

llvm-svn: 346127

2ae1be72

[ARM] Turn assert into condition in ARMCGP · fec793c9

Sam Parker authored Nov 05, 2018

Turn the assert in PrepareConstants into a conditon so that we can
handle mul instructions with negative immediates.

Differential Revision: https://reviews.llvm.org/D54094

llvm-svn: 346126

fec793c9