Commits · c26fd1e77258d2a1ebeff25638636291ac64671a · Roger Ferrer / llvm-epi

Nov 05, 2018

[InstCombine] canonicalize -0.0 to +0.0 in fcmp · c26fd1e7

Sanjay Patel authored Nov 05, 2018

As stated in IEEE-754 and discussed in:
https://bugs.llvm.org/show_bug.cgi?id=38086
...the sign of zero does not affect any FP compare predicate.

Known regressions were fixed with:
rL346097 (D54001)
rL346143

The transform will help reduce pattern-matching complexity to solve:
https://bugs.llvm.org/show_bug.cgi?id=39475
...as well as improve CSE and codegen (a zero constant is almost always
easier to produce than 0x80..00).

llvm-svn: 346147

c26fd1e7

[AST] Get aliased type info from an aliased TemplateSpecialization. · c50240da

Matt Davis authored Nov 05, 2018

Summary:
Previously the TemplateSpecialization instance for 'template_alias', in the example below, returned the type info of the  canonical type (int).  This ignored the type alias if the template type happen to be aliased. 

Before this patch, the assert would trigger with an  alignment of 4:
```
typedef int __attribute__(( aligned( 16 ) )) aligned_int;
template < typename >
using template_alias = aligned_int;
static_assert( alignof( template_alias<void>) == 16, "" );
```

This patch checks if the TemplateSpecialization type has an alias, and if so will return the type information for the aliased type, else the canonical type's info is returned (original behavior).  I believe that this is the desired behavior.  

Reviewers: aaron.ballman, rjmccall

Reviewed By: rjmccall

Subscribers: cfe-commits

Differential Revision: https://reviews.llvm.org/D54048

llvm-svn: 346146

c50240da

[UBsan] disable few tests for FreeBSD · a7b4ae9d

David Carlier authored Nov 05, 2018

Reviewers: krytarowsky, vitalybuka

Reviewed By: vitalybuka

Differential Revision: https://reviews.llvm.org/D54103

llvm-svn: 346145

a7b4ae9d

[Format] Add debugging to ObjC language guesser · 0675e87e

Ben Hamilton authored Nov 05, 2018

Summary:
To handle diagnosing bugs where ObjCHeaderStyleGuesser guesses
wrong, this diff adds a bit more debug logging to the Objective-C
language guesser.

Reviewers: krasimir

Reviewed By: krasimir

Subscribers: cfe-commits

Differential Revision: https://reviews.llvm.org/D54110

llvm-svn: 346144

0675e87e

[InstCombine] loosen FP 0.0 constraint for fcmp+select substitution · 87aa1006

Sanjay Patel authored Nov 05, 2018

It looks like we correctly removed edge cases with 0.0 from D50714,
but we were a bit conservative because getBinOpIdentity() doesn't
distinguish between +0.0 and -0.0 and 'nsz' is effectively always
true for fcmp (see discussion in:
https://bugs.llvm.org/show_bug.cgi?id=38086

Without this change, we would get regressions by canonicalizing
to +0.0 in all fcmp, and that's a step towards solving:
https://bugs.llvm.org/show_bug.cgi?id=39475

llvm-svn: 346143

87aa1006

[InstCombine] adjust tests for select with FP identity op; NFC · 8b2a1f7f
Sanjay Patel authored Nov 05, 2018
```
These are mislabeled as negative tests.

llvm-svn: 346142
```
8b2a1f7f
[FPEnv] Add constrained CEIL/FLOOR/ROUND/TRUNC intrinsics · 9757d5d6
Cameron McInally authored Nov 05, 2018
```
Differential Revision: https://reviews.llvm.org/D53411

llvm-svn: 346141
```
9757d5d6

[ThinLTO] Add an option to disable (thin)lto internalization. · 7ca74448

Xin Tong authored Nov 05, 2018

Summary:
LTO and ThinLTO optimizes the IR differently.

One source of differences is the amount of internalizations that
can happen.

Add an option to enable/disable internalization so that other
differences can be studied in isolation. e.g. inlining.

There are other things lto and thinlto do differently, I will add
flags to enable/disable them as needed.

Reviewers: tejohnson, pcc, steven_wu

Subscribers: mehdi_amini, inglorion, steven_wu, dexonsmith, dang, llvm-commits

Differential Revision: https://reviews.llvm.org/D53294

llvm-svn: 346140

7ca74448

[TargetLowering] Begin generalizing TargetLowering::expandFP_TO_SINT support. NFCI. · 6bd468bd
Simon Pilgrim authored Nov 05, 2018
```
Prior to initial work to add vector expansion support, remove assumptions that we're working on scalar types.

llvm-svn: 346139
```
6bd468bd
[InstCombine] add/adjust tests for select with fsub identity op; NFC · 92a53eab
Sanjay Patel authored Nov 05, 2018
```
llvm-svn: 346138
```
92a53eab

[NFCI][FPEnv] Split constrained intrinsic tests · 51a91e86

Cameron McInally authored Nov 05, 2018

The constrained intrinsic tests have grown in number. Split off
the FMA tests into their own file to reduce double coverage.

Differential Revision: https://reviews.llvm.org/D53932

llvm-svn: 346137

51a91e86

[InstCombine] add tests for select with FP identity op; NFC · 278db2fb
Sanjay Patel authored Nov 05, 2018
```
llvm-svn: 346136
```
278db2fb
[Tooling] Correct the total number of files being processed when `filter` is provided. · 63ecf073
Haojian Wu authored Nov 05, 2018
```
Reviewers: ioeric

Subscribers: cfe-commits

Differential Revision: https://reviews.llvm.org/D54104

llvm-svn: 346135
```
63ecf073

[Inliner] Penalise inlining of calls with loops at Oz · ba9f245b

David Green authored Nov 05, 2018

We currently seem to underestimate the size of functions with loops in them,
both in terms of absolute code size and in the difficulties of dealing with
such code. (Calls, for example, can be tail merged to further reduce
codesize). At -Oz, we can then increase code size by inlining small loops
multiple times.

This attempts to penalise functions with loops at -Oz by adding a CallPenalty
for each top level loop in the function. It uses LI (and hence DT) to calculate
the number of loops. As we are dealing with minsize, the inline threshold is
small and functions at this point should be relatively small, making the
construction of these cheap.

Differential Revision: https://reviews.llvm.org/D52716

llvm-svn: 346134

ba9f245b

[Mips] Supplement long branch pseudo instructions · 8d7c3517

Stefan Maksimovic authored Nov 05, 2018

Expand on LONG_BRANCH_LUi and LONG_BRANCH_(D)ADDiu pseudo
instructions by creating variants which support
less operands/accept GPR64Opnds as their operand in order
to appease the machine verifier pass.

Differential Revision: https://reviews.llvm.org/D53977

llvm-svn: 346133

8d7c3517

[NFC][ARM] Adding extra test for ARM CGP · 7275eec6
Sam Parker authored Nov 05, 2018
```
Added a reproducer that I received a while ago.

llvm-svn: 346132
```
7275eec6

[Tooling] Add "-filter" option to AllTUsExecution · cd5e59f5

Haojian Wu authored Nov 05, 2018

Summary: We can run the tools on a subset files of compilation database.

Reviewers: ioeric

Subscribers: cfe-commits

Differential Revision: https://reviews.llvm.org/D54092

llvm-svn: 346131

cd5e59f5

Reapply "Fix regression in behavior of clang -x c++-header -fmodule-name=XXX" · 5904c41e

Benjamin Kramer authored Nov 05, 2018

This reverts commit r345963. We have a path forward now.

Original commit message:
The driver accidentally stopped passing the input filenames on to -cc1
in this mode due to confusion over what action was being requested.

This change also fixes a couple of crashes I encountered when passing
multiple files to such a -cc1 invocation.

llvm-svn: 346130

5904c41e

Fix build on sparc64-linux-gnu. · 5f60861b
Martin Liska authored Nov 05, 2018
```
Differential Revision: https://reviews.llvm.org/D54030

llvm-svn: 346129
```
5f60861b

[AMDGPU] Fix the new atomic optimizer in pixel shaders. · 233a02d0

Neil Henning authored Nov 05, 2018

The new atomic optimizer I previously added in D51969 did not work
correctly when a pixel shader was using derivatives, and had helper
lanes active.

To fix this we add an llvm.amdgcn.ps.live call that guards a branch
around the entire atomic operation - ensuring that all helper lanes are
inactive within the wavefront when we compute our atomic results.

I've added a test case that can cause derivatives, and exposes the
problem.

Differential Revision: https://reviews.llvm.org/D53930

llvm-svn: 346128

233a02d0

[CMake] Expose opt-remark tooling through libOptRemarks.dylib · 2ae1be72

Francis Visoiu Mistrih authored Nov 05, 2018

* Create an install target for it
* Add it under tools/opt-remarks
* Add an export file for the dylib
* Install the llvm-c/OptRemarks.h header
* Add an API to query its version

rdar://45458839

llvm-svn: 346127

2ae1be72

[ARM] Turn assert into condition in ARMCGP · fec793c9

Sam Parker authored Nov 05, 2018

Turn the assert in PrepareConstants into a conditon so that we can
handle mul instructions with negative immediates.

Differential Revision: https://reviews.llvm.org/D54094

llvm-svn: 346126

fec793c9

[ARM][ARMCGP] Remove unecessary zexts and truncs · fcd8adab

Sam Parker authored Nov 05, 2018

r345840 slightly changed the way promotion happens which could
result in zext and truncs having the same source and destination
types. This fixes that issue.

We can now also remove the zext and trunc in the following case:
(zext (trunc (promoted op)), i32)

This means that we can no longer treat a value, that is only used by
a sink, to be safe to promote.

I've also added in some extra asserts and replaced a cast for a
dyn_cast.

Differential Revision: https://reviews.llvm.org/D54032

llvm-svn: 346125

fcd8adab

[mips][msa] Fix broken test · 76741336

Aleksandar Beserminji authored Nov 05, 2018

Test builtins-mips-msa-error.c wasn't reporting errors.
This patch fixes the test, so further test cases can be added.

Differential Revision: https://reviews.llvm.org/D53984

llvm-svn: 346124

76741336

Fix breakage on FrontendTest by initializing new field on constructor · 16950edc
Kadir Cetinkaya authored Nov 05, 2018
```
llvm-svn: 346123
```
16950edc
Exclude wasm target from Windows packaging due to PR39448 · d604ea53
Hans Wennborg authored Nov 05, 2018
```
llvm-svn: 346122
```
d604ea53
[NFC][x86][AArch64] extract-bits.ll: add test with 'ashr'. · 7db25f2b
Roman Lebedev authored Nov 05, 2018
```
llvm-svn: 346121
```
7db25f2b

[hwasan] Fix stack-uar.c after rCRT345110 · 342ca711

Fangrui Song authored Nov 05, 2018

Set -fno-discard-value-names so that the frame description string contains the variable name.

llvm-svn: 346120

342ca711

[DAGCombiner] Use tryFoldToZero to simplify some code and make it work... · 8f2f2a76

Craig Topper authored Nov 05, 2018

[DAGCombiner] Use tryFoldToZero to simplify some code and make it work correctly between LegalTypes and LegalOperations.

The original code avoided creating a zero vector after type legalization, but if we're after type legalization the type we have is legal. The real hazard we need to avoid is creating a build vector after op legalization. tryFoldToZero takes care of checking for this.

llvm-svn: 346119

8f2f2a76

[DAGCombiner] Remove an unused argument from tryFoldToZero. NFC · 8d64abdd
Craig Topper authored Nov 05, 2018
```
llvm-svn: 346118
```
8d64abdd

[AVR] Fix a backend bug that left extraneous operands after expansion · 4c5a5c8d

Dylan McKay authored Nov 05, 2018

This patch fixes a bug in the AVR FRMIDX expansion logic.

The expansion would leave a leftover operand from the original FRMIDX,
but now attached to a MOVWRdRr instruction. The MOVWRdRr instruction
did not expect this operand and so LLVM rejected the machine
instruction.

This would trigger an assertion:

    Assertion failed: ((isImpReg || Op.isRegMask() || MCID->isVariadic() ||
                        OpNo < MCID->getNumOperands() || isMetaDataOp) &&
                        "Trying to add an operand to a machine instr that is already done!"),
    function addOperand, file llvm/lib/CodeGen/MachineInstr.cpp

Tim fixed this so that now the FRMIDX is expanded correctly into
a well-formed MOVWRdRr.

Patch by Tim Neumann

llvm-svn: 346117

4c5a5c8d

[XRay] Update TSC math to handle wraparound · 1e255e7a

Dean Michael Berris authored Nov 05, 2018

Summary:
Prior to this change, we can run into situations where the TSC we're
getting when exiting a function is less than the TSC we got when
entering it. This would sometimes cause the counter for cumulative call
times overflow, which was erroneously also being stored as a signed
64-bit integer.

This change addresses both these issues while adding provisions for
tracking CPU migrations. We do this because moving from one CPU to
another doesn't guarantee that the timestamp counter for some
architectures aren't guaranteed to be synchronised. For the moment, we
leave the provisions there until we can update the data format to
include the counting of CPU migrations we can catch.

We update the necessary tests as well, ensuring that our expectations
for the cycle accounting to be met in case of counter wraparound.

Reviewers: mboerger

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D54088

llvm-svn: 346116

1e255e7a

[X86] Custom type legalize v2i8/v2i16/v2i32 mul to use to pmuludq. · 30b627e5

Craig Topper authored Nov 05, 2018

v2i8/v2i16/v2i32 are promoted to v2i64. pmuludq takes a v2i64 input and produces a v2i64 output. Since we don't about the upper bits of the type legalized multiply we can use the pmuludq to produce the multiply result for the bits we do care about.

llvm-svn: 346115

30b627e5

[AVR] Disallow the LDDWRdPtrQ instruction with Z as the destination · 9a9ae99b

Dylan McKay authored Nov 05, 2018

This is an AVR-specific workaround for a limitation of the register
allocator that only exposes itself on targets with high register
contention like AVR, which only has three pointer registers.

The three pointer registers are X, Y, and Z.
In most nontrivial functions, Y is reserved for the frame pointer,
as per the calling convention. This leaves X and Z. Some instructions,
such as LPM ("load program memory"), are only defined for the Z
register. Sometimes this just leaves X.

When the backend generates a LDDWRdPtrQ instruction with Z as the
destination pointer, it usually trips up the register allocator
with this error message:

  LLVM ERROR: ran out of registers during register allocation

This patch is a hacky workaround. We ban the LDDWRdPtrQ instruction
from ever using the Z register as an operand. This gives the
register allocator a bit more space to allocate, fixing the
regalloc exhaustion error.

Here is a description from the patch author Peter Nimmervoll

  As far as I understand the problem occurs when LDDWRdPtrQ uses
  the ptrdispregs register class as target register. This should work, but
  the allocator can't deal with this for some reason. So from my testing,
  it seams like (and I might be totally wrong on this) the allocator reserves
  the Z register for the ICALL instruction and then the register class
  ptrdispregs only has 1 register left and we can't use Y for source and
  destination. Removing the Z register from DREGS fixes the problem but
  removing Y register does not.

More information about the bug can be found on the avr-rust issue
tracker at https://github.com/avr-rust/rust/issues/37.

A bug has raised to track the removal of this workaround and a proper
fix; PR39553 at https://bugs.llvm.org/show_bug.cgi?id=39553.

Patch by Peter Nimmervoll

llvm-svn: 346114

9a9ae99b

[analyzer] Restrict AnalyzerOptions' interface so that non-checker objects have to be registered · 0a1f91c8

Kristof Umann authored Nov 05, 2018

One of the reasons why AnalyzerOptions is so chaotic is that options can be
retrieved from the command line whenever and wherever. This allowed for some
options to be forgotten for a looooooong time. Have you ever heard of
"region-store-small-struct-limit"? In order to prevent this in the future, I'm
proposing to restrict AnalyzerOptions' interface so that only checker options
can be retrieved without special getters. I would like to make every option be
accessible only through a getter, but checkers from plugins are a thing, so I'll
have to figure something out for that.

This also forces developers who'd like to add a new option to register it
properly in the .def file.

This is done by

* making the third checker pointer parameter non-optional, and checked by an
  assert to be non-null.
* I added new, but private non-checkers option initializers, meant only for
  internal use,
* Renamed these methods accordingly (mind the consistent name for once with
  getBooleanOption!):
  - getOptionAsString -> getCheckerStringOption,
  - getOptionAsInteger -> getCheckerIntegerOption
* The 3 functions meant for initializing data members (with the not very
  descriptive getBooleanOption, getOptionAsString and getOptionAsUInt names)
  were renamed to be overloads of the getAndInitOption function name.
* All options were in some way retrieved via getCheckerOption. I removed it, and
  moved the logic to getStringOption and getCheckerStringOption. This did cause
  some code duplication, but that's the only way I could do it, now that checker
  and non-checker options are separated. Note that the non-checker version
  inserts the new option to the ConfigTable with the default value, but the
  checker version only attempts to find already existing entries. This is how
  it always worked, but this is clunky and I might end reworking that too, so we
  can eventually get a ConfigTable that contains the entire configuration of the
  analyzer.

Differential Revision: https://reviews.llvm.org/D53483

llvm-svn: 346113

0a1f91c8

Ensure the correct order of evaluation in part 2. of PlistMacroExpansion · cb88cc67

Kristof Umann authored Nov 05, 2018

Windows buildbots break with the previous commit '[analyzer][PlistMacroExpansion]
Part 2.: Retrieving the macro name and primitive expansion'. This patch attempts
to solve this issue.

llvm-svn: 346112

cb88cc67

Reland '[analyzer][PlistMacroExpansion] Part 2.: Retrieving the macro name and primitive expansion' · 3800257f
Kristof Umann authored Nov 05, 2018
```
llvm-svn: 346111
```
3800257f
[X86] Fix typo in test comment. NFC · 60789b34
Craig Topper authored Nov 05, 2018
```
llvm-svn: 346110
```
60789b34

[LLDB] Fix FreeBSD/Darwin build · 511e1cf1

David Carlier authored Nov 04, 2018

Reviewers: JDevlieghere, tatyana-krasnukha

Reviwed By: tatyana-krasnukha

Differential Revision: https://reviews.llvm.org/D54084

llvm-svn: 346109

511e1cf1

[HotColdSplitting] Use TTI to inform outlining threshold · d2a895a9

Vedant Kumar authored Nov 04, 2018

Using TargetTransformInfo allows the splitting pass to factor in the
code size cost of instructions as it decides whether or not outlining is
profitable.

This did not regress the overall amount of outlining seen on the handful
of internal frameworks I tested.

Thanks to Jun Bum Lim for suggesting this!

Differential Revision: https://reviews.llvm.org/D53835

llvm-svn: 346108

d2a895a9