Commits · cb8c009801c927c4f6401d76bac45370c936d778 · Lorenzo Albano / LLVM bpEVL

Dec 16, 2017

[Driver, CodeGen] pass through and apply -fassociative-math · cb8c0098

Sanjay Patel authored Dec 16, 2017

There are 2 parts to getting the -fassociative-math command-line flag translated to LLVM FMF:

1. In the driver/frontend, we accept the flag and its 'no' inverse and deal with the 
   interactions with other flags like -ffast-math -fno-signed-zeros -fno-trapping-math. 
   This was mostly already done - we just need to translate the flag as a codegen option. 
   The test file is complicated because there are many potential combinations of flags here.
   Note that we are matching gcc's behavior that requires 'nsz' and no-trapping-math.

2. In codegen, we map the codegen option to FMF in the IR builder. This is simple code and 
   corresponding test.

For the motivating example from PR27372:

float foo(float a, float x) { return ((a + x) - x); }

$ ./clang -O2 27372.c -S -o - -ffast-math  -fno-associative-math -emit-llvm  | egrep 'fadd|fsub'
  %add = fadd nnan ninf nsz arcp contract float %0, %1
  %sub = fsub nnan ninf nsz arcp contract float %add, %2

So 'reassoc' is off as expected (and so is the new 'afn' but that's a different patch). 
This case now works as expected end-to-end although the underlying logic is still wrong:

$ ./clang  -O2 27372.c -S -o - -ffast-math  -fno-associative-math | grep xmm
	addss	%xmm1, %xmm0
	subss	%xmm1, %xmm0

We're not done because the case where 'reassoc' is set is ignored by optimizer passes. Example:

$ ./clang  -O2 27372.c -S -o - -fassociative-math -fno-signed-zeros -fno-trapping-math -emit-llvm  | grep fadd
  %add = fadd reassoc float %0, %1

$ ./clang -O2  27372.c -S -o - -fassociative-math -fno-signed-zeros -fno-trapping-math | grep xmm
	addss	%xmm1, %xmm0
	subss	%xmm1, %xmm0

Differential Revision: https://reviews.llvm.org/D39812

llvm-svn: 320920

cb8c0098

[X86] Implement kand/kandn/kor/kxor/kxnor/knot intrinsics using native IR. · 5028ace6
Craig Topper authored Dec 16, 2017
```
llvm-svn: 320919
```
5028ace6

[X86] Remove GCCBuiltin from kand/kandn/kor/kxor/kxnor/knot intrinsics so... · d2a2a39c

Craig Topper authored Dec 16, 2017

[X86] Remove GCCBuiltin from kand/kandn/kor/kxor/kxnor/knot intrinsics so clang can implement with native IR.

llvm-svn: 320918

d2a2a39c

[X86] Remove unneeded code for handling the old kunpck intrinsics. · 1c7d07c6
Craig Topper authored Dec 16, 2017
```
llvm-svn: 320917
```
1c7d07c6
[X86] Add the two files I forgot to commit in r320915. · 798f2c03
Craig Topper authored Dec 16, 2017
```
llvm-svn: 320916
```
798f2c03
[X86] Add builtins and tests for 128 and 256 bit vpopcntdq. · b846d1ff
Craig Topper authored Dec 16, 2017
```
llvm-svn: 320915
```
b846d1ff
Move Transforms/LoopVectorize/consecutive-ptr-cg-bug.ll into the X86 subdirectory · 92ea8acb
Hal Finkel authored Dec 16, 2017
```
This test depends on X86's TTI; move into the X86 subdirectory.

llvm-svn: 320914
```
92ea8acb

[LV] Extend InstWidening with CM_Widen_Recursive · 5444f409

Hal Finkel authored Dec 16, 2017

Changes to the original scalar loop during LV code gen cause the return value
of Legal->isConsecutivePtr() to be inconsistent with the return value during
legal/cost phases (further analysis and information of the bug is in D39346).
This patch is an alternative fix to PR34965 following the CM_Widen approach
proposed by Ayal and Gil in D39346. It extends InstWidening enum with
CM_Widen_Reverse to properly record the widening decision for consecutive
reverse memory accesses and, consequently, get rid of the
Legal->isConsetuviePtr() call in LV code gen. I think this is a simpler/cleaner
solution to PR34965 than the one in D39346.

Fixes PR34965.

Patch by Diego Caballero, thanks!

Differential Revision: https://reviews.llvm.org/D40742

llvm-svn: 320913

5444f409

Fixed warning 'function declaration isn’t a prototype [-Werror=strict-prototypes]' · 5f8c84c5
Galina Kistanova authored Dec 16, 2017
```
llvm-svn: 320912
```
5f8c84c5

[PowerPC, AsmParser] Enable the mnemonic spell corrector · e86a8b79

Hal Finkel authored Dec 16, 2017

r307148 added an assembly mnemonic spelling correction support and enabled it
on ARM. This enables that support on PowerPC as well.

Patch by Dmitry Venikov, thanks!

Differential Revision: https://reviews.llvm.org/D40552

llvm-svn: 320911

e86a8b79

[X86] Add 128 and 256-bit VPOPCNTDQ instructions. Adjust some tablegen classes LZCNT/POPCNT. · c0896059
Craig Topper authored Dec 16, 2017
```
I think when this instruction was first published it was only for a Knights CPU and thus VLX version was missing.

llvm-svn: 320910
```
c0896059
[LTO] Update tests for r320905 · 12f9b8cf
Vitaly Buka authored Dec 16, 2017
```
llvm-svn: 320909
```
12f9b8cf

[VerifyDiagnosticConsumer] support -verify=<prefixes> · 05e46484

Hal Finkel authored Dec 16, 2017

This mimics FileCheck's --check-prefixes option.

The default prefix is "expected". That is, "-verify" is equivalent to
"-verify=expected".

The goal is to permit exercising a single test suite source file with different
compiler options producing different sets of diagnostics.  While cpp can be
combined with the existing -verify to accomplish the same goal, source is often
easier to maintain when it's not cluttered with preprocessor directives or
duplicate passages of code. For example, this patch also rewrites some existing
clang tests to demonstrate the benefit of this feature.

Patch by Joel E. Denny, thanks!

Differential Revision: https://reviews.llvm.org/D39694

llvm-svn: 320908

05e46484

Remove trailing whitespace · fd563a03
Vitaly Buka authored Dec 16, 2017
```
llvm-svn: 320907
```
fd563a03

[WebAssembly] Return ArrayRef's rather than const std::vector& · 731a7664

Sam Clegg authored Dec 16, 2017

From working on lld I've learned this is generally the
preferred way for several reasons (e.g. more concise, improves
encapsulation).

Differential Revision: https://reviews.llvm.org/D41265

llvm-svn: 320906

731a7664

[LTO] Make processing of combined module more consistent · a5376f39

Vitaly Buka authored Dec 16, 2017

Summary:
1. Use stream 0 only for combined module. Previously if combined module was not
processes ThinLTO used the stream for own output. However small changes in input,
could trigger combined module  and shuffle outputs making life of llvm::LTO harder.

2. Always process combined module and write output to stream 0. Processing empty
combined module is cheap and allows llvm::LTO users to avoid implementing processing
which is already done in llvm::LTO.

Subscribers: mehdi_amini, inglorion, eraman, hiraditya

Differential Revision: https://reviews.llvm.org/D41267

llvm-svn: 320905

a5376f39

[TextDiagnosticBuffer] Fix diagnostic note emission order · cc15219f

Hal Finkel authored Dec 16, 2017

The frontend currently groups diagnostics from the command line according to
diagnostic level, but that places all notes last. Fix that by emitting such
diagnostics in the order they were generated.

Patch by Joel E. Denny, thanks!

Differential Revision: https://reviews.llvm.org/D40995

llvm-svn: 320904

cc15219f

Add another missing -enable-import-metadata to test · 160f4bb8

Teresa Johnson authored Dec 16, 2017

r320895 modified a test so that it needs -enable-import-metadata which
is false by default for NDEBUG, found another place that needs this
added.

llvm-svn: 320903

160f4bb8

[CodeGen] Specialize mixed-sign mul-with-overflow (fix PR34920) · fa5a0e59

Vedant Kumar authored Dec 16, 2017

This patch introduces a specialized way to lower overflow-checked
multiplications with mixed-sign operands. This fixes link failures and
ICEs on code like this:

  void mul(int64_t a, uint64_t b) {
    int64_t res;
    __builtin_mul_overflow(a, b, &res);
  }

The generic checked-binop irgen would use a 65-bit multiplication
intrinsic here, which requires runtime support for _muloti4 (128-bit
multiplication), and therefore fails to link on i386. To get an ICE
on x86_64, change the example to use __int128_t / __uint128_t.

Adding runtime and backend support for 65-bit or 129-bit checked
multiplication on all of our supported targets is infeasible.

This patch solves the problem by using simpler, specialized irgen for
the mixed-sign case.

llvm.org/PR34920, rdar://34963321

Testing: Apart from check-clang, I compared the output from this fairly
comprehensive test driver using unpatched & patched clangs:
https://gist.github.com/vedantk/3eb9c88f82e5c32f2e590555b4af5081

Differential Revision: https://reviews.llvm.org/D41149

llvm-svn: 320902

fa5a0e59

[SimplifyLibCalls] Inline calls to cabs when it's safe to do so · 2ff24731

Hal Finkel authored Dec 16, 2017

When unsafe algerbra is allowed calls to cabs(r) can be replaced by:

  sqrt(creal(r)*creal(r) + cimag(r)*cimag(r))

Patch by Paul Walker, thanks!

Differential Revision: https://reviews.llvm.org/D40069

llvm-svn: 320901

2ff24731

[LV] NFC patch for moving VP*Recipe class definitions from LoopVectorize.cpp to VPlan.h · 7333aa9f

Hal Finkel authored Dec 16, 2017

This is a small step forward to move VPlan stuff to where it should belong (i.e., VPlan.*):

1. VP*Recipe classes in LoopVectorize.cpp are moved to VPlan.h.
2. Many of VP*Recipe::print() and execute() definitions are still left in
LoopVectorize.cpp since they refer to things declared in LoopVectorize.cpp. To
be moved to VPlan.cpp at a later time.
3. InterleaveGroup class is moved from anonymous namespace to llvm namespace.
Referencing it in anonymous namespace from VPlan.h ended up in warning.

Patch by Hideki Saito, thanks!

Differential Revision: https://reviews.llvm.org/D41045

llvm-svn: 320900

7333aa9f

Add -enable-import-metadata to test · 4358a403

Teresa Johnson authored Dec 16, 2017

r320895 modified a test so that it needs -enable-import-metadata which
is false by default for NDEBUG.

llvm-svn: 320899

4358a403

[X86] Add back the assert from r320830 that was reverted in r320850 · 6b129fde
Craig Topper authored Dec 16, 2017
```
Hopefully r320864 has fixed the offending case that failed the assert.

llvm-svn: 320898
```
6b129fde
Fix NDEBUG build problem in r320895 · 69b2de84
Teresa Johnson authored Dec 16, 2017
```
Fix incorrect placement of #endif causing NDEBUG build failures.

llvm-svn: 320897
```
69b2de84

[COFF] Clean up debug option handling · a1f6fba4

Shoaib Meenai authored Dec 16, 2017

/debug and /debug:dwarf are orthogonal. An object file can contain both
CodeView and DWARF debug info, so the combination of /debug:dwarf and
/debug should generate both DWARF and a PDB, rather than /debug:dwarf
always suppressing PDB creation.

/nopdb is now redundant and can be removed. /debug /nopdb was previously
used to support DWARF, but specifying /debug:dwarf is entirely
equivalent to that combination now.

Differential Revision: https://reviews.llvm.org/D41310

llvm-svn: 320896

a1f6fba4

[ThinLTO] Enable importing of aliases as copy of aliasee · 81bbf742

Teresa Johnson authored Dec 16, 2017

Summary:
This implements a missing feature to allow importing of aliases, which
was previously disabled because alias cannot be available_externally.
We instead import an alias as a copy of its aliasee.

Some additional work was required in the IndexBitcodeWriter for the
distributed build case, to ensure that the aliasee has a value id
in the distributed index file (i.e. even when it is not being
imported directly).

This is a performance win in codes that have many aliases, e.g. C++
applications that have many constructor and destructor aliases.

Reviewers: pcc

Subscribers: mehdi_amini, inglorion, eraman, llvm-commits

Differential Revision: https://reviews.llvm.org/D40747

llvm-svn: 320895

81bbf742

[COFF] Update an outdated comment. NFC · c4fdbca6

Shoaib Meenai authored Dec 15, 2017

This comment dates from when LLD didn't produce actual PDBs, and is very
outdated now.

llvm-svn: 320894

c4fdbca6

Fix WebAssembly backend for some LLVM API changes · 21109249
David Blaikie authored Dec 15, 2017
```
llvm-svn: 320893
```
21109249

[COFF] Simplify hasArgs calls. NFC · 111db794

Shoaib Meenai authored Dec 15, 2017

We can just pass multiple options to hasArgs (which will check for any
of those options being present) instead of calling it multiple times.

llvm-svn: 320892

111db794

[CMake] darwin-debug is an hard dependency for tests on macOS. · 864a6edf
Davide Italiano authored Dec 15, 2017
```
Fixes a few failured on the testsuite with CMake.

llvm-svn: 320891
```
864a6edf

[TableGen][GlobalISel] Make the different Matcher comparable · 893e0f15

Quentin Colombet authored Dec 15, 2017

This opens refactoring opportunities in the match table now that we can
check that two predicates are the same.

NFC.

llvm-svn: 320890

893e0f15

[TableGen][GlobalISel] Fix unused variable warning in release mode · a646ef08
Quentin Colombet authored Dec 15, 2017
```
Introduced in r320887.

NFC.

llvm-svn: 320889
```
a646ef08
Revert "Recommit "[DWARFv5] Dump an MD5 checksum in the line-table header."" · 6d0484f2
Paul Robinson authored Dec 15, 2017
```
This reverts commit 0afef672f63f0e4e91938656bc73424a8c058bfc.
Still failing at runtime on bots.

llvm-svn: 320888
```
6d0484f2

[TableGen][GlobalISel] Have the predicate directly know which data they are dealing with · aad20be6

Quentin Colombet authored Dec 15, 2017

Prior to this patch, a predicate wouldn't make sense outside of its
rule. Indeed, it was only during emitting a rule that a predicate would
be made aware of the IDs of the data it is checking. Because of that,
predicates could not be moved around or compared between each other.

NFC.

llvm-svn: 320887

aad20be6

Dec 15, 2017

Recommit "[DWARFv5] Dump an MD5 checksum in the line-table header." · 5c8f7d7d

Paul Robinson authored Dec 15, 2017

Adds missing support for DW_FORM_data16.

Update of r320852, fixing the unittest to use a hand-coded struct
instead of std::array to guarantee data layout.

Differential Revision: https://reviews.llvm.org/D41090

llvm-svn: 320886

5c8f7d7d

Fix unused variable in non-assert builds · 042fed54
Matthias Braun authored Dec 15, 2017
```
llvm-svn: 320885
```
042fed54
MachineFunction: Return reference from getFunction(); NFC · f1caa283
Matthias Braun authored Dec 15, 2017
```
The Function can never be nullptr so we can return a reference.

llvm-svn: 320884
```
f1caa283

[MacOSX/Queues] Relax an overly aggressive assertion in a test. · 8539edb0

Davide Italiano authored Dec 15, 2017

"Default" is a valid QoS for a thread on older versions of macOS,
like the one installed in the bot.
Thanks to Jason Molenda for helping me figuring out the problem.

<rdar://problem/28346273>

llvm-svn: 320883

8539edb0

MachineFunction: Slight refactoring; NFC · 4684033a
Matthias Braun authored Dec 15, 2017
```
Slight cleanup/refactor in preparation for upcoming commit.

llvm-svn: 320882
```
4684033a

MachineModuleInfo: Remove unused function; NFC · 89488fff

Matthias Braun authored Dec 15, 2017

Remove the unused setModule() function; it would be dangerous if someone
actually used it as it wouldn't reset/recompute various other module
related data.

llvm-svn: 320881

89488fff