Commits · 5d0d30304c5f2305d8458625e5e46db6cc7f0a06 · Roger Ferrer / llvm-epi

Apr 01, 2018

AMDGPU: Make getTgtMemIntrinsic table-driven for resource-based intrinsics · 5d0d3030

Nicolai Haehnle authored Apr 01, 2018

Summary:
Avoids having to list all intrinsics manually.

This is in preparation for the new dimension-aware image intrinsics,
which I'd rather not have to list here by hand.

Change-Id: If7ced04998397ef68c4cb8f7de66b5050fb767e5

Reviewers: arsenm, rampitec, b-sumner

Subscribers: kzhuravl, wdng, mgorny, yaxunl, dstuttard, tpr, llvm-commits, t-tye

Differential Revision: https://reviews.llvm.org/D44937

llvm-svn: 328938

5d0d3030

TableGen: Support Intrinsic values in SearchableTable · 398c0b67

Nicolai Haehnle authored Apr 01, 2018

Summary:
We will use this in the AMDGPU backend in a subsequent patch
in the stack to lookup target-specific per-intrinsic information.

The generic CodeGenIntrinsic machinery is used to ensure that,
even though we don't calculate actual enum values here, we do
get the intrinsics in the right order for the binary search
index.

Change-Id: If61cd5587963a4c5a1cc53df1e59c5e4dec1f9dc

Reviewers: arsenm, rampitec, b-sumner

Subscribers: wdng, tpr, llvm-commits

Differential Revision: https://reviews.llvm.org/D44935

llvm-svn: 328937

398c0b67

TableGen: More helpful error messages · 24e3a4d6

Nicolai Haehnle authored Apr 01, 2018

Summary: Change-Id: I3c23f6f6597912423762780cd8c5315870412bbe

Reviewers: arsenm, rampitec, b-sumner

Subscribers: wdng, llvm-commits

Differential Revision: https://reviews.llvm.org/D44936

Change-Id: Ie62614a3e2d7774f46e4034478b28f57100a2c92
llvm-svn: 328936

24e3a4d6

[DebugInfo] Change std::sort to llvm::sort in response to r327219 · fe1d28e8

Mandeep Singh Grang authored Apr 01, 2018

Summary:
r327219 added wrappers to std::sort which randomly shuffle the container before sorting.
This will help in uncovering non-determinism caused due to undefined sorting
order of objects having the same key.

To make use of that infrastructure we need to invoke llvm::sort instead of std::sort.

Note: This patch is one of a series of patches to replace *all* std::sort to llvm::sort.
Refer the comments section in D44363 for a list of all the required patches.

Reviewers: echristo, zturner, samsonov

Reviewed By: echristo

Subscribers: JDevlieghere, llvm-commits

Differential Revision: https://reviews.llvm.org/D45134

llvm-svn: 328935

fe1d28e8

[ThinLTO] Add an import cutoff for debugging/triaging · 974706eb

Teresa Johnson authored Apr 01, 2018

Summary:
Adds -import-cutoff=N which will stop importing during the thin link
after N imports. Default is -1 (no  limit).

Reviewers: wmi

Subscribers: inglorion, llvm-commits

Differential Revision: https://reviews.llvm.org/D45127

llvm-svn: 328934

974706eb

[LoopRotate] Rotate loops with loop exiting latches · f80ebc8d

David Green authored Apr 01, 2018

If a loop has a loop exiting latch, it can be profitable
to rotate the loop if it leads to the simplification of
a phi node. Perform rotation in these cases even if loop
rotate itself didnt simplify the loop to get there.

Differential Revision: https://reviews.llvm.org/D44199

llvm-svn: 328933

f80ebc8d

[clang-tidy] Define __clang_analyzer__ macro for clang-tidy for compatibility... · c16815ca

Zinovy Nis authored Apr 01, 2018

[clang-tidy] Define __clang_analyzer__ macro for clang-tidy for compatibility with clang static analyzer

This macro is widely used in many well-known projects, ex. Chromium.
But it's not set for clang-tidy, so for ex. DCHECK in Chromium is not considered as [[no-return]], and a lot of false-positive warnings about nullptr dereferenced are emitted.
This patch fixes the issue by explicitly added macro definition.

Differential Revision: https://reviews.llvm.org/D44906

llvm-svn: 328932

c16815ca

[X86] Don't check for folding into a store when deciding if we can promote an i16 mul. · 9b8cd5fe
Craig Topper authored Apr 01, 2018
```
There's no RMW mul operation.

llvm-svn: 328931
```
9b8cd5fe

[X86] Check if the load and store are to the same pointer before preventing... · db6caabc

Craig Topper authored Apr 01, 2018

[X86] Check if the load and store are to the same pointer before preventing i16 RMW shifts and subtracts from being promoted.

llvm-svn: 328930

db6caabc

[X86] Add test case to show failure to promote i16 subtract when the LHS is a... · 3998041e

Craig Topper authored Apr 01, 2018

[X86] Add test case to show failure to promote i16 subtract when the LHS is a load and the result is stored to a different address.

We mistakenly believe we might be able to fold this as a RMW operation, but that doesn't end up happening.

llvm-svn: 328929

3998041e

[X86] Allow i16 subtracts to be promoted if the load is on the LHS and its not being stored. · ae2de57d
Craig Topper authored Apr 01, 2018
```
llvm-svn: 328928
```
ae2de57d

[X86] Add test case to show failure to promote i16 subtract because we... · 280f6313

Craig Topper authored Apr 01, 2018

[X86] Add test case to show failure to promote i16 subtract because we mistakenly believe the load can be folded. NFC

The left hand side of the subtract is a load, but we cna't fold those unless we also have a store.

llvm-svn: 328927

280f6313

[X86] Remove unneeded temporary variable. NFC · 9bc0d881

Craig Topper authored Apr 01, 2018

This Promote flag was alwasys set to true except in the default case. But in the default case we don't need to set PVT and can just return false.

llvm-svn: 328926

9bc0d881

[Analysis] Change std::sort to llvm::sort in response to r327219 · 97bcade7

Mandeep Singh Grang authored Apr 01, 2018

Summary:
r327219 added wrappers to std::sort which randomly shuffle the container before sorting.
This will help in uncovering non-determinism caused due to undefined sorting
order of objects having the same key.

To make use of that infrastructure we need to invoke llvm::sort instead of std::sort.

Note: This patch is one of a series of patches to replace *all* std::sort to llvm::sort.
Refer D44363 for a list of all the required patches.

Reviewers: sanjoy, dexonsmith, hfinkel, RKSimon

Reviewed By: dexonsmith

Subscribers: david2050, llvm-commits

Differential Revision: https://reviews.llvm.org/D44944

llvm-svn: 328925

97bcade7

Add missing include to ContinuousRangeMap.h · caa0e6b5
Eric Fiselier authored Apr 01, 2018
```
llvm-svn: 328924
```
caa0e6b5
Add missing include to Visibility.h · b97e3621
Eric Fiselier authored Apr 01, 2018
```
llvm-svn: 328923
```
b97e3621

Mar 31, 2018

Revert r328845, it caused crbug.com/827810. · e7c7d702
Nico Weber authored Mar 31, 2018
```
llvm-svn: 328922
```
e7c7d702

[DAGCombine] (float)((int) f) --> ftrunc (PR36617) · 6124cae8

Sanjay Patel authored Mar 31, 2018

fptosi / fptoui round towards zero, and that's the same behavior as ISD::FTRUNC, 
so replace a pair of casts with the equivalent node. We don't have to account for 
special cases (NaN, INF) because out-of-range casts are undefined.

Differential Revision: https://reviews.llvm.org/D44909

llvm-svn: 328921

6124cae8

[llvm-rtdyld] Fix the InputFileList cl::opt description: it accepts multiple · 9c755450
Lang Hames authored Mar 31, 2018
```
input files.

llvm-svn: 328920
```
9c755450

[analyzer] Unroll the loop when it has a unsigned counter. · f717d479

Henry Wong authored Mar 31, 2018

Summary:
The original implementation in the `LoopUnrolling.cpp` didn't consider the case where the counter is unsigned. This case is only handled in `simpleCondition()`, but this is not enough, we also need to deal with the unsinged counter with the counter initialization.

Since `IntegerLiteral` is `signed`, there is a `ImplicitCastExpr<IntegralCast>` in `unsigned counter = IntergerLiteral`. This patch add the `ignoringParenImpCasts()` in the `IntegerLiteral` matcher.

Reviewers: szepet, a.sidorin, NoQ, george.karpenkov

Reviewed By: szepet, george.karpenkov

Subscribers: xazax.hun, rnkovacs, cfe-commits, MTC

Differential Revision: https://reviews.llvm.org/D45086

llvm-svn: 328919

f717d479

[X86][Btver2] Add MMX_PSHUFB to the JWritePSHUFB InstRW entries · 3b8ad346
Simon Pilgrim authored Mar 31, 2018
```
llvm-svn: 328918
```
3b8ad346
Fix trailing whitespace. NFCI. · 8c8ebd79
Simon Pilgrim authored Mar 31, 2018
```
llvm-svn: 328917
```
8c8ebd79
Unbreak the build of the go bindings after r328839. · 824f36ed
Benjamin Kramer authored Mar 31, 2018
```
llvm-svn: 328916
```
824f36ed
[MIR-Canon] Adding support for local idempotent instruction hoisting. · 57c4f38c
Puyan Lotfi authored Mar 31, 2018
```
llvm-svn: 328915
```
57c4f38c

[X86] Add SchedRW for PMULLD · 13a0f83a

Craig Topper authored Mar 31, 2018

Summary:
It seems many CPUs don't implement this instruction as well as the other vector multiplies. Often using a multi uop flow. Silvermont in particular has a 7 uop flow with 11 cycle throughput. Sandy Bridge implements it as a single uop with 5 cycle latency and 1 cycle throughput. But Haswell and later use 2 uops with 10 cycle latency and 2 cycle throughput.

This patch adds a new X86SchedWritePair we can use to tag this instruction separately. I've provided correct information for Silvermont, Btver2, and Sandy Bridge. I've removed the InstRWs for SandyBridge. I've left Haswell/Broadwell/Skylake InstRWs in place because I wasn't sure how to account for the different load latency between 128 and 256 bits. I also left Znver1 InstRWs in place because the existing values don't match Agner's spreadsheet.

I also left a FIXME in the SandyBridge model because it being used for the "generic" model is too optimistic for the 256/512-bit versions since those are multiple uops on all known CPUs.

Reviewers: RKSimon, GGanesh, courbet

Reviewed By: RKSimon

Subscribers: gchatelet, gbedwell, andreadb, llvm-commits

Differential Revision: https://reviews.llvm.org/D44972

llvm-svn: 328914

13a0f83a

[analyzer] Hopefully fix the ARM buildbot. · 96871864
George Karpenkov authored Mar 31, 2018
```
llvm-svn: 328913
```
96871864

[analyzer] Fix assertion crash in CStringChecker · 6fe0f035

George Karpenkov authored Mar 31, 2018

An offset might be unknown.

rdar://39054939

Differential Revision: https://reviews.llvm.org/D45115

llvm-svn: 328912

6fe0f035

[analyzer] Cache offset computation for MemRegion · fa4d18c7

George Karpenkov authored Mar 31, 2018

Achieves almost a 200% speedup on the example where the performance of
visitors was problematic.

Performance on sqlite3 is unaffected.

rdar://38818362

Differential Revision: https://reviews.llvm.org/D45113

llvm-svn: 328911

fa4d18c7

[analyzer] Fix liveness calculation for C++17 structured bindings · 137ca91f

George Karpenkov authored Mar 31, 2018

C++ structured bindings for non-tuple-types are defined in a peculiar
way, where the resulting declaration is not a VarDecl, but a
BindingDecl.
That means a lot of existing machinery stops working.

rdar://36912381

Differential Revision: https://reviews.llvm.org/D44956

llvm-svn: 328910

137ca91f

[ThinLTO] Add an option to force summary call edges cold for debugging · db83aceb

Teresa Johnson authored Mar 31, 2018

Summary:
Useful to selectively disable importing into specific modules for
debugging/triaging/workarounds.

Reviewers: eraman

Subscribers: inglorion, llvm-commits

Differential Revision: https://reviews.llvm.org/D45062

llvm-svn: 328909

db83aceb

[ELF] Simplify read32. NFC · ef61d85f
Fangrui Song authored Mar 30, 2018
```
llvm-svn: 328908
```
ef61d85f
Fix a bunch of typoes. NFC · 956ee797
Fangrui Song authored Mar 30, 2018
```
llvm-svn: 328907
```
956ee797

[ASTImporter] Add test helper Fixture · dedda6fa

Peter Szecsi authored Mar 30, 2018

Add a helper test Fixture, so we can add tests which can check internal
attributes of AST nodes like getPreviousDecl(), isVirtual(), etc.
This enables us to check if a redeclaration chain is correctly built during
import, if the virtual flag is preserved during import, etc. We cannot check
such attributes with the existing testImport.
Also, this fixture makes it possible to import from several "From" contexts.

We also added several test cases here, some of them are disabled.
We plan to pass the disabled tests in other patches.

Patch by Gabor Marton!

Differential Revision: https://reviews.llvm.org/D43967

llvm-svn: 328906

dedda6fa

Mar 30, 2018

ELF: Place ordered sections in the middle of the unordered section list on... · 5ea6d50a

Peter Collingbourne authored Mar 30, 2018

ELF: Place ordered sections in the middle of the unordered section list on targets with limited-range branches.

It generally does not matter much where we place sections ordered
by --symbol-ordering-file relative to other sections. But if the
ordered sections are hot (which is the case already for some users
of --symbol-ordering-file, and is increasingly more likely to be
the case once profile-guided section layout lands) and the target
has limited-range branches, it is beneficial to place the ordered
sections in the middle of the output section in order to decrease
the likelihood that a range extension thunk will be required to call
a hot function from a cold function or vice versa.

That is what this patch does. After D44966 it reduces the size of
Chromium for Android's .text section by 60KB.

Differential Revision: https://reviews.llvm.org/D44969

llvm-svn: 328905

5ea6d50a

Prevent data races in concurrent ThinLTO processes. · 0b01dfbb

Ekaterina Romanova authored Mar 30, 2018

Make sure ThinLTO with caching doesn't use non-atomic writes to the cache file (to prevent data races and cache files corruption).

1. Place temp file to the same place where the caching directory is (instead of creating it the directory pointed to by TMP/TEMP variable). This will help to prevent using non-atomic rename and falling back to non-atomic "direct" write to the cache file.
2. if rename failed do not write to the cache file directly (direct write to the file is non-atomic and could cause data race conditions).
3. if cache file doesn't exist (e.g., because 'rename' failed or because some other reasons), bypass using the cache altogether.

Differential Revision: https://reviews.llvm.org/D45076

llvm-svn: 328904

0b01dfbb

[analyzer] Fix test triple in missing-bind-temporary.cpp. · d1fe360b

Artem Dergachev authored Mar 30, 2018

Otherwise the default triple for x86-windows-msvc2015 auto-inserts
__attribute__((thiscall)) to some calls.

Fixes the respective buildbot.

llvm-svn: 328903

d1fe360b

Initialize Elf Header to zero to ensure that bytes not assigned any value... · 7588a8e8

Rumeet Dhindsa authored Mar 30, 2018

Initialize Elf Header to zero to ensure that bytes not assigned any value later on are initialized properly.

Differential Revision: https://reviews.llvm.org/D44986

llvm-svn: 328902

7588a8e8

[WebAssembly] Register wasm passes with the PassRegistry · 40926451

Jacob Gravelle authored Mar 30, 2018

Summary:
This exposes WebAssembly passes for use on the command line (as
arguments to -print-before and the like).

Reviewers: dschuff, sunfish

Subscribers: MatzeB, jfb, sbc100, llvm-commits, aheejin

Differential Revision: https://reviews.llvm.org/D45103

llvm-svn: 328901

40926451

Minor cleanup in __kmp_atfork_child() · 1e6bb8d5

Jonathan Peyton authored Mar 30, 2018

This change removes the unnecessary lock operation on __kmp_initz_lock inside
the __kmp_atfork_child() function for Linux; the lock variable is initialized
in the same function later.

Patch by Hansang Bae

Differential Revision: https://reviews.llvm.org/D44949

llvm-svn: 328900

1e6bb8d5

[Hexagon] Fix testcase · 526fbf8e
Krzysztof Parzyszek authored Mar 30, 2018
```
llvm-svn: 328899
```
526fbf8e