Commits · a1032a0f7cd5d6eac98d692633b82a86e704792c · Lorenzo Albano / LLVM bpEVL

Jul 22, 2015

[PM/AA] Remove the last of the legacy update API from AliasAnalysis as · a1032a0f

Chandler Carruth authored Jul 22, 2015

part of simplifying its interface and usage in preparation for porting
to work with the new pass manager.

Note that this will likely expose that we have dead arguments, members,
and maybe even pass requirements for AA. I'll be cleaning those up in
seperate patches. This just zaps the actual update API.

Differential Revision: http://reviews.llvm.org/D11325

llvm-svn: 242881

a1032a0f

[PM/AA] Switch to an early-exit. NFC. This was split out of another · d86a4f5e

Chandler Carruth authored Jul 22, 2015

change because the diff is *useless*. I assure you, I just switched to
early-return in this function.

Cleanup in preparation for my next commit, as requested in code review!

llvm-svn: 242880

d86a4f5e

[PM/AA] Put the 'final' keyword in the correct place. And actually · 1ffd12e3
Chandler Carruth authored Jul 22, 2015
```
succeed at compiling my change before committing it too!

llvm-svn: 242879
```
1ffd12e3

[PM/AA] Replace the only use of the AliasAnalysis::deleteValue API (in · da7c1919

Chandler Carruth authored Jul 22, 2015

GlobalsModRef) with CallbackVHs that trigger the same behavior.

This is technically more expensive, but in benchmarking some LTO runs,
it seems unlikely to even be above the noise floor. The only way I was
able to measure the performance of GMR at all was to run nothing else
but this one analysis on a linked clang bitcode file. The call graph
analysis still took 5x more time than GMR, and this change at most made
GMR 2% slower (this is well within the noise, so its hard for me to be
sure that this is an actual change). However, in a real LTO run over the
same bitcode, the GMR run takes so little time that the pass timers
don't measure it.

With this, I can remove the last update API from the AliasAnalysis
interface, but I'll actually remove the interface hook point in
a follow-up commit.

Differential Revision: http://reviews.llvm.org/D11324

llvm-svn: 242878

da7c1919

AVX-512: Added intrinsics for VCVT* instructions. · a26f10ce

Elena Demikhovsky authored Jul 22, 2015

All SKX forms. All VCVT instructions for float/double/int/long types.

Differential Revision: http://reviews.llvm.org/D11343

llvm-svn: 242877

a26f10ce

[LoopUnswitch] Code refactoring to separate trivial loop unswitch and... · c0f3a158

Chen Li authored Jul 22, 2015

[LoopUnswitch] Code refactoring to separate trivial loop unswitch and non-trivial loop unswitch in processCurrentLoop()

Summary: The current code in LoopUnswtich::processCurrentLoop() mixes trivial loop unswitch and non-trivial loop unswitch together. It goes over all basic blocks in the loop and checks if a condition is trivial or non-trivial unswitch condition. However, trivial unswitch condition can only occur in the loop header basic block (where it controls whether or not the loop does something at all). This refactoring separate trivial loop unswitch and non-trivial loop unswitch. Before going over all basic blocks in the loop, it checks if the loop header contains a trivial unswitch condition. If so, unswitch it. Otherwise, go over all blocks like before but don't check trivial condition any more since they are not possible to be in the other blocks. This code has no functionality change.

Reviewers: meheff, reames, broune

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D11276

llvm-svn: 242873

c0f3a158

[BranchFolding] do not iterate the aliases of virtual registers · 20d73c6c

Jingyue Wu authored Jul 22, 2015

Summary:
MCRegAliasIterator only works for physical registers. So, do not run it
on virtual registers.

With this issue fixed, we can resurrect the BranchFolding pass in NVPTX
backend.

Reviewers: jholewinski, bkramer

Subscribers: henryhu, meheff, llvm-commits, jholewinski

Differential Revision: http://reviews.llvm.org/D11174

llvm-svn: 242871

20d73c6c

[SROA] Fix a nasty pile of bugs to do with big-endian, different alloca · ccffdaf7

Chandler Carruth authored Jul 22, 2015

types and loads, loads or stores widened past the size of an alloca,
etc.

This started off with a bug report about big-endian behavior with
bitfields and loads and stores to a { i32, i24 } struct. An initial
attempt to fix this was sent for review in D10357, but that didn't
really get to the root of the problem.

The core issue was that canConvertValue and convertValue in SROA were
handling different bitwidth integers by doing a zext of the integer. It
wouldn't do a trunc though, only a zext! This would in turn lead SROA to
form an i24 load from an i24 alloca, zext it to i32, and then use it.
This would at least produce the wrong value for big-endian systems.

One of my many false starts here was to correct the computation for
big-endian systems by shifting. But this doesn't actually work because
the original code has a 64-bit store to the entire 8 bytes, and a 32-bit
load of the last 4 bytes, and because the alloc size is 8 bytes, we
can't lose that last (least significant if bigendian) byte! The real
problem here is that we're forming an i24 load in SROA which is actually
not sufficiently wide to load all of the necessary bits here. The source
has an i32 load, and SROA needs to form that as well.

The straightforward way to do this is to disable the zext logic in
canConvertValue and convertValue, forcing us to actually load all
32-bits. This seems like a really good change, but it in turn breaks
several other parts of SROA.

First in the chain of knock-on failures, we had places where we were
doing integer-widening promotion even though some of the integer loads
or stores extended *past the end* of the alloca's memory! There was even
a comment about preventing this, but it only prevented the case where
the type had a different bit size from its store size. So I added checks
to handle the cases where we actually have a widened load or store and
to avoid trying to special integer widening promotion in those cases.

Second, we actually rely on the ability to promote in the face of loads
past the end of an alloca! This is important so that we can (for
example) speculate loads around PHI nodes to do more promotion. The bits
loaded are garbage, but as long as they aren't used and the alignment is
suitable high (which it wasn't in the test case!) this is "fine". And we
can't stop promoting here, lots of things stop working well if we do. So
we need to add specific logic to handle the extension (and truncation)
case, but *only* where that extension or truncation are over bytes that
*are outside the alloca's allocated storage* and thus totally bogus to
load or store.

And of course, once we add back this correct handling of extension or
truncation, we need to correctly handle bigendian systems to avoid
re-introducing the exact bug that started us off on this chain of misery
in the first place, but this time even more subtle as it only happens
along speculated loads atop a PHI node.

I've ported an existing test for PHI speculation to the big-endian test
file and checked that we get that part correct, and I've added several
more interesting big-endian test cases that should help check that we're
getting this correct.

Fun times.

llvm-svn: 242869

ccffdaf7

SetVector: add reverse_iterator support. · c519c9b8
Richard Smith authored Jul 22, 2015
```
llvm-svn: 242865
```
c519c9b8
[Fuzzer] Rely on $PATH expansion instead of hardcoding paths in tests. NFC. · 4800c2de
Alexey Samsonov authored Jul 21, 2015
```
llvm-svn: 242851
```
4800c2de
[Fuzzer] Clearly separate regular and DFSan tests. NFC. · dc324e16
Alexey Samsonov authored Jul 21, 2015
```
llvm-svn: 242850
```
dc324e16

[dsymutil] Implement ODR uniquing for C++ code. · 1c65094d

Frederic Riss authored Jul 21, 2015

This optimization allows the DWARF linker to reuse definition of
types it has emitted in previous CUs rather than reemitting them
in each CU that references them. The size and link time gains are
huge. For example when linking the DWARF for a debug build of
clang, this generates a ~150M dwarf file instead of a ~700M one
(the numbers date back a bit and must not be totally accurate
these days).

As with all the other parts of the llvm-dsymutil codebase, the
goal is to keep bit-for-bit compatibility with dsymutil-classic.
The code is littered with a lot of FIXMEs that should be
addressed once we can get rid of the compatibilty goal.

llvm-svn: 242847

1c65094d

MIR Serialization: Start serializing the CFI operands with .cfi_def_cfa_offset. · f4baeb51

Alex Lorenz authored Jul 21, 2015

This commit begins serialization of the CFI index machine operands by
serializing one kind of CFI instruction - the .cfi_def_cfa_offset instruction.

Reviewers: Duncan P. N. Exon Smith
llvm-svn: 242845

f4baeb51

Jul 21, 2015

Fix a performance problem in memcpyopt by removing a linear scan over ranges... · f836c89c

Nick Lewycky authored Jul 21, 2015

Fix a performance problem in memcpyopt by removing a linear scan over ranges when inserting a new range. No functionality change intended. Patch by Anthony Pesch!

llvm-svn: 242843

f836c89c

[MDA] change BlockScanLimit into a command line option. · d058ea92

Jingyue Wu authored Jul 21, 2015

Summary:
In the benchmark (https://github.com/vetter/shoc) we are researching,
the duplicated load is not eliminated because MemoryDependenceAnalysis
hit the BlockScanLimit. This patch change it into a command line option
instead of a hardcoded value.

Patched by Xuetian Weng. 

Test Plan: test/Analysis/MemoryDependenceAnalysis/memdep-block-scan-limit.ll

Reviewers: jingyue, reames

Subscribers: reames, llvm-commits

Differential Revision: http://reviews.llvm.org/D11366

llvm-svn: 242842

d058ea92

[AsmPrinter] Check for valid constants in handleIndirectSymViaGOTPCRel · e8640518
Bruno Cardoso Lopes authored Jul 21, 2015
```
Check whether BaseCst is valid before extracting a GlobalValue.
This fixes PR24163.

Patch by David Majnemer.

llvm-svn: 242840
```
e8640518
[Object][ELF] Handle files with no section header string table. · 402a4f10
Michael J. Spencer authored Jul 21, 2015
```
llvm-svn: 242839
```
402a4f10

[PPC64LE] More vector swap optimization TLC · 2be8054b

Bill Schmidt authored Jul 21, 2015

This makes one substantive change and a few stylistic changes to the
VSX swap optimization pass.

The substantive change is to permit LXSDX and LXSSPX instructions to
participate in swap optimization computations.  The previous change to
insert a swap following a SUBREG_TO_REG widening operation makes this
almost trivial.

I experimented with also permitting STXSDX and STXSSPX instructions.
This can be done using similar techniques:  we could insert a swap
prior to a narrowing COPY operation, and then permit these stores to
participate.  I prototyped this, but discovered that the pattern of a
narrowing COPY followed by an STXSDX does not occur in any of our
test-suite code.  So instead, I added commentary indicating that this
could be done.

Other TLC:
 - I changed SH_COPYSCALAR to SH_COPYWIDEN to more clearly indicate
 the direction of the copy.
 - I factored the insertion of swap instructions into a separate
 function.

Finally, I added a new test case to check that the scalar-to-vector
loads are working properly with swap optimization.

llvm-svn: 242838

2be8054b

MIR Parser: Reuse the function 'lexName' when lexing global value tokens. NFC. · c1fbb354

Alex Lorenz authored Jul 21, 2015

This commit refactors the function 'maybeLexGlobalValue' so that now it reuses
the function 'lexName' when lexing a named global value token.

llvm-svn: 242837

c1fbb354

[SCEV][NFC] Fix a typo in a comment. · 135e5b9d
Sanjoy Das authored Jul 21, 2015
```
llvm-svn: 242834
```
135e5b9d

Don't iterate over the program headers in the constructor of ELFFile. · bbfd90fc

Rafael Espindola authored Jul 21, 2015

Not every program needs this information.

In particular, it is necessary and sufficient for a static linker to scan the
section table.

llvm-svn: 242833

bbfd90fc

Remove oversight group. Replace with LLVM Foundation Board of Directors. · af346433
Tanya Lattner authored Jul 21, 2015
```
llvm-svn: 242830
```
af346433
Make printValue a member function. · 3a0b1dc8
Rafael Espindola authored Jul 21, 2015
```
We were already passing 3 values it can get from ELFDumper.

llvm-svn: 242829
```
3a0b1dc8
Remove always null argument. · c7b0ee2c
Rafael Espindola authored Jul 21, 2015
```
llvm-svn: 242828
```
c7b0ee2c

[RewriteStatepointsForGC] minor style cleanup · 6ff1a1e3

Philip Reames authored Jul 21, 2015

Use a named lambda for readability, common some code, remove a stale comments, and use llvm style variable names.

llvm-svn: 242827

6ff1a1e3

Add some utilities to iterator_range for trimming a range and constructing one from a container. · adc8d721
David Blaikie authored Jul 21, 2015
```
To be used in clang in a follow-up commit.

llvm-svn: 242823
```
adc8d721
Remove getDynamicSymbolName. · 87dff0e0
Rafael Espindola authored Jul 21, 2015
```
llvm-svn: 242821
```
87dff0e0
Remove getStaticSymbolName. · bd05101e
Rafael Espindola authored Jul 21, 2015
```
Every user now keeps track of the correct string table to use.

llvm-svn: 242818
```
bd05101e
Follow up to r242810. NFC. · fe5399fe
Chad Rosier authored Jul 21, 2015
```
llvm-svn: 242812
```
fe5399fe
[AArch64] Simplify the passing of arguments. NFC. · 96a18a96
Chad Rosier authored Jul 21, 2015
```
This is setup for future work planned for the AArch64 Load/Store Opt pass.

llvm-svn: 242810
```
96a18a96

Re-land 242726 to use RAII to do cleanup · 2f907557

Reid Kleckner authored Jul 21, 2015

The LooksLikeCodeInBug11395() codepath was returning without clearing
the ProcessedAllocas cache.

llvm-svn: 242809

2f907557

[RewriteStatepointsForGC] Hoist some code out of a loop · 94babb70
Philip Reames authored Jul 21, 2015
```
llvm-svn: 242808
```
94babb70

MergeFunc: Transfer the callee's attributes when replacing a direct caller · 36512330

Arnold Schwaighofer authored Jul 21, 2015

We insert a bitcast which obfuscates the getCalledFunction for the utility
function which looks up attributes from the called function. Loosing ABI
changing parameter attributes is a bad thing.

rdar://21516488

llvm-svn: 242807

36512330

MIR Serialization: Serialize the external symbol machine operands. · 6ede3744
Alex Lorenz authored Jul 21, 2015
```
Reviewers: Duncan P. N. Exon Smith
llvm-svn: 242806
```
6ede3744

[RewriteStatepointsForGC] Delete trivial code · 74ce2e76

Philip Reames authored Jul 21, 2015

A bit more code cleanup: delete some a trivial true assertion and supporting code, remove a redundant cast, and use count in assertions where feasible.

llvm-svn: 242805

74ce2e76

Remove dead code. · 9cb7933d
Rafael Espindola authored Jul 21, 2015
```
llvm-svn: 242804
```
9cb7933d

IR: Extract a function 'printLLVMNameWithoutPrefix' from 'PrintLLVMName'. NFC. · 2b7f2650

Alex Lorenz authored Jul 21, 2015

This commit extracts the code that prints out a name of an LLVM value without a
prefix from a function 'PrintLLVMName' into a publicly accessible function named
'printLLVMNameWithoutPrefix'.

This change would be useful for MIR serialization, as it would allow the MIR
printer to reuse this function to print out the names of the external symbol
machine operands.

Reviewers: Duncan P. N. Exon Smith
llvm-svn: 242803

2b7f2650

Remove always false parameter. · 34d17fb5
Rafael Espindola authored Jul 21, 2015
```
llvm-svn: 242802
```
34d17fb5
Use range loop. NFC. · fe35b9d4
Rafael Espindola authored Jul 21, 2015
```
llvm-svn: 242801
```
fe35b9d4
Replace the last uses of ELF::getSymbolName in llvm-readobj. · 8e54b3ee
Rafael Espindola authored Jul 21, 2015
```
llvm-svn: 242798
```
8e54b3ee