Commits · 329860e495667aa3caab095f5e6032ffc4d4dcfd · Roger Ferrer / llvm-epi

Jan 26, 2016

Saleem Abdulrasool authored Jan 26, 2016

Seems that the patch was rebased on top of another change which obsoleted the
change but wasnt caught.

Thanks to nbjoerg for pointing this out!

llvm-svn: 258821

329860e4

don't repeat names in documentation comments; NFC · f1ac8ba4
Sanjay Patel authored Jan 26, 2016
```
llvm-svn: 258820
```
f1ac8ba4
Update for LLVM change · 79dad1d0
Benjamin Kramer authored Jan 26, 2016
```
llvm-svn: 258819
```
79dad1d0
Reflect the MC/MCDisassembler split on the include/ level. · f57c1977
Benjamin Kramer authored Jan 26, 2016
```
No functional change, just moving code around.

llvm-svn: 258818
```
f57c1977

[OpenMP] Parsing + sema for defaultmap clause. · 3cf89040

Arpith Chacko Jacob authored Jan 26, 2016

Summary:
This patch adds parsing + sema for the defaultmap clause associated with the target directive (among others).

Reviewers: ABataev

Differential Revision: http://reviews.llvm.org/D16527

llvm-svn: 258817

3cf89040

[LibCallSimplifier] fold memset(malloc(x), 0, x) --> calloc(1, x) · 980b280f

Sanjay Patel authored Jan 26, 2016

This is a step towards solving PR25892:
https://llvm.org/bugs/show_bug.cgi?id=25892

It won't handle the reported case. As noted by the 'TODO' comments in the patch, 
we need to relax the hasOneUse() constraint and also match patterns that include
memset_chk() and the llvm.memset() intrinsic in addition to memset().

Differential Revision: http://reviews.llvm.org/D16337

llvm-svn: 258816

980b280f

Revert "[Driver] Make sure -fno-math-builtin option is being passed by the driver." · f662fb3d
Chad Rosier authored Jan 26, 2016
```
This reverts commit r258814.

llvm-svn: 258815
```
f662fb3d

[Driver] Make sure -fno-math-builtin option is being passed by the driver. · 17d2e878

Chad Rosier authored Jan 26, 2016

Support for the -fno-math-builtin option was added in r186899.  The codegen side
is being tested in test/CodeGen/nomathbuiltin.c.  The missing part was just
passing the option through the driver.

PR26317

llvm-svn: 258814

17d2e878

[Driver] Update FIXME comment now that PR4941 has been addressed. · 38fd54ed
Chad Rosier authored Jan 26, 2016
```
The actual fix should be addressed by someone who can test on Darwin.

llvm-svn: 258813
```
38fd54ed

Revert "Reapply commit r258404 with fix" · 61d5a184

Matthew Simpson authored Jan 26, 2016

This commit exposes a crash in computeKnownBits on the Chromium buildbots.
Reverting to investigate.

Reference: https://llvm.org/bugs/show_bug.cgi?id=26307
llvm-svn: 258812

61d5a184

Re-submit r256008 "Improve DWARFDebugFrame::parse to also handle __eh_frame." · 03a670c0
Igor Laevsky authored Jan 26, 2016
```
Originally this change was causing failures on windows buildbots.
But those problems were fixed in r258806.

llvm-svn: 258811
```
03a670c0
[WebAssembly] Fix a typo in a comment. · fb619e96
Dan Gohman authored Jan 26, 2016
```
llvm-svn: 258810
```
fb619e96

Unique phi write accesses · ee6a4fc6

Michael Kruse authored Jan 26, 2016

Ensure that there is at most one phi write access per PHINode and
ScopStmt. In particular, this would be possible for non-affine
subregions with multiple exiting blocks. We replace multiple MAY_WRITE
accesses by one MUST_WRITE access. The written value is constructed
using a PHINode of all exiting blocks. The interpretation of the PHI
WRITE's "accessed value" changed from the incoming value to the PHI like
for PHI READs since there is no unique incoming value.

Because region simplification shuffles around PHI nodes -- particularly
with exit node PHIs -- the PHINodes at analysis time does not always
exist anymore in the code generation pass. We instead remember the
incoming block/value pair in the MemoryAccess.

Differential Revision: http://reviews.llvm.org/D15681

llvm-svn: 258809

ee6a4fc6

Unique value read accesses · ad28e5a5

Michael Kruse authored Jan 26, 2016

Keep at most one value read MemoryAccess per value and statement;
multiple generated loads do not have any additional effect. As one such
MemoryAccess can cater multiple uses within the statement, the
AccessInstruction property is not unique any more and set to nullptr.

Differential Revision: http://reviews.llvm.org/D15510

llvm-svn: 258808

ad28e5a5

Unique value write accesses · 436db620

Michael Kruse authored Jan 26, 2016

Ensure there is at most one write access per definition of an
llvm::Value. Keep track of already created value write access by using
a (dense) map.

Replace addValueWriteAccess by ensureValueStore which can be uses more
liberally without worrying to add redundant accesses. It will be used,
e.g. in a logical correspondant for value reads -- ensureValueReload --
to ensure that the expected definition has been written when loading it.

Differential Revision: http://reviews.llvm.org/D15483

llvm-svn: 258807

436db620

[DebugInfo] Fix DWARFDebugFrame instruction operand ordering · 0e1605a3

Igor Laevsky authored Jan 26, 2016

We can't rely on the evalution order of function arguments.

Differential Revision: http://reviews.llvm.org/D16509

llvm-svn: 258806

0e1605a3

[OPENMP 4.5] Allow arrays in 'reduction' clause. · 1189bd02

Alexey Bataev authored Jan 26, 2016

OpenMP 4.5, alogn with array sections, allows to use variables of array type in reductions.

llvm-svn: 258804

1189bd02

[FIX] Domain generation error due to loops in non-affine regions · 6f50c29a
Johannes Doerfert authored Jan 26, 2016
```
llvm-svn: 258803
```
6f50c29a
[FIX] Build correct domain for non-affine region SCoPs · 432658d7
Johannes Doerfert authored Jan 26, 2016
```
llvm-svn: 258802
```
432658d7

Fix crashing on user-defined conversion. · dc84150e

Alexander Kornienko authored Jan 26, 2016

Summary: Fix the assertion failure for the user-defined conversion method. e.g.: operator bool()

Reviewers: alexfh, aaron.ballman

Subscribers: aaron.ballman, cfe-commits

Patch by Cong Liu!

Differential Revision: http://reviews.llvm.org/D16536

llvm-svn: 258801

dc84150e

[RenderScript] Provide option to specify a single allocation to print · b649b005

Ewan Crawford authored Jan 26, 2016

Patch replaces the 'renderscript allocation list' command flag --refresh, with a new option --id <ID>.
This new option only prints the details of a single allocation with a given id, rather than printing all the allocations.
Functionality from the removed '--refresh' flag will be moved into its own command in a subsequent commit.

llvm-svn: 258800

b649b005

BlockGenerators: Replace getNewScalarValue with getNewValue · f2cdd144

Tobias Grosser authored Jan 26, 2016

Both functions implement the same functionality, with the difference that
getNewScalarValue assumes that globals and out-of-scop scalars can be directly
reused without loading them from their corresponding stack slot. This is correct
for sequential code generation, but causes issues with outlining code e.g. for
OpenMP code generation. getNewValue handles such cases correctly.

Hence, we can replace getNewScalarValue with getNewValue. This is not only more
future proof, but also eliminates a bunch of code.

The only functionality that was available in getNewScalarValue that is lost
is the on-demand creation of scalar values. However, this is not necessary any
more as scalars are always loaded at the beginning of each basic block and will
consequently always be available when scalar stores are generated. As this was
not the case in older versions of Polly, it seems the on-demand loading is just
some older code that has not yet been removed.

Finally, generateScalarLoads also generated loads for values that are loop
invariant, available in GlobalMap and which are preferred over the ones loaded
in generateScalarLoads. Hence, we can just skip the code generation of such
scalar values, avoiding the generation of dead code.

Differential Revision: http://reviews.llvm.org/D16522

llvm-svn: 258799

f2cdd144

[X86][SSE] Add zero element and general 64-bit VZEXT_LOAD support to EltsFromConsecutiveLoads · 46696ef9

Simon Pilgrim authored Jan 26, 2016

This patch adds support for trailing zero elements to VZEXT_LOAD loads (and checks that no zero elts occur within the consecutive load).

It also generalizes the 64-bit VZEXT_LOAD load matching to work for loads other than 2x32-bit loads.

After this patch it will also be easier to add support for other basic load patterns like 32-bit VZEXT_LOAD loads, PMOVZX and subvector load insertion.

Differential Revision: http://reviews.llvm.org/D16217

llvm-svn: 258798

46696ef9

Fix compilations with msvc's /Zc:strictStrings · c9655d9b
Ismail Donmez authored Jan 26, 2016
```
llvm-svn: 258797
```
c9655d9b
Simplify. NFC. · 231b5e23
Rui Ueyama authored Jan 26, 2016
```
llvm-svn: 258796
```
231b5e23
Simplify. NFC. · 3ae28a47
Rui Ueyama authored Jan 26, 2016
```
llvm-svn: 258795
```
3ae28a47
AMDGPU: Add amdgcn cube builtins · cf70cb9d
Matt Arsenault authored Jan 26, 2016
```
llvm-svn: 258794
```
cf70cb9d

[X86] Mark LDS/LES as not being allowed in 64-bit mode. · b9c932f2

Craig Topper authored Jan 26, 2016

Their opcodes are used as part of the VEX prefix in 64-bit mode. Clearly the disassembler implicitly decoded them as AVX instructions in 64-bit mode, but I think the AsmParser would have encoded them.

llvm-svn: 258793

b9c932f2

Simplify. NFC. · d6cea14c

Rui Ueyama authored Jan 26, 2016

This new code should be logically equivalent to the previous code.

llvm-svn: 258792

d6cea14c

Reverting r258759 as it is breaking the OSX build · dd54a3a8
Enrico Granata authored Jan 26, 2016
```
llvm-svn: 258791
```
dd54a3a8
AMDGPU: Move AMDGPU intrinsics only used by R600 · bee7575e
Matt Arsenault authored Jan 26, 2016
```
llvm-svn: 258790
```
bee7575e

AMDGPU: Tidy minor td file issues · 382d945d

Matt Arsenault authored Jan 26, 2016

Make comments and indentation more consistent.

Rearrange a few things to be in a more consistent order,
such as organizing subtarget features from those describing
an actual device property, and those used as options.

llvm-svn: 258789

382d945d

AMDGPU: Make v32i8/v64i8 illegal types · c5f61529

Matt Arsenault authored Jan 26, 2016

Old intrinsics were forcing these, but they have now all
been removed. This fixes large i8 vector operations generally
being broken.

llvm-svn: 258788

c5f61529

AMDGPU: Remove old sample intrinsics · 018179fc

Matt Arsenault authored Jan 26, 2016

I did my best to try to update all the uses in tests that
just happened to use the old ones to the newer intrinsics.

I'm not sure I got all of the immediate operand conversions
correct, since the value seems to have been ignored by the
old pattern but I don't think it really matters.

llvm-svn: 258787

018179fc

AMDGPU: Add new amdgcn intrinsics for cube instructions · 051d6f9f

Matt Arsenault authored Jan 26, 2016

More cleanup to try to get all intrinsics using the correct
amdgcn prefix that are as close to the instruction as possible.

llvm-svn: 258786

051d6f9f

AMDGPU: Implement read_register and write_register intrinsics · 9a10cea7

Matt Arsenault authored Jan 26, 2016

Some of the special intrinsics now that now correspond to a instruction
also have special setting of some registers, e.g. llvm.SI.sendmsg sets
m0 as well as use s_sendmsg. Using these explicit register intrinsics
may be a better option.

Reading the exec mask and others may be useful for debugging. For this
I'm not sure this is entirely correct because we would want this to
be convergent, although it's possible this is already treated
sufficently conservatively.

llvm-svn: 258785

9a10cea7

AMDGPU: Note mesa version in release notes · cee02ccc
Matt Arsenault authored Jan 26, 2016
```
llvm-svn: 258784
```
cee02ccc
AMDGPU: Restore AMDGPU prefixed rsq intrinsic for now · 0c3e2338
Matt Arsenault authored Jan 26, 2016
```
Also move into backend intrinsics to discourage use of the old name.

llvm-svn: 258783
```
0c3e2338

Recommit: R258773 [OpenCL] Pipe builtin functions · bb4d8d30

Xiuli Pan authored Jan 26, 2016

Fix arc patch fuzz error.
Summary:
Support for the pipe built-in functions for OpenCL 2.0.
The pipe builtin functions may have infinite kinds of element types, one approach
would be to just generate calls that would always use generic types such as void*.
This patch is based on bader's opencl support patch on SPIR-V branch.

Reviewers: Anastasia, pekka.jaaskelainen

Subscribers: keryell, bader, cfe-commits

Differential Revision: http://reviews.llvm.org/D15914

llvm-svn: 258782

bb4d8d30

[WebAssembly] Optimize memcpy/memmove/memcpy calls. · bdf08d5d

Dan Gohman authored Jan 26, 2016

These calls return their first argument, but because LLVM uses an intrinsic
with a void return type, they can't use the returned attribute. Generalize
the store results pass to optimize these calls too.

llvm-svn: 258781

bdf08d5d