Commits · fcf01e6e5f856fc7f1deee1a3ff4a85c6b2f4220 · Roger Ferrer / llvm-epi

Dec 20, 2018

Add PLATFORM constants for iOS, tvOS, and watchOS simulators · f44d830e

Michael Trent authored Dec 20, 2018

Summary:
Add PLATFORM constants for iOS, tvOS, and watchOS simulators, as well
as human readable names for these constants, to the Mach-O file format
header files. 

rdar://46854119

Reviewers: ab, davide

Reviewed By: ab, davide

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D55905

llvm-svn: 349779

f44d830e

[BPF] Disable relocation for .BTF.ext section · 821c93d5

Yonghong Song authored Dec 20, 2018



Build llvm with assertion on, and then build bcc against this llvm.
Run any bcc tool with debug=8 (turning on -g for clang compilation),
you will get the following assertion errors,
  /home/yhs/work/llvm/lib/ExecutionEngine/RuntimeDyld/RuntimeDyldELF.cpp:888:
  void llvm::RuntimeDyldELF::resolveBPFRelocation(const llvm::SectionEntry&, uint64_t,
    uint64_t, uint32_t, int64_t): Assertion `Value <= (4294967295U)' failed.

The .BTF.ext ELF section uses Fixup's to get the instruction
offsets. The data width of the Fixup is 4 bytes since we only need
the insn offset within the section.

This caused the above error though since R_BPF_64_32 expects
4-byte value and the Runtime Dyld tried to resolve the actual
insn address which is 8 bytes.

Actually the offset within the section is all what we need.
Therefore, there is no need to perform any kind of relocation
for .BTF.ext section and such relocation will actually cause
incorrect result.

This patch changed BPFELFObjectWriter::getRelocType() such that
for Fixup Kind FK_Data_4, if the relocation Target is a temporary
symbol, let us skip the relocation (ELF::R_BPF_NONE).

Acked-by: Alexei Starovoitov <ast@kernel.org>
Signed-off-by: Yonghong Song <yhs@fb.com>
llvm-svn: 349778

821c93d5

· b17464e4

Brock Wyma authored Dec 20, 2018

[CodeView] Emit global variables within lexical scopes to limit visibility

Emit static locals within the correct lexical scope so variables with the same
name will not confuse the debugger into getting the wrong value.

Differential Revision: https://reviews.llvm.org/D55336

llvm-svn: 349777

b17464e4

[InstCombine] Preserve access-group metadata. · 19942710

Michael Kruse authored Dec 20, 2018

Preserve llvm.access.group metadata when combining store instructions.
This was forgotten in r349725.

Fixes llvm.org/PR40117

llvm-svn: 349774

19942710

[x86] add test to show missed movddup load fold; NFC · 18b008b5
Sanjay Patel authored Dec 20, 2018
```
llvm-svn: 349773
```
18b008b5
Test commit · 388bb86d
Amilendra Kodithuwakku authored Dec 20, 2018
```
Fix a simple typo.

llvm-svn: 349771
```
388bb86d
[Hexagon] Add patterns for funnel shifts · 30c42e2a
Krzysztof Parzyszek authored Dec 20, 2018
```
llvm-svn: 349770
```
30c42e2a

[SelectionDAGBuilder] Enable funnel shift building to custom rotates · b208255f

Simon Pilgrim authored Dec 20, 2018

This patch enables funnel shift -> rotate building for all ROTL/ROTR custom/legal operations.

AFAICT X86 was the last target that was missing modulo support (PR38243), but I've tried to CC stakeholders for every target that has ROTL/ROTR custom handling for their final OK.

Differential Revision: https://reviews.llvm.org/D55747

llvm-svn: 349765

b208255f

[RISCV] Properly evaluate fixup_riscv_pcrel_lo12 · eb3a64a4

Alex Bradbury authored Dec 20, 2018

This is a update to D43157 to correctly handle fixup_riscv_pcrel_lo12.

Notable changes:

Rebased onto trunk
Handle and test S-type
Test case pcrel-hilo.s is merged into relocations.s

D43157 description:
VK_RISCV_PCREL_LO has to be handled specially. The MCExpr inside is
actually the location of an auipc instruction with a VK_RISCV_PCREL_HI fixup
pointing to the real target.

Differential Revision: https://reviews.llvm.org/D54029
Patch by Chih-Mao Chen and Michael Spencer.

llvm-svn: 349764

eb3a64a4

[X86][AVX512] Don't custom lower v16i8 rotations. · 09c08117

Simon Pilgrim authored Dec 20, 2018

As discussed on D55747, the expansion to (wider) shifts is better on all AVX512 cases, not just BWI.

llvm-svn: 349763

09c08117

[SystemZ] "Generic" vector assembler instructions shoud clobber CC · 380bece7

Ulrich Weigand authored Dec 20, 2018

There are several vector instructions which may or may not set the
condition code register, depending on the value of an argument.

For codegen, we use two versions of the instruction, one that sets
CC and one that doesn't, which hard-code appropriate values of that
argument.  But we also have a "generic" version of the instruction
that is used for the assembler/disassembler.  These generic versions
should always be considered to clobber CC just to be safe.

llvm-svn: 349761

380bece7

Fix gcc7 -Wdangling-else warning. NFCI. · 84980f42
Simon Pilgrim authored Dec 20, 2018
```
llvm-svn: 349760
```
84980f42

[InstCombine] Make x86 PADDS/PSUBS constant folding tests generic · 8c4abd20

Simon Pilgrim authored Dec 20, 2018

As discussed on D55894, this replaces the existing PADDS/PSUBUS intrinsics with the the sadd/ssub.sat generic intrinsics and moves the tests out of the x86 subfolder.

PR40110 has been raised to fix the regression with constant folding vectors containing undef elements.

llvm-svn: 349759

8c4abd20

[gn build] Add build files for clang/lib/{Analysis,Edit,Sema} · 5f31de22
Nico Weber authored Dec 20, 2018
```
Differential Revision: https://reviews.llvm.org/D55913

llvm-svn: 349757
```
5f31de22
[gn build] Add build files for clang/lib/Lex and clang/lib/AST · 89a2a9f8
Nico Weber authored Dec 20, 2018
```
Differential Revision: https://reviews.llvm.org/D55912

llvm-svn: 349756
```
89a2a9f8

[SystemZ] Make better use of VLLEZ · 44d37ae3

Ulrich Weigand authored Dec 20, 2018

This patch fixes two deficiencies in current code that recognizes
the VLLEZ idiom:

- For the floating-point versions, we have ISel patterns that match
  on a bitconvert as the top node.  In more complex cases, that
  bitconvert may already have been merged into something else.
  Fix the patterns to match the inner nodes instead.

- For the 64-bit integer versions, depending on the surrounding code,
  we may get either a DAG tree based on JOIN_DWORDS or one based on
  INSERT_VECTOR_ELT.  Use a PatFrags to simply match both variants.

llvm-svn: 349749

44d37ae3

[SystemZ] Make better use of VGEF/VGEG · 8bb46b0f

Ulrich Weigand authored Dec 20, 2018

Current code in SystemZDAGToDAGISel::tryGather refuses to perform
any transformation if the Load SDNode has more than one use.  This
(erronously) counts uses of the chain result, which prevents the
optimization in many cases unnecessarily.  Fixed by this patch.

llvm-svn: 349748

8bb46b0f

Re-land r349731 "[CodeGen][ExpandMemcmp] Add an option for allowing overlapping loads. · 36a34803
Clement Courbet authored Dec 20, 2018
```
Update PPC ir following GEP->bitcat to bitcat->GEP->bitcat change.

llvm-svn: 349747
```
36a34803

[SystemZ] Make better use of VLDEB · f43b5100

Ulrich Weigand authored Dec 20, 2018

We already have special code (DAG combine support for FP_ROUND)
to recognize cases where we an use a vector version of VLEDB to
perform two floating-point truncates in parallel, but equivalent
support for VLEDB (vector floating-point extends) has been
missing so far.  This patch adds corresponding DAG combine
support for FP_EXTEND.

llvm-svn: 349746

f43b5100

[X86][SSE] Auto upgrade PADDS/PSUBS intrinsics to SADD_SAT/SSUB_SAT generic intrinsics (llvm) · 6bbf39b4
Simon Pilgrim authored Dec 20, 2018
```
Pulled out of D55894 to match the clang changes in D55890.

Differential Revision: https://reviews.llvm.org/D55890

llvm-svn: 349744
```
6bbf39b4
[X86] Update PADDSW/PSUBSW intrinsic usage with generic saturated intrinsics. · e85ad60e
Simon Pilgrim authored Dec 20, 2018
```
As discussed on D55894, this makes no difference to the actual test.

llvm-svn: 349742
```
e85ad60e
[llvm-objcopy] Use ELFOSABI_NONE instead of 0. NFC. · 3ac20a92
George Rimar authored Dec 20, 2018
```
This was requested during the review of D55886.
(sorry, forgot to address this)

llvm-svn: 349741
```
3ac20a92

[X86] Change 'simple nonmem' intrinsic test to not use PADDSW · 6199784c

Simon Pilgrim authored Dec 20, 2018

Those intrinsics will be autoupgraded soon to @llvm.sadd.sat generics (D55894), so to keep a x86-specific case I'm replacing it with @llvm.x86.sse2.pmulhu.w

llvm-svn: 349739

6199784c

[llvm-objcopy] - Do not drop the OS/ABI and ABIVersion fields of ELF header · 4ded7733

George Rimar authored Dec 20, 2018

This is https://bugs.llvm.org/show_bug.cgi?id=40005,

Patch teaches llvm-objcopy to preserve OS/ABI and ABIVersion fields of ELF header.
(Currently, it drops them to zero).

Differential revision: https://reviews.llvm.org/D55886

llvm-svn: 349738

4ded7733

[yaml2obj/obj2yaml] - Support dumping/parsing ABI version. · 6367d7a6

George Rimar authored Dec 20, 2018

These tools were assuming ABI version is 0,
that is not always true.

Patch teaches them to work with that field.

Differential revision: https://reviews.llvm.org/D55884

llvm-svn: 349737

6367d7a6

[InstCombine][AMDGPU] Handle more buffer intrinsics · deaacc17

Piotr Sobczak authored Dec 20, 2018

Summary:
Include the following intrinsics in the InsctCombine
simplification:

* amdgcn_raw_buffer_load
* amdgcn_raw_buffer_load_format
* amdgcn_struct_buffer_load
* amdgcn_struct_buffer_load_format

Change-Id: I14deceff74bcb21179baf6aa6e94bf39e7d63d5d

Reviewers: arsenm

Reviewed By: arsenm

Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, llvm-commits

Differential Revision: https://reviews.llvm.org/D55882

llvm-svn: 349735

deaacc17

[MSan] Don't emit __msan_instrument_asm_load() calls · 0e3b85a7

Alexander Potapenko authored Dec 20, 2018

LLVM treats void* pointers passed to assembly routines as pointers to
sized types.
We used to emit calls to __msan_instrument_asm_load() for every such
void*, which sometimes led to false positives.
A less error-prone (and truly "conservative") approach is to unpoison
only assembly output arguments.

llvm-svn: 349734

0e3b85a7

Revert r349731 "[CodeGen][ExpandMemcmp] Add an option for allowing overlapping loads." · e22cf4d7
Clement Courbet authored Dec 20, 2018
```
Forgot to update PowerPC tests for the GEP->bitcast change.

llvm-svn: 349733
```
e22cf4d7
[NFC] Fix trailing comma after function. · d4bd3eb8
Clement Courbet authored Dec 20, 2018
```
lib/Analysis/VectorUtils.cpp:482:2: warning: extra ‘;’ [-Wpedantic]

llvm-svn: 349732
```
d4bd3eb8

[CodeGen][ExpandMemcmp] Add an option for allowing overlapping loads. · 1bb6e1b0

Clement Courbet authored Dec 20, 2018

Summary:
This allows expanding {7,11,13,14,15,21,22,23,25,26,27,28,29,30,31}-byte memcmp
in just two loads on X86. These were previously calling memcmp.

Reviewers: spatel, gchatelet

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D55263

llvm-svn: 349731

1bb6e1b0

[HWASAN] Add support for memory intrinsics · 2d98eb1b
Eugene Leviant authored Dec 20, 2018
```
Differential revision: https://reviews.llvm.org/D55117

llvm-svn: 349728
```
2d98eb1b

[PowerPC] Implement the isSelectSupported() target hook · ca8db489

Kang Zhang authored Dec 20, 2018

Summary:
PowerPC has scalar selects (isel) and vector mask selects (xxsel). But PowerPC
does not have vector CR selects, PowerPC does not support scalar condition 
selects on vectors.
In addition to implementing this hook, isSelectSupported() should return false
when the SelectSupportKind is ScalarCondVectorVal, so that predictable selects
are converted into branch sequences.

Reviewed By: steven.zhang,  hfinkel

Differential Revision: https://reviews.llvm.org/D55754

llvm-svn: 349727

ca8db489

[DAGCombiner] Fix a place that was creating a SIGN_EXTEND with an extra operand. · bd788ce5
Craig Topper authored Dec 20, 2018
```
llvm-svn: 349726
```
bd788ce5

Introduce llvm.loop.parallel_accesses and llvm.access.group metadata. · 978ba615

Michael Kruse authored Dec 20, 2018

The current llvm.mem.parallel_loop_access metadata has a problem in that
it uses LoopIDs. LoopID unfortunately is not loop identifier. It is
neither unique (there's even a regression test assigning the some LoopID
to multiple loops; can otherwise happen if passes such as LoopVersioning
make copies of entire loops) nor persistent (every time a property is
removed/added from a LoopID's MDNode, it will also receive a new LoopID;
this happens e.g. when calling Loop::setLoopAlreadyUnrolled()).
Since most loop transformation passes change the loop attributes (even
if it just to mark that a loop should not be processed again as
llvm.loop.isvectorized does, for the versioned and unversioned loop),
the parallel access information is lost for any subsequent pass.

This patch unlinks LoopIDs and parallel accesses.
llvm.mem.parallel_loop_access metadata on instruction is replaced by
llvm.access.group metadata. llvm.access.group points to a distinct
MDNode with no operands (avoiding the problem to ever need to add/remove
operands), called "access group". Alternatively, it can point to a list
of access groups. The LoopID then has an attribute
llvm.loop.parallel_accesses with all the access groups that are parallel
(no dependencies carries by this loop).

This intentionally avoid any kind of "ID". Loops that are clones/have
their attributes modifies retain the llvm.loop.parallel_accesses
attribute. Access instructions that a cloned point to the same access
group. It is not necessary for each access to have it's own "ID" MDNode,
but those memory access instructions with the same behavior can be
grouped together.

The behavior of llvm.mem.parallel_loop_access is not changed by this
patch, but should be considered deprecated.

Differential Revision: https://reviews.llvm.org/D52116

llvm-svn: 349725

978ba615

[WebAssembly] Emit a splat for v128 IMPLICIT_DEF · feb18fe9

Thomas Lively authored Dec 20, 2018

Summary:
This is a code size savings and is also important to get runnable code
while engines do not support v128.const.

Reviewers: aheejin, dschuff

Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits

Differential Revision: https://reviews.llvm.org/D55910

llvm-svn: 349724

feb18fe9

Fix build errors introduced by r349712 on aarch64 bots. · 321bfb21
Amara Emerson authored Dec 20, 2018
```
llvm-svn: 349723
```
321bfb21

[WebAssembly] Gate unimplemented SIMD ops on flag · 8dbf29af

Thomas Lively authored Dec 20, 2018

Summary:
Gates v128.const, f32x4.sqrt, f32x4.div, i8x16.extract_lane_u, and
i16x8.extract_lane_u on the --wasm-enable-unimplemented-simd flag,
since these ops are not implemented yet in V8.

Reviewers: aheejin, dschuff

Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits

Differential Revision: https://reviews.llvm.org/D55904

llvm-svn: 349720

8dbf29af

AMDGPU: Make i1/i64/v2i32 and/or/xor legal · 43398837

Matt Arsenault authored Dec 20, 2018

The 64-bit types do depend on the register bank,
but that's another issue to deal with later.

llvm-svn: 349716

43398837

AMDGPU/GlobalISel: Fix ValueMapping tables for i1 · 8cc98bee

Matt Arsenault authored Dec 20, 2018

This was incorrectly selecting SGPR for any i1 values,
e.g. G_TRUNC to i1 from a VGPR was still an SGPR.

llvm-svn: 349715

8cc98bee

[X86] Disable custom widening of signed/unsigned add/sub saturation intrinsics... · 9ca2f560

Craig Topper authored Dec 20, 2018

[X86] Disable custom widening of signed/unsigned add/sub saturation intrinsics under -x86-experimental-vector-widening-legalization.

Generic legalization should take care of this.

llvm-svn: 349714

9ca2f560