Commits · 908c8b37c2be23f86fb6d13aeb59598e302eb5b3 · Lorenzo Albano / LLVM bpEVL

Sep 15, 2017

[X86] PR32755 : Improvement in CodeGen instruction selection for LEAs. · 908c8b37

Jatin Bhateja authored Sep 15, 2017

Summary:
   1/  Operand folding during complex pattern matching for LEAs has been
       extended, such that it promotes Scale to accommodate similar operand
       appearing in the DAG.
       e.g.
          T1 = A + B
          T2 = T1 + 10
          T3 = T2 + A
       For above DAG rooted at T3, X86AddressMode will no look like
          Base = B , Index = A , Scale = 2 , Disp = 10

   2/  During OptimizeLEAPass down the pipeline factorization is now performed over LEAs
       so that if there is an opportunity then complex LEAs (having 3 operands)
       could be factored out.
       e.g.
          leal 1(%rax,%rcx,1), %rdx
          leal 1(%rax,%rcx,2), %rcx
       will be factored as following
          leal 1(%rax,%rcx,1), %rdx
          leal (%rdx,%rcx)   , %edx

   3/ Aggressive operand folding for AM based selection for LEAs is sensitive to loops,
      thus avoiding creation of any complex LEAs within a loop.

Reviewers: lsaba, RKSimon, craig.topper, qcolombet

Reviewed By: lsaba

Subscribers: spatel, igorb, llvm-commits

Differential Revision: https://reviews.llvm.org/D35014

llvm-svn: 313343

908c8b37

[codeview] Use a type index of zero for static method "this" types · 87288b98
Reid Kleckner authored Sep 15, 2017
```
Otherwise VS won't show anything in the autos or watch window of static
methods.

llvm-svn: 313329
```
87288b98

Sep 14, 2017

Add AddresSpace to PseudoSourceValue. · 312ccf76
Jan Sjodin authored Sep 14, 2017
```
Differential Revision: https://reviews.llvm.org/D35089

llvm-svn: 313297
```
312ccf76

Remove usages of deprecated std::unary_function and std::binary_function. · 591aac7c

Benjamin Kramer authored Sep 14, 2017

These are removed in C++17. We still have some users of
unary_function::argument_type, so just spell that typedef out. No
functionality change intended.

Note that many of the argument types are actually wrong :)

llvm-svn: 313287

591aac7c

TableGen support for parameterized register class information · 779d98e1

Krzysztof Parzyszek authored Sep 14, 2017

This replaces TableGen's type inference to operate on parameterized
types instead of MVTs, and as a consequence, some interfaces have
changed:
- Uses of MVTs are replaced by ValueTypeByHwMode.
- EEVT::TypeSet is replaced by TypeSetByHwMode.

This affects the way that types and type sets are printed, and the
tests relying on that have been updated.

There are certain users of the inferred types outside of TableGen
itself, namely FastISel and GlobalISel. For those users, the way
that the types are accessed have changed. For typical scenarios,
these replacements can be used:
- TreePatternNode::getType(ResNo) -> getSimpleType(ResNo)
- TreePatternNode::hasTypeSet(ResNo) -> hasConcreteType(ResNo)
- TypeSet::isConcrete -> TypeSetByHwMode::isValueTypeByHwMode(false)

For more information, please refer to the review page.

Differential Revision: https://reviews.llvm.org/D31951

llvm-svn: 313271

779d98e1

[IfConversion] More simple, correct dead/kill liveness handling · 6ca02b25
Krzysztof Parzyszek authored Sep 14, 2017
```
Patch by Jesper Antonsson.

Differential Revision: https://reviews.llvm.org/D37611

llvm-svn: 313268
```
6ca02b25

[DAGCombine] (shl (or x, c1), c2) -> (or (shl x, c2), c1 << c2) · 8bd2d878

Simon Pilgrim authored Sep 14, 2017

We already have a combine for this pattern when the input to shl is add, so we just need to enable the transformation when the input is or.

Original patch by @tstellar

Differential Revision: https://reviews.llvm.org/D19325

llvm-svn: 313251

8bd2d878

[SelectionDAG] ComputeNumSignBits - cleanup ROTL/ROTR wrapping to match DAGCombine etc. · 523483e0

Simon Pilgrim authored Sep 14, 2017

Use RotAmt.urem(VTBits) instead of AND(RotAmt, VTBits - 1)

TBH I don't expect non-power-of-2 types to be created, but it makes the logic clearer and matches what we do in other rotation combines.

llvm-svn: 313245

523483e0

[XRay][CodeGen] Use the current function symbol as the associated symbol for... · 01fd7c8b

Dean Michael Berris authored Sep 14, 2017

[XRay][CodeGen] Use the current function symbol as the associated symbol for the instrumentation map

Summary:
XRay had been assuming that the previous section is the "text" section
of the function when lowering the instrumentation map. Unfortunately
this is not a safe assumption, because we may be coming from lowering
debug type information for the function being lowered.

This fixes an issue with combining -gsplit-dwarf, -generate-type-units,
-debug-compile and -fxray-instrument for sole member functions. When the
split dwarf section is stripped, we're left with references from the
xray_instr_map to the debug section. The change now uses the function's
symbol instead of the previous section's start symbol.

We found the bug while attempting to strip the split debug sections off
an XRay-instrumented object file, which had a peculiar edge-case for
single-function classes where the single function is being lowered.
Because XRay had assocaited the instrumentation map for a function to
the debug types section instead of the function's section, the objcopy
call will fail due to the misplaced reference from the xray_instr_map
section.

Reviewers: pcc, dblaikie, echristo

Subscribers: llvm-commits, aprantl

Differential Revision: https://reviews.llvm.org/D37791

llvm-svn: 313233

01fd7c8b

[codeview] Fold FIXME into comment, there's nothing to do. NFC · cd7bba02
Reid Kleckner authored Sep 13, 2017
```
llvm-svn: 313214
```
cd7bba02

Revert r312719 "[MachineCombiner] Update instruction depths incrementally for large BBs." · 06e2a384

Hans Wennborg authored Sep 13, 2017

This caused PR34596.

> [MachineCombiner] Update instruction depths incrementally for large BBs.
>
> Summary:
> For large basic blocks with lots of combinable instructions, the
> MachineTraceMetrics computations in MachineCombiner can dominate the compile
> time, as computing the trace information is quadratic in the number of
> instructions in a BB and it's relevant successors/predecessors.
>
> In most cases, knowing the instruction depth should be enough to make
> combination decisions. As we already iterate over all instructions in a basic
> block, the instruction depth can be computed incrementally. This reduces the
> cost of machine-combine drastically in cases where lots of instructions
> are combined. The major drawback is that AFAIK, computing the critical path
> length cannot be done incrementally. Therefore we only compute
> instruction depths incrementally, for basic blocks with more
> instructions than inc_threshold. The -machine-combiner-inc-threshold
> option can be used to set the threshold and allows for easier
> experimenting and checking if using incremental updates for all basic
> blocks has any impact on the performance.
>
> Reviewers: sanjoy, Gerolf, MatzeB, efriedma, fhahn
>
> Reviewed By: fhahn
>
> Subscribers: kiranchandramohan, javed.absar, efriedma, llvm-commits
>
> Differential Revision: https://reviews.llvm.org/D36619

llvm-svn: 313213

06e2a384

Allow target to decide when to cluster loads/stores in misched · 7fe9a5d9

Stanislav Mekhanoshin authored Sep 13, 2017

MachineScheduler when clustering loads or stores checks if base
pointers point to the same memory. This check is done through
comparison of base registers of two memory instructions. This
works fine when instructions have separate offset operand. If
they require a full calculated pointer such instructions can
never be clustered according to such logic.

Changed shouldClusterMemOps to accept base registers as well and
let it decide what to do about it.

Differential Revision: https://reviews.llvm.org/D37698

llvm-svn: 313208

7fe9a5d9

Sep 13, 2017

[codeview] VLAs and unsized arrays should use a size of zero · 89af112c

Reid Kleckner authored Sep 13, 2017

Previously we used a size of '1' for VLAs because we weren't sure what
MSVC did. However, MSVC does support declaring an array without a size,
for which it emits an array type with a size of zero. Clang emits the
same DI metadata for VLAs and arrays without bound, so we would describe
arrays without bound as having one element. This lead to Microsoft
debuggers only printing a single element.

Emitting a size of zero appears to cause these debuggers to search the
symbol information to find a definition of the variable with accurate
array bounds.

Fixes http://crbug.com/763580

llvm-svn: 313203

89af112c

[RegAlloc] Keep a copy of live interval for the spilled vregs in HoistSpillHelper. · c0d06646

Wei Mi authored Sep 13, 2017

This is to fix PR34502. After rL311401, the live range of spilled vreg will be
cleared. HoistSpill need to use the live range of the original vreg before splitting
to know the moving range of the spills. The patch saves a copy of live interval for
the spilled vreg inside of HoistSpillHelper.

Differential Revision: https://reviews.llvm.org/D37578

llvm-svn: 313197

c0d06646

[CodeGen] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). · 618c555b
Eugene Zelenko authored Sep 13, 2017
```
llvm-svn: 313194
```
618c555b

Mark static member functions as static in CodeViewDebug · d91bf399

Adrian McCarthy authored Sep 13, 2017

Summary:
To improve CodeView quality for static member functions, we need to make the
static explicit.  In addition to a small change in LLVM's CodeViewDebug to
return the appropriate MethodKind, this requires a small change in Clang to
note the staticness in the debug info metadata.

Subscribers: aprantl, hiraditya

Differential Revision: https://reviews.llvm.org/D37715

llvm-svn: 313192

d91bf399

[MachineScheduler] Put SchedRegion in an anonymous namespace. · 4eb2a96e

Mikael Holmen authored Sep 13, 2017

Summary: It pollutes the global namespace otherwise.

Patch by: Bevin Hansson

Reviewers: jonpa

Reviewed By: jonpa

Subscribers: MatzeB, javed.absar, llvm-commits

Differential Revision: https://reviews.llvm.org/D37555

llvm-svn: 313148

4eb2a96e

Sep 12, 2017

Remove -generate-dwarf-pub-sections flag. · 876da029

Peter Collingbourne authored Sep 12, 2017

This flag is unnecessary for testing because we can get the coverage
we need by adjusting CU attributes.

Differential Revision: https://reviews.llvm.org/D37725

llvm-svn: 313079

876da029

IR: Represent -ggnu-pubnames with a flag on the DICompileUnit. · b52e2366

Peter Collingbourne authored Sep 12, 2017

This allows the flag to be persisted through to LTO.

Differential Revision: https://reviews.llvm.org/D37655

llvm-svn: 313078

b52e2366

Update branch coalescing to be a PowerPC specific pass · 34e66217

Lei Huang authored Sep 12, 2017

Implementing this pass as a PowerPC specific pass. Branch coalescing utilizes
the analyzeBranch method which currently does not include any implicit operands.
This is not an issue on PPC but must be handled on other targets.

Pass is currently off by default. Enabled via -enable-ppc-branch-coalesce.

Differential Revision : https: // reviews.llvm.org/D32776

llvm-svn: 313061

34e66217

[WebAssembly] Remove flags from MCSectionWasm · 2176a9f2

Sam Clegg authored Sep 12, 2017

Looks like these were copied from the ELF sections but
don't apply to Wasm and were not used anywhere.

Also remove unused Wasm methods in MCContext.

Differential Revision: https://reviews.llvm.org/D37633

llvm-svn: 313058

2176a9f2

Revert "[DWARF] Incorrect prologue end line record." · 51529eb0

Robert Lougher authored Sep 12, 2017

This reverts commit r313047 as it is causing buildbot failure (lldb inline
stepping tests).

llvm-svn: 313057

51529eb0

[DWARF] Incorrect prologue end line record. · f696a22d

Robert Lougher authored Sep 12, 2017

A prologue-end line record is emitted with an incorrect associated address,
which causes a debugger to show the beginning of function body to be inside
the prologue.

Patch written by Carlos Alberto Enciso.

Differential Revision: https://reviews.llvm.org/D37625

llvm-svn: 313047

f696a22d

[CodeGen] Fix some Clang-tidy modernize-use-using and Include What You Use... · 32a40564
Eugene Zelenko authored Sep 11, 2017
```
[CodeGen] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC).

llvm-svn: 312971
```
32a40564

Sep 11, 2017

Unmerge GEPs to reduce register pressure on IndirectBr edges. · 9364432c

Hiroshi Yamauchi authored Sep 11, 2017

Summary:
GEP merging can sometimes increase the number of live values and register
pressure across control edges and cause performance problems particularly if the
increased register pressure results in spills.

This change implements GEP unmerging around an IndirectBr in certain cases to
mitigate the issue. This is in the CodeGenPrepare pass (after all the GEP
merging has happened.)

With this patch, the Python interpreter loop runs faster by ~5%.

Reviewers: sanjoy, hfinkel

Reviewed By: hfinkel

Subscribers: eastig, junbuml, llvm-commits

Differential Revision: https://reviews.llvm.org/D36772

llvm-svn: 312930

9364432c

[SelectionDAG] Remove a check for type being a vector type after calling getShiftAmountTy. NFCI · 8dff57a0
Craig Topper authored Sep 11, 2017
```
getShiftAmountTy already returns the vector type when called for vectors.

llvm-svn: 312924
```
8dff57a0
Fix typo · eb4474cf
Matt Arsenault authored Sep 11, 2017
```
llvm-svn: 312919
```
eb4474cf

Fixed a bug in splitting Scatter operation in the Type Legalizer. · cc477bbc

Elena Demikhovsky authored Sep 11, 2017

After the split of the Scatter operation, the order of the new instructions is well defined - Lo goes before Hi. Otherwise the semantic of Scatter (from LSB to MSB) is broken.
I'm chaining 2 nodes to prevent reordering.

Differential Revision https://reviews.llvm.org/D37670

llvm-svn: 312894

cc477bbc

Sep 09, 2017
- RegAllocFast: Fix warning; NFC · 6b2b88b0
  Matthias Braun authored Sep 09, 2017
```
llvm-svn: 312852
```
  6b2b88b0
- RegAllocFast: Cleanup; NFC · 864cf585
  Matthias Braun authored Sep 09, 2017
```
- Use range based for
- Variable names should start with upper case
- Add `const`
- Change class name to match filename
- Fix doxygen comments
- Use MCPhysReg instead of unsigned
- Use references instead of pointers where things cannot be nullptr
- Misc coding style improvements

llvm-svn: 312846
```
  864cf585
- RegAllocFast: Move vector to class level to avoid reallocation; NFC · a09d18de
  Matthias Braun authored Sep 09, 2017
```
llvm-svn: 312845
```
  a09d18de
- RegAllocFast: Remove write-only set; NFC · a5225e8c
  Matthias Braun authored Sep 09, 2017
```
llvm-svn: 312844
```
  a5225e8c
Sep 08, 2017

Fix a bug for rL312641. · 5d84d9b3

Wei Mi authored Sep 08, 2017

rL312641 Allowed llvm.memcpy/memset/memmove to be tail calls when parent
function return the intrinsics's first argument. However on arm-none-eabi
platform, llvm.memcpy will be expanded to __aeabi_memcpy which doesn't
have return value. The fix is to check the libcall name after expansion
to match "memcpy/memset/memmove" before allowing those intrinsic to be
tail calls.

llvm-svn: 312799

5d84d9b3

Preserve existing regs when adding pristines to LivePhysRegs/LiveRegUnits · f78eca8f
Krzysztof Parzyszek authored Sep 08, 2017
```
Differential Revision: https://reviews.llvm.org/D37600

llvm-svn: 312797
```
f78eca8f

Fix a crash when emitting debug info for multi-reg function arguments · 99ba9772

Adrian Prantl authored Sep 08, 2017

by reusing more of the existing machinery

This is a follow-up to r312169.
Thanks to Björn Pettersson for the testcase!

llvm-svn: 312773

99ba9772

[XRay][CodeGen][PowerPC] Fix tail exit codegen for XRay in PPC · 711dec26

Dean Michael Berris authored Sep 08, 2017

Summary:
This fixes code-gen for XRay in PPC. The regression wasn't caught by
codegen tests  which we add in this change.

What happened was the following:

- For tail exits, we used to unconditionally prepend the returns/exits
  with a pseudo-instruction that gets lowered to the instrumentation
  sled (and leave the actual return/exit instruction as-is).
- Changes to the XRay instrumentation pass caused the tail exits to
  suddenly also emit the tail exit pseudo-instruction, since the check
  for whether a return instruction was also a call instruction meant it
  was a tail exit instruction.
- None of the tests caught the regression either due to non-existent
  tests, or the tests being disabled/removed for continuous breakage.

This change re-introduces some of the basic tests and verifies that
we're back to a state that allows the back-end to generate appropriate
XRay instrumented binaries for PPC in the presence of tail exits.

Reviewers: echristo, timshen

Subscribers: nemanjai, kbarton, llvm-commits

Differential Revision: https://reviews.llvm.org/D37570

llvm-svn: 312772

711dec26

Sink some IntrinsicInst.h and Intrinsics.h out of llvm/include · 0e8c4bb0

Reid Kleckner authored Sep 07, 2017

Many of these uses can get by with forward declarations. Hopefully this
speeds up compilation after adding a single intrinsic.

llvm-svn: 312759

0e8c4bb0

Revert r312318, r312325, r312424, r312489 · c7828ebe

Richard Trieu authored Sep 07, 2017

r312318 - Debug info for variables whose type is shrinked to bool
r312325, r312424, r312489 - Test case for r312318

Revision 312318 introduced a null dereference bug.
Details in https://bugs.llvm.org/show_bug.cgi?id=34490

llvm-svn: 312758

c7828ebe

[DWARF] Line 0 should not have a discriminator. · bb921370

Paul Robinson authored Sep 07, 2017

It's meaningless and takes up extra space in the line table.

Differential Revision: https://reviews.llvm.org/D37364

llvm-svn: 312751

bb921370

Sep 07, 2017

DAG: Allow creating extract_vector_elt post-legalize · 61ec738b

Matt Arsenault authored Sep 07, 2017

Fixes some combine issues for AMDGPU where we weren't
getting the many extract_vector_elt combines expected
in a future patch.

This should really be checking isOperationLegalOrCustom on
the extract. That improves a number of x86 lit tests, but
a few get stuck in an infinite loop from one place
where a similar looking extract is created. I have a
different workaround in the backend for that which
keeps many of those improvements, but also adds a few
regressions.

llvm-svn: 312730

61ec738b