Skip to content
  1. Jan 18, 2019
  2. Jan 17, 2019
    • Matt Arsenault's avatar
      AMDGPU: Convert tests away from llvm.SI.load.const · 456b93b4
      Matt Arsenault authored
      llvm-svn: 351494
      456b93b4
    • Vedant Kumar's avatar
      [HotColdSplit] Allow outlining with live outputs · f529b507
      Vedant Kumar authored
      Prior to r348205, extracting code regions with live output values was
      disabled because of a miscompilation (PR39433). Lift the restriction as
      PR39433 has been addressed.
      
      Tested on LNT+externals, on a run of check-llvm in a stage2 build, and
      with a full build of iOS (with hot/cold splitting enabled).
      
      As a drive-by, remove an errant TODO.
      
      llvm-svn: 351492
      f529b507
    • Vedant Kumar's avatar
      [HotColdSplit] Consider resume instructions to be cold · b70e20db
      Vedant Kumar authored
      Resuming exception unwinding is roughly as unlikely as throwing an
      exception.
      
      Tested on LNT+externals (in particular, the C++ EH regression tests
      provide end-to-end test coverage), as well as with a full build of iOS.
      
      llvm-svn: 351491
      b70e20db
    • Vladimir Stefanovic's avatar
      [mips] Emit .reloc R_{MICRO}MIPS_JALR along with j(al)r(c) $25 · 3daf8bc9
      Vladimir Stefanovic authored
      The callee address is added as an optional operand (MCSymbol) in
      AdjustInstrPostInstrSelection() and then used by asm printer to insert:
      '.reloc tmplabel, R_MIPS_JALR, symbol
      tmplabel:'.
      Controlled with '-mips-jalr-reloc', default is true.
      
      Differential revision: https://reviews.llvm.org/D56694
      
      llvm-svn: 351485
      3daf8bc9
    • Vedant Kumar's avatar
      [HotColdSplit] Relax requirement that the cold sink block be extractable · 4541be06
      Vedant Kumar authored
      Relaxing this requirement creates opportunities to split code dominated
      by an EH pad.
      
      Tested on LNT+externals.
      
      llvm-svn: 351483
      4541be06
    • Vedant Kumar's avatar
      [HotColdSplit] Simplify tests by lowering their splitting thresholds · 32a014d0
      Vedant Kumar authored
      This gets rid of the brittle/mysterious calls to @sink()/@sideeffect()
      peppered throughout the test cases. They are no longer needed to force
      splitting to occur.
      
      llvm-svn: 351480
      32a014d0
    • Wei Mi's avatar
      [SampleFDO] Skip profile reading when flattened profile used in ThinLTO postlink · 3bcccdfe
      Wei Mi authored
      If the sample profile has no inlining hierachy information included, we call
      the sample profile is flattened. For flattened profile, in ThinLTO postlink
      phase, SampleProfileLoader's hot function inlining and profile annotation will
      do nothing, so it is better to save the effort to read in the profile and run
      the sample profile loader pass. It is helpful for reducing compile time when
      the flattened profile is huge.
      
      Differential Revision: https://reviews.llvm.org/D54819
      
      llvm-svn: 351476
      3bcccdfe
    • Reid Kleckner's avatar
      [InstCombine] Don't sink dynamic allocas · edd653bc
      Reid Kleckner authored
      Summary:
      InstCombine's sinking algorithm only thinks about memory. It doesn't
      think about non-memory constraints like stack object lifetime. It can
      sink dynamic allocas across a stacksave call, which may be used with
      stackrestore, which can incorrectly reduce the lifetime of the dynamic
      alloca.
      
      Fixes PR40365
      
      Reviewers: hfinkel, efriedma
      
      Subscribers: hiraditya, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D56872
      
      llvm-svn: 351475
      edd653bc
    • Sanjin Sijaric's avatar
      Fix the buildbot failure introduced by r351404 · 4d145029
      Sanjin Sijaric authored
      EXPENSIVE_CHECKS buildbots are failing due to r351404.
      
      Add x1 as live in to the funclet basic block for SEH funclets, as well as
      -verify-machineinstrs to the test case that triggered the failure.
      
      llvm-svn: 351472
      4d145029
    • Wouter van Oortmerssen's avatar
      [WebAssembly] Fixing 2 more symbol offset related tests. · 73a0c240
      Wouter van Oortmerssen authored
      llvm-svn: 351465
      73a0c240
    • Jonas Devlieghere's avatar
      [Test] Fix debug-loc-0.mir with EXPENSIVE_CHECKS · 032ef2dc
      Jonas Devlieghere authored
      The `llc` invocation was missing `-start-before=machine-cp`.
      
      llvm-svn: 351464
      032ef2dc
    • Wouter van Oortmerssen's avatar
      [WebAssembly] Fixed objdump not parsing function headers. · f3b762a0
      Wouter van Oortmerssen authored
      Summary:
      objdump was interpreting the function header containing the locals
      declaration as instructions. To parse these without injecting target
      specific code in objdump, MCDisassembler::onSymbolStart was added to
      be implemented by the WebAssembly implemention.
      
      WasmObjectFile now returns a code offset for the "address" of a symbol,
      rather than the index. This is also more in-line with what other
      targets do.
      
      Also ensured that the AsmParser correctly puts each function
      in its own segment to enable this test case.
      
      Reviewers: sbc100, dschuff
      
      Subscribers: jgravelle-google, aheejin, sunfish, rupprecht, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D56684
      
      llvm-svn: 351460
      f3b762a0
    • Teresa Johnson's avatar
      Revert "[ThinLTO] Add summary entries for index-based WPD" · 8d86f1ba
      Teresa Johnson authored
      Mistaken commit of something still under review!
      
      This reverts commit r351453.
      
      llvm-svn: 351455
      8d86f1ba
    • Teresa Johnson's avatar
      Add -dump-input=always to cfi-devirt test to debug flake · 0be9960f
      Teresa Johnson authored
      To help diagnose flaky bot failures in PR40351.
      
      llvm-svn: 351454
      0be9960f
    • Teresa Johnson's avatar
      [ThinLTO] Add summary entries for index-based WPD · 4fcf3b16
      Teresa Johnson authored
      Summary:
      If LTOUnit splitting is disabled, the module summary analysis computes
      the summary information necessary to perform single implementation
      devirtualization during the thin link with the index and no IR. The
      information collected from the regular LTO IR in the current hybrid WPD
      algorithm is summarized, including:
      1) For vtable definitions, record the function pointers and their offset
      within the vtable initializer (subsumes the information collected from
      IR by tryFindVirtualCallTargets).
      2) A record for each type metadata summarizing the vtable definitions
      decorated with that metadata (subsumes the TypeIdentiferMap collected
      from IR).
      
      Also added are the necessary bitcode records, and the corresponding
      assembly support.
      
      The index-based WPD will be sent as a follow-on.
      
      Depends on D53890.
      
      Reviewers: pcc
      
      Subscribers: mehdi_amini, Prazek, inglorion, eraman, steven_wu, dexonsmith, arphaman, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D54815
      
      llvm-svn: 351453
      4fcf3b16
    • James Henderson's avatar
      [llvm-readobj][ELF]Add demangling support · e50d9cb3
      James Henderson authored
      This change adds demangling support to the ELF side of llvm-readobj,
      under the switch --demangle/-C.
      
      The following places are demangled: symbol table dumps (static and
      dynamic), relocation dumps (static and dynamic), addrsig dumps, call
      graph profile dumps, and group section signature symbols.
      
      Although GNU readelf doesn't support demangling, it is still a useful
      feature to have, and brings it on a par with llvm-objdump's
      capabilities.
      
      This fixes https://bugs.llvm.org/show_bug.cgi?id=40054.
      
      Reviewed by: grimar, rupprecht
      
      Differential Revision: https://reviews.llvm.org/D56791
      
      llvm-svn: 351450
      e50d9cb3
    • Max Kazantsev's avatar
      [LoopSimplifyCFG] Form LCSSA when a parent loop becomes a sibling · 61a8d3fb
      Max Kazantsev authored
      During the transforms in LoopSimplifyCFG, when we remove a dead exiting edge, the
      parent loop may stop being reachable from the child loop, and therefore they become
      siblings. If the former child loop had uses of some values from its former parent loop,
      now such uses will require LCSSA Phis, even if they weren't needed before. So we must
      form LCSSA for all loops that stopped being ancestors of the current loop in this case.
      
      Differential Revision: https://reviews.llvm.org/D56144
      Reviewed By: fedor.sergeev
      
      llvm-svn: 351434
      61a8d3fb
    • Max Kazantsev's avatar
      [LoopSimplifyCFG] Fix order of deletion of complex dead subloops · 8b134169
      Max Kazantsev authored
      Function `DeleteDeadBlock` requires that all predecessors of a block
      being deleted have already been deleted, with the exception of a
      single-block loop. When we use it for removal of dead subloops that
      contain more than one block, we may not fulfull this requirement and
      fail an assertion.
      
      This patch replaces invocation of `DeleteDeadBlock` with a generalized
      version `DeleteDeadBlocks` that is able to deal with multiple dead blocks,
      even if they contain some cycles.
      
      Differential Revision: https://reviews.llvm.org/D56121
      Reviewed By: fedor.sergeev
      
      llvm-svn: 351433
      8b134169
    • Simon Pilgrim's avatar
      [X86][SSE] Add PR40340 test case · 6cc9c3cd
      Simon Pilgrim authored
      llvm-svn: 351430
      6cc9c3cd
    • Simon Pilgrim's avatar
      [X86] Add AVX512 test to insertps · 8260bf9d
      Simon Pilgrim authored
      Pre-commit for PR40340
      
      llvm-svn: 351429
      8260bf9d
    • Matt Arsenault's avatar
      Allow FP types for atomicrmw xchg · 0cb08e44
      Matt Arsenault authored
      llvm-svn: 351427
      0cb08e44
    • Diana Picus's avatar
      [ARM GlobalISel] Allow calls to varargs functions · d5c2499a
      Diana Picus authored
      Allow varargs functions to be called, both in arm and thumb mode. This
      boils down to choosing the correct calling convention, which we can
      easily test by making sure arm_aapcscc is used instead of
      arm_aapcs_vfpcc when the callee is variadic.
      
      llvm-svn: 351424
      d5c2499a
    • Alex Bradbury's avatar
      [RISCV] Add codegen support for RV64A · 07f1c623
      Alex Bradbury authored
      In order to support codegen RV64A, this patch:
      * Introduces masked atomics intrinsics for atomicrmw operations and cmpxchg
        that use the i64 type. These are ultimately lowered to masked operations
        using lr.w/sc.w, but we need to use these alternate intrinsics for RV64
        because i32 is not legal
      * Modifies RISCVExpandPseudoInsts.cpp to handle PseudoAtomicLoadNand64 and
        PseudoCmpXchg64
      * Modifies the AtomicExpandPass hooks in RISCVTargetLowering to sext/trunc as
        needed for RV64 and to select the i64 intrinsic IDs when necessary
      * Adds appropriate patterns to RISCVInstrInfoA.td
      * Updates test/CodeGen/RISCV/atomic-*.ll to show RV64A support
      
      This ends up being a fairly mechanical change, as the logic for RV32A is
      effectively reused.
      
      Differential Revision: https://reviews.llvm.org/D53233
      
      llvm-svn: 351422
      07f1c623
    • Sanjin Sijaric's avatar
      [ARM64][Windows] Share unwind codes between epilogues · b6940306
      Sanjin Sijaric authored
      There are cases where we have multiple epilogues that have the exact same unwind
      code sequence.  In that case, the epilogues can share the same unwind codes in
      the .xdata section.  This should get us past the assert "SEH unwind data
      splitting not yet implemented" in many cases.
      
      We still need to add support for generating multiple .pdata/.xdata sections for
      those functions that need to be split into fragments.
      
      Differential Revision: https://reviews.llvm.org/D56813
      
      llvm-svn: 351421
      b6940306
    • Thomas Lively's avatar
      [WebAssembly] Parse llvm.ident into producers section · cbda16eb
      Thomas Lively authored
      llvm-svn: 351413
      cbda16eb
    • Vedant Kumar's avatar
      [MergeFunc] Prevent silent miscompile of vararg functions · a9906c1e
      Vedant Kumar authored
      The function merging pass miscompiles identical vararg functions. The
      forwarding thunk it emits doesn't forward the full variable-length list
      of arguments. Disable merging for vararg functions for now.
      
      I've filed llvm.org/PR40345 to track the issue.
      
      rdar://47326238
      
      llvm-svn: 351411
      a9906c1e
    • Thomas Lively's avatar
      Revert "[WebAssembly] Parse llvm.ident into producers section" · 3cfcc94c
      Thomas Lively authored
      This reverts commit eccdbba3a02a33e13b5262e92200a33e2ead873d.
      
      llvm-svn: 351410
      3cfcc94c
    • Vedant Kumar's avatar
      [FunctionComparator] Consider tail call kinds · e21ab221
      Vedant Kumar authored
      Essentially, do not treat `call` and `musttail call` as the same thing.
      
      As a drive-by, fold CallInst and InvokeInst handling together using the
      CallSite helper.
      
      Differential Revision: https://reviews.llvm.org/D56815
      
      llvm-svn: 351405
      e21ab221
    • Sanjin Sijaric's avatar
      [SEH] [ARM64] Retrieve the frame pointer from SEH funclets · 685565ae
      Sanjin Sijaric authored
      The Windows ARM64 runtime passes the establisher frame to funclets as the first
      argument.
      
      llvm-svn: 351404
      685565ae
    • Thomas Lively's avatar
      [WebAssembly] Parse llvm.ident into producers section · a56c23c5
      Thomas Lively authored
      Summary:
      Everything before the word "version" is the tool, and everything after
      the word "version" is the version.
      
      Reviewers: aheejin, dschuff
      
      Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D56742
      
      llvm-svn: 351399
      a56c23c5
    • Jonas Devlieghere's avatar
      [AsmPrinter] Collapse .loc 0 0 directives · 669edb5c
      Jonas Devlieghere authored
      Currently we do not always collapse subsequent .loc 0 0 directives. The
      reason is that we were checking for a PrevInstLoc which is not set when
      we emit a line-0 record. We should only check the LastAsmLine, which
      seems to be created exactly for this purpose.
      
        // When we emit a line-0 record, we don't update PrevInstLoc; so look at
        // the last line number actually emitted, to see if it was line 0.
        unsigned LastAsmLine =
          Asm->OutStreamer->getContext().getCurrentDwarfLoc().getLine();
      
      Differential revision: https://reviews.llvm.org/D56767
      
      llvm-svn: 351395
      669edb5c
    • Wei Mi's avatar
      [PGO] Make pgo related options in opt more consistent. · c876e3d4
      Wei Mi authored
      Currently we have pgo options defined in PassManagerBuilder.cpp only for
      instrument pgo, but not for sample pgo. We also have pgo options defined
      in NewPMDriver.cpp in opt only for new pass manager and for all kinds of
      pgo. They have some inconsistency.
      
      To make the options more consistent and make tests writing easier, the
      patch let old pass manager to share the same pgo options with new pass
      manager in opt, and removes the options in PassManagerBuilder.cpp.
      
      Differential Revision: https://reviews.llvm.org/D56749
      
      llvm-svn: 351392
      c876e3d4
  3. Jan 16, 2019
    • Craig Topper's avatar
      [X86] Add X86ISD::VSHLV and X86ISD::VSRLV nodes for psllv and psrlv · 59abdf5f
      Craig Topper authored
      Previously we used ISD::SHL and ISD::SRL to represent these in SelectionDAG. ISD::SHL/SRL interpret an out of range shift amount as undefined behavior and will constant fold to undef. While the intrinsics are defined to return 0 for out of range shift amounts. A previous patch added a special node for VPSRAV to produce all sign bits.
      
      This was previously believed safe because undefs frequently get turned into 0 either from the constant pool or a desire to not have a false register dependency. But undef is treated specially in some optimizations. For example, its ignored in detection of vector splats. So if the ISD::SHL/SRL can be constant folded and all of the elements with in bounds shift amounts are the same, we might fold it to single element broadcast from the constant pool. This would not put 0s in the elements with out of bounds shift amounts.
      
      We do have an existing InstCombine optimization to use shl/lshr when the shift amounts are all constant and in bounds. That should prevent some loss of constant folding from this change.
      
      Patch by zhutianyang and Craig Topper
      
      Differential Revision: https://reviews.llvm.org/D56695
      
      llvm-svn: 351381
      59abdf5f
    • Changpeng Fang's avatar
      AMDGPU: Adjust the chain for loads writing to the HI part of a register. · fe9269f8
      Changpeng Fang authored
      Summary:
        For these loads that write to the HI part of a register, we should chain them to the op that writes to the LO part
      of the register to maintain the appropriate order.
      
      Reviewers:
        rampitec, arsenm
      
      Differential Revision:
        https://reviews.llvm.org/D56454
      
      llvm-svn: 351379
      fe9269f8
    • Craig Topper's avatar
      [X86] Add a one use check to the setcc inversion code in combineVSelectWithAllOnesOrZeros · e5b7cc8a
      Craig Topper authored
      If we're going to generate a new inverted setcc, we should make sure we will be able to remove the old setcc.
      
      Differential Revision: https://reviews.llvm.org/D56765
      
      llvm-svn: 351378
      e5b7cc8a
    • Craig Topper's avatar
      [X86] Add test case for D56765. NFC · 238ad13b
      Craig Topper authored
      llvm-svn: 351377
      238ad13b
    • Nikita Popov's avatar
      [X86] Add additional saturating add/sub vector tests; NFC · 5fa75a04
      Nikita Popov authored
      Additional tests for vNi32 and vNi64. I've added these for
      usub.sat before, this covers uadd.sat, ssub.sat and sadd.sat.
      
      llvm-svn: 351375
      5fa75a04
    • Mandeep Singh Grang's avatar
      [COFF, ARM64] Implement support for SEH extensions __try/__except/__finally · 33c49c0c
      Mandeep Singh Grang authored
      Summary:
      This patch supports MS SEH extensions __try/__except/__finally. The intrinsics localescape and localrecover are responsible for communicating escaped static allocas from the try block to the handler.
      
      We need to preserve frame pointers for SEH. So we create a new function/property HasLocalEscape.
      
      Reviewers: rnk, compnerd, mstorsjo, TomTan, efriedma, ssijaric
      
      Reviewed By: rnk, efriedma
      
      Subscribers: smeenai, jrmuizel, alex, majnemer, ssijaric, ehsan, dmajor, kristina, javed.absar, kristof.beyls, chrib, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D53540
      
      llvm-svn: 351370
      33c49c0c
    • Andrea Di Biagio's avatar
      [X86][BtVer2] Update latency of horizontal operations. · c5f0f530
      Andrea Di Biagio authored
      On Jaguar, horizontal adds/subs have local forwarding disable.
      That means, we pay a compulsory extra cycle of write-back stage, and the value
      is not available until the end of that stage.
      
      This patch changes the latency of horizontal operations by adding an extra
      cycle. With this patch, latency numbers now match what is reported by perf.
      
      I plan to send another patch to also 'fix' the latency of shuffle operations (on
      Jaguar, local forwarding is disabled for vector shuffles too).
      
      Differential Revision: https://reviews.llvm.org/D56777
      
      llvm-svn: 351366
      c5f0f530
Loading