Skip to content
  1. Mar 12, 2018
  2. Mar 09, 2018
  3. Mar 08, 2018
  4. Mar 07, 2018
  5. Mar 06, 2018
    • Craig Topper's avatar
      [TargetLowering] Rename DAGCombinerInfo::isAfterLegalizeVectorOps to... · 80d3bb3b
      Craig Topper authored
      [TargetLowering] Rename DAGCombinerInfo::isAfterLegalizeVectorOps to DAGCombiner::isAfterLegalizeDAG since that's what it checks. NFC
      
      The code checks Level == AfterLegalizeDAG which is the fourth and last of the possible DAG combine stages that we have.
      
      There is a Level called AfterLegalVectorOps, but that's the third DAG combine and it doesn't always run.
      
      A function called isAfterLegalVectorOps should imply it returns true in either of the DAG combines that runs after the legalize vector ops stage, but that's not what this function does.
      
      llvm-svn: 326832
      80d3bb3b
  6. Mar 05, 2018
  7. Mar 02, 2018
  8. Mar 01, 2018
  9. Feb 28, 2018
    • Chih-Hung Hsieh's avatar
      [TLS] use emulated TLS if the target supports only this mode · 9f9e4681
      Chih-Hung Hsieh authored
      Emulated TLS is enabled by llc flag -emulated-tls,
      which is passed by clang driver.
      When llc is called explicitly or from other drivers like LTO,
      missing -emulated-tls flag would generate wrong TLS code for targets
      that supports only this mode.
      Now use useEmulatedTLS() instead of Options.EmulatedTLS to decide whether
      emulated TLS code should be generated.
      Unit tests are modified to run with and without the -emulated-tls flag.
      
      Differential Revision: https://reviews.llvm.org/D42999
      
      llvm-svn: 326341
      9f9e4681
  10. Feb 24, 2018
  11. Feb 23, 2018
    • Stefan Pintilie's avatar
      [Power9] Add missing instructions to the Power 9 scheduler · 626b6510
      Stefan Pintilie authored
      This is the first in a series of patches that will define more
      instructions using InstRW so that we can move away from ItinRW
      and ultimately have a complete Power 9 scheduler.
      
      Differential Revision: https://reviews.llvm.org/D43635
      
      llvm-svn: 325956
      626b6510
    • Geoff Berry's avatar
      [MachineOperand][Target] MachineOperand::isRenamable semantics changes · f8bf2ec0
      Geoff Berry authored
      Summary:
      Add a target option AllowRegisterRenaming that is used to opt in to
      post-register-allocation renaming of registers.  This is set to 0 by
      default, which causes the hasExtraSrcRegAllocReq/hasExtraDstRegAllocReq
      fields of all opcodes to be set to 1, causing
      MachineOperand::isRenamable to always return false.
      
      Set the AllowRegisterRenaming flag to 1 for all in-tree targets that
      have lit tests that were effected by enabling COPY forwarding in
      MachineCopyPropagation (AArch64, AMDGPU, ARM, Hexagon, Mips, PowerPC,
      RISCV, Sparc, SystemZ and X86).
      
      Add some more comments describing the semantics of the
      MachineOperand::isRenamable function and how it is set and maintained.
      
      Change isRenamable to check the operand's opcode
      hasExtraSrcRegAllocReq/hasExtraDstRegAllocReq bit directly instead of
      relying on it being consistently reflected in the IsRenamable bit
      setting.
      
      Clear the IsRenamable bit when changing an operand's register value.
      
      Remove target code that was clearing the IsRenamable bit when changing
      registers/opcodes now that this is done conservatively by default.
      
      Change setting of hasExtraSrcRegAllocReq in AMDGPU target to be done in
      one place covering all opcodes that have constant pipe read limit
      restrictions.
      
      Reviewers: qcolombet, MatzeB
      
      Subscribers: aemerson, arsenm, jyknight, mcrosier, sdardis, nhaehnle, javed.absar, tpr, arichardson, kristof.beyls, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, jordy.potman.lists, apazos, sabuasal, niosHD, escha, nemanjai, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D43042
      
      llvm-svn: 325931
      f8bf2ec0
    • Stefan Pintilie's avatar
      [PowerPC] Code cleanup. Remove instructions that were withdrawn from Power 9. · 15e6b10e
      Stefan Pintilie authored
      The following set of instructions was originally planned to be added for Power 9
      and so code was added to support them. However, a decision was made later on to
      withdraw support for these instructions in the hardware.
      xscmpnedp
      xvcmpnesp
      xvcmpnedp
      This patch removes support for the instructions that were not added.
      
      Differential Revision: https://reviews.llvm.org/D43641
      
      llvm-svn: 325918
      15e6b10e
  12. Feb 22, 2018
    • Nemanja Ivanovic's avatar
      [PowerPC] Do not produce invalid CTR loop with an FRem · e54a9ee8
      Nemanja Ivanovic authored
      An FRem instruction inside a loop should prevent the loop from being converted
      into a CTR loop since this is not an operation that is legal on any PPC
      subtarget. This will always be a call to a library function which means the
      loop will be invalid if this instruction is in the body.
      
      Fixes PR36292.
      
      llvm-svn: 325739
      e54a9ee8
  13. Feb 20, 2018
  14. Feb 16, 2018
  15. Feb 09, 2018
  16. Feb 06, 2018
  17. Feb 05, 2018
    • Hiroshi Inoue's avatar
      [PowerPC] Check hot loop exit edge in PPCCTRLoops · c5ab1ab7
      Hiroshi Inoue authored
      PPCCTRLoops transform loops using mtctr/bdnz instructions if loop trip count is known and big enough to compensate for the cost of mtctr.
      But if there is a loop exit edge which is known to be frequently taken (by builtin_expect or by PGO), we should not transform the loop to avoid the cost of mtctr instruction. Here is an example of a loop with hot exit edge:
      
      for (unsigned i = 0; i < TripCount; i++) {
        // do something
        if (__builtin_expect(check(), 1))
          break;
        // do something
      }
      
      Differential Revision: https://reviews.llvm.org/D42637
      
      llvm-svn: 324229
      c5ab1ab7
  18. Feb 02, 2018
    • David Blaikie's avatar
      Remove non-modular header containing static utility functions · d8a6f93a
      David Blaikie authored
      The one place that uses these functions isn't particularly
      long/complicated, so it's easier to just have these inline at that
      location than trying to split it out into a true header. (in part also
      because of the use of the DEBUG macros, which make this not really a
      standalone header even if the static functions were made inline instead)
      
      llvm-svn: 324044
      d8a6f93a
  19. Feb 01, 2018
  20. Jan 31, 2018
  21. Jan 30, 2018
    • Zaara Syeda's avatar
      Re-commit : [PowerPC] Add handling for ColdCC calling convention and a pass to mark · 1f59ae31
      Zaara Syeda authored
      candidates with coldcc attribute.
      
      This recommits r322721 reverted due to sanitizer memory leak build bot failures.
      
      Original commit message:
      This patch adds support for the coldcc calling convention for Power.
      This changes the set of non-volatile registers. It includes a pass to stress
      test the implementation by marking all static directly called functions with
      the coldcc attribute through the option -enable-coldcc-stress-test. It also
      includes an option, -ppc-enable-coldcc, to add the coldcc attribute to
      functions which are cold at all call sites based on BlockFrequencyInfo when
      the containing function does not call any non cold functions.
      
      Differential Revision: https://reviews.llvm.org/D38413
      
      llvm-svn: 323778
      1f59ae31
  22. Jan 29, 2018
  23. Jan 23, 2018
    • Tim Shen's avatar
      [PPC] Avoid incorrect fp-i128-fp lowering. · 7abe9887
      Tim Shen authored
      Summary:
      Fix an issue that's similar to what D41411 fixed:
        float(__int128(float_var)) shouldn't be optimized to xscvdpsxds +
        xscvsxdsp, as they mean (float)(int64_t)float_var.
      
      Reviewers: jtony, hfinkel, echristo
      
      Subscribers: sanjoy, nemanjai, hiraditya, llvm-commits, kbarton
      
      Differential Revision: https://reviews.llvm.org/D42400
      
      llvm-svn: 323270
      7abe9887
  24. Jan 17, 2018
    • Zaara Syeda's avatar
      Revert [PowerPC] This reverts commit rL322721 · c9dc7b45
      Zaara Syeda authored
      Failing build bots. Revert the commit now.
      
      llvm-svn: 322748
      c9dc7b45
    • Zaara Syeda's avatar
      [PowerPC] Add handling for ColdCC calling convention and a pass to mark · 8e951fd2
      Zaara Syeda authored
      candidates with coldcc attribute.
      
      This patch adds support for the coldcc calling convention for Power.
      This changes the set of non-volatile registers. It includes a pass to stress
      test the implementation by marking all static directly called functions with
      the coldcc attribute through the option -enable-coldcc-stress-test. It also
      includes an option, -ppc-enable-coldcc, to add the coldcc attribute to
      functions which are cold at all call sites based on BlockFrequencyInfo when
      the containing function does not call any non cold functions.
      
      Differential Revision: https://reviews.llvm.org/D38413
      
      llvm-svn: 322721
      8e951fd2
  25. Jan 16, 2018
    • Guozhi Wei's avatar
      [PPC] Add a new register XER aliased to CARRY · e6fb4e1f
      Guozhi Wei authored
      When "xer" is specified as clobbered register in inline assembler, clang can accept it, but llvm simply ignore it when lowered to machine instructions. It may cause problems later in scheduler.
      
      This patch adds a new register XER aliased to CARRY, and adds it to register class CARRYRC. Now PPCTargetLowering::getRegForInlineAsmConstraint can return correct register number for inline asm constraint "{xer}", and scheduler behave correctly.
      
      Differential Revision: https://reviews.llvm.org/D41967
      
      llvm-svn: 322591
      e6fb4e1f
  26. Jan 12, 2018
  27. Jan 09, 2018
    • Stefan Pintilie's avatar
      [PowerPC] Manually schedule the prologue and epilogue · 17127008
      Stefan Pintilie authored
      This patch makes the following changes to the schedule of instructions in the
      prologue and epilogue.
      
      The stack pointer update is moved down in the prologue so that the callee saves
      do not have to wait for the update to happen.
      Saving the lr is moved down in the prologue to hide the latency of the mflr.
      The stack pointer is moved up in the epilogue so that restoring of the lr can
      happen sooner.
      The mtlr is moved up in the epilogue so that it is away form the blr at the end
      of the epilogue. The latency of the mtlr can now be hidden by the loads of the
      callee saved registers.
      
      This commit is almost identical to this one: r322036 except that two warnings
      that broke build bots have been fixed.
      
      The revision number is D41737 as before.
      
      llvm-svn: 322124
      17127008
    • Sean Fertile's avatar
      [PowerPC] Can not assume an intrinsic argument is a simple type. · 33a17762
      Sean Fertile authored
      The CTRLoop pass performs checks on the argument of certain libcalls/intrinsics,
      and assumes the arguments must be of a simple type. This isn't always the case
      though. For example if we unroll and vectorize a loop we may end up with vectors
      larger then the largest legal type, along with intrinsics that operate on those
      wider types. This happened in the ffmpeg build, where we unrolled a loop and
      ended up with a sqrt intrinsic that operated on V16f64, triggering an assertion.
      
      Differential Revision: https://reviews.llvm.org/D41758
      
      llvm-svn: 322055
      33a17762
    • Stefan Pintilie's avatar
      Revert "[PowerPC] Manually schedule the prologue and epilogue" · 7e10987b
      Stefan Pintilie authored
      [PowerPC] This reverts commit r322036.
      
      Failing build bots. Revert the commit now.
      
      llvm-svn: 322051
      7e10987b
  28. Jan 08, 2018
    • Stefan Pintilie's avatar
      [PowerPC] Manually schedule the prologue and epilogue · 55bfdd04
      Stefan Pintilie authored
      This patch makes the following changes to the schedule of instructions in the
      prologue and epilogue.
      
      The stack pointer update is moved down in the prologue so that the callee saves
      do not have to wait for the update to happen.
      Saving the lr is moved down in the prologue to hide the latency of the mflr.
      The stack pointer is moved up in the epilogue so that restoring of the lr can
      happen sooner.
      The mtlr is moved up in the epilogue so that it is away form the blr at the end
      of the epilogue. The latency of the mtlr can now be hidden by the loads of the
      callee saved registers.
      
      Differential Revision: https://reviews.llvm.org/D41737
      
      llvm-svn: 322036
      55bfdd04
  29. Jan 07, 2018
    • Craig Topper's avatar
      [PowerPC] Add an ISD::TRUNCATE to the legalization for ppc_is_decremented_ctr_nonzero · d461aefe
      Craig Topper authored
      Summary:
      I believe legalization is really expecting that ReplaceNodeResults will return something with the same type as the thing that's being legalized. Ultimately, it uses the output to replace the uses in the DAG so the type should match to make that work.
      
      There are two relevant cases here. When crbits are enabled, then i1 is a legal type and getSetCCResultType should return i1. In this case, the truncate will be between i1 and i1 and should be removed (SelectionDAG::getNode does this). Otherwise, getSetCCResultType will be i32 and the legalizer will promote the truncate to be i32 -> i32 which will be similarly removed.
      
      With this fixed we can remove some code from PromoteIntRes_SETCC that seemed to only exist to deal with the intrinsic being replaced with a larger type without changing the other operand. With the truncate being used for connectivity this doesn't happen anymore.
      
      Reviewers: hfinkel
      
      Reviewed By: hfinkel
      
      Subscribers: nemanjai, llvm-commits, kbarton
      
      Differential Revision: https://reviews.llvm.org/D41654
      
      llvm-svn: 321959
      d461aefe
  30. Jan 03, 2018
    • Alex Bradbury's avatar
      Thread MCSubtargetInfo through Target::createMCAsmBackend · b22f751f
      Alex Bradbury authored
      Currently it's not possible to access MCSubtargetInfo from a TgtMCAsmBackend. 
      D20830 threaded an MCSubtargetInfo reference through 
      MCAsmBackend::relaxInstruction, but this isn't the only function that would 
      benefit from access. This patch removes the Triple and CPUString arguments 
      from createMCAsmBackend and replaces them with MCSubtargetInfo.
      
      This patch just changes the interface without making any intentional 
      functional changes. Once in, several cleanups are possible:
      * Get rid of the awkward MCSubtargetInfo handling in ARMAsmBackend
      * Support 16-bit instructions when valid in MipsAsmBackend::writeNopData
      * Get rid of the CPU string parsing in X86AsmBackend and just use a SubtargetFeature for HasNopl
      * Emit 16-bit nops in RISCVAsmBackend::writeNopData if the compressed instruction set extension is enabled (see D41221)
      
      This change initially exposed PR35686, which has since been resolved in r321026.
      
      Differential Revision: https://reviews.llvm.org/D41349
      
      llvm-svn: 321692
      b22f751f
  31. Dec 30, 2017
    • Hiroshi Inoue's avatar
      [PowerPC] fix a bug in TCO eligibility check · ca3cdd7f
      Hiroshi Inoue authored
      If the callee and caller use different calling convensions, we cannot apply TCO if the callee requires arguments on stack; e.g. C calling convention and Fast CC use the same registers for parameter passing, but the stack offset is not necessarily same.
      
      This patch also recommit r319218 "[PowerPC] Allow tail calls of fastcc functions from C CallingConv functions." by @sfertile since the problem reported in r320106 should be fixed.
      
      Differential Revision: https://reviews.llvm.org/D40893
      
      llvm-svn: 321579
      ca3cdd7f
Loading