Skip to content
  1. Oct 24, 2016
  2. Oct 20, 2016
    • Sanjay Patel's avatar
      [Target] remove TargetRecip class; 2nd try · 0051efcf
      Sanjay Patel authored
      This is a retry of r284495 which was reverted at r284513 due to use-after-scope bugs
      caused by faulty usage of StringRef.
      
      This version also renames a pair of functions:
      getRecipEstimateDivEnabled()
      getRecipEstimateSqrtEnabled()
      as suggested by Eric Christopher.
      
      original commit msg:
      
      [Target] remove TargetRecip class; move reciprocal estimate isel functionality to TargetLowering
      
      This is a follow-up to https://reviews.llvm.org/D24816 - where we changed reciprocal estimates to be function attributes
      rather than TargetOptions.
      
      This patch is intended to be a structural, but not functional change. By moving all of the
      TargetRecip functionality into TargetLowering, we can remove all of the reciprocal estimate
      state, shield the callers from the string format implementation, and simplify/localize the
      logic needed for a target to enable this.
      
      If a function has a "reciprocal-estimates" attribute, those settings may override the target's
      default reciprocal preferences for whatever operation and data type we're trying to optimize.
      If there's no attribute string or specific setting for the op/type pair, just use the target
      default settings.
      
      As noted earlier, a better solution would be to move the reciprocal estimate settings to IR
      instructions and SDNodes rather than function attributes, but that's a multi-step job that
      requires infrastructure improvements. I intend to work on that, but it's not clear how long
      it will take to get all the pieces in place.
      
      Differential Revision: https://reviews.llvm.org/D25440
      
      llvm-svn: 284746
      0051efcf
    • Benjamin Kramer's avatar
      Do a sweep over move ctors and remove those that are identical to the default. · 2a8bef87
      Benjamin Kramer authored
      All of these existed because MSVC 2013 was unable to synthesize default
      move ctors. We recently dropped support for it so all that error-prone
      boilerplate can go.
      
      No functionality change intended.
      
      llvm-svn: 284721
      2a8bef87
  3. Oct 18, 2016
    • Sanjay Patel's avatar
      revert r284495: [Target] remove TargetRecip class · 19601fa5
      Sanjay Patel authored
      There's something wrong with the StringRef usage while parsing the attribute string.
      
      llvm-svn: 284513
      19601fa5
    • Sanjay Patel's avatar
      [Target] remove TargetRecip class; move reciprocal estimate isel functionality to TargetLowering · 08fff9ca
      Sanjay Patel authored
      This is a follow-up to D24816 - where we changed reciprocal estimates to be function attributes
      rather than TargetOptions.
      
      This patch is intended to be a structural, but not functional change. By moving all of the
      TargetRecip functionality into TargetLowering, we can remove all of the reciprocal estimate
      state, shield the callers from the string format implementation, and simplify/localize the
      logic needed for a target to enable this.
      
      If a function has a "reciprocal-estimates" attribute, those settings may override the target's
      default reciprocal preferences for whatever operation and data type we're trying to optimize.
      If there's no attribute string or specific setting for the op/type pair, just use the target
      default settings.
      
      As noted earlier, a better solution would be to move the reciprocal estimate settings to IR
      instructions and SDNodes rather than function attributes, but that's a multi-step job that
      requires infrastructure improvements. I intend to work on that, but it's not clear how long
      it will take to get all the pieces in place.
      
      Differential Revision: https://reviews.llvm.org/D25440
      
      llvm-svn: 284495
      08fff9ca
  4. Oct 14, 2016
    • Guozhi Wei's avatar
      [PPC] Shorter sequence to load 64bit constant with same hi/lo words · 0cd65429
      Guozhi Wei authored
      This is a patch to implement pr30640.
      
      When a 64bit constant has the same hi/lo words, we can use rldimi to copy the low word into high word of the same register.
      
      This optimization caused failure of test case bperm.ll because of not optimal heuristic in function SelectAndParts64. It chooses AND or ROTATE to extract bit groups from a register, and OR them together. This optimization lowers the cost of loading 64bit constant mask used in AND method, and causes different code sequence. But actually ROTATE method is better in this test case. The reason is in ROTATE method the final OR operation can be avoided since rldimi can insert the rotated bits into target register directly. So this patch also enhances SelectAndParts64 to prefer ROTATE method when the two methods have same cost and there are multiple bit groups need to be ORed together.
      
      Differential Revision: https://reviews.llvm.org/D25521
      
      llvm-svn: 284276
      0cd65429
  5. Oct 12, 2016
    • Tim Shen's avatar
      [PPCMIPeephole] Fix splat elimination · 4ff62b18
      Tim Shen authored
      Summary:
      In PPCMIPeephole, when we see two splat instructions, we can't simply do the following transformation:
        B = Splat A
        C = Splat B
      =>
        C = Splat A
      because B may still be used between these two instructions. Instead, we should make the second Splat a PPC::COPY and let later passes decide whether to remove it or not:
        B = Splat A
        C = Splat B
      =>
        B = Splat A
        C = COPY B
      
      Fixes PR30663.
      
      Reviewers: echristo, iteratee, kbarton, nemanjai
      
      Subscribers: mehdi_amini, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D25493
      
      llvm-svn: 283961
      4ff62b18
  6. Oct 11, 2016
  7. Oct 10, 2016
  8. Oct 09, 2016
  9. Oct 07, 2016
  10. Oct 04, 2016
    • Sanjay Patel's avatar
      [Target] move reciprocal estimate settings from TargetOptions to TargetLowering · bfdbea64
      Sanjay Patel authored
      The motivation for the change is that we can't have pseudo-global settings for
      codegen living in TargetOptions because that doesn't work with LTO.
      
      Ideally, these reciprocal attributes will be moved to the instruction-level via
      FMF, metadata, or something else. But making them function attributes is at least
      an improvement over the current state.
      
      The ingredients of this patch are:
      
          Remove the reciprocal estimate command-line debug option.
          Add TargetRecip to TargetLowering.
          Remove TargetRecip from TargetOptions.
          Clean up the TargetRecip implementation to work with this new scheme.
          Set the default reciprocal settings in TargetLoweringBase (everything is off).
          Update the PowerPC defaults, users, and tests.
          Update the x86 defaults, users, and tests.
      
      Note that if this patch needs to be reverted, the related clang patch checked in
      at r283251 should be reverted too.
      
      Differential Revision: https://reviews.llvm.org/D24816
      
      llvm-svn: 283252
      bfdbea64
    • Nemanja Ivanovic's avatar
      [Power9] Exploit D-Form VSX Scalar memory ops that target full VSX register set · 6354d235
      Nemanja Ivanovic authored
      This patch corresponds to review:
      
      The newly added VSX D-Form (register + offset) memory ops target the upper half
      of the VSX register set. The existing ones target the lower half. In order to
      unify these and have the ability to target all the VSX registers using D-Form
      operations, this patch defines Pseudo-ops for the loads/stores which are
      expanded post-RA. The expansion then choses the correct opcode based on the
      register that was allocated for the operation.
      
      llvm-svn: 283212
      6354d235
    • Nemanja Ivanovic's avatar
      [Power9] Part-word VSX integer scalar loads/stores and sign extend instructions · 11049f8f
      Nemanja Ivanovic authored
      This patch corresponds to review:
      https://reviews.llvm.org/D23155
      
      This patch removes the VSHRC register class (based on D20310) and adds
      exploitation of the Power9 sub-word integer loads into VSX registers as well
      as vector sign extensions.
      The new instructions are useful for a few purposes:
      
          Int to Fp conversions of 1 or 2-byte values loaded from memory
          Building vectors of 1 or 2-byte integers with values loaded from memory
          Storing individual 1 or 2-byte elements from integer vectors
      
      This patch implements all of those uses.
      
      llvm-svn: 283190
      11049f8f
  11. Oct 03, 2016
    • Hal Finkel's avatar
      [PowerPC] Account for the ELFv2 function prologue during branch selection · 530fa5fc
      Hal Finkel authored
      The PPC branch-selection pass, which performs branch relaxation, needs to
      account for the padding that might be introduced to satisfy block alignment
      requirements. We were assuming that the first block was at offset zero (i.e.
      had the alignment of the function itself), but under the ELFv2 ABI, a global
      entry function prologue is added to the first block, and it is a
      two-instruction sequence (i.e. eight-bytes long). If the function has 16-byte
      alignment, the fact that the first block is eight bytes offset from the start
      of the function is relevant to calculating where padding will be added in
      between later blocks.
      
      Unfortunately, I don't have a small test case.
      
      llvm-svn: 283086
      530fa5fc
  12. Oct 02, 2016
    • Hal Finkel's avatar
      [PowerPC] Refactor soft-float support, and enable PPC64 soft float · a9321059
      Hal Finkel authored
      This change enables soft-float for PowerPC64, and also makes soft-float disable
      all vector instruction sets for both 32-bit and 64-bit modes. This latter part
      is necessary because the PPC backend canonicalizes many Altivec vector types to
      floating-point types, and so soft-float breaks scalarization support for many
      operations. Both for embedded targets and for operating-system kernels desiring
      soft-float support, it seems reasonable that disabling hardware floating-point
      also disables vector instructions (embedded targets without hardware floating
      point support are unlikely to have Altivec, etc. and operating system kernels
      desiring not to use floating-point registers to lower syscall cost are unlikely
      to want to use vector registers either). If someone needs this to work, we'll
      need to change the fact that we promote many Altivec operations to act on
      v4f32. To make it possible to disable Altivec when soft-float is enabled,
      hardware floating-point support needs to be expressed as a positive feature,
      like the others, and not a negative feature, because target features cannot
      have dependencies on the disabling of some other feature. So +soft-float has
      now become -hard-float.
      
      Fixes PR26970.
      
      llvm-svn: 283060
      a9321059
  13. Oct 01, 2016
  14. Sep 27, 2016
    • Nemanja Ivanovic's avatar
      [Power9] Builtins for ELF v.2 API conformance - back end portion · 6f22b413
      Nemanja Ivanovic authored
      This patch corresponds to review:
      https://reviews.llvm.org/D24396
      
      This patch adds support for the "vector count trailing zeroes",
      "vector compare not equal" and "vector compare not equal or zero instructions"
      as well as "scalar count trailing zeroes" instructions. It also changes the
      vector negation to use XXLNOR (when VSX is enabled) so as not to increase
      register pressure (previously this was done with a splat immediate of all
      ones followed by an XXLXOR). This was done because the altivec.h
      builtins (patch to follow) use vector negation and the use of an additional
      register for the splat immediate is not optimal.
      
      llvm-svn: 282478
      6f22b413
  15. Sep 23, 2016
  16. Sep 22, 2016
  17. Sep 16, 2016
  18. Sep 14, 2016
Loading