Skip to content
  1. Nov 22, 2016
  2. Nov 20, 2016
  3. Nov 19, 2016
    • Daniel Sanders's avatar
      Check that emitted instructions meet their predicates on all targets except ARM, Mips, and X86. · 72db2a39
      Daniel Sanders authored
      Summary:
      * ARM is omitted from this patch because this check appears to expose bugs in this target.
      * Mips is omitted from this patch because this check either detects bugs or deliberate
        emission of instructions that don't satisfy their predicates. One deliberate
        use is the SYNC instruction where the version with an operand is correctly
        defined as requiring MIPS32 while the version without an operand is defined
        as an alias of 'SYNC 0' and requires MIPS2.
      * X86 is omitted from this patch because it doesn't use the tablegen-erated
        MCCodeEmitter infrastructure.
      
      Patches for ARM and Mips will follow.
      
      Depends on D25617
      
      Reviewers: tstellarAMD, jmolloy
      
      Subscribers: wdng, jmolloy, aemerson, rengolin, arsenm, jyknight, nemanjai, nhaehnle, tstellarAMD, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D25618
      
      llvm-svn: 287439
      72db2a39
  4. Nov 18, 2016
    • Ehsan Amiri's avatar
      [PPC] limit line width to 80 characters · 395be572
      Ehsan Amiri authored
      NFC. Forgot to fix this in the original commit.
      
      llvm-svn: 287350
      395be572
    • Ehsan Amiri's avatar
      [Power9] Add patterns for vnegd, vnegw · ff0942e6
      Ehsan Amiri authored
      Exploit new instructions by adding patterns to .td file.
      https://reviews.llvm.org/D26551
      
      llvm-svn: 287334
      ff0942e6
    • Ehsan Amiri's avatar
      [PPC][DAGCombine] Convert SETCC to subtract when the result is zero extended · 85818684
      Ehsan Amiri authored
      When we see a SETCC whose only users are zero extend operations, we can replace
      it with a subtraction. This results in doing all calculations in GPRs and
      avoids CR use.
      
      Currently we do this only for ULT, ULE, UGT and UGE condition codes. There are
      ways that this can be extended. For example for signed condition codes. In that
      case we will be introducing additional sign extend instructions, so more careful
      profitability analysis may be required.
      
      Another direction to extend this is for equal, not equal conditions. Also when
      users of SETCC are any_ext or sign_ext, we might be able to do something 
      similar.
      
      llvm-svn: 287329
      85818684
  5. Nov 16, 2016
    • Joerg Sonnenberger's avatar
      Always use relative jump table encodings on PowerPC64. · 8c1a9ac5
      Joerg Sonnenberger authored
      For the default, small and medium code model, use the existing
      difference from the jump table towards the label. For all other code
      models, setup the picbase and use the difference between the picbase and
      the block address.
      
      Overall, this results in smaller data tables at the expensive of one or
      two more arithmetic operation at the jump site. Given that we only create
      jump tables with a lot more than two entries, it is a net win in size.
      For larger code models the assumption remains that individual functions
      are no larger than 2GB.
      
      Differential Revision: https://reviews.llvm.org/D26336
      
      llvm-svn: 287059
      8c1a9ac5
  6. Nov 15, 2016
    • Zaara Syeda's avatar
      vector load store with length (left justified) llvm portion · a19c9e60
      Zaara Syeda authored
      llvm-svn: 286993
      a19c9e60
    • Tony Jiang's avatar
      [PowerPC] Implement BE VSX load/store builtins - llvm portion. · 5f850cd1
      Tony Jiang authored
      This patch implements all the overloads for vec_xl_be and vec_xst_be. On BE,
      they behaves exactly the same with vec_xl and vec_xst, therefore they are
      simply implemented by defining a matching macro. On LE, they are implemented
      by defining new builtins and intrinsics. For int/float/long long/double, it
      is just a load (lxvw4x/lxvd2x) or store(stxvw4x/stxvd2x). For char/char/short,
      we also need some extra shuffling before or after call the builtins to get the
      desired BE order. For int128, simply call vec_xl or vec_xst.
      
      llvm-svn: 286967
      5f850cd1
  7. Nov 14, 2016
  8. Nov 11, 2016
  9. Nov 04, 2016
    • Chandler Carruth's avatar
      Sink all of the code relying on the MachO MachineModuleInfo to live · 651f0192
      Chandler Carruth authored
      behind the test that the MachineModuleInfo analysis was
      actually available and can be used.
      
      While the MachO bits may well be reasonable to assume in the darwin
      assembly printer, the analysis isn't constructively guaranteed anywhere
      I could find so it seems safest to avoid crashing here.
      
      This issue was found with PVS-Studio. Pretty sure the Clang Static
      Anaylzer flags similar issues but we've probably never pointed it at
      this code effectively.
      
      llvm-svn: 285972
      651f0192
  10. Nov 03, 2016
  11. Nov 02, 2016
  12. Nov 01, 2016
  13. Oct 31, 2016
  14. Oct 28, 2016
  15. Oct 27, 2016
  16. Oct 26, 2016
  17. Oct 24, 2016
  18. Oct 20, 2016
    • Sanjay Patel's avatar
      [Target] remove TargetRecip class; 2nd try · 0051efcf
      Sanjay Patel authored
      This is a retry of r284495 which was reverted at r284513 due to use-after-scope bugs
      caused by faulty usage of StringRef.
      
      This version also renames a pair of functions:
      getRecipEstimateDivEnabled()
      getRecipEstimateSqrtEnabled()
      as suggested by Eric Christopher.
      
      original commit msg:
      
      [Target] remove TargetRecip class; move reciprocal estimate isel functionality to TargetLowering
      
      This is a follow-up to https://reviews.llvm.org/D24816 - where we changed reciprocal estimates to be function attributes
      rather than TargetOptions.
      
      This patch is intended to be a structural, but not functional change. By moving all of the
      TargetRecip functionality into TargetLowering, we can remove all of the reciprocal estimate
      state, shield the callers from the string format implementation, and simplify/localize the
      logic needed for a target to enable this.
      
      If a function has a "reciprocal-estimates" attribute, those settings may override the target's
      default reciprocal preferences for whatever operation and data type we're trying to optimize.
      If there's no attribute string or specific setting for the op/type pair, just use the target
      default settings.
      
      As noted earlier, a better solution would be to move the reciprocal estimate settings to IR
      instructions and SDNodes rather than function attributes, but that's a multi-step job that
      requires infrastructure improvements. I intend to work on that, but it's not clear how long
      it will take to get all the pieces in place.
      
      Differential Revision: https://reviews.llvm.org/D25440
      
      llvm-svn: 284746
      0051efcf
    • Benjamin Kramer's avatar
      Do a sweep over move ctors and remove those that are identical to the default. · 2a8bef87
      Benjamin Kramer authored
      All of these existed because MSVC 2013 was unable to synthesize default
      move ctors. We recently dropped support for it so all that error-prone
      boilerplate can go.
      
      No functionality change intended.
      
      llvm-svn: 284721
      2a8bef87
  19. Oct 18, 2016
    • Sanjay Patel's avatar
      revert r284495: [Target] remove TargetRecip class · 19601fa5
      Sanjay Patel authored
      There's something wrong with the StringRef usage while parsing the attribute string.
      
      llvm-svn: 284513
      19601fa5
    • Sanjay Patel's avatar
      [Target] remove TargetRecip class; move reciprocal estimate isel functionality to TargetLowering · 08fff9ca
      Sanjay Patel authored
      This is a follow-up to D24816 - where we changed reciprocal estimates to be function attributes
      rather than TargetOptions.
      
      This patch is intended to be a structural, but not functional change. By moving all of the
      TargetRecip functionality into TargetLowering, we can remove all of the reciprocal estimate
      state, shield the callers from the string format implementation, and simplify/localize the
      logic needed for a target to enable this.
      
      If a function has a "reciprocal-estimates" attribute, those settings may override the target's
      default reciprocal preferences for whatever operation and data type we're trying to optimize.
      If there's no attribute string or specific setting for the op/type pair, just use the target
      default settings.
      
      As noted earlier, a better solution would be to move the reciprocal estimate settings to IR
      instructions and SDNodes rather than function attributes, but that's a multi-step job that
      requires infrastructure improvements. I intend to work on that, but it's not clear how long
      it will take to get all the pieces in place.
      
      Differential Revision: https://reviews.llvm.org/D25440
      
      llvm-svn: 284495
      08fff9ca
  20. Oct 14, 2016
    • Guozhi Wei's avatar
      [PPC] Shorter sequence to load 64bit constant with same hi/lo words · 0cd65429
      Guozhi Wei authored
      This is a patch to implement pr30640.
      
      When a 64bit constant has the same hi/lo words, we can use rldimi to copy the low word into high word of the same register.
      
      This optimization caused failure of test case bperm.ll because of not optimal heuristic in function SelectAndParts64. It chooses AND or ROTATE to extract bit groups from a register, and OR them together. This optimization lowers the cost of loading 64bit constant mask used in AND method, and causes different code sequence. But actually ROTATE method is better in this test case. The reason is in ROTATE method the final OR operation can be avoided since rldimi can insert the rotated bits into target register directly. So this patch also enhances SelectAndParts64 to prefer ROTATE method when the two methods have same cost and there are multiple bit groups need to be ORed together.
      
      Differential Revision: https://reviews.llvm.org/D25521
      
      llvm-svn: 284276
      0cd65429
  21. Oct 12, 2016
    • Tim Shen's avatar
      [PPCMIPeephole] Fix splat elimination · 4ff62b18
      Tim Shen authored
      Summary:
      In PPCMIPeephole, when we see two splat instructions, we can't simply do the following transformation:
        B = Splat A
        C = Splat B
      =>
        C = Splat A
      because B may still be used between these two instructions. Instead, we should make the second Splat a PPC::COPY and let later passes decide whether to remove it or not:
        B = Splat A
        C = Splat B
      =>
        B = Splat A
        C = COPY B
      
      Fixes PR30663.
      
      Reviewers: echristo, iteratee, kbarton, nemanjai
      
      Subscribers: mehdi_amini, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D25493
      
      llvm-svn: 283961
      4ff62b18
  22. Oct 11, 2016
  23. Oct 10, 2016
  24. Oct 09, 2016
  25. Oct 07, 2016
Loading