Skip to content
  1. Nov 16, 2017
    • Yaxun Liu's avatar
      Fix APInt bit size in processDbgDeclares · 4d9a4d7a
      Yaxun Liu authored
      processDbgDeclares assumes pointer size is the same for different addr spaces.
      It uses pointer size for addr space 0 for all pointers, which causes assertion
      in stripAndAccumulateInBoundsConstantOffsets for amdgcn---amdgiz since
      pointer in addr space 5 has different size than in addr space 0.
      
      This patch fixes that.
      
      Differential Revision: https://reviews.llvm.org/D40085
      
      llvm-svn: 318370
      4d9a4d7a
    • Rong Xu's avatar
      [CodeGen] Fix the branch probability assertion in r318202 · e4572c6b
      Rong Xu authored
      Due to integer precision, we might have numerator greater than denominator in
      the branch probability scaling. Add a check to prevent this from happening.
      
      llvm-svn: 318353
      e4572c6b
  2. Nov 14, 2017
  3. Nov 13, 2017
  4. Nov 11, 2017
    • Craig Topper's avatar
      [SelectionDAG] Make getUniformBase in SelectionDAGBuilder fail if any of the... · bdb8db45
      Craig Topper authored
      [SelectionDAG] Make getUniformBase in SelectionDAGBuilder fail if any of the middle GEP indices are non-constant.
      
      This is a fix for a bug in r317947. We were supposed to check that all the indices are are constant 0, but instead we're only make sure that indices that are constant are 0. Non-constant indices are being ignored.
      
      llvm-svn: 317950
      bdb8db45
  5. Nov 10, 2017
  6. Nov 09, 2017
  7. Nov 08, 2017
    • Dan Gohman's avatar
      Add an @llvm.sideeffect intrinsic · 2c74fe97
      Dan Gohman authored
      This patch implements Chandler's idea [0] for supporting languages that
      require support for infinite loops with side effects, such as Rust, providing
      part of a solution to bug 965 [1].
      
      Specifically, it adds an `llvm.sideeffect()` intrinsic, which has no actual
      effect, but which appears to optimization passes to have obscure side effects,
      such that they don't optimize away loops containing it. It also teaches
      several optimization passes to ignore this intrinsic, so that it doesn't
      significantly impact optimization in most cases.
      
      As discussed on llvm-dev [2], this patch is the first of two major parts.
      The second part, to change LLVM's semantics to have defined behavior
      on infinite loops by default, with a function attribute for opting into
      potential-undefined-behavior, will be implemented and posted for review in
      a separate patch.
      
      [0] http://lists.llvm.org/pipermail/llvm-dev/2015-July/088103.html
      [1] https://bugs.llvm.org/show_bug.cgi?id=965
      [2] http://lists.llvm.org/pipermail/llvm-dev/2017-October/118632.html
      
      Differential Revision: https://reviews.llvm.org/D38336
      
      llvm-svn: 317729
      2c74fe97
    • Adrian Prantl's avatar
      Handle inlined variables in SelectionDAGBuilder::EmitFuncArgumentDbgValue(). · 93faeecd
      Adrian Prantl authored
      In 2010 a commit with no testcase and no further explanation
      explicitly disabled the handling of inlined variables in
      EmitFuncArgumentDbgValue(). I don't think there is a good reason for
      this any more and re-enabling this adds debug locations for variables
      associated with an LLVM function argument in functions that are
      inlined into the first basic block. The only downside of doing this is
      that we may insert a DBG_VALUE before the inlined scope, but (1) this
      could be filtered out later, and (2) LiveDebugValues will not
      propagate it into subsequent basic blocks if they don't dominate the
      variable's lexical scope, so this seems like a small price to pay.
      
      rdar://problem/26228128
      
      llvm-svn: 317702
      93faeecd
    • Matt Arsenault's avatar
      DAG: Add computeKnownBitsForFrameIndex · f6ee94c1
      Matt Arsenault authored
      Some of the AMDGPU stack addressing modes require knowing the sign
      bit is zero. We used to accomplish this by custom lowering
      frame indexes, and then putting an AssertZext around a
      TargetFrameIndex. This required specifically looking for
      the AssextZext + frame index pattern which was moderately
      disgusting. The same could probably be accomplished
      with a target specific node, but would still
      require special handling of frame indexes.
      
      llvm-svn: 317671
      f6ee94c1
    • David Blaikie's avatar
      Target/TargetInstrInfo.h -> CodeGen/TargetInstrInfo.h to match layering · 3f833edc
      David Blaikie authored
      This header includes CodeGen headers, and is not, itself, included by
      any Target headers, so move it into CodeGen to match the layering of its
      implementation.
      
      llvm-svn: 317647
      3f833edc
  8. Nov 07, 2017
  9. Nov 06, 2017
    • Sanjay Patel's avatar
      [IR] redefine 'UnsafeAlgebra' / 'reassoc' fast-math-flags and add 'trans' fast-math-flag · 629c4115
      Sanjay Patel authored
      As discussed on llvm-dev:
      http://lists.llvm.org/pipermail/llvm-dev/2016-November/107104.html
      and again more recently:
      http://lists.llvm.org/pipermail/llvm-dev/2017-October/118118.html
      
      ...this is a step in cleaning up our fast-math-flags implementation in IR to better match
      the capabilities of both clang's user-visible flags and the backend's flags for SDNode.
      
      As proposed in the above threads, we're replacing the 'UnsafeAlgebra' bit (which had the 
      'umbrella' meaning that all flags are set) with a new bit that only applies to algebraic 
      reassociation - 'AllowReassoc'.
      
      We're also adding a bit to allow approximations for library functions called 'ApproxFunc' 
      (this was initially proposed as 'libm' or similar).
      
      ...and we're out of bits. 7 bits ought to be enough for anyone, right? :) FWIW, I did 
      look at getting this out of SubclassOptionalData via SubclassData (spacious 16-bits), 
      but that's apparently already used for other purposes. Also, I don't think we can just 
      add a field to FPMathOperator because Operator is not intended to be instantiated. 
      We'll defer movement of FMF to another day.
      
      We keep the 'fast' keyword. I thought about removing that, but seeing IR like this:
      %f.fast = fadd reassoc nnan ninf nsz arcp contract afn float %op1, %op2
      ...made me think we want to keep the shortcut synonym.
      
      Finally, this change is binary incompatible with existing IR as seen in the 
      compatibility tests. This statement:
      "Newer releases can ignore features from older releases, but they cannot miscompile 
      them. For example, if nsw is ever replaced with something else, dropping it would be 
      a valid way to upgrade the IR." 
      ( http://llvm.org/docs/DeveloperPolicy.html#ir-backwards-compatibility )
      ...provides the flexibility we want to make this change without requiring a new IR 
      version. Ie, we're not loosening the FP strictness of existing IR. At worst, we will 
      fail to optimize some previously 'fast' code because it's no longer recognized as 
      'fast'. This should get fixed as we audit/squash all of the uses of 'isFast()'.
      
      Note: an inter-dependent clang commit to use the new API name should closely follow 
      commit.
      
      Differential Revision: https://reviews.llvm.org/D39304
      
      llvm-svn: 317488
      629c4115
  10. Nov 03, 2017
    • David Blaikie's avatar
      Move TargetFrameLowering.h to CodeGen where it's implemented · 1be62f03
      David Blaikie authored
      This header already includes a CodeGen header and is implemented in
      lib/CodeGen, so move the header there to match.
      
      This fixes a link error with modular codegeneration builds - where a
      header and its implementation are circularly dependent and so need to be
      in the same library, not split between two like this.
      
      llvm-svn: 317379
      1be62f03
  11. Nov 02, 2017
    • Ayman Musa's avatar
      [X86] Fix bug in legalize vector types - Split large loads · a37d1130
      Ayman Musa authored
      When splitting a large load to smaller legally-typed loads, the last load should be padded to reach the size of the previous one so a CONCAT_VECTORS node could reunite them again.
      The code currently pads the last load to reach the size of the first load (instead of the previous).
      
      Differential Revision: https://reviews.llvm.org/D38495
      
      Change-Id: Ib60b55ed26ce901fabf68108daf52683fbd5013f
      llvm-svn: 317206
      a37d1130
  12. Nov 01, 2017
  13. Oct 30, 2017
    • Simon Pilgrim's avatar
      Fix unused variable warnings. NFCI. · 9cd7abbc
      Simon Pilgrim authored
      llvm-svn: 316964
      9cd7abbc
    • Simon Pilgrim's avatar
      [SelectionDAG] Tidyup computeKnownBits extension/truncation cases. NFCI. · 80b37136
      Simon Pilgrim authored
      We don't need to extend/truncate the Known structure before calling computeKnownBits - it will reset at the start of the function.
      
      llvm-svn: 316962
      80b37136
    • Daniel Neilson's avatar
      Create instruction classes for identifying any atomicity of memory intrinsic. (NFC) · f9c7d29c
      Daniel Neilson authored
      Summary:
      For reference, see: http://lists.llvm.org/pipermail/llvm-dev/2017-August/116589.html
      
      This patch fleshes out the instruction class hierarchy with respect to atomic and
      non-atomic memory intrinsics. With this change, the relevant part of the class
      hierarchy becomes:
      
      IntrinsicInst
        -> MemIntrinsicBase (methods-only class)
          -> MemIntrinsic (non-atomic intrinsics)
            -> MemSetInst
            -> MemTransferInst
              -> MemCpyInst
              -> MemMoveInst
          -> AtomicMemIntrinsic (atomic intrinsics)
            -> AtomicMemSetInst
            -> AtomicMemTransferInst
              -> AtomicMemCpyInst
              -> AtomicMemMoveInst
          -> AnyMemIntrinsic (both atomicities)
            -> AnyMemSetInst
            -> AnyMemTransferInst
              -> AnyMemCpyInst
              -> AnyMemMoveInst
      
      This involves some class renaming:
          ElementUnorderedAtomicMemCpyInst -> AtomicMemCpyInst
          ElementUnorderedAtomicMemMoveInst -> AtomicMemMoveInst
          ElementUnorderedAtomicMemSetInst -> AtomicMemSetInst
      A script for doing this renaming in downstream trees is included below.
      
      An example of where the Any* classes should be used in LLVM is when reasoning
      about the effects of an instruction (ex: aliasing).
      
      ---
      Script for renaming AtomicMem* classes:
      PREFIXES="[<,([:space:]]"
      CLASSES="MemIntrinsic|MemTransferInst|MemSetInst|MemMoveInst|MemCpyInst"
      SUFFIXES="[;)>,[:space:]]"
      
      REGEX="(${PREFIXES})ElementUnorderedAtomic(${CLASSES})(${SUFFIXES})"
      REGEX2="visitElementUnorderedAtomic(${CLASSES})"
      
      FILES=$( grep -E "(${REGEX}|${REGEX2})" -r . | tr ':' ' ' | awk '{print $1}' | sort | uniq )
      
      SED_SCRIPT="s~${REGEX}~\1Atomic\2\3~g"
      SED_SCRIPT2="s~${REGEX2}~visitAtomic\1~g"
      
      for f in $FILES; do
          echo "Processing: $f"
          sed  -i ".bak" -E "${SED_SCRIPT};${SED_SCRIPT2};${EA_SED_SCRIPT};${EA_SED_SCRIPT2}" $f
      done
      
      Reviewers: sanjoy, deadalnix, apilipenko, anna, skatkov, mkazantsev
      
      Reviewed By: sanjoy
      
      Subscribers: hfinkel, jholewinski, arsenm, sdardis, nhaehnle, JDevlieghere, javed.absar, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D38419
      
      llvm-svn: 316950
      f9c7d29c
    • Simon Pilgrim's avatar
      017f896a
    • Simon Pilgrim's avatar
      [SelectionDAG] Add VSELECT support to computeKnownBits · 96a0b9ef
      Simon Pilgrim authored
      llvm-svn: 316944
      96a0b9ef
    • Simon Pilgrim's avatar
      5da11dfd
  14. Oct 29, 2017
  15. Oct 28, 2017
  16. Oct 27, 2017
  17. Oct 25, 2017
    • Matt Arsenault's avatar
      DAG: Fix creating select with wrong condition type · 8a752b77
      Matt Arsenault authored
      This code added in r297930 assumed that it could create
      a select with a condition type that is just an integer
      bitcast of the selected type. For AMDGPU any vselect is
      going to be scalarized (although the vector types are legal),
      and all select conditions must be i1 (the same as getSetCCResultType).
      
      This logic doesn't really make sense to me, but there's
      never really been a consistent policy in what the select
      condition mask type is supposed to be. Try to extend
      the logic for skipping the transform for condition types
      that aren't setccs. It doesn't seem quite right to me though,
      but checking conditions that seem more sensible (like whether the
      vselect is going to be expanded) doesn't work since this
      seems to depend on that also.
      
      llvm-svn: 316554
      8a752b77
    • Adrian Prantl's avatar
      Implement salavageDebugInfo functionality for SelectionDAG. · 2eb7cbf9
      Adrian Prantl authored
      Similar to how llvm::salvagDebugInfo hooks into InstCombine, this adds
      a hook that can be invoked before an SDNode that is associated with an
      SDDbgValue is erased to capture the effect of the deleted node in a
      DIExpression.
      
      The motivating example is an SDDebugValue attached to an ADD operation
      that gets folded into a LOAD+OFFSET operation.
      
      rdar://problem/32121503
      
      llvm-svn: 316525
      2eb7cbf9
  18. Oct 24, 2017
Loading