Skip to content
  1. Oct 10, 2017
  2. Oct 09, 2017
  3. Oct 08, 2017
  4. Oct 07, 2017
  5. Oct 05, 2017
    • Mikael Holmen's avatar
      Minor refactoring regarding Cast::isNoopCast(), NFC · 0ec1d25d
      Mikael Holmen authored
      Summary:
      FastISel::hasTrivialKill() was the only user of the "IntPtrTy" version of
      Cast::isNoopCast(). According to review comments in D37894 we could instead
      use the "DataLayout" version of the method, and thus get rid of the
      "IntPtrTy" versions of isNoopCast() completely.
      
      With the above done, the remaining isNoopCast() could then be simplified
      a bit more.
      
      Reviewers: arsenm
      
      Reviewed By: arsenm
      
      Subscribers: wdng, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D38497
      
      llvm-svn: 314969
      0ec1d25d
  6. Oct 04, 2017
    • Hans Wennborg's avatar
      Revert r314886 "[X86] Improvement in CodeGen instruction selection for LEAs... · 2a6c9adb
      Hans Wennborg authored
      Revert r314886 "[X86] Improvement in CodeGen instruction selection for LEAs (re-applying post required revision changes.)"
      
      It broke the Chromium / SQLite build; see PR34830.
      
      > Summary:
      >    1/  Operand folding during complex pattern matching for LEAs has been
      >        extended, such that it promotes Scale to accommodate similar operand
      >        appearing in the DAG.
      >        e.g.
      >          T1 = A + B
      >          T2 = T1 + 10
      >          T3 = T2 + A
      >        For above DAG rooted at T3, X86AddressMode will no look like
      >          Base = B , Index = A , Scale = 2 , Disp = 10
      >
      >    2/  During OptimizeLEAPass down the pipeline factorization is now performed over LEAs
      >        so that if there is an opportunity then complex LEAs (having 3 operands)
      >        could be factored out.
      >        e.g.
      >          leal 1(%rax,%rcx,1), %rdx
      >          leal 1(%rax,%rcx,2), %rcx
      >        will be factored as following
      >          leal 1(%rax,%rcx,1), %rdx
      >          leal (%rdx,%rcx)   , %edx
      >
      >    3/ Aggressive operand folding for AM based selection for LEAs is sensitive to loops,
      >       thus avoiding creation of any complex LEAs within a loop.
      >
      > Reviewers: lsaba, RKSimon, craig.topper, qcolombet, jmolloy
      >
      > Reviewed By: lsaba
      >
      > Subscribers: jmolloy, spatel, igorb, llvm-commits
      >
      >     Differential Revision: https://reviews.llvm.org/D35014
      
      llvm-svn: 314919
      2a6c9adb
    • Jatin Bhateja's avatar
      [X86] Improvement in CodeGen instruction selection for LEAs (re-applying post... · 3c29bacd
      Jatin Bhateja authored
      [X86] Improvement in CodeGen instruction selection for LEAs (re-applying post required revision changes.)
      
      Summary:
         1/  Operand folding during complex pattern matching for LEAs has been
             extended, such that it promotes Scale to accommodate similar operand
             appearing in the DAG.
             e.g.
               T1 = A + B
               T2 = T1 + 10
               T3 = T2 + A
             For above DAG rooted at T3, X86AddressMode will no look like
               Base = B , Index = A , Scale = 2 , Disp = 10
      
         2/  During OptimizeLEAPass down the pipeline factorization is now performed over LEAs
             so that if there is an opportunity then complex LEAs (having 3 operands)
             could be factored out.
             e.g.
               leal 1(%rax,%rcx,1), %rdx
               leal 1(%rax,%rcx,2), %rcx
             will be factored as following
               leal 1(%rax,%rcx,1), %rdx
               leal (%rdx,%rcx)   , %edx
      
         3/ Aggressive operand folding for AM based selection for LEAs is sensitive to loops,
            thus avoiding creation of any complex LEAs within a loop.
      
      Reviewers: lsaba, RKSimon, craig.topper, qcolombet, jmolloy
      
      Reviewed By: lsaba
      
      Subscribers: jmolloy, spatel, igorb, llvm-commits
      
          Differential Revision: https://reviews.llvm.org/D35014
      
      llvm-svn: 314886
      3c29bacd
  7. Oct 03, 2017
    • Bjorn Pettersson's avatar
      [DebugInfo] Handle endianness when moving debug info for split integer values (reapplied) · 90cc1b53
      Bjorn Pettersson authored
      Summary:
      Take the target's endianness into account when splitting the
      debug information in DAGTypeLegalizer::SetExpandedInteger.
      
      This patch fixes so that, for big-endian targets, the fragment
      expression corresponding to the high part of a split integer
      value is placed at offset 0, in order to correctly represent
      the memory address order.
      
      I have attached a PPC32 reproducer where the resulting DWARF
      pieces for a 64-bit integer were incorrectly reversed.
      
      Original patch was reverted due to using -stop-after=isel in
      the test case (but that is only working when AMDGPU target
      is included in the llc build). The test case has now been
      updated to use -stop-before=expand-isel-pseudos instead.
      
      Patch by: dstenb
      
      Reviewers: JDevlieghere, aprantl, dblaikie
      
      Reviewed By: JDevlieghere, aprantl, dblaikie
      
      Subscribers: nemanjai
      
      Differential Revision: https://reviews.llvm.org/D38172
      
      llvm-svn: 314781
      90cc1b53
    • Sjoerd Meijer's avatar
      ISel type legalization: add debug messages. NFCI. · 7a22a494
      Sjoerd Meijer authored
      This adds some more debug messages to the type legalizer and functions
      like PromoteNode, ExpandNode, ExpandLibCall in an attempt to make
      the debug messages a little bit more informative and useful.
      
      Differential Revision: https://reviews.llvm.org/D38450
      
      llvm-svn: 314773
      7a22a494
    • Tim Shen's avatar
      [PowerPC] Revert r314666. · 59465d29
      Tim Shen authored
      See https://reviews.llvm.org/D38172.
      
      I tried to XFAIL it, but sometimes XPASS triggers the bot. Simply
      revert it.
      
      llvm-svn: 314739
      59465d29
  8. Oct 02, 2017
  9. Sep 30, 2017
    • Yaxun Liu's avatar
      CodeGen: Fix pointer info in expandUnalignedLoad/Store · b33607e5
      Yaxun Liu authored
      Currently expandUnalignedLoad/Store uses place holder pointer info for temporary memory operand
      in stack, which does not have correct address space. This causes unaligned private double16 load/store to be
      lowered to flat_load instead of buffer_load for amdgcn target.
      
      This fixes failures of OpenCL conformance test basic/vload_private/vstore_private on target amdgcn---amdgizcl.
      
      Differential Revision: https://reviews.llvm.org/D35361
      
      llvm-svn: 314566
      b33607e5
  10. Sep 28, 2017
  11. Sep 27, 2017
  12. Sep 25, 2017
    • Craig Topper's avatar
      [SelectionDAG] Teach simplifyDemandedBits to handle shifts by constant splat vectors · 5bc10ede
      Craig Topper authored
      This teach simplifyDemandedBits to handle constant splat vector shifts.
      
      This required changing some uses of getZExtValue to getLimitedValue since we can't rely on legalization using getShiftAmountTy for the shift amount.
      
      I believe there may have been a bug in the ((X << C1) >>u ShAmt) handling where we didn't check if the inner shift was too large. I've fixed that here.
      
      I had to add new patterns to ARM because the zext/sext the patterns were trying to look for got turned into an any_extend with this patch. Happy to split that out too, but not sure how to test without this change.
      
      Differential Revision: https://reviews.llvm.org/D37665
      
      llvm-svn: 314139
      5bc10ede
    • Reid Kleckner's avatar
      [DebugInfo] Sort the SDDbgValue list before assuming it is in IR order · 8898cd8d
      Reid Kleckner authored
      Summary:
      This code iterates the 'Orders' vector in parallel with the DbgValue
      list, emitting all DBG_VALUEs that occurred between the last IR order
      insertion point and the next insertion point. This assumes the
      SDDbgValue list is sorted in IR order, which it usually is. However, it
      is not sorted when a node with a debug value is replaced with another
      one. When this happens, TransferDbgValues is called, and the new value
      is added to the end of the list.
      
      The problem can be solved by stably sorting the list by IR order.
      
      Reviewers: aprantl, Ka-Ka
      
      Reviewed By: aprantl
      
      Subscribers: MatzeB, hiraditya, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D38197
      
      llvm-svn: 314114
      8898cd8d
    • Reid Kleckner's avatar
      09e75c93
  13. Sep 22, 2017
  14. Sep 21, 2017
    • Craig Topper's avatar
      [DAGCombiner] Slightly simplify some code by using APInt::isMask() and... · de437925
      Craig Topper authored
      [DAGCombiner] Slightly simplify some code by using APInt::isMask() and countTrailingOnes instead of getting active bits and checking if all the bits below that make a mask.
      
      At least for the 64-bit and less case, we should be able to determine if we even have a mask without counting any bits. This also removes the need to explicitly check for 0 active bits, isMask will return false for 0.
      
      llvm-svn: 313908
      de437925
    • Reid Kleckner's avatar
      Re-land r313825: "[IR] Add llvm.dbg.addr, a control-dependent version of llvm.dbg.declare" · 0fe506bc
      Reid Kleckner authored
      The fix is to avoid invalidating our insertion point in
      replaceDbgDeclare:
           Builder.insertDeclare(NewAddress, DIVar, DIExpr, Loc, InsertBefore);
      +    if (DII == InsertBefore)
      +      InsertBefore = &*std::next(InsertBefore->getIterator());
           DII->eraseFromParent();
      
      I had to write a unit tests for this instead of a lit test because the
      use list order matters in order to trigger the bug.
      
      The reduced C test case for this was:
        void useit(int*);
        static inline void inlineme() {
          int x[2];
          useit(x);
        }
        void f() {
          inlineme();
          inlineme();
        }
      
      llvm-svn: 313905
      0fe506bc
    • Bjorn Pettersson's avatar
      [SelectionDAG] Pick correct frame index in LowerArguments · 0dde08c3
      Bjorn Pettersson authored
      Summary:
      SelectionDAGISel::LowerArguments is associating arguments
      with frame indices (FuncInfo->setArgumentFrameIndex). That
      information is later on used by EmitFuncArgumentDbgValue to
      create DBG_VALUE instructions that denotes that a variable
      can be found on the stack.
      
      I discovered that for our (big endian) out-of-tree target
      the association created by SelectionDAGISel::LowerArguments
      sometimes is wrong. I've seen this happen when a 64-bit value
      is passed on the stack. The argument will occupy two stack
      slots (frame index X, and frame index X+1). The fault is
      that a call to setArgumentFrameIndex is associating the
      64-bit argument with frame index X+1. The effect is that the
      debug information (DBG_VALUE) will point at the least significant
      part of the arguement on the stack. When printing the
      argument in a debugger I will get the wrong value.
      
      I managed to create a test case for PowerPC that seems to
      show the same kind of problem.
      
      The bugfix will look at the datalayout, taking endianness into
      account when examining a BUILD_PAIR node, assuming that the
      least significant part is in the first operand of the BUILD_PAIR.
      For big endian targets we should use the frame index from
      the second operand, as the most significant part will be stored
      at the lower address (using the highest frame index).
      
      Reviewers: bogner, rnk, hfinkel, sdardis, aprantl
      
      Reviewed By: aprantl
      
      Subscribers: nemanjai, aprantl, llvm-commits, igorb
      
      Differential Revision: https://reviews.llvm.org/D37740
      
      llvm-svn: 313901
      0dde08c3
    • Craig Topper's avatar
      [DAGCombiner] Remove duplicate code from visitZERO_EXTEND · 280f1337
      Craig Topper authored
      This exact block of code exists right below.
      
      Differential Revision: https://reviews.llvm.org/D38122
      
      llvm-svn: 313891
      280f1337
    • Daniel Jasper's avatar
      Revert r313825: "[IR] Add llvm.dbg.addr, a control-dependent version of llvm.dbg.declare" · 7d2f38d6
      Daniel Jasper authored
      .. as well as the two subsequent changes r313826 and r313875.
      
      This leads to segfaults in combination with ASAN. Will forward repro
      instructions to the original author (rnk).
      
      llvm-svn: 313876
      7d2f38d6
    • Craig Topper's avatar
      [SelectionDAG] Replace a flag that can never be true with an assert. · f0ba3003
      Craig Topper authored
      llvm-svn: 313847
      f0ba3003
    • Craig Topper's avatar
      eb0f71f2
  15. Sep 20, 2017
    • Reid Kleckner's avatar
      [IR] Add llvm.dbg.addr, a control-dependent version of llvm.dbg.declare · 3f547e87
      Reid Kleckner authored
      Summary:
      This implements the design discussed on llvm-dev for better tracking of
      variables that live in memory through optimizations:
        http://lists.llvm.org/pipermail/llvm-dev/2017-September/117222.html
      
      This is tracked as PR34136
      
      llvm.dbg.addr is intended to be produced and used in almost precisely
      the same way as llvm.dbg.declare is today, with the exception that it is
      control-dependent. That means that dbg.addr should always have a
      position in the instruction stream, and it will allow passes that
      optimize memory operations on local variables to insert llvm.dbg.value
      calls to reflect deleted stores. See SourceLevelDebugging.rst for more
      details.
      
      The main drawback to generating DBG_VALUE machine instrs is that they
      usually cause LLVM to emit a location list for DW_AT_location. The next
      step will be to teach DwarfDebug.cpp how to recognize more DBG_VALUE
      ranges as not needing a location list, and possibly start setting
      DW_AT_start_offset for variables whose lifetimes begin mid-scope.
      
      Reviewers: aprantl, dblaikie, probinson
      
      Subscribers: eraman, hiraditya, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D37768
      
      llvm-svn: 313825
      3f547e87
    • Saleem Abdulrasool's avatar
      CodeGen: support SwiftError SwiftCC on Windows x64 · 432b88e5
      Saleem Abdulrasool authored
      Add support for passing SwiftError through a register on the Windows x64
      calling convention.  This allows the use of swifterror attributes on
      parameters which is used by the swift front end for the `Error`
      parameter.  This partially enables building the swift standard library
      for Windows x86_64.
      
      llvm-svn: 313791
      432b88e5
    • Saleem Abdulrasool's avatar
      CodeGen: use range based for loops (NFC) · 399a4e9b
      Saleem Abdulrasool authored
      Simplify the RPOT traversal by using a range based for loop for the
      iterator dereference.
      
      llvm-svn: 313687
      399a4e9b
  16. Sep 19, 2017
    • Sanjay Patel's avatar
      [DAGCombiner] fold assertzexts separated by trunc · f31b1a00
      Sanjay Patel authored
      If we have an AssertZext of a truncated value that has already been AssertZext'ed, 
      we can assert on the wider source op to improve the zext-y knowledge:
       assert (trunc (assert X, i8) to iN), i1 --> trunc (assert X, i1) to iN
      
      This moves a fold from being Mips-specific to general combining, and x86 shows
      improvements.
      
      Differential Revision: https://reviews.llvm.org/D37017
      
      llvm-svn: 313577
      f31b1a00
  17. Sep 18, 2017
  18. Sep 15, 2017
    • Hans Wennborg's avatar
      Revert r313343 "[X86] PR32755 : Improvement in CodeGen instruction selection for LEAs." · 534bfbd3
      Hans Wennborg authored
      This caused PR34629: asserts firing when building Chromium. It also broke some
      buildbots building test-suite as reported on the commit thread.
      
      > Summary:
      >    1/  Operand folding during complex pattern matching for LEAs has been
      >        extended, such that it promotes Scale to accommodate similar operand
      >        appearing in the DAG.
      >        e.g.
      >           T1 = A + B
      >           T2 = T1 + 10
      >           T3 = T2 + A
      >        For above DAG rooted at T3, X86AddressMode will no look like
      >           Base = B , Index = A , Scale = 2 , Disp = 10
      >
      >    2/  During OptimizeLEAPass down the pipeline factorization is now performed over LEAs
      >        so that if there is an opportunity then complex LEAs (having 3 operands)
      >        could be factored out.
      >        e.g.
      >           leal 1(%rax,%rcx,1), %rdx
      >           leal 1(%rax,%rcx,2), %rcx
      >        will be factored as following
      >           leal 1(%rax,%rcx,1), %rdx
      >           leal (%rdx,%rcx)   , %edx
      >
      >    3/ Aggressive operand folding for AM based selection for LEAs is sensitive to loops,
      >       thus avoiding creation of any complex LEAs within a loop.
      >
      > Reviewers: lsaba, RKSimon, craig.topper, qcolombet
      >
      > Reviewed By: lsaba
      >
      > Subscribers: spatel, igorb, llvm-commits
      >
      > Differential Revision: https://reviews.llvm.org/D35014
      
      llvm-svn: 313376
      534bfbd3
    • Jatin Bhateja's avatar
      [X86] PR32755 : Improvement in CodeGen instruction selection for LEAs. · 908c8b37
      Jatin Bhateja authored
      Summary:
         1/  Operand folding during complex pattern matching for LEAs has been
             extended, such that it promotes Scale to accommodate similar operand
             appearing in the DAG.
             e.g.
                T1 = A + B
                T2 = T1 + 10
                T3 = T2 + A
             For above DAG rooted at T3, X86AddressMode will no look like
                Base = B , Index = A , Scale = 2 , Disp = 10
      
         2/  During OptimizeLEAPass down the pipeline factorization is now performed over LEAs
             so that if there is an opportunity then complex LEAs (having 3 operands)
             could be factored out.
             e.g.
                leal 1(%rax,%rcx,1), %rdx
                leal 1(%rax,%rcx,2), %rcx
             will be factored as following
                leal 1(%rax,%rcx,1), %rdx
                leal (%rdx,%rcx)   , %edx
      
         3/ Aggressive operand folding for AM based selection for LEAs is sensitive to loops,
            thus avoiding creation of any complex LEAs within a loop.
      
      Reviewers: lsaba, RKSimon, craig.topper, qcolombet
      
      Reviewed By: lsaba
      
      Subscribers: spatel, igorb, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D35014
      
      llvm-svn: 313343
      908c8b37
  19. Sep 14, 2017
  20. Sep 13, 2017
Loading