Skip to content
  1. Oct 04, 2017
    • Jatin Bhateja's avatar
      [X86] Improvement in CodeGen instruction selection for LEAs (re-applying post... · 3c29bacd
      Jatin Bhateja authored
      [X86] Improvement in CodeGen instruction selection for LEAs (re-applying post required revision changes.)
      
      Summary:
         1/  Operand folding during complex pattern matching for LEAs has been
             extended, such that it promotes Scale to accommodate similar operand
             appearing in the DAG.
             e.g.
               T1 = A + B
               T2 = T1 + 10
               T3 = T2 + A
             For above DAG rooted at T3, X86AddressMode will no look like
               Base = B , Index = A , Scale = 2 , Disp = 10
      
         2/  During OptimizeLEAPass down the pipeline factorization is now performed over LEAs
             so that if there is an opportunity then complex LEAs (having 3 operands)
             could be factored out.
             e.g.
               leal 1(%rax,%rcx,1), %rdx
               leal 1(%rax,%rcx,2), %rcx
             will be factored as following
               leal 1(%rax,%rcx,1), %rdx
               leal (%rdx,%rcx)   , %edx
      
         3/ Aggressive operand folding for AM based selection for LEAs is sensitive to loops,
            thus avoiding creation of any complex LEAs within a loop.
      
      Reviewers: lsaba, RKSimon, craig.topper, qcolombet, jmolloy
      
      Reviewed By: lsaba
      
      Subscribers: jmolloy, spatel, igorb, llvm-commits
      
          Differential Revision: https://reviews.llvm.org/D35014
      
      llvm-svn: 314886
      3c29bacd
    • Sean Eveson's avatar
      ea9dceed
    • George Rimar's avatar
      [MC] - Don't assert when non-english characters are used. · 099960d3
      George Rimar authored
      I found that llvm-mc does not like non-english characters even in comments,
      which it tries to tokenize.
      
      Problem happens because of functions like isdigit(), isalnum() which takes
      int argument and expects it is not negative.
      But at the same time MCParser uses char* to store input buffer poiner, char has signed value,
      so it is possible to pass negative value to one of functions from above and
      that triggers an assert. 
      Testcase for demonstration is provided.
      
      To fix the issue helper functions were introduced in StringExtras.h
      
      Differential revision: https://reviews.llvm.org/D38461
      
      llvm-svn: 314883
      099960d3
    • Mikael Holmen's avatar
      Recommit [UnreachableBlockElim] Use COPY if PHI input is undef · a1a3f5c5
      Mikael Holmen authored
      This time invoking llc with "-march=x86-64" in the testcase, so we don't assume
      the default target is x86.
      
      Summary:
      If we have
      
          %vreg0<def> = PHI %vreg2<undef>, <BB#0>, %vreg3, <BB#2>; GR32:%vreg0,%vreg2,%vreg3
          %vreg3<def,tied1> = ADD32ri8 %vreg0<kill,tied0>, 1, %EFLAGS<imp-def>; GR32:%vreg3,%vreg0
      
      then we can't just change %vreg0 into %vreg3, since %vreg2 is actually
      undef. We would have to also copy the undef flag to be able to change the
      register.
      
      Instead we deal with this case like other cases where we can't just
      replace the register: we insert a COPY. The code creating the COPY already
      copied all flags from the PHI input, so the undef flag will be transferred
      as it should.
      
      Reviewers: kparzysz
      
      Reviewed By: kparzysz
      
      Subscribers: llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D38235
      
      llvm-svn: 314882
      a1a3f5c5
    • Max Kazantsev's avatar
      [IRCE] Temporarily disable unsigned latch conditions by default · 8aacef6c
      Max Kazantsev authored
      We have found some corner cases connected to range intersection where IRCE makes
      a bad thing when the latch condition is unsigned. The fix for that will go as a follow up.
      This patch temporarily disables IRCE for unsigned latch conditions until the issue is fixed.
      
      The unsigned latch conditions were introduced to IRCE by rL310027.
      
      Differential Revision: https://reviews.llvm.org/D38529
      
      llvm-svn: 314881
      8aacef6c
    • Mikael Holmen's avatar
      Revert r314879 "[UnreachableBlockElim] Use COPY if PHI input is undef" · 75b1992f
      Mikael Holmen authored
      Build-bots broke on the new testcase. I'll investigate and fix.
      
      llvm-svn: 314880
      75b1992f
    • Mikael Holmen's avatar
      [UnreachableBlockElim] Use COPY if PHI input is undef · 65eb2f39
      Mikael Holmen authored
      Summary:
      If we have
      
          %vreg0<def> = PHI %vreg2<undef>, <BB#0>, %vreg3, <BB#2>; GR32:%vreg0,%vreg2,%vreg3
          %vreg3<def,tied1> = ADD32ri8 %vreg0<kill,tied0>, 1, %EFLAGS<imp-def>; GR32:%vreg3,%vreg0
      
      then we can't just change %vreg0 into %vreg3, since %vreg2 is actually
      undef. We would have to also copy the undef flag to be able to change the
      register.
      
      Instead we deal with this case like other cases where we can't just
      replace the register: we insert a COPY. The code creating the COPY already
      copied all flags from the PHI input, so the undef flag will be transferred
      as it should.
      
      Reviewers: kparzysz
      
      Reviewed By: kparzysz
      
      Subscribers: llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D38235
      
      llvm-svn: 314879
      65eb2f39
    • Martin Storsjö's avatar
      [X86] Fix using the SJLJ jump table on x86_64 · e14145dc
      Martin Storsjö authored
      The previous version didn't work if the jump table base address didn't
      fit in 32 bit, since it was encoded as an immediate offset. And in case
      the jump table is encoded as 32 bit label differences, we need to
      load and add them to the table base first.
      
      This solves the first half of the issues mentioned in PR34720.
      
      Also fix some of the errors pointed out by -verify-machineinstrs, by
      using GR32_NOSPRegClass.
      
      Differential Revision: https://reviews.llvm.org/D38333
      
      llvm-svn: 314876
      e14145dc
    • Adam Nemet's avatar
      Move verbosity check for remarks to the diag handler · f31b1f31
      Adam Nemet authored
      Test needs some slight adjustment because we no longer check the existence of
      BFI but rather that the actual hotness is set on the remark.  If entry_count
      is not set getBlockProfileCount returns None.
      
      llvm-svn: 314874
      f31b1f31
    • Tim Shen's avatar
      [FuzzerUtil] Partially revert D38481 on FuzzerUtil · 83fd6a12
      Tim Shen authored
      This is because lib/Fuzzer doesn't really depend on llvm infrastucture.
      It's not easy to access the llvm hardware_concurrency here.
      
      Differential Reivision: https://reviews.llvm.org/D38481
      
      llvm-svn: 314870
      83fd6a12
    • Adrian Prantl's avatar
      Add a manpage for llvm-dwarfdump. · f01b3f3f
      Adrian Prantl authored
      llvm-svn: 314863
      f01b3f3f
    • Rui Ueyama's avatar
      Simplify multikey_qsort function. · 15b83279
      Rui Ueyama authored
      This function implements the three-way radix quicksort algorithm.
      This patch simplifies the implementation by using MutableArrayRef.
      
      llvm-svn: 314858
      15b83279
    • Balaram Makam's avatar
      [AArch64] Use LateSimplifyCFG after expanding atomic operations. · e0c43152
      Balaram Makam authored
      Summary:
      After r308422 we defer optimizations that can destroy loop canonical forms to
      LateSimplifyCFG. Running LateSimplifyCFG after expanding atomic operations
      can exploit more control-flow opportunities.
      
      Reviewers: mcrosier, t.p.northover, efriedma
      
      Reviewed By: efriedma
      
      Subscribers: aemerson, rengolin, javed.absar, llvm-commits, kristof.beyls
      
      Differential Revision: https://reviews.llvm.org/D38262
      
      llvm-svn: 314857
      e0c43152
    • Adrian Prantl's avatar
      e5b93f2e
  2. Oct 03, 2017
Loading