Skip to content
  1. Jan 26, 2017
    • Sean Fertile's avatar
      [PPC] cleanup of mayLoad/mayStore flags and memory operands. · 3c8c385a
      Sean Fertile authored
      1) Explicitly sets mayLoad/mayStore property in the tablegen files on load/store
         instructions.
      2) Updated the flags on a number of intrinsics indicating that they write
          memory.
      3) Added SDNPMemOperand flags for some target dependent SDNodes so that they
         propagate their memory operand
      
      Review: https://reviews.llvm.org/D28818
      llvm-svn: 293200
      3c8c385a
    • Stanislav Mekhanoshin's avatar
      Replace addEarlyAsPossiblePasses callback with adjustPassManager · 81598117
      Stanislav Mekhanoshin authored
      This change introduces adjustPassManager target callback giving a
      target an opportunity to tweak PassManagerBuilder before pass
      managers are populated.
      
      This generalizes and replaces addEarlyAsPossiblePasses target
      callback. In particular that can be used to add custom passes to
      extension points other than EP_EarlyAsPossible.
      
      Differential Revision: https://reviews.llvm.org/D28336
      
      llvm-svn: 293189
      81598117
    • Nirav Dave's avatar
      Revert "In visitSTORE, always use FindBetterChain, rather than only when UseAA is enabled." · d32a421f
      Nirav Dave authored
      This reverts commit r293184 which is failing in LTO builds
      
      llvm-svn: 293188
      d32a421f
    • Serge Rogatch's avatar
      [XRay][Arm32] Reduce the portion of the stub and implement more staging for tail calls - in LLVM · e09ba748
      Serge Rogatch authored
      Summary:
      This patch provides more staging for tail calls in XRay Arm32 . When the logging part of XRay is ready for tail calls, its support in the core part of XRay Arm32 may be as easy as changing the number passed to the handler from 1 to 2.
      Coupled patch:
      - https://reviews.llvm.org/D28674
      
      Reviewers: dberris, rengolin
      
      Reviewed By: dberris
      
      Subscribers: llvm-commits, iid_iunknown, aemerson, rengolin, dberris
      
      Differential Revision: https://reviews.llvm.org/D28673
      
      llvm-svn: 293185
      e09ba748
    • Nirav Dave's avatar
      In visitSTORE, always use FindBetterChain, rather than only when UseAA is enabled. · de6516c4
      Nirav Dave authored
          * Simplify Consecutive Merge Store Candidate Search
      
          Now that address aliasing is much less conservative, push through
          simplified store merging search and chain alias analysis which only
          checks for parallel stores through the chain subgraph. This is cleaner
          as the separation of non-interfering loads/stores from the
          store-merging logic.
      
          When merging stores search up the chain through a single load, and
          finds all possible stores by looking down from through a load and a
          TokenFactor to all stores visited.
      
          This improves the quality of the output SelectionDAG and the output
          Codegen (save perhaps for some ARM cases where we correctly constructs
          wider loads, but then promotes them to float operations which appear
          but requires more expensive constant generation).
      
          Some minor peephole optimizations to deal with improved SubDAG shapes (listed below)
      
          Additional Minor Changes:
      
            1. Finishes removing unused AliasLoad code
      
            2. Unifies the chain aggregation in the merged stores across code
               paths
      
            3. Re-add the Store node to the worklist after calling
               SimplifyDemandedBits.
      
            4. Increase GatherAllAliasesMaxDepth from 6 to 18. That number is
               arbitrary, but seems sufficient to not cause regressions in
               tests.
      
            5. Remove Chain dependencies of Memory operations on CopyfromReg
               nodes as these are captured by data dependence
      
            6. Forward loads-store values through tokenfactors containing
                {CopyToReg,CopyFromReg} Values.
      
            7. Peephole to convert buildvector of extract_vector_elt to
               extract_subvector if possible (see
               CodeGen/AArch64/store-merge.ll)
      
            8. Store merging for the ARM target is restricted to 32-bit as
               some in some contexts invalid 64-bit operations are being
               generated. This can be removed once appropriate checks are
               added.
      
          This finishes the change Matt Arsenault started in r246307 and
          jyknight's original patch.
      
          Many tests required some changes as memory operations are now
          reorderable, improving load-store forwarding. One test in
          particular is worth noting:
      
            CodeGen/PowerPC/ppc64-align-long-double.ll - Improved load-store
            forwarding converts a load-store pair into a parallel store and
            a memory-realized bitcast of the same value. However, because we
            lose the sharing of the explicit and implicit store values we
            must create another local store. A similar transformation
            happens before SelectionDAG as well.
      
          Reviewers: arsenm, hfinkel, tstellarAMD, jyknight, nhaehnle
      
      llvm-svn: 293184
      de6516c4
    • Rafael Espindola's avatar
      Use shouldAssumeDSOLocal in classifyGlobalReference. · 82149a1a
      Rafael Espindola authored
      And teach shouldAssumeDSOLocal that ppc has no copy relocations.
      
      The resulting code handle a few more case than before. For example, it
      knows that a weak symbol can be resolved to another .o file, but it
      will still be in the main executable.
      
      llvm-svn: 293180
      82149a1a
    • Simon Pilgrim's avatar
      027bb453
    • Simon Pilgrim's avatar
      [X86][SSE] Pull out target shuffle resolve code into helper. NFCI. · 3057fd53
      Simon Pilgrim authored
      Pulled out code that removed unused inputs from a target shuffle mask into a helper function to allow it to be reused in a future commit.
      
      llvm-svn: 293175
      3057fd53
    • Valery Pykhtin's avatar
      [AMDGPU] Fix typo in GCNSchedStrategy · 75d1de90
      Valery Pykhtin authored
      Differential revision: https://reviews.llvm.org/D28980
      
      llvm-svn: 293171
      75d1de90
    • Simon Dardis's avatar
      Revert "[mips] N64 static relocation model support" · 5b67a4f7
      Simon Dardis authored
      This reverts commit r293164. There are multiple tests failing.
      
      llvm-svn: 293170
      5b67a4f7
    • Simon Dardis's avatar
      [mips] N64 static relocation model support · 09e65efd
      Simon Dardis authored
      This patch makes one change to GOT handling and two changes to N64's
      relocation model handling. Furthermore, the jumptable encodings have
      been corrected for static N64.
      
      Big GOT handling is now done via a new SDNode MipsGotHi - this node is
      unconditionally lowered to an lui instruction.
      
      The first change to N64's relocation handling is the lifting of the
      restriction that N64 always uses PIC. Now it is possible to target static
      environments.
      
      The second change adds support for 64 bit symbols and enables them by
      default. Previously N64 had patterns for sym32 mode only. In this mode all
      symbols are assumed to have 32 bit addresses. sym32 mode support
      is selectable with attribute 'sym32'. A follow on patch for clang will
      add the necessary frontend parameter.
      
      This partially resolves PR/23485.
      
      Thanks to Brooks Davis for reporting the issue!
      
      Reviewers: dsanders, seanbruno, zoran.jovanovic, vkalintiris
      
      Differential Revision: https://reviews.llvm.org/D23652
      
      llvm-svn: 293164
      09e65efd
    • Diana Picus's avatar
      [ARM] GlobalISel: Load i1, i8 and i16 args from stack · 278c722e
      Diana Picus authored
      Add support for loading i1, i8 and i16 arguments from the stack, with or without
      the ABI extension flags.
      
      When the ABI extension flags are present, we load a 4-byte value, otherwise we
      preserve the size of the load and let the instruction selector replace it with a
      LDRB/LDRH. This generates the same thing as DAGISel.
      
      Differential Revision: https://reviews.llvm.org/D27803
      
      llvm-svn: 293163
      278c722e
    • Craig Topper's avatar
      [AVX-512] Move the combine that runs combineBitcastForMaskedOp to the last DAG... · bad53cce
      Craig Topper authored
      [AVX-512] Move the combine that runs combineBitcastForMaskedOp to the last DAG combine phase where I had originally meant to put it.
      
      llvm-svn: 293157
      bad53cce
    • Craig Topper's avatar
      [X86] When bitcasting INSERT_SUBVECTOR/EXTRACT_SUBVECTOR to match masked... · f0bab7b7
      Craig Topper authored
      [X86] When bitcasting INSERT_SUBVECTOR/EXTRACT_SUBVECTOR to match masked operations, use the correct type for the immediate operand.
      
      llvm-svn: 293156
      f0bab7b7
    • Jonas Paulsson's avatar
      [TargetTransformInfo] Refactor and improve getScalarizationOverhead() · 8e2f948e
      Jonas Paulsson authored
      Refactoring to remove duplications of this method.
      
      New method getOperandsScalarizationOverhead() that looks at the present unique
      operands and add extract costs for them. Old behaviour was to just add extract
      costs for one operand of the type always, which still happens in
      getArithmeticInstrCost() if no operands are provided by the caller.
      
      This is a good start of improving on this, but there are more places
      that can be improved by using getOperandsScalarizationOverhead().
      
      Review: Hal Finkel
      https://reviews.llvm.org/D29017
      
      llvm-svn: 293155
      8e2f948e
    • Matt Arsenault's avatar
      AMDGPU: Fold fneg into round instructions · 53f0cc23
      Matt Arsenault authored
      llvm-svn: 293127
      53f0cc23
  2. Jan 25, 2017
  3. Jan 24, 2017
Loading