Skip to content
  1. Apr 05, 2016
    • Manman Ren's avatar
      Swift Calling Convention: swifterror target-independent change. · e221a870
      Manman Ren authored
      At IR level, the swifterror argument is an input argument with type
      ErrorObject**. For targets that support swifterror, we want to optimize it
      to behave as an inout value with type ErrorObject*; it will be passed in a
      fixed physical register.
      
      The main idea is to track the virtual registers for each swifterror value. We
      define swifterror values as AllocaInsts with swifterror attribute or a function
      argument with swifterror attribute.
      
      In SelectionDAGISel.cpp, we set up swifterror values (SwiftErrorVals) before
      handling the basic blocks.
      
      When iterating over all basic blocks in RPO, before actually visiting the basic
      block, we call mergeIncomingSwiftErrors to merge incoming swifterror values when
      there are multiple predecessors or to simply propagate them. There, we create a
      virtual register for each swifterror value in the entry block. For predecessors
      that are not yet visited, we create virtual registers to hold the swifterror
      values at the end of the predecessor. The assignments are saved in
      SwiftErrorWorklist and will be materialized at the end of visiting the basic
      block.
      
      When visiting a load from a swifterror value, we copy from the current virtual
      register assignment. When visiting a store to a swifterror value, we create a
      virtual register to hold the swifterror value and update SwiftErrorMap to
      track the current virtual register assignment.
      
      Differential Revision: http://reviews.llvm.org/D18108
      
      llvm-svn: 265433
      e221a870
  2. Mar 25, 2016
  3. Mar 19, 2016
  4. Mar 07, 2016
  5. Feb 02, 2016
  6. Jan 31, 2016
  7. Jan 30, 2016
  8. Jan 23, 2016
  9. Jan 14, 2016
    • David Majnemer's avatar
      [X86] Don't alter HasOpaqueSPAdjustment after we've relied on it · 3463e696
      David Majnemer authored
      We rely on HasOpaqueSPAdjustment not changing after we've calculated
      things based on it.  Things like whether or not we can use 'rep;movs' to
      copy bytes around, that sort of thing.  If it changes, invariants in the
      backend will quietly break.  This situation arose when we had a call to
      memcpy *and* a COPY of the FLAGS register where we would attempt to
      reference local variables using %esi, a register that was clobbered by
      the 'rep;movs'.
      
      This fixes PR26124.
      
      llvm-svn: 257730
      3463e696
  10. Jan 04, 2016
    • David Majnemer's avatar
      [X86] Make hasFP constant time · ca1c9f07
      David Majnemer authored
      We need a frame pointer if there is a push/pop sequence after the
      prologue in order to unwind the stack.  Scanning the instructions to
      figure out if this happened made hasFP not constant-time which is a
      violation of expectations.  Let's compute this up-front and reuse that
      computation when we need it.
      
      llvm-svn: 256730
      ca1c9f07
  11. Dec 16, 2015
  12. Dec 11, 2015
    • Manman Ren's avatar
      CXX_FAST_TLS calling convention: target independent portion. · abc7c1d1
      Manman Ren authored
      The access function has a short entry and a short exit, the initialization
      block is only run the first time. To improve the performance, we want to
      have a short frame at the entry and exit.
      
      We explicitly handle most of the CSRs via copies. Only the CSRs that are not
      handled via copies will be in CSR_SaveList.
      
      Frame lowering and prologue/epilogue insertion will generate a short frame
      in the entry and exit according to CSR_SaveList. The majority of the CSRs will
      be handled by register allcoator. Register allocator will try to spill and
      reload them in the initialization block.
      
      We add CSRsViaCopy, it will be explicitly handled during lowering.
      
      1> we first set FunctionLoweringInfo->SplitCSR if conditions are met (the target
         supports it for the given calling convention and the function has only return
         exits). We also call TLI->initializeSplitCSR to perform initialization.
      2> we call TLI->insertCopiesSplitCSR to insert copies from CSRsViaCopy to
         virtual registers at beginning of the entry block and copies from virtual
         registers to CSRsViaCopy at beginning of the exit blocks.
      3> we also need to make sure the explicit copies will not be eliminated.
      
      rdar://problem/23557469
      
      Differential Revision: http://reviews.llvm.org/D15340
      
      llvm-svn: 255353
      abc7c1d1
  13. Dec 03, 2015
  14. Nov 30, 2015
  15. Nov 24, 2015
    • Cong Hou's avatar
      Let SelectionDAG start to use probability-based interface to add successors. · 1938f2eb
      Cong Hou authored
      The patch in http://reviews.llvm.org/D13745 is broken into four parts:
      
      1. New interfaces without functional changes.
      2. Use new interfaces in SelectionDAG, while in other passes treat probabilities
      as weights.
      3. Use new interfaces in all other passes.
      4. Remove old interfaces.
      
      This the second patch above. In this patch SelectionDAG starts to use
      probability-based interfaces in MBB to add successors but other MC passes are
      still using weight-based interfaces. Therefore, we need to maintain correct
      weight list in MBB even when probability-based interfaces are used. This is
      done by updating weight list in probability-based interfaces by treating the
      numerator of probabilities as weights. This change affects many test cases
      that check successor weight values. I will update those test cases once this
      patch looks good to you.
      
      
      Differential revision: http://reviews.llvm.org/D14361
      
      llvm-svn: 253965
      1938f2eb
  16. Nov 20, 2015
  17. Nov 07, 2015
    • Joseph Tremoulet's avatar
      [WinEH] Update exception pointer registers · f748c893
      Joseph Tremoulet authored
      Summary:
      The CLR's personality routine passes these in rdx/edx, not rax/eax.
      
      Make getExceptionPointerRegister a virtual method parameterized by
      personality function to allow making this distinction.
      
      Similarly make getExceptionSelectorRegister a virtual method parameterized
      by personality function, for symmetry.
      
      
      Reviewers: pgavlin, majnemer, rnk
      
      Subscribers: jyknight, dsanders, llvm-commits
      
      Differential Revision: http://reviews.llvm.org/D14344
      
      llvm-svn: 252383
      f748c893
  18. Oct 13, 2015
  19. Oct 12, 2015
    • Reid Kleckner's avatar
      Don't call PrepareEHLandingPad on non EH pads · 9abb3c06
      Reid Kleckner authored
      This was a minor bug in r249492. Calling PrepareEHLandingPad on a
      non-landingpad was a no-op, but it attempted to get the generic pointer
      register class, which apparently doesn't exist for some targets.
      
      llvm-svn: 250068
      9abb3c06
  20. Oct 10, 2015
    • Reid Kleckner's avatar
      [WinEH] Delete the old landingpad implementation of Windows EH · 14e77350
      Reid Kleckner authored
      The new implementation works at least as well as the old implementation
      did.
      
      Also delete the associated preparation tests. They don't exercise
      interesting corner cases of the new implementation. All the codegen
      tests of the EH tables have already been ported.
      
      llvm-svn: 249918
      14e77350
  21. Oct 09, 2015
  22. Oct 07, 2015
  23. Oct 06, 2015
  24. Sep 10, 2015
  25. Sep 09, 2015
    • Chandler Carruth's avatar
      [PM/AA] Rebuild LLVM's alias analysis infrastructure in a way compatible · 7b560d40
      Chandler Carruth authored
      with the new pass manager, and no longer relying on analysis groups.
      
      This builds essentially a ground-up new AA infrastructure stack for
      LLVM. The core ideas are the same that are used throughout the new pass
      manager: type erased polymorphism and direct composition. The design is
      as follows:
      
      - FunctionAAResults is a type-erasing alias analysis results aggregation
        interface to walk a single query across a range of results from
        different alias analyses. Currently this is function-specific as we
        always assume that aliasing queries are *within* a function.
      
      - AAResultBase is a CRTP utility providing stub implementations of
        various parts of the alias analysis result concept, notably in several
        cases in terms of other more general parts of the interface. This can
        be used to implement only a narrow part of the interface rather than
        the entire interface. This isn't really ideal, this logic should be
        hoisted into FunctionAAResults as currently it will cause
        a significant amount of redundant work, but it faithfully models the
        behavior of the prior infrastructure.
      
      - All the alias analysis passes are ported to be wrapper passes for the
        legacy PM and new-style analysis passes for the new PM with a shared
        result object. In some cases (most notably CFL), this is an extremely
        naive approach that we should revisit when we can specialize for the
        new pass manager.
      
      - BasicAA has been restructured to reflect that it is much more
        fundamentally a function analysis because it uses dominator trees and
        loop info that need to be constructed for each function.
      
      All of the references to getting alias analysis results have been
      updated to use the new aggregation interface. All the preservation and
      other pass management code has been updated accordingly.
      
      The way the FunctionAAResultsWrapperPass works is to detect the
      available alias analyses when run, and add them to the results object.
      This means that we should be able to continue to respect when various
      passes are added to the pipeline, for example adding CFL or adding TBAA
      passes should just cause their results to be available and to get folded
      into this. The exception to this rule is BasicAA which really needs to
      be a function pass due to using dominator trees and loop info. As
      a consequence, the FunctionAAResultsWrapperPass directly depends on
      BasicAA and always includes it in the aggregation.
      
      This has significant implications for preserving analyses. Generally,
      most passes shouldn't bother preserving FunctionAAResultsWrapperPass
      because rebuilding the results just updates the set of known AA passes.
      The exception to this rule are LoopPass instances which need to preserve
      all the function analyses that the loop pass manager will end up
      needing. This means preserving both BasicAAWrapperPass and the
      aggregating FunctionAAResultsWrapperPass.
      
      Now, when preserving an alias analysis, you do so by directly preserving
      that analysis. This is only necessary for non-immutable-pass-provided
      alias analyses though, and there are only three of interest: BasicAA,
      GlobalsAA (formerly GlobalsModRef), and SCEVAA. Usually BasicAA is
      preserved when needed because it (like DominatorTree and LoopInfo) is
      marked as a CFG-only pass. I've expanded GlobalsAA into the preserved
      set everywhere we previously were preserving all of AliasAnalysis, and
      I've added SCEVAA in the intersection of that with where we preserve
      SCEV itself.
      
      One significant challenge to all of this is that the CGSCC passes were
      actually using the alias analysis implementations by taking advantage of
      a pretty amazing set of loop holes in the old pass manager's analysis
      management code which allowed analysis groups to slide through in many
      cases. Moving away from analysis groups makes this problem much more
      obvious. To fix it, I've leveraged the flexibility the design of the new
      PM components provides to just directly construct the relevant alias
      analyses for the relevant functions in the IPO passes that need them.
      This is a bit hacky, but should go away with the new pass manager, and
      is already in many ways cleaner than the prior state.
      
      Another significant challenge is that various facilities of the old
      alias analysis infrastructure just don't fit any more. The most
      significant of these is the alias analysis 'counter' pass. That pass
      relied on the ability to snoop on AA queries at different points in the
      analysis group chain. Instead, I'm planning to build printing
      functionality directly into the aggregation layer. I've not included
      that in this patch merely to keep it smaller.
      
      Note that all of this needs a nearly complete rewrite of the AA
      documentation. I'm planning to do that, but I'd like to make sure the
      new design settles, and to flesh out a bit more of what it looks like in
      the new pass manager first.
      
      Differential Revision: http://reviews.llvm.org/D12080
      
      llvm-svn: 247167
      7b560d40
    • Reid Kleckner's avatar
      [WinEH] Avoid creating MBBs for LLVM BBs that cannot contain code · 51189f0a
      Reid Kleckner authored
      Typically these are catchpads, which hold data used to decide whether to
      catch the exception or continue unwinding. We also shouldn't create MBBs
      for catchendpads, cleanupendpads, or terminatepads, since no real code
      can live in them.
      
      This fixes a problem where MI passes (like the register allocator) would
      try to put code into catchpad blocks, which are not executed by the
      runtime. In the new world, blocks ending in invokes now have many
      possible successors.
      
      llvm-svn: 247102
      51189f0a
  26. Sep 01, 2015
    • Cong Hou's avatar
      Distribute the weight on the edge from switch to default statement to edges... · 511298b9
      Cong Hou authored
      Distribute the weight on the edge from switch to default statement to edges generated in lowering switch.
      
      Currently, when edge weights are assigned to edges that are created when lowering switch statement, the weight on the edge to default statement (let's call it "default weight" here) is not considered. We need to distribute this weight properly. However, without value profiling, we have no idea how to distribute it. In this patch, I applied the heuristic that this weight is evenly distributed to successors.
      
      For example, given a switch statement with cases 1,2,3,5,10,11,20, and every edge from switch to each successor has weight 10. If there is a binary search tree built to test if n < 10, then its two out-edges will have weight 4x10+10/2 = 45 and 3x10 + 10/2 = 35 respectively (currently they are 40 and 30 without considering the default weight). Each distribution (which is 5 here) will be stored in each SwitchWorkListItem for further distribution.
      
      There are some exceptions:
      
      For a jump table header which doesn't have any edge to default statement, we don't distribute the default weight to it.
      For a bit test header which covers a contiguous range and hence has no edges to default statement, we don't distribute the default weight to it.
      When the branch checks a single value or a contiguous range with no edge to default statement, we don't distribute the default weight to it.
      In other cases, the default weight is evenly distributed to successors.
      
      Differential Revision: http://reviews.llvm.org/D12418
      
      llvm-svn: 246522
      511298b9
  27. Aug 31, 2015
    • Reid Kleckner's avatar
      [EH] Handle non-Function personalities like unknown personalities · e00faf8c
      Reid Kleckner authored
      Also delete and simplify a lot of MachineModuleInfo code that used to be
      needed to handle personalities on landingpads.  Now that the personality
      is on the LLVM Function, we no longer need to track it this way on MMI.
      Certainly it should not live on LandingPadInfo.
      
      llvm-svn: 246478
      e00faf8c
  28. Aug 28, 2015
  29. Aug 25, 2015
    • Cong Hou's avatar
      Remove the final bit test during lowering switch statement if all cases in bit... · cd595913
      Cong Hou authored
      Remove the final bit test during lowering switch statement if all cases in bit test cover a contiguous range.
      
      When lowering switch statement, if bit tests are used then LLVM will always generates a jump to the default statement in the last bit test. However, this is not necessary when all cases in bit tests cover a contiguous range. This is because when generating the bit tests header MBB, there is a range check that guarantees cases in bit tests won't go outside of [low, high], where low and high are minimum and maximum case values in the bit tests. This patch checks if this is the case and then doesn't emit jump to default statement and hence saves a bit test and a branch.
      
      Differential Revision: http://reviews.llvm.org/D12249
      
      llvm-svn: 245976
      cd595913
  30. Jul 28, 2015
  31. Jul 22, 2015
  32. Jul 16, 2015
  33. Jul 09, 2015
  34. Jun 26, 2015
  35. Jun 23, 2015
  36. Jun 22, 2015
    • Rafael Espindola's avatar
      Avoid a Symbol -> Name -> Symbol conversion. · 36b718fc
      Rafael Espindola authored
      Before this we were producing a TargetExternalSymbol from a MCSymbol.
      That meant extracting the symbol name and fetching the symbol again
      down the pipeline.
      
      This patch adds a DAG.getMCSymbol that lets the MCSymbol pass unchanged on the
      DAG.
      
      Doing so removes the need for MO_NOPREFIX and fixes the root cause of pr23900,
      allowing r240130 to be committed again.
      
      llvm-svn: 240300
      36b718fc
  37. Jun 19, 2015
Loading