Skip to content
  1. Dec 20, 2016
    • Adrian Prantl's avatar
      [IR] Remove the DIExpression field from DIGlobalVariable. · bceaaa96
      Adrian Prantl authored
      This patch implements PR31013 by introducing a
      DIGlobalVariableExpression that holds a pair of DIGlobalVariable and
      DIExpression.
      
      Currently, DIGlobalVariables holds a DIExpression. This is not the
      best way to model this:
      
      (1) The DIGlobalVariable should describe the source level variable,
          not how to get to its location.
      
      (2) It makes it unsafe/hard to update the expressions when we call
          replaceExpression on the DIGLobalVariable.
      
      (3) It makes it impossible to represent a global variable that is in
          more than one location (e.g., a variable with multiple
          DW_OP_LLVM_fragment-s).  We also moved away from attaching the
          DIExpression to DILocalVariable for the same reasons.
      
      This reapplies r289902 with additional testcase upgrades and a change
      to the Bitcode record for DIGlobalVariable, that makes upgrading the
      old format unambiguous also for variables without DIExpressions.
      
      <rdar://problem/29250149>
      https://llvm.org/bugs/show_bug.cgi?id=31013
      Differential Revision: https://reviews.llvm.org/D26769
      
      llvm-svn: 290153
      bceaaa96
  2. Dec 19, 2016
  3. Dec 17, 2016
  4. Dec 16, 2016
    • Adrian Prantl's avatar
      Revert "[IR] Remove the DIExpression field from DIGlobalVariable." · 73ec0656
      Adrian Prantl authored
      This reverts commit 289920 (again).
      I forgot to implement a Bitcode upgrade for the case where a DIGlobalVariable
      has not DIExpression. Unfortunately it is not possible to safely upgrade
      these variables without adding a flag to the bitcode record indicating which
      version they are.
      My plan of record is to roll the planned follow-up patch that adds a
      unit: field to DIGlobalVariable into this patch before recomitting.
      This way we only need one Bitcode upgrade for both changes (with a
      version flag in the bitcode record to safely distinguish the record
      formats).
      
      Sorry for the churn!
      
      llvm-svn: 289982
      73ec0656
    • Adrian Prantl's avatar
      [IR] Remove the DIExpression field from DIGlobalVariable. · 74a835cd
      Adrian Prantl authored
      This patch implements PR31013 by introducing a
      DIGlobalVariableExpression that holds a pair of DIGlobalVariable and
      DIExpression.
      
      Currently, DIGlobalVariables holds a DIExpression. This is not the
      best way to model this:
      
      (1) The DIGlobalVariable should describe the source level variable,
          not how to get to its location.
      
      (2) It makes it unsafe/hard to update the expressions when we call
          replaceExpression on the DIGLobalVariable.
      
      (3) It makes it impossible to represent a global variable that is in
          more than one location (e.g., a variable with multiple
          DW_OP_LLVM_fragment-s).  We also moved away from attaching the
          DIExpression to DILocalVariable for the same reasons.
      
      This reapplies r289902 with additional testcase upgrades.
      
      <rdar://problem/29250149>
      https://llvm.org/bugs/show_bug.cgi?id=31013
      Differential Revision: https://reviews.llvm.org/D26769
      
      llvm-svn: 289920
      74a835cd
    • Teresa Johnson's avatar
      [ThinLTO] Thin link efficiency: More efficient export list computation · edddca22
      Teresa Johnson authored
      Summary:
      Instead of checking whether a global referenced by a function being
      imported is defined in the same module, speculatively always add the
      referenced globals to the module's export list. After all imports are
      computed, for each module prune any not in its defined set from its
      export list.
      
      For a huge C++ app with aggressive importing thresholds, even with
      D27687 we spent a lot of time invoking modulePath() from
      exportGlobalInModule (modulePath() was still the 2nd hottest routine in
      profile). The reason is that with comdat/linkonce the summary lists for
      each GUID can be long. For the app in question, for example, we were
      invoking exportGlobalInModule almost 2 million times, and we traversed
      an average of 63 entries in the summary list each time.
      
      This patch reduced the thin link time for the app by about 10% (on top
      of D27687) when using aggressive importing thresholds, and about 3.5% on
      average with default importing thresholds.
      
      Reviewers: mehdi_amini
      
      Subscribers: llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D27755
      
      llvm-svn: 289918
      edddca22
    • Adrian Prantl's avatar
      Revert "[IR] Remove the DIExpression field from DIGlobalVariable." · 03c6d31a
      Adrian Prantl authored
      This reverts commit 289902 while investigating bot berakage.
      
      llvm-svn: 289906
      03c6d31a
    • Peter Collingbourne's avatar
      Add missing library dep. · 7a4be21d
      Peter Collingbourne authored
      llvm-svn: 289903
      7a4be21d
    • Adrian Prantl's avatar
      [IR] Remove the DIExpression field from DIGlobalVariable. · ce139357
      Adrian Prantl authored
      This patch implements PR31013 by introducing a
      DIGlobalVariableExpression that holds a pair of DIGlobalVariable and
      DIExpression.
      
      Currently, DIGlobalVariables holds a DIExpression. This is not the
      best way to model this:
      
      (1) The DIGlobalVariable should describe the source level variable,
          not how to get to its location.
      
      (2) It makes it unsafe/hard to update the expressions when we call
          replaceExpression on the DIGLobalVariable.
      
      (3) It makes it impossible to represent a global variable that is in
          more than one location (e.g., a variable with multiple
          DW_OP_LLVM_fragment-s).  We also moved away from attaching the
          DIExpression to DILocalVariable for the same reasons.
      
      <rdar://problem/29250149>
      https://llvm.org/bugs/show_bug.cgi?id=31013
      Differential Revision: https://reviews.llvm.org/D26769
      
      llvm-svn: 289902
      ce139357
    • Peter Collingbourne's avatar
      IPO: Introduce ThinLTOBitcodeWriter pass. · 1398a32e
      Peter Collingbourne authored
      This pass prepares a module containing type metadata for ThinLTO by splitting
      it into regular and thin LTO parts if possible, and writing both parts to
      a multi-module bitcode file. Modules that do not contain type metadata are
      written unmodified as a single module.
      
      All globals with type metadata are added to the regular LTO module, and
      the rest are added to the thin LTO module.
      
      Differential Revision: https://reviews.llvm.org/D27324
      
      llvm-svn: 289899
      1398a32e
    • Teresa Johnson's avatar
      [ThinLTO] Thin link efficiency improvement: don't re-export globals (NFC) · 19f2aa78
      Teresa Johnson authored
      Summary:
      We were reinvoking exportGlobalInModule numerous times redundantly.
      No need to re-export globals referenced by a global that was already
      imported from its module. This resulted in a large speedup in the thin
      link for a big application, particularly when importing aggressiveness
      was cranked up.
      
      Reviewers: mehdi_amini
      
      Subscribers: llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D27687
      
      llvm-svn: 289896
      19f2aa78
  5. Dec 15, 2016
    • Teresa Johnson's avatar
      [ThinLTO] Revert part of r289843 that belonged to another patch. · eb0ac241
      Teresa Johnson authored
      The code change for D27687 accidentally got committed along with the
      main change in r289843. Revert it temporarily, so that I can recommit it
      along with its test as intended.
      
      llvm-svn: 289875
      eb0ac241
    • Teresa Johnson's avatar
      [ThinLTO] Remove stale comment (NFC) · 0c3f57b1
      Teresa Johnson authored
      This should have been removed with r288446.
      
      llvm-svn: 289871
      0c3f57b1
    • Teresa Johnson's avatar
      [ThinLTO] Thin link efficiency: skip candidate added later with higher threshold (NFC) · 475b51a7
      Teresa Johnson authored
      Summary:
      Thin link efficiency improvement. After adding an importing candidate to
      the worklist we might have later added it again with a higher threshold.
      Skip it when popped from the worklist if we recorded a higher threshold
      than the current worklist entry, it will get processed again at the
      higher threshold when that entry is popped.
      
      This required adding the summary's GUID to the worklist, so that it can
      be used to query the recorded highest threshold for it when we pop from the
      worklist.
      
      Reviewers: mehdi_amini
      
      Subscribers: llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D27696
      
      llvm-svn: 289867
      475b51a7
    • Teresa Johnson's avatar
      [ThinLTO] Ensure callees get hot threshold when first seen on cold path · 1b859a23
      Teresa Johnson authored
      This is split out from D27696, since it turned out to be a bug fix and
      not part of the NFC efficiency change.
      
      Keep the same adjusted (possibly decayed) threshold in both the worklist
      and the ImportList. Otherwise if we encountered it first along a cold
      path, the callee would be added to the worklist with a lower decayed
      threshold than when it is later encountered along a hot path. But the
      logic uses the threshold recorded in the ImportList entry to check if
      we should re-add it, and without this patch the threshold recorded there
      is the same along both paths so we don't re-add it. Using the
      same possibly decayed threshold in the ImportList ensures we re-add it
      later with the higher non-decayed hot path threshold.
      
      llvm-svn: 289843
      1b859a23
    • Hal Finkel's avatar
      Remove the AssumptionCache · 3ca4a6bc
      Hal Finkel authored
      After r289755, the AssumptionCache is no longer needed. Variables affected by
      assumptions are now found by using the new operand-bundle-based scheme. This
      new scheme is more computationally efficient, and also we need much less
      code...
      
      llvm-svn: 289756
      3ca4a6bc
  6. Dec 14, 2016
    • Dehao Chen's avatar
      Only sets profile summary when it was not preset. · 40dd8c51
      Dehao Chen authored
      Summary: SampleProfileLoader pass may be invoked twice by LTO. The 2nd pass should not append more summary info as it is already preset by the 1st pass.
      
      Reviewers: eraman, davidxl
      
      Subscribers: mehdi_amini, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D27733
      
      llvm-svn: 289725
      40dd8c51
    • Dehao Chen's avatar
      Fix the bug in r289714 (NFC). · fb699619
      Dehao Chen authored
      llvm-svn: 289724
      fb699619
    • Dehao Chen's avatar
      Create SampleProfileLoader pass in llvm instead of clang · a99e082e
      Dehao Chen authored
      Summary: We used to create SampleProfileLoader pass in clang. This makes LTO/ThinLTO unable to add this pass in the linker plugin. This patch moves the SampleProfileLoader pass creation from clang to llvm pass manager builder.
      
      Reviewers: tejohnson, davidxl, dnovillo
      
      Subscribers: llvm-commits, mehdi_amini
      
      Differential Revision: https://reviews.llvm.org/D27743
      
      llvm-svn: 289714
      a99e082e
    • Geoff Berry's avatar
      [GVNHoist] Move GVNHoist to function simplification part of pipeline. · ca11a1e1
      Geoff Berry authored
      Summary:
      Move GVNHoist to later in the optimization pipeline, specifically, to
      the function simplification part of the pipeline.  The new pipeline
      location allows GVNHoist to run on a function after its callees have
      been inlined but before the function has been considered for inlining
      into its callers, exposing more opportunities for hoisting.
      
      Performance results on AArch64 kryo:
      Improvements:
        Benchmarks/CoyoteBench/fftbench  -24.952%
        spec2006/bzip2                    -4.071%
        internal bmark                    -3.177%
        Benchmarks/PAQ8p/paq8p            -1.754%
        spec2000/perlbmk                  -1.328%
        spec2006/h264ref                  -1.140%
      
      Regressions:
        internal bmark                    +1.818%
        Benchmarks/mafft/pairlocalalign   +1.084%
      
      Reviewers: sebpop, dberlin, hiraditya
      
      Subscribers: aemerson, mehdi_amini, mcrosier, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D27722
      
      llvm-svn: 289696
      ca11a1e1
    • Dehao Chen's avatar
      revert r289669 which breaks bots · 23025f84
      Dehao Chen authored
      llvm-svn: 289676
      23025f84
    • Dehao Chen's avatar
      Create SampleProfileLoader pass in llvm instead of clang · cb61c94d
      Dehao Chen authored
      Summary: We used to create SampleProfileLoader pass in clang. This makes LTO/ThinLTO unable to add this pass in the linker plugin. This patch moves the SampleProfileLoader pass creation from clang to llvm pass manager builder.
      
      Reviewers: tejohnson, davidxl, dnovillo
      
      Subscribers: llvm-commits, mehdi_amini
      
      Differential Revision: https://reviews.llvm.org/D27743
      
      llvm-svn: 289669
      cb61c94d
  7. Dec 13, 2016
  8. Dec 12, 2016
  9. Dec 09, 2016
  10. Dec 08, 2016
    • Evgeniy Stepanov's avatar
      CFI-icall on Thumb · 0c8957c1
      Evgeniy Stepanov authored
      Replace @progbits in the section directive with %progbits, because "@" starts a comment on arm/thumb.
      Use b.w branch instruction.
      Use .thumb_function and .thumb_set for proper arm/thumb interwork. This way jumptable entry addresses on thumb have bit 0 set (correctly). This does not affect CFI check math, because the address of the jumptable start also has that bit set.
      
      This does not work on thumbv5, because it does not support b.w, and the linker would not insert a veneer (trampoline?) to extend the range of b.n. We may need to do full-range plt-style jumptables on thumbv54, which are 12 bytes per entry. Another option is "push lr; bl; pop pc" (4 bytes) but that needs unwinding instructions, etc.
      
      Differential Revision: https://reviews.llvm.org/D27499
      
      llvm-svn: 289008
      0c8957c1
  11. Dec 07, 2016
    • Benjamin Kramer's avatar
      Try unbreaking the MSVC build. · b1332d8b
      Benjamin Kramer authored
      llvm-svn: 288907
      b1332d8b
    • Benjamin Kramer's avatar
      [LowerTypeTests] Use the TrailingObjects infrastructure for trailing objects. · 926ab5b0
      Benjamin Kramer authored
      Also avoid allocating ~3x as much memory as needed.
      
      llvm-svn: 288904
      926ab5b0
    • Peter Collingbourne's avatar
      LowerTypeTests: Improve performance by optimising type metadata queries. · 7357b2ad
      Peter Collingbourne authored
      Requesting metadata for a global is a relatively expensive operation as it
      involves a map lookup, but it's one that we need to do relatively frequently in
      this pass to collect the list of type metadata nodes associated with a global.
      This change improves the performance of type metadata queries by prebuilding
      data structures that keep the global together with its list of type metadata,
      and changing the pass to use that data structure wherever we were previously
      passing global references around.
      
      This change also eliminates some O(N^2) behavior by collecting the list of
      globals associated with each type identifier during the first pass over the
      list of globals rather than visiting each global to compute that list every
      time we add a new type identifier.
      
      Reduces pass runtime on a module containing Chrome's vtables from over 60s
      to 0.9s.
      
      Differential Revision: https://reviews.llvm.org/D27484
      
      llvm-svn: 288859
      7357b2ad
  12. Dec 02, 2016
    • Peter Collingbourne's avatar
      IR: Move NumElements field from {Array,Vector}Type to SequentialType. · bc070524
      Peter Collingbourne authored
      Now that PointerType is no longer a SequentialType, all SequentialTypes
      have an associated number of elements, so we can move that information to
      the base class, allowing for a number of simplifications.
      
      Differential Revision: https://reviews.llvm.org/D27122
      
      llvm-svn: 288464
      bc070524
    • Peter Collingbourne's avatar
      IR: Change PointerType to derive from Type rather than SequentialType. · 4568158c
      Peter Collingbourne authored
      As proposed on llvm-dev:
      http://lists.llvm.org/pipermail/llvm-dev/2016-October/106640.html
      
      This is for a couple of reasons:
      
      - Values of type PointerType are unlike the other SequentialTypes (arrays
        and vectors) in that they do not hold values of the element type. By moving
        PointerType we can unify certain aspects of how the other SequentialTypes
        are handled.
      - PointerType will have no place in the SequentialType hierarchy once
        pointee types are removed, so this is a necessary step towards removing
        pointee types.
      
      Differential Revision: https://reviews.llvm.org/D26595
      
      llvm-svn: 288462
      4568158c
    • Peter Collingbourne's avatar
      IR: Change the gep_type_iterator API to avoid always exposing the "current" type. · ab85225b
      Peter Collingbourne authored
      Instead, expose whether the current type is an array or a struct, if an array
      what the upper bound is, and if a struct the struct type itself. This is
      in preparation for a later change which will make PointerType derive from
      Type rather than SequentialType.
      
      Differential Revision: https://reviews.llvm.org/D26594
      
      llvm-svn: 288458
      ab85225b
    • Teresa Johnson's avatar
      [ThinLTO] Stop importing constant global vars as copies in the backend · 185b4ab6
      Teresa Johnson authored
      Summary:
      We were doing an optimization in the ThinLTO backends of importing
      constant unnamed_addr globals unconditionally as a local copy (regardless
      of whether the thin link decided to import them). This should be done in
      the thin link instead, so that resulting exported references are marked
      and promoted appropriately, but will need a summary enhancement to mark
      these variables as constant unnamed_addr.
      
      The function import logic during the thin link was trying to handle
      this proactively, by conservatively marking all values referenced in
      the initializer lists of exported global variables as also exported.
      However, this only handled values referenced directly from the
      initializer list of an exported global variable. If the value is itself
      a constant unnamed_addr variable, we could end up exporting its
      references as well. This caused multiple issues. The first is that the
      transitively exported references weren't promoted. Secondly, some could
      not be promoted/renamed (e.g. they had a section or other constraint).
      recursively, instead of just adding the first level of initializer list
      references to the ExportList directly.
      
      Remove this optimization and the associated handling in the function
      import backend. SPEC measurements indicate we weren't getting much
      from it in any case.
      
      Fixes PR31052.
      
      Reviewers: mehdi_amini
      
      Subscribers: krasin, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D26880
      
      llvm-svn: 288446
      185b4ab6
  13. Dec 01, 2016
  14. Nov 29, 2016
  15. Nov 25, 2016
  16. Nov 22, 2016
  17. Nov 21, 2016
  18. Nov 20, 2016
Loading