Skip to content
  1. Aug 20, 2017
  2. Aug 19, 2017
    • Teresa Johnson's avatar
      [ThinLTO] Fix ThinLTO crash · 73305f82
      Teresa Johnson authored
      Summary:
      Follow up to fix in r311023, which fixed the case where the combined
      index is written to disk. The same samplePGO logic exists for the
      in-memory index when computing imports, so we need to filter out
      GlobalVariable summaries there too.
      
      Reviewers: davidxl
      
      Subscribers: inglorion, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D36919
      
      llvm-svn: 311254
      73305f82
  3. Aug 14, 2017
  4. Aug 11, 2017
    • Eli Friedman's avatar
      [OptDiag] Updating Remarks in SampleProfile · 51cf2604
      Eli Friedman authored
      Updating remark API to newer OptimizationDiagnosticInfo API. This
      allows remarks to show up in diagnostic yaml file, and enables use
      of opt-viewer tool.
      
      Hotness information for remarks (L505 and L751) do not display hotness
      information, most likely due to profile information not being
      propagated yet. Unsure if this is the desired outcome.
      
      Patch by Tarun Rajendran.
      
      Differential Revision: https://reviews.llvm.org/D36127
      
      llvm-svn: 310763
      51cf2604
    • Xinliang David Li's avatar
      Fix typo /NFC · 24524f31
      Xinliang David Li authored
      llvm-svn: 310737
      24524f31
  5. Aug 09, 2017
  6. Aug 04, 2017
  7. Aug 02, 2017
    • Chandler Carruth's avatar
      [PM] Fix a bug where through CGSCC iteration we can get · 95055d8f
      Chandler Carruth authored
      infinite-inlining across multiple runs of the inliner by keeping a tiny
      history of internal-to-SCC inlining decisions.
      
      This is still a bit gross, but I don't yet have any fundamentally better
      ideas and numerous people are blocked on this to use new PM and ThinLTO
      together.
      
      The core of the idea is to detect when we are about to do an inline that
      has a chance of re-splitting an SCC which we have split before with
      a similar inlining step. That is a critical component in the inlining
      forming a cycle and so far detects all of the various cyclic patterns
      I can come up with as well as the original real-world test case (which
      comes from a ThinLTO build of libunwind).
      
      I've added some tests that I think really demonstrate what is going on
      here. They are essentially state machines that march the inliner through
      various steps of a cycle and check that we stop when the cycle is closed
      and that we actually did do inlining to form that cycle.
      
      A lot of thanks go to Eric Christopher and Sanjoy Das for the help
      understanding this issue and improving the test cases.
      
      The biggest "yuck" here is the layering issue -- the CGSCC pass manager
      is providing somewhat magical state to the inliner for it to use to make
      itself converge. This isn't great, but I don't honestly have a lot of
      better ideas yet and at least seems nicely isolated.
      
      I have tested this patch, and it doesn't block *any* inlining on the
      entire LLVM test suite and SPEC, so it seems sufficiently narrowly
      targeted to the issue at hand.
      
      We have come up with hypothetical issues that this patch doesn't cover,
      but so far none of them are practical and we don't have a viable
      solution yet that covers the hypothetical stuff, so proceeding here in
      the interim. Definitely an area that we will be back and revisiting in
      the future.
      
      Differential Revision: https://reviews.llvm.org/D36188
      
      llvm-svn: 309784
      95055d8f
  8. Jul 31, 2017
    • Peter Collingbourne's avatar
      Update phi nodes in LowerTypeTests control flow simplification · bcd204b4
      Peter Collingbourne authored
      D33925 added a control flow simplification for -O2 --lto-O0 builds that
      manually splits blocks and reassigns conditional branches but does not
      correctly update phi nodes. If the else case being branched to had
      incoming phi nodes the control-flow simplification would leave phi nodes
      in that BB with an unhandled predecessor.
      
      Patch by Vlad Tsyrklevich!
      
      Differential Revision: https://reviews.llvm.org/D36012
      
      llvm-svn: 309621
      bcd204b4
    • Florian Hahn's avatar
      Guard print() functions only used by dump() functions. · 6b3216aa
      Florian Hahn authored
      Summary:
      Since  r293359, most dump() function are only defined when
      `!defined(NDEBUG) || defined(LLVM_ENABLE_DUMP)` holds. print() functions
      only used by dump() functions are now unused in release builds,
      generating lots of warnings. This patch only defines some print()
      functions if they are used.
      
      Reviewers: MatzeB
      
      Reviewed By: MatzeB
      
      Subscribers: arsenm, mzolotukhin, nhaehnle, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D35949
      
      llvm-svn: 309553
      6b3216aa
  9. Jul 28, 2017
  10. Jul 27, 2017
    • whitequark's avatar
      [MergeFunctions] Remove alias support. · 9e8197ac
      whitequark authored
      The alias support was dead code since 2011. It was last touched
      in r124182, where it was reintroduced after being removed
      in r110434, and since then it was gated behind a HasGlobalAliases
      flag that was permanently stuck as `false`.
      
      It is also broken. I'm not sure if it bitrotted or was just broken
      in the first place because it appears to have never been tested,
      but the following IR results in a crash:
      
          define internal i32 @a(i32 %a, i32 %b) unnamed_addr {
            %c = add i32 %a, %b
            %d = xor i32 %a, %c
            ret i32 %c
          }
      
          define internal i32 @b(i32 %a, i32 %b) unnamed_addr {
            %c = add i32 %a, %b
            %d = xor i32 %a, %c
            ret i32 %c
          }
      
      It seems safe to remove buggy untested code that no one cared about
      for seven years.
      
      Differential Revision: https://reviews.llvm.org/D34802
      
      llvm-svn: 309313
      9e8197ac
    • Davide Italiano's avatar
      [FunctionImport] Prefer isa<> to dyn_cast<> as the value is not used. · 82c7d376
      Davide Italiano authored
      This change makes GCC7 happy again.
      
      llvm-svn: 309305
      82c7d376
    • David Blaikie's avatar
      ThinLTO: Don't import aliases of any kind (even linkonce_odr) · 2f0cc477
      David Blaikie authored
      Summary:
      Until a more advanced version of importing can be implemented for
      aliases (one that imports an alias as an available_externally definition
      of the aliasee), skip the narrow subset of cases that was possible but
      came at a cost: aliases of linkonce_odr functions could be imported
      because the linkonce_odr function could be safely duplicated from the
      source module. This came/comes at the cost of not being able to 'home'
      imported linkonce functions (they had to be emitted linkonce_odr in all
      the destination modules (even if they weren't used by an alias) rather
      than as available_externally - causing extra object size).
      
      Tangentially, this also was the only reason ThinLTO would emit multiple
      CUs in to the resulting DWARF - which happens to be a problem for
      Fission (there's a fix for this in GDB but not released yet, etc).
      (actually it's not the only reason - but I'm sending a patch to fix the
      other reason shortly)
      
      There's no reason to believe this particularly narrow alias importing
      was especially/meaningfully important, only that it was /possible/ to
      implement in this way. When a more general solution is done, it should
      still satisfy the DWARF concerns above, since the import will still be
      available_externally, and thus not create extra CUs.
      
      Since now all aliases are treated the same, I removed/simplified some
      test cases since they were testing corner cases where there are no
      longer any corners.
      
      Reviewers: tejohnson, mehdi_amini
      
      Differential Revision: https://reviews.llvm.org/D35875
      
      llvm-svn: 309278
      2f0cc477
  11. Jul 21, 2017
    • Haojie Wang's avatar
      ThinLTO Minimized Bitcode File Size Reduction · 1dec57d5
      Haojie Wang authored
      Summary: Currently the ThinLTO minimized bitcode file only strip the debug info, but there is still a lot of information in the minimized bit code file that will be not used for thin linker. In this patch, most of the extra information is striped to reduce the minimized bitcode file. Now only ModuleVersion, ModuleInfo, ModuleGlobalValueSummary, ModuleHash, Symtab and Strtab are left. Now the minimized bitcode file size is reduced to 15%-30% of the debug info stripped bitcode file size.
      
      Reviewers: danielcdh, tejohnson, pcc
      
      Reviewed By: pcc
      
      Subscribers: mehdi_amini, aprantl, inglorion, eraman, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D35334
      
      llvm-svn: 308760
      1dec57d5
  12. Jul 20, 2017
    • Peter Collingbourne's avatar
      LowerTypeTests: Drop function type metadata only if we're going to replace it. · 6f6788b9
      Peter Collingbourne authored
      Previously we were (mis)handling jump table members with a prevailing
      definition in a full LTO module and a non-prevailing definition in a
      ThinLTO module by dropping type metadata on those functions entirely,
      which would cause type tests involving such functions to fail.
      
      This patch causes us to drop metadata only if we are about to replace
      it with metadata from cfi.functions.
      
      We also want to replace metadata for available_externally functions,
      which can arise in the opposite scenario (prevailing ThinLTO
      definition, non-prevailing full LTO definition). The simplest way
      to handle that is to remove the definition; there's little value in
      keeping it around at this point (i.e. after most optimization passes
      have already run) and later code will try to use the function's linkage
      to create an alias, which would result in invalid IR if the function
      is available_externally.
      
      Fixes PR33832.
      
      Differential Revision: https://reviews.llvm.org/D35604
      
      llvm-svn: 308642
      6f6788b9
  13. Jul 19, 2017
    • Peter Collingbourne's avatar
      ThinLTOBitcodeWriter: Do not rewrite intrinsic functions when splitting modules. · 93fdaca5
      Peter Collingbourne authored
      Changing the type of an intrinsic may invalidate the IR.
      
      Differential Revision: https://reviews.llvm.org/D35593
      
      llvm-svn: 308500
      93fdaca5
    • Chandler Carruth's avatar
      [PM/LCG] Follow-up fix to r308088 to handle deletion of library · 06a86301
      Chandler Carruth authored
      functions.
      
      In the prior commit, we provide ordering to the LCG between functions
      and library function definitions that they might begin to call through
      transformations. But we still would delete these library functions from
      the call graph if they became dead during inlining.
      
      While this immediately crashed, it also exposed a loss of information.
      We shouldn't remove definitions of library functions that can still
      usefully participate in the LCG-powered CGSCC optimization process. If
      new call edges are formed, we want to have definitions to be called.
      
      We can still remove these functions if truly dead using global-dce, etc,
      but removing them during the CGSCC walk is premature.
      
      This fixes a crash in the new PM when optimizing some unusual libraries
      that end up with "internal" lib functions such as the code in the "R"
      language's libraries.
      
      llvm-svn: 308417
      06a86301
  14. Jul 17, 2017
  15. Jul 16, 2017
  16. Jul 15, 2017
    • Chandler Carruth's avatar
      Revert r308078 (and subsequent tweak in r308079) which introduces a test · d78a38ed
      Chandler Carruth authored
      that appears to exhibit non-determinism and is flaking on the bots
      pretty consistently.
      
      r308078: [ThinLTO] Ensure we always select the same function copy to import
      r308079: Require asserts in new test that uses debug flag
      llvm-svn: 308095
      d78a38ed
    • Teresa Johnson's avatar
      [ThinLTO] Ensure we always select the same function copy to import · 82b4fb1a
      Teresa Johnson authored
      Summary:
      Check if the first eligible callee is under the instruction threshold.
      Checking this on the first eligible callee ensures that we don't end
      up selecting different callees to import when we invoke this routine
      with different thresholds due to reaching the callee via paths that
      are shallower or hotter (when there are multiple copies, i.e. with
      weak or linkonce linkage). We don't want to leave the decision of which
      copy to import up to the backend.
      
      Reviewers: mehdi_amini
      
      Subscribers: inglorion, fhahn, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D35436
      
      llvm-svn: 308078
      82b4fb1a
  17. Jul 14, 2017
    • Jakub Kuderski's avatar
      [Dominators] Make IsPostDominator a template parameter · b292c22c
      Jakub Kuderski authored
      Summary:
      DominatorTreeBase used to have IsPostDominators (bool) member to indicate if the tree is a dominator or a postdominator tree. This made it possible to switch between the two 'modes' at runtime, but it isn't used in practice anywhere.
      
      This patch makes IsPostDominator a template argument. This way, it is easier to switch between different algorithms at compile-time based on this argument and design external utilities around it. It also makes it impossible to incidentally assign a postdominator tree to a dominator tree (and vice versa), and to further simplify template code in GenericDominatorTreeConstruction.
      
      Reviewers: dberlin, sanjoy, davide, grosser
      
      Reviewed By: dberlin
      
      Subscribers: mzolotukhin, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D35315
      
      llvm-svn: 308040
      b292c22c
  18. Jul 13, 2017
  19. Jul 12, 2017
    • Peter Collingbourne's avatar
      LowerTypeTests: When importing functions skip definitions where the summary contains a decl. · cacac6a1
      Peter Collingbourne authored
      This normally indicates mixed CFI + non-CFI compilation, and will
      result in us treating the function in the same way as a function
      defined outside of the LTO unit.
      
      Part of PR33752.
      
      Differential Revision: https://reviews.llvm.org/D35281
      
      llvm-svn: 307744
      cacac6a1
    • Davide Italiano's avatar
      [IPO] Temporarily rollback r307215. · b8ad3eeb
      Davide Italiano authored
      [GlobalOpt] Remove unreachable blocks before optimizing a function.
      While the change is presumably correct, it exposes a latent bug
      in DI which breaks on of the CFI checks. I'll analyze it further
      and try to understand what's going on.
      
      llvm-svn: 307729
      b8ad3eeb
    • Konstantin Zhuravlyov's avatar
      Enhance synchscope representation · bb80d3e1
      Konstantin Zhuravlyov authored
        OpenCL 2.0 introduces the notion of memory scopes in atomic operations to
        global and local memory. These scopes restrict how synchronization is
        achieved, which can result in improved performance.
      
        This change extends existing notion of synchronization scopes in LLVM to
        support arbitrary scopes expressed as target-specific strings, in addition to
        the already defined scopes (single thread, system).
      
        The LLVM IR and MIR syntax for expressing synchronization scopes has changed
        to use *syncscope("<scope>")*, where <scope> can be "singlethread" (this
        replaces *singlethread* keyword), or a target-specific name. As before, if
        the scope is not specified, it defaults to CrossThread/System scope.
      
        Implementation details:
          - Mapping from synchronization scope name/string to synchronization scope id
            is stored in LLVM context;
          - CrossThread/System and SingleThread scopes are pre-defined to efficiently
            check for known scopes without comparing strings;
          - Synchronization scope names are stored in SYNC_SCOPE_NAMES_BLOCK in
            the bitcode.
      
      Differential Revision: https://reviews.llvm.org/D21723
      
      llvm-svn: 307722
      bb80d3e1
  20. Jul 11, 2017
    • Chandler Carruth's avatar
      [PM/ThinLTO] Fix PR33536, a bug where the ThinLTO bitcode writer was · 01f0c8a8
      Chandler Carruth authored
      querying for analysis results on a function declaration rather than
      a definition.
      
      The only reason this worked previously is by chance -- because the way
      we got alias analysis results with the legacy PM, we happened to not
      compute a dominator tree and so we happened to not hit an assert even
      though it didn't make any real sense. Now we bail out before trying to
      compute alias analysis so that we don't hit these asserts.
      
      llvm-svn: 307625
      01f0c8a8
  21. Jul 10, 2017
    • Mikael Holmen's avatar
      [ArgumentPromotion] Change use of removed argument in llvm.dbg.value to undef · e0ced144
      Mikael Holmen authored
      Summary:
      This solves PR33641.
      
      When removing a dead argument we must also handle possibly existing calls
      to llvm.dbg.value that use the removed argument. Now we change the use
      of the otherwise dead argument to an undef for some other pass to cleanup
      later.
      
      If the calls are left untouched, they will later on cause errors:
       "function-local metadata used in wrong function"
      since the ArgumentPromotion rewrites the code by creating a new function
      with the wanted signature, but the metadata is not recreated so the new
      function may then erroneously use metadata from the old function.
      
      Reviewers: mstorsjo, rnk, arsenm
      
      Reviewed By: rnk
      
      Subscribers: wdng, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D34874
      
      llvm-svn: 307521
      e0ced144
  22. Jul 09, 2017
    • Chandler Carruth's avatar
      [PM] Finish implementing and fix a chain of bugs uncovered by testing · bd9c2903
      Chandler Carruth authored
      the invalidation propagation logic from an SCC to a Function.
      
      I wrote the infrastructure to test this but didn't actually use it in
      the unit test where it was designed to be used. =[ My bad. Once
      I actually added it to the test case I discovered that it also hadn't
      been properly implemented, so I've implemented it. The logic in the FAM
      proxy for an SCC pass to propagate invalidation follows the same ideas
      as the FAM proxy for a Module pass, but the implementation is a bit
      different to reflect the fact that it is forwarding just for an SCC.
      
      However, implementing this correctly uncovered a surprising "bug" (it
      was conservatively correct but relatively very expensive) in how we
      handle invalidation when splitting one SCC into multiple SCCs. We did an
      eager invalidation when in reality we should be deferring invaliadtion
      for the *current* SCC to the CGSCC pass manager and just invaliating the
      newly constructed SCCs. Otherwise we end up invalidating too much too
      soon. This was exposed by the inliner test case that I've updated. Now,
      we invalidate *just* the split off '(test1_f)' SCC when doing the CG
      update, and then the inliner finishes and invalidates the '(test1_g,
      test1_h)' SCC's analyses. The first few attempts at fixing this hit
      still more bugs, but all of those are covered by existing tests. For
      example, the inliner should also preserve the FAM proxy to avoid
      unnecesasry invalidation, and this is safe because the CG update
      routines it uses handle any necessary adjustments to the FAM proxy.
      
      Finally, the unittests for the CGSCC pass manager needed a bunch of
      updates where we weren't correctly preserving the FAM proxy because it
      hadn't been fully implemented and failing to preserve it didn't matter.
      
      Note that this doesn't yet fix the current crasher due to MemSSA finding
      a stale dominator tree, but without this the fix to that crasher doesn't
      really make any sense when testing because it relies on the proxy
      behavior.
      
      llvm-svn: 307487
      bd9c2903
  23. Jul 07, 2017
    • Dehao Chen's avatar
      Increase the import-threshold for crtical functions. · 64c46574
      Dehao Chen authored
      Summary: For interative sample-pgo, if a hot call site is inlined in the profiling binary, we should inline it in before profile annotation in the backend. Before that, the compile phase first collects all GUIDs that needs to be imported and creates virtual "hot" call edge in the summary. However, "hot" is not good enough to guarantee the callsites get inlined. This patch introduces "critical" call edge, and assign much higher importing threshold for those edges.
      
      Reviewers: tejohnson
      
      Reviewed By: tejohnson
      
      Subscribers: sanjoy, mehdi_amini, llvm-commits, eraman
      
      Differential Revision: https://reviews.llvm.org/D35096
      
      llvm-svn: 307439
      64c46574
  24. Jul 06, 2017
    • Davide Italiano's avatar
      [lib/LTO] Add a comment to explain where we set the linkage in the summary. · f4891d29
      Davide Italiano authored
      Pointed out by Teresa!
      
      llvm-svn: 307305
      f4891d29
    • Davide Italiano's avatar
      [LTO] Fix the interaction between linker redefined symbols and ThinLTO · 6a5fbe52
      Davide Italiano authored
      This is the same as r304719 but for ThinLTO.
      The substantial difference is that in this case we don't have
      whole visibility, just the summary.
      In the LTO case, when we got the resolution for the input file we
      could just see if the linker told us whether a symbol was linker
      redefined (using --wrap or --defsym) and switch the linkage directly
      for the GV.
      
      Here, we have the summary. So, we record that the linkage changed
      from <whatever it was> to $weakany to prevent IPOs across this symbol
      boundaries and actually just switch the linkage at FunctionImport time.
      
      This patch should also fixes the lld bits (as all the scaffolding for
      communicating if a symbol is linker redefined should be there & should
      be the same), but I'll make sure to add some tests there as well.
      
      Fixes PR33192.
      
      Differential Revision:  https://reviews.llvm.org/D35064
      
      llvm-svn: 307303
      6a5fbe52
    • Frederich Munch's avatar
      Avoid constructing GlobalExtensions only to find out it is empty. · 52dfcd18
      Frederich Munch authored
      Summary:
      GlobalExtensions is dereferenced twice, once for iteration and then a check if it is empty.
      As a ManagedStatic this dereference forces it's construction which is unnecessary.
      
      Reviewers: efriedma, davide, mehdi_amini
      
      Reviewed By: mehdi_amini
      
      Subscribers: chapuni, llvm-commits, mehdi_amini
      
      Differential Revision: https://reviews.llvm.org/D33381
      
      llvm-svn: 307229
      52dfcd18
    • Davide Italiano's avatar
      [GlobalOpt] Remove unreachable blocks before optimizing a function. · 7dd0694f
      Davide Italiano authored
      LLVM's definition of dominance allows instructions that are cyclic
      in unreachable blocks, e.g.:
      
        %pat = select i1 %condition, @global, i16* %pat
      
      because any instruction dominates an instruction in a block that's
      not reachable from entry.
      So, remove unreachable blocks from the function, because a) there's
      no point in analyzing them and b) GlobalOpt should otherwise grow
      some more complicated logic to break these cycles.
      
      Differential Revision:  https://reviews.llvm.org/D35028
      
      llvm-svn: 307215
      7dd0694f
  25. Jun 30, 2017
Loading