Skip to content
  1. Jan 25, 2021
    • Richard Smith's avatar
      Revert "[ObjC][ARC] Annotate calls with attributes instead of emitting retainRV" · 925ae8c7
      Richard Smith authored
      This reverts commit 53176c16, which
      introduceed a layering violation. LLVM's IR library can't include
      headers from Analysis.
      925ae8c7
    • Akira Hatanaka's avatar
      [ObjC][ARC] Annotate calls with attributes instead of emitting retainRV · 53176c16
      Akira Hatanaka authored
      or claimRV calls in the IR
      
      Background:
      
      This patch makes changes to the front-end and middle-end that are
      needed to fix a longstanding problem where llvm breaks ARC's autorelease
      optimization (see the link below) by separating calls from the marker
      instructions or retainRV/claimRV calls. The backend changes are in
      https://reviews.llvm.org/D92569.
      
      https://clang.llvm.org/docs/AutomaticReferenceCounting.html#arc-runtime-objc-autoreleasereturnvalue
      
      What this patch does to fix the problem:
      
      - The front-end annotates calls with attribute "clang.arc.rv"="retain"
        or "clang.arc.rv"="claim", which indicates the call is implicitly
        followed by a marker instruction and a retainRV/claimRV call that
        consumes the call result. This is currently done only when the target
        is arm64 and the optimization level is higher than -O0.
      
      - ARC optimizer temporarily emits retainRV/claimRV calls after the
        annotated calls in the IR and removes the inserted calls after
        processing the function.
      
      - ARC contract pass emits retainRV/claimRV calls after the annotated
        calls. It doesn't remove the attribute on the call since the backend
        needs it to emit the marker instruction. The retainRV/claimRV calls
        are emitted late in the pipeline to prevent optimization passes from
        transforming the IR in a way that makes it harder for the ARC
        middle-end passes to figure out the def-use relationship between the
        call and the retainRV/claimRV calls (which is the cause of PR31925).
      
      - The function inliner removes the autoreleaseRV call in the callee that
        returns the result if nothing in the callee prevents it from being
        paired up with the calls annotated with "clang.arc.rv"="retain/claim"
        in the caller. If the call is annotated with "claim", a release call
        is inserted since autoreleaseRV+claimRV is equivalent to a release. If
        it cannot find an autoreleaseRV call, it tries to transfer the
        attributes to a function call in the callee. This is important since
        ARC optimizer can remove the autoreleaseRV call returning the callee
        result, which makes it impossible to pair it up with the retainRV or
        claimRV call in the caller. If that fails, it simply emits a retain
        call in the IR if the call is annotated with "retain" and does nothing
        if it's annotated with "claim".
      
      - This patch teaches dead argument elimination pass not to change the
        return type of a function if any of the calls to the function are
        annotated with attribute "clang.arc.rv". This is necessary since the
        pass can incorrectly determine nothing in the IR uses the function
        return, which can happen since the front-end no longer explicitly
        emits retainRV/claimRV calls in the IR, and change its return type to
        'void'.
      
      Future work:
      
      - Use the attribute on x86-64.
      
      - Fix the auto upgrader to convert call+retainRV/claimRV pairs into
        calls annotated with the attributes.
      
      rdar://71443534
      
      Differential Revision: https://reviews.llvm.org/D92808
      53176c16
    • Wei Mi's avatar
      [SampleFDO] Report error when reading a bad/incompatible profile instead of · c9cd9a00
      Wei Mi authored
      turning off SampleFDO silently.
      
      Currently sample loader pass turns off SampleFDO optimization silently when
      it sees error in reading the profile. This behavior will defeat the tests
      which could have caught those bad/incompatible profile problems. This patch
      change the behavior to report error.
      
      Differential Revision: https://reviews.llvm.org/D95269
      c9cd9a00
  2. Jan 23, 2021
  3. Jan 22, 2021
  4. Jan 21, 2021
  5. Jan 20, 2021
  6. Jan 19, 2021
    • Florian Hahn's avatar
      [LoopRotate] Add PrepareForLTO stage, avoid rotating with inline cands. · 83daa497
      Florian Hahn authored
      D84108 exposed a bad interaction between inlining and loop-rotation
      during regular LTO, which is causing notable regressions in at least
      CINT2006/473.astar.
      
      The problem boils down to: we now rotate a loop just before the vectorizer
      which requires duplicating a function call in the preheader when compiling
      the individual files ('prepare for LTO'). But this then prevents further
      inlining of the function during LTO.
      
      This patch tries to resolve this issue by making LoopRotate more
      conservative with respect to rotating loops that have inline-able calls
      during the 'prepare for LTO' stage.
      
      I think this change intuitively improves the current situation in
      general. Loop-rotate tries hard to avoid creating headers that are 'too
      big'. At the moment, it assumes all inlining already happened and the
      cost of duplicating a call is equal to just doing the call. But with LTO,
      inlining also happens during full LTO and it is possible that a previously
      duplicated call is actually a huge function which gets inlined
      during LTO.
      
      From the perspective of LV, not much should change overall. Most loops
      calling user-provided functions won't get vectorized to start with
      (unless we can infer that the function does not touch memory, has no
      other side effects). If we do not inline the 'inline-able' call during
      the LTO stage, we merely delayed loop-rotation & vectorization. If we
      inline during LTO, chances should be very high that the inlined code is
      itself vectorizable or the user call was not vectorizable to start with.
      
      There could of course be scenarios where we inline a sufficiently large
      function with code not profitable to vectorize, which would have be
      vectorized earlier (by scalarzing the call). But even in that case,
      there probably is no big performance impact, because it should be mostly
      down to the cost-model to reject vectorization in that case. And then
      the version with scalarized calls should also not be beneficial. In a way,
      LV should have strictly more information after inlining and make more
      accurate decisions (barring cost-model issues).
      
      There is of course plenty of room for things to go wrong unexpectedly,
      so we need to keep a close look at actual performance and address any
      follow-up issues.
      
      I took a look at the impact on statistics for
      MultiSource/SPEC2000/SPEC2006. There are a few benchmarks with fewer
      loops rotated, but no change to the number of loops vectorized.
      
      Reviewed By: sanwou01
      
      Differential Revision: https://reviews.llvm.org/D94232
      83daa497
  7. Jan 18, 2021
  8. Jan 16, 2021
  9. Jan 15, 2021
  10. Jan 14, 2021
  11. Jan 13, 2021
    • Andrew Litteken's avatar
      [IROutliner] Adapting to hoisted bitcasts in CodeExtractor · 05b1a15f
      Andrew Litteken authored
      In commit 700d2417 the CodeExtractor
      was updated so that bitcasts that have lifetime markers that beginning
      outside of the region are deduplicated outside the region and are not
      used as an output.  This caused a discrepancy in the IROutliner, where
      in these cases there were arguments added to the aggregate function
      that were not needed causing assertion errors.
      
      The IROutliner queries the CodeExtractor twice to determine the inputs
      and outputs, before and after `findAllocas` is called with the same
      ValueSet for the outputs causing the duplication. This has been fixed
      with a dummy ValueSet for the first call.
      
      However, the additional bitcasts prevent us from using the same
      similarity relationships that were previously defined by the
      IR Similarity Analysis Pass. In these cases, we check whether the
      initial version of the region being analyzed for outlining is still the
      same as it was previously.  If it is not, i.e. because of the additional
      bitcast instructions from the CodeExtractor, we discard the region.
      
      Reviewers: yroux
      
      Differential Revision: https://reviews.llvm.org/D94303
      05b1a15f
    • Kazu Hirata's avatar
      [llvm] Remove redundant string initialization (NFC) · 12fc9ca3
      Kazu Hirata authored
      Identified with readability-redundant-string-init.
      12fc9ca3
  12. Jan 12, 2021
    • modimo's avatar
      [Inliner] Change inline remark format and update ReplayInlineAdvisor to use it · 2a49b7c6
      modimo authored
      This change modifies the source location formatting from:
      LineNumber.Discriminator
      to:
      LineNumber:ColumnNumber.Discriminator
      
      The motivation here is to enhance location information for inline replay that currently exists for the SampleProfile inliner. This will be leveraged further in inline replay for the CGSCC inliner in the related diff.
      
      The ReplayInlineAdvisor is also modified to read the new format and now takes into account the callee for greater accuracy.
      
      Testing:
      ninja check-llvm
      
      Reviewed By: mtrofin
      
      Differential Revision: https://reviews.llvm.org/D94333
      2a49b7c6
    • Florian Hahn's avatar
      [FunctionAttrs] Derive willreturn for fns with readonly` & `mustprogress`. · 6cd44b20
      Florian Hahn authored
      Similar to D94125, derive `willreturn` for functions that are `readonly` and
      `mustprogress` in FunctionAttrs.
      
      To quote the reasoning from D94125:
      
          Since D86233 we have `mustprogress` which, in combination with
          `readonly`, implies `willreturn`. The idea is that every side-effect
          has to be modeled as a "write". Consequently, `readonly` means there
          is no side-effect, and `mustprogress` guarantees that we cannot "loop"
          forever without side-effect.
      
      Reviewed By: jdoerfert, nikic
      
      Differential Revision: https://reviews.llvm.org/D94502
      6cd44b20
  13. Jan 11, 2021
    • Giorgis Georgakoudis's avatar
      [OpenMPOpt][WIP] Expand parallel region merging · 97517055
      Giorgis Georgakoudis authored
      The existing implementation of parallel region merging applies only to
      consecutive parallel regions that have speculatable sequential
      instructions in-between. This patch lifts this limitation to expand
      merging with any sequential instructions in-between, except calls to
      unmergable OpenMP runtime functions. In-between sequential instructions
      in the merged region are sequentialized in a "master" region and any
      output values are broadcasted to the following parallel regions and the
      sequential region continuation of the merged region.
      
      Reviewed By: jdoerfert
      
      Differential Revision: https://reviews.llvm.org/D90909
      97517055
  14. Jan 10, 2021
    • Florian Hahn's avatar
      [STLExtras] Use return type from operator* of the wrapped iter. · c701f85c
      Florian Hahn authored
      Currently make_early_inc_range cannot be used with iterators with
      operator* implementations that do not return a reference.
      
      Most notably in the LLVM codebase, this means the User iterator ranges
      cannot be used with make_early_inc_range, which slightly simplifies
      iterating over ranges while elements are removed.
      
      Instead of directly using BaseT::reference as return type of operator*,
      this patch uses decltype to get the actual return type of the operator*
      implementation in WrappedIteratorT.
      
      This patch also updates a few places to use make use of
      make_early_inc_range.
      
      Reviewed By: dblaikie
      
      Differential Revision: https://reviews.llvm.org/D93992
      c701f85c
  15. Jan 08, 2021
  16. Jan 06, 2021
    • Arthur Eubanks's avatar
      [CGSCC][Coroutine][NewPM] Properly support function splitting/outlining · 7fea561e
      Arthur Eubanks authored
      Previously when trying to support CoroSplit's function splitting, we
      added in a hack that simply added the new function's node into the
      original function's SCC (https://reviews.llvm.org/D87798). This is
      incorrect since it might be in its own SCC.
      
      Now, more similar to the previous design, we have callers explicitly
      notify the LazyCallGraph that a function has been split out from another
      one.
      
      In order to properly support CoroSplit, there are two ways functions can
      be split out.
      
      One is the normal expected "outlining" of one function into a new one.
      The new function may only contain references to other functions that the
      original did. The original function must reference the new function. The
      new function may reference the original function, which can result in
      the new function being in the same SCC as the original function. The
      weird case is when the original function indirectly references the new
      function, but the new function directly calls the original function,
      resulting in the new SCC being a parent of the original function's SCC.
      This form of function splitting works with CoroSplit's Switch ABI.
      
      The second way of splitting is more specific to CoroSplit. CoroSplit's
      Retcon and Async ABIs split the original function into multiple
      functions that all reference each other and are referenced by the
      original function. In order to keep the LazyCallGraph in a valid state,
      all new functions must be processed together, else some nodes won't be
      populated. To keep things simple, this only supports the case where all
      new edges are ref edges, and every new function references every other
      new function. There can be a reference back from any new function to the
      original function, putting all functions in the same RefSCC.
      
      This also adds asserts that all nodes in a (Ref)SCC can reach all other
      nodes to prevent future incorrect hacks.
      
      The original hacks in https://reviews.llvm.org/D87798 are no longer
      necessary since all new functions should have been registered before
      calling updateCGAndAnalysisManagerForPass.
      
      This fixes all coroutine tests when opt's -enable-new-pm is true by
      default. This also fixes PR48190, which was likely due to the previous
      hack breaking SCC invariants.
      
      Reviewed By: rnk
      
      Differential Revision: https://reviews.llvm.org/D93828
      7fea561e
  17. Jan 05, 2021
    • Arthur Eubanks's avatar
      [FuncAttrs] Infer noreturn · 8cf1cc57
      Arthur Eubanks authored
      A function is noreturn if all blocks terminating with a ReturnInst
      contain a call to a noreturn function. Skip looking at naked functions
      since there may be asm that returns.
      
      This can be further refined in the future by checking unreachable blocks
      and taking into account recursion. It looks like the attributor pass
      does this, but that is not yet enabled by default.
      
      This seems to help with code size under the new PM since PruneEH does
      not run under the new PM, missing opportunities to mark some functions
      noreturn, which in turn doesn't allow simplifycfg to clean up dead code.
      https://bugs.llvm.org/show_bug.cgi?id=46858.
      
      Reviewed By: rnk
      
      Differential Revision: https://reviews.llvm.org/D93946
      8cf1cc57
  18. Jan 04, 2021
    • Florian Hahn's avatar
      [SimplifyCFG] Enabled hoisting late in LTO pipeline. · c367258b
      Florian Hahn authored
      bb7d3af1 disabled hoisting in SimplifyCFG by default, but enabled it
      late in the pipeline. But it appears as if the LTO pipelines got missed.
      
      This patch adjusts the LTO pipelines to also enable hoisting in the
      later stages.
      
      Unfortunately there's no easy way to add a test for the change I think.
      
      Reviewed By: lebedev.ri
      
      Differential Revision: https://reviews.llvm.org/D93684
      c367258b
    • Florian Hahn's avatar
      [ArgPromotion] Delay dead GEP removal until doPromotion. · e0905553
      Florian Hahn authored
      Currently ArgPromotion removes dead GEPs as part of the legality check
      in isSafeToPromoteArgument. If no promotion happens, this means the pass
      claims no modifications happened, even though GEPs were removed.
      
      This patch fixes the issue by delaying removal of dead GEPs until
      doPromotion: isSafeToPromoteArgument can simply skips dead GEPs and
      the code in doPromotion dealing with GEPs is updated to account for
      dead GEPs. Once we committed to promotion, it should be safe to
      remove dead GEPs.
      
      Alternatively isSafeToPromoteArgument could return an additional boolean
      to indicate whether it made changes, but this is quite cumbersome and
      there should be no real benefit of weeding out some dead GEPs here if we
      do not perform promotion.
      
      I added a test for the case where dead GEPs need to be removed when
      promotion happens in 578c5a0c.
      
      Fixes PR47477.
      
      Reviewed By: jdoerfert
      
      Differential Revision: https://reviews.llvm.org/D93991
      e0905553
    • Andrew Litteken's avatar
      [IROutliner] Refactoring errors in the cost model from past patches. · 5c951623
      Andrew Litteken authored
      There were was the reuse of a variable that should not have been
      occurred due to confusion during committing patches.
      5c951623
    • Andrew Litteken's avatar
      [IROutliner] Removing a duplicate addition, causing overestimates in IROutliner. · 05e6ac4e
      Andrew Litteken authored
      There was an extra addition left over from a previous commit for the
      cost model, this removes it.
      05e6ac4e
  19. Jan 03, 2021
  20. Jan 02, 2021
  21. Dec 31, 2020
    • Andrew Litteken's avatar
      [IROutliner] Adding consistent function attribute merging · 1a9eb19a
      Andrew Litteken authored
      When combining extracted functions, they may have different function
      attributes. We want to make sure that we do not make any assumptions,
      or lose any information. This attempts to make sure that we consolidate
      function attributes to their most general case.
      
      Tests:
      llvm/test/Transforms/IROutliner/outlining-compatible-and-attribute-transfer.ll
      llvm/test/Transforms/IROutliner/outlining-compatible-or-attribute-transfer.ll
      
      Reviewers: jdoefert, paquette
      
      Differential Revision: https://reviews.llvm.org/D87301
      1a9eb19a
    • Fangrui Song's avatar
      [ThinLTO] Default -enable-import-metadata to false · a90b42b0
      Fangrui Song authored
      The default value is dependent on `-DLLVM_ENABLE_ASSERTIONS={off,on}` (D22167), which is
      error-prone. The few tests checking `!thinlto_src_module` can specify -enable-import-metadata explicitly.
      
      Reviewed By: tejohnson
      
      Differential Revision: https://reviews.llvm.org/D93959
      a90b42b0
Loading