Skip to content
  1. Jul 07, 2020
  2. Jul 04, 2020
    • sstefan1's avatar
      [OpenMPOpt] ICV Tracking · 6c4a5e92
      sstefan1 authored
      This is the first and most basic ICV Tracking implementation. For this
      first version, we only support deduplication within the same BB.
      
      Reviewers: jdoerfert, JonChesterfield, hamax97, jhuber6, uenoku,
      baziotis
      
      Differential Revision: https://reviews.llvm.org/D81788
      6c4a5e92
  3. Jul 02, 2020
  4. Jul 01, 2020
  5. Jun 29, 2020
  6. Jun 26, 2020
  7. Jun 24, 2020
    • Christopher Tetreault's avatar
      [SVE] Remove calls to VectorType::getNumElements from IPO · 3d123e17
      Christopher Tetreault authored
      Reviewers: efriedma, jdoerfert, sdesmalen, kmclaughlin
      
      Reviewed By: efriedma, jdoerfert
      
      Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits
      
      Tags: #llvm
      
      Differential Revision: https://reviews.llvm.org/D82219
      3d123e17
    • Teresa Johnson's avatar
      [WPD] Allow virtual calls to be analyzed with multiple type tests · d291bd51
      Teresa Johnson authored
      Summary:
      In D52514 I had fixed a bug with WPD after indirect call promotion, by
      checking that a type test being analyzed dominates potential virtual
      calls. With that fix I included a small effiency enhancement to avoid
      processing a devirt candidate multiple times (when there are multiple
      type tests). This latter change wasn't in response to any measured
      efficiency issues, it was merely theoretical. Unfortuantely, it turns
      out to limit optimization opportunities after inlining.
      
      Specifically, consider code that looks like:
      
      class A {
        virtual void foo();
      };
      class B : public A {
        void foo();
      }
      void callee(A *a) {
        a->foo(); // Call 1
      }
      void caller(B *b) {
        b->foo(); // Call 2
        callee(b);
      }
      
      After inlining callee into caller, because of the existing call to
      b->foo() in caller there will be 2 type tests in caller for the vtable
      pointer of b: the original type test against B from Call 2, and the
      inlined type test against A from Call 1. If the code was compiled with
      -fstrict-vtable-pointers, then after optimization WPD will see that
      both type tests are associated with the inlined virtual Call 1.
      With my earlier change to only process a virtual call against one type
      test, we may only consider virtual Call 1 against the base class A type
      test, which can't be devirtualized. With my change here to remove this
      restriction, it also gets considered for the type test against the
      derived class B type test, where it can be devirtualized.
      
      Note that if caller didn't include it's own earlier virtual call
      b->foo() we will not be able to devirtualize after inlining callee even
      after this fix, since there would not be a type test against B in the
      IR. As a future enhancement we can consider inserting type tests at call
      sites that pass pointers to classes with virtual calls, to enable
      context-sensitive devirtualization after inlining.
      
      Reviewers: pcc, vitalybuka, evgeny777
      
      Subscribers: Prazek, hiraditya, steven_wu, dexonsmith, llvm-commits
      
      Tags: #llvm
      
      Differential Revision: https://reviews.llvm.org/D79235
      d291bd51
    • sstefan1's avatar
      [OpenMPOpt] ICV macro definitions · 0f426935
      sstefan1 authored
      Summary:
      This defines some basic information about ICVs in `OMPKinds.def`.
      We also emit remarks with initial values for each function (which are default for now)
      as a way to test this.
      
      Reviewers: jdoerfert, JonChesterfield, hamax97, jhuber6
      
      Subscribers: yaxunl, hiraditya, guansong, llvm-commits
      
      Tags: #llvm
      
      Differential Revision: https://reviews.llvm.org/D82193
      0f426935
  8. Jun 23, 2020
  9. Jun 21, 2020
    • clfbbn's avatar
      [Attributor][NFC] Fix indentation · 10b05397
      clfbbn authored
      Summary: The patch D81022 seems to break the indentation of the `cleanupIR()` function. This patch fixes this problem
      
      Reviewers: jdoerfert, sstefan1, uenoku
      
      Reviewed By: jdoerfert
      
      Subscribers: hiraditya, uenoku, kuter, llvm-commits
      
      Tags: #llvm
      
      Differential Revision: https://reviews.llvm.org/D82260
      10b05397
    • Wenlei He's avatar
      [Remarks] Add callsite locations to inline remarks · 7c8a6936
      Wenlei He authored
      Summary:
      Add call site location info into inline remarks so we can differentiate inline sites.
      This can be useful for inliner tuning. We can also reconstruct full hierarchical inline
      tree from parsing such remarks. The messege of inline remark is also tweaked so we can
      differentiate SampleProfileLoader inline from CGSCC inline.
      
      Reviewers: wmi, davidxl, hoy
      
      Subscribers: hiraditya, cfe-commits, llvm-commits
      
      Tags: #clang, #llvm
      
      Differential Revision: https://reviews.llvm.org/D82213
      7c8a6936
  10. Jun 20, 2020
  11. Jun 18, 2020
    • Arthur Eubanks's avatar
      [GlobalOpt] Remove preallocated calls when possible · 91ef9305
      Arthur Eubanks authored
      When possible (e.g. internal linkage), strip preallocated attribute off
      parameters/arguments.
      This requires removing the "preallocated" operand bundle from the call
      site, replacing @llvm.call.preallocated.arg() with an alloca and a
      bitcast to i8*, and removing the @llvm.call.preallocated.setup(). Since
      @llvm.call.preallocated.arg() can be called multiple times with the same
      arg index, we create an alloca per arg index.
      We add a @llvm.stacksave() where the @llvm.call.preallocated.setup() was
      and a @llvm.stackrestore() after the preallocated call to prevent the
      stack from blowing up. This is valid because the argument would normally
      not exist on the stack after the call before the transformation.
      
      This does not currently handle all possible preallocated calls. We will
      need to figure out where to put @llvm.stackrestore() in the cases where
      there is no obvious place to put it, for example conditional
      preallocated calls, invokes.
      
      This sort of transformation may need to be moved to somewhere more
      accessible to accomodate similar transformations (like inlining) in the
      future.
      
      Reviewers: efriedma, hans
      
      Subscribers: hiraditya, llvm-commits
      
      Tags: #llvm
      
      Differential Revision: https://reviews.llvm.org/D80951
      91ef9305
    • Mehdi Amini's avatar
      Remove "unused" member ModuleSlice from `struct OpenMPOpt` · 77b79d79
      Mehdi Amini authored
      This is fixing warning from clang:
      
       warning: private field 'ModuleSlice' is not used [-Wunused-private-field]
        SmallPtrSetImpl<Function *> &ModuleSlice;
                                     ^
      
      Differential Revision: https://reviews.llvm.org/D82027
      77b79d79
  12. Jun 17, 2020
  13. Jun 16, 2020
  14. Jun 14, 2020
    • Sanjay Patel's avatar
      [PassManager] restore early-cse to vector cleanup · 098e48a6
      Sanjay Patel authored
      As noted in D80236 - the early-cse pass was included here before:
      D75145 / rG71a316883d50
      But it got moved outside of the "extra" option there, then it
      got dropped while adjusting -vector-combine:
      rG6438ea45e053
      rG57bb4787d72f
      
      So this is restoring the behavior and adding a test to prevent
      accidental changes again. I don't see an equivalent option for
      the new pass manager.
      098e48a6
  15. Jun 12, 2020
  16. Jun 10, 2020
  17. Jun 07, 2020
  18. Jun 05, 2020
  19. Jun 04, 2020
    • Yevgeny Rouban's avatar
      [Instruction] Remove setProfWeight() · 417bcb88
      Yevgeny Rouban authored
      Remove the function Instruction::setProfWeight() and make
      use of Instruction::copyMetadata(.., {LLVMContext::MD_prof}).
      This is correct for all use cases of setProfWeight() as it
      is applied to CallBase instructions only.
      This change results in prof metadata copied intact even if
      the source has "VP". The old pair of calls
      extractProfTotalWeight() + setProfWeight() resulted in
      setting branch_weights if the source had "VP" data.
      
      Reviewers: yamauchi, davidxl
      Tags: #llvm
      Differential Revision: https://reviews.llvm.org/D80987
      417bcb88
  20. Jun 03, 2020
    • Wei Mi's avatar
      [SampleFDO] Add use-sample-profile function attribute. · 7a6c8942
      Wei Mi authored
      When sampleFDO is enabled, people may expect they can use
      -fno-profile-sample-use to opt-out using sample profile for a certain file.
      That could be either for debugging purpose or for performance tuning purpose.
      However, when thinlto is enabled, if a function in file A compiled with
      -fno-profile-sample-use is imported to another file B compiled with
      -fprofile-sample-use, the inlined copy of the function in file B may still
      get its profile annotated.
      
      The inconsistency may even introduce profile unused warning because if the
      target is not compiled with explicit debug information flag, the function
      in file A won't have its debug information enabled (debug information will
      be enabled implicitly only when -fprofile-sample-use is used). After it is
      imported into file B which is compiled with -fprofile-sample-use, profile
      annotation for the outline copy of the function will fail because the
      function has no debug information, and that will trigger  profile unused
      warning.
      
      We add a new attribute use-sample-profile to control whether a function
      will use its sample profile no matter for its outline or inline copies.
      That will make the behavior of -fno-profile-sample-use consistent.
      
      Differential Revision: https://reviews.llvm.org/D79959
      7a6c8942
  21. Jun 01, 2020
    • Mircea Trofin's avatar
      [llvm][NFC] Cache FAM in InlineAdvisor · 999ea25a
      Mircea Trofin authored
      Summary:
      This simplifies the interface by storing the function analysis manager
      with the InlineAdvisor, and, thus, not requiring it be passed each time
      we inquire for an advice.
      
      Reviewers: davidxl, asbirlea
      
      Subscribers: eraman, hiraditya, llvm-commits
      
      Tags: #llvm
      
      Differential Revision: https://reviews.llvm.org/D80405
      999ea25a
    • Hiroshi Yamauchi's avatar
      [PGO] Improve the working set size heuristics under the partial sample PGO. · 6c27c61d
      Hiroshi Yamauchi authored
      Summary:
      The working set size heuristics (ProfileSummaryInfo::hasHugeWorkingSetSize)
      under the partial sample PGO may not be accurate because the profile is partial
      and the number of hot profile counters in the ProfileSummary may not reflect the
      actual working set size of the program being compiled.
      
      To improve this, the (approximated) ratio of the the number of profile counters
      of the program being compiled to the number of profile counters in the partial
      sample profile is computed (which is called the partial profile ratio) and the
      working set size of the profile is scaled by this ratio to reflect the working
      set size of the program being compiled and used for the working set size
      heuristics.
      
      The partial profile ratio is approximated based on the number of the basic
      blocks in the program and the NumCounts field in the ProfileSummary and computed
      through the thin LTO indexing. This means that there is the limitation that the
      scaled working set size is available to the thin LTO post link passes only.
      
      Reviewers: davidxl
      
      Subscribers: mgorny, eraman, hiraditya, steven_wu, dexonsmith, arphaman, dang, llvm-commits
      
      Tags: #llvm
      
      Differential Revision: https://reviews.llvm.org/D79831
      6c27c61d
  22. May 29, 2020
  23. May 27, 2020
  24. May 26, 2020
Loading