Skip to content
  1. Jun 07, 2021
  2. Jun 06, 2021
  3. Jun 05, 2021
  4. Jun 04, 2021
    • Fangrui Song's avatar
      [InstrProfiling] If no value profiling, make data variable private and (for Windows) use one comdat · 9e51d1f3
      Fangrui Song authored
      `__profd_*` variables are referenced by code only when value profiling is
      enabled. If disabled (e.g. default -fprofile-instr-generate), the symbols just
      waste space on ELF/Mach-O. We change the comdat symbol from `__profd_*` to
      `__profc_*` because an internal symbol does not provide deduplication features
      on COFF. The choice doesn't matter on ELF.
      
      (In -DLLVM_BUILD_INSTRUMENTED_COVERAGE=on build, there is now no `__profd_*` symbols.)
      
      On Windows this enables further optimization. We are no longer affected by the
      link.exe limitation: an external symbol in IMAGE_COMDAT_SELECT_ASSOCIATIVE can
      cause duplicate definition error.
      https://lists.llvm.org/pipermail/llvm-dev/2021-May/150758.html
      We can thus use llvm.compiler.used instead of llvm.used like ELF (D97585).
      This avoids many `/INCLUDE:` directives in `.drectve`.
      
      Here is rnk's measurement for Chrome:
      ```
      This reduced object file size of base_unittests.exe, compiled with coverage, optimizations, and gmlt debug info by 10%:
      
      #BEFORE
      
      $ find . -iname '*.obj' | xargs du -b | awk '{ sum += $1 } END { print sum}'
      1047758867
      
      $ du -cksh base_unittests.exe
      82M     base_unittests.exe
      82M     total
      
      # AFTER
      
      $ find . -iname '*.obj' | xargs du -b | awk '{ sum += $1 } END { print sum}'
      937886499
      
      $ du -cksh base_unittests.exe
      78M     base_unittests.exe
      78M     total
      ```
      
      The change is NFC for Mach-O.
      
      Reviewed By: davidxl, rnk
      
      Differential Revision: https://reviews.llvm.org/D103372
      9e51d1f3
    • Nikita Popov's avatar
      [IndVars] Don't forget value when inferring nowrap flags · 14f350da
      Nikita Popov authored
      When SimplifyIndVars infers IR nowrap flags from SCEV, this may
      happen in two ways: Either nowrap flags were already present in
      SCEV and just get transferred to IR. Or zero/sign extension of
      addrecs infers additional nowrap flags, and those get transferred
      to IR. In the latter case, calling forgetValue() ensures that the
      newly inferred nowrap flags get propagated to any other SCEV
      expressions based on the addrec. However, the invalidation can
      also have a major compile-time effect in some cases. For
      https://bugs.llvm.org/show_bug.cgi?id=50384 with n=512 compile-
      time drops from 7.1s to 0.8s without this invalidation. At the
      same time, removing the invalidation doesn't affect any codegen
      in test-suite.
      
      Differential Revision: https://reviews.llvm.org/D103424
      14f350da
    • Rong Xu's avatar
      [SampleFDO] New hierarchical discriminator for FS SampleFDO (llvm-profdata part) · 8d581857
      Rong Xu authored
      This patch was split from https://reviews.llvm.org/D102246
      [SampleFDO] New hierarchical discriminator for Flow Sensitive SampleFDO
      This is for llvm-profdata part of change. It sets the bit masks for the
      profile reader in llvm-profdata. Also add an internal option
      "-fs-discriminator-pass" for show and merge command to process the profile
      offline.
      
      This patch also moved setDiscriminatorMaskedBitFrom() to
      SampleProfileReader::create() to simplify the interface.
      
      Differential Revision: https://reviews.llvm.org/D103550
      8d581857
    • Adam Nemet's avatar
      [Matrix] Fix transpose-multiply folding if transpose has multiple uses · ffde966c
      Adam Nemet authored
      Don't add it to FusedInsts in this case.
      
      Differential Revision: https://reviews.llvm.org/D103627
      ffde966c
Loading