Skip to content
  1. Jun 01, 2020
    • Sanjay Patel's avatar
      [InstCombine] fix use of base VectorType; NFC · 26ebe936
      Sanjay Patel authored
      SimplifyDemandedVectorElts() bails out on ScalableVectorType
      anyway, but we can exit faster with the external check.
      
      Move this to a helper function because there are likely other
      vector folds that we can try here.
      26ebe936
    • Hiroshi Yamauchi's avatar
      [PGO] Improve the working set size heuristics under the partial sample PGO. · 6c27c61d
      Hiroshi Yamauchi authored
      Summary:
      The working set size heuristics (ProfileSummaryInfo::hasHugeWorkingSetSize)
      under the partial sample PGO may not be accurate because the profile is partial
      and the number of hot profile counters in the ProfileSummary may not reflect the
      actual working set size of the program being compiled.
      
      To improve this, the (approximated) ratio of the the number of profile counters
      of the program being compiled to the number of profile counters in the partial
      sample profile is computed (which is called the partial profile ratio) and the
      working set size of the profile is scaled by this ratio to reflect the working
      set size of the program being compiled and used for the working set size
      heuristics.
      
      The partial profile ratio is approximated based on the number of the basic
      blocks in the program and the NumCounts field in the ProfileSummary and computed
      through the thin LTO indexing. This means that there is the limitation that the
      scaled working set size is available to the thin LTO post link passes only.
      
      Reviewers: davidxl
      
      Subscribers: mgorny, eraman, hiraditya, steven_wu, dexonsmith, arphaman, dang, llvm-commits
      
      Tags: #llvm
      
      Differential Revision: https://reviews.llvm.org/D79831
      6c27c61d
    • Stanislav Mekhanoshin's avatar
      Process gep (phi ptr1, ptr2) in SROA · 745c6c84
      Stanislav Mekhanoshin authored
      Differential Revision: https://reviews.llvm.org/D79218
      745c6c84
    • Sanjay Patel's avatar
      [InstNamer] use 'i' for Instructions, not 'tmp' · dd54432a
      Sanjay Patel authored
      As discussed in https://bugs.llvm.org/show_bug.cgi?id=45951 and
      D80584, the name 'tmp' is almost always a bad choice, but we have
      a legacy of regression tests with that name because it was baked
      into utils/update_test_checks.py.
      
      This change makes -instnamer more consistent (already using "arg"
      and "bb", the common LLVM shorthand). And it avoids the conflict
      in telling users of the FileCheck script to run "-instnamer" to
      create a better regression test and having that cause a warn/fail
      in update_test_checks.py.
      dd54432a
    • Ehud Katz's avatar
      8a84158e
    • Ehud Katz's avatar
      [StructurizeCFG] Fix region nodes ordering · 85c30880
      Ehud Katz authored
      This is a reimplementation of the `orderNodes` function, as the old
      implementation didn't take into account all cases.
      The new implementation uses SCCs instead of Loops to take account of
      irreducible loops.
      
      Fix PR41509
      
      Differential Revision: https://reviews.llvm.org/D79037
      85c30880
  2. May 30, 2020
  3. May 29, 2020
    • Stanislav Mekhanoshin's avatar
      Revert "Process gep (phi ptr1, ptr2) in SROA" · af852d6f
      Stanislav Mekhanoshin authored
      This reverts commit f66a43c1.
      af852d6f
    • Stanislav Mekhanoshin's avatar
      Process gep (phi ptr1, ptr2) in SROA · f66a43c1
      Stanislav Mekhanoshin authored
      Differential Revision: https://reviews.llvm.org/D79218
      f66a43c1
    • Christopher Tetreault's avatar
      [SVE] Eliminate calls to default-false VectorType::get() from Vectorize · d2befc66
      Christopher Tetreault authored
      Reviewers: efriedma, c-rhodes, david-arm, fhahn
      
      Reviewed By: david-arm
      
      Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits
      
      Tags: #llvm
      
      Differential Revision: https://reviews.llvm.org/D80339
      d2befc66
    • Ehud Katz's avatar
      [Local] Prevent `invertCondition` from creating a redundant instruction · c710bb44
      Ehud Katz authored
      Prevent `invertCondition` from creating the inversion instruction, in
      case the given value is an argument which has already been inverted.
      Note that this approach has already been taken in case the given value
      is an instruction (and not an argument).
      
      Differential Revision: https://reviews.llvm.org/D80399
      c710bb44
    • Paul Robinson's avatar
      Preserve DbgLoc when DeadArgumentElimination rewrites a 'ret'. · 8c2d2d97
      Paul Robinson authored
      Fixes PR46002.
      8c2d2d97
    • Florian Hahn's avatar
      [SCCP] Switch to widen at PHIs, stores and call edges. · 01f999ae
      Florian Hahn authored
      Currently SCCP does not widen PHIs, stores or along call edges
      (arguments/return values), but on operations that directly extend ranges
      (like binary operators).
      
      This means PHIs, stores and call edges are not pessimized by widening
      currently, while binary operators are. The main reason for widening
      operators initially was that opting-out for certain operations was
      more straight-forward in the initial implementation (and it did not
      matter too much, as range support initially was only implemented for a
      very limited set of operations.
      
      During the discussion in D78391, it was suggested to consider flipping
      widening to PHIs, stores and along call edges. After adding support for
      tracking the number of range extensions in ValueLattice, limiting the
      number of range extensions per value is straight forward.
      
      This patch introduces a MaxWidenSteps option to the MergeOptions,
      limiting the number of range extensions per value. For PHIs, it seems
      natural allow an extension for each (active) incoming value plus 1. For
      the other cases, a arbitrary limit of 10 has been chosen initially. It would
      potentially make sense to set it depending on the users of a
      function/global, but that still needs investigating. This potentially
      leads to more state-changes and longer compile-times.
      
      The results look quite promising (MultiSource, SPEC):
      
      Same hash: 179 (filtered out)
      Remaining: 58
      Metric: sccp.IPNumInstRemoved
      
      Program                                        base    widen-phi diff
       test-suite...ks/Prolangs-C/agrep/agrep.test    58.00   82.00    41.4%
       test-suite...marks/SciMark2-C/scimark2.test    32.00   43.00    34.4%
       test-suite...rks/FreeBench/mason/mason.test     6.00    8.00    33.3%
       test-suite...langs-C/football/football.test   104.00  128.00    23.1%
       test-suite...cations/hexxagon/hexxagon.test    36.00   42.00    16.7%
       test-suite...CFP2000/177.mesa/177.mesa.test   214.00  249.00    16.4%
       test-suite...ngs-C/assembler/assembler.test    14.00   16.00    14.3%
       test-suite...arks/VersaBench/dbms/dbms.test    10.00   11.00    10.0%
       test-suite...oxyApps-C++/miniFE/miniFE.test    43.00   47.00     9.3%
       test-suite...ications/JM/ldecod/ldecod.test   179.00  195.00     8.9%
       test-suite...CFP2006/433.milc/433.milc.test   249.00  265.00     6.4%
       test-suite.../CINT2000/175.vpr/175.vpr.test    98.00  104.00     6.1%
       test-suite...peg2/mpeg2dec/mpeg2decode.test    70.00   74.00     5.7%
       test-suite...CFP2000/188.ammp/188.ammp.test    71.00   75.00     5.6%
       test-suite...ce/Benchmarks/PAQ8p/paq8p.test   111.00  117.00     5.4%
       test-suite...ce/Applications/Burg/burg.test    41.00   43.00     4.9%
       test-suite...000/197.parser/197.parser.test    66.00   69.00     4.5%
       test-suite...tions/lambda-0.1.3/lambda.test    23.00   24.00     4.3%
       test-suite...urce/Applications/lua/lua.test   301.00  313.00     4.0%
       test-suite...TimberWolfMC/timberwolfmc.test    76.00   79.00     3.9%
       test-suite...lications/ClamAV/clamscan.test   991.00  1030.00    3.9%
       test-suite...plications/d/make_dparser.test    53.00   55.00     3.8%
       test-suite...fice-ispell/office-ispell.test    83.00   86.00     3.6%
       test-suite...lications/obsequi/Obsequi.test    28.00   29.00     3.6%
       test-suite.../Prolangs-C/bison/mybison.test    56.00   58.00     3.6%
       test-suite.../CINT2000/254.gap/254.gap.test   170.00  176.00     3.5%
       test-suite.../Applications/lemon/lemon.test    30.00   31.00     3.3%
       test-suite.../CINT2000/176.gcc/176.gcc.test   1202.00 1240.00    3.2%
       test-suite...pplications/treecc/treecc.test    79.00   81.00     2.5%
       test-suite...chmarks/MallocBench/gs/gs.test   357.00  366.00     2.5%
       test-suite...eeBench/analyzer/analyzer.test   103.00  105.00     1.9%
       test-suite...T2006/445.gobmk/445.gobmk.test   1697.00 1724.00    1.6%
       test-suite...006/453.povray/453.povray.test   1812.00 1839.00    1.5%
       test-suite.../Benchmarks/Bullet/bullet.test   337.00  342.00     1.5%
       test-suite.../CINT2000/252.eon/252.eon.test   426.00  432.00     1.4%
       test-suite...T2000/300.twolf/300.twolf.test   214.00  217.00     1.4%
       test-suite...pplications/oggenc/oggenc.test   244.00  247.00     1.2%
       test-suite.../CINT2006/403.gcc/403.gcc.test   4008.00 4055.00    1.2%
       test-suite...T2006/456.hmmer/456.hmmer.test   175.00  177.00     1.1%
       test-suite...nal/skidmarks10/skidmarks.test   430.00  434.00     0.9%
       test-suite.../Applications/sgefa/sgefa.test   115.00  116.00     0.9%
       test-suite...006/447.dealII/447.dealII.test   1082.00 1091.00    0.8%
       test-suite...6/482.sphinx3/482.sphinx3.test   141.00  142.00     0.7%
       test-suite...ocBench/espresso/espresso.test   152.00  153.00     0.7%
       test-suite...3.xalancbmk/483.xalancbmk.test   4003.00 4025.00    0.5%
       test-suite...lications/sqlite3/sqlite3.test   548.00  551.00     0.5%
       test-suite...marks/7zip/7zip-benchmark.test   5522.00 5551.00    0.5%
       test-suite...nsumer-lame/consumer-lame.test   208.00  209.00     0.5%
       test-suite...:: External/Povray/povray.test   1556.00 1563.00    0.4%
       test-suite...000/186.crafty/186.crafty.test   298.00  299.00     0.3%
       test-suite.../Applications/SPASS/SPASS.test   2019.00 2025.00    0.3%
       test-suite...ications/JM/lencod/lencod.test   8427.00 8449.00    0.3%
       test-suite...6/464.h264ref/464.h264ref.test   6797.00 6813.00    0.2%
       test-suite...6/471.omnetpp/471.omnetpp.test   431.00  430.00    -0.2%
       test-suite...006/450.soplex/450.soplex.test   446.00  447.00     0.2%
       test-suite...0.perlbench/400.perlbench.test   1729.00 1727.00   -0.1%
       test-suite...000/255.vortex/255.vortex.test   3815.00 3819.00    0.1%
      
      Reviewers: efriedma, nikic, davide
      
      Reviewed By: efriedma
      
      Differential Revision: https://reviews.llvm.org/D79036
      01f999ae
    • David Sherwood's avatar
      [SVE] Remove getNumElements() warnings in InstCombiner::visitBitCast · f254f1d9
      David Sherwood authored
      Whilst trying to compile this test to assembly:
      
        CodeGen/aarch64-sve-intrinsics/acle_sve_reinterpret.c
      
      I discovered some warnings were firing in InstCombiner::visitBitCast
      due to calls to getNumElements() for scalable vector types. These
      calls only really made sense for fixed width vectors so I have fixed
      up the code appropriately.
      
      Differential Revision: https://reviews.llvm.org/D80559
      f254f1d9
    • Whitney Tsang's avatar
    • Whitney Tsang's avatar
      [LoopUnroll] Support loops with exiting block that is neither header nor · 1bc73b02
      Whitney Tsang authored
      latch.
      
      Summary: Remove the limitation in LoopUnrollPass that exiting block must
      be either header or latch.
      Reviewer: dmgreen, jdoerfert, Meinersbur, kbarton, bmahjour, etiotto,
      fhahn, efriedma
      Reviewed By: etiotto, fhahn, efriedma
      Subscribers: efriedma, lkail, xbolva00, hiraditya, zzheng, llvm-commits
      Tag: LLVM
      Differential Revision: https://reviews.llvm.org/D80477
      1bc73b02
  4. May 28, 2020
  5. May 27, 2020
Loading