Skip to content
  1. Oct 24, 2018
    • Matthias Braun's avatar
      SelectionDAG: Reuse bigger sized constants in memset expansion. · 4f82406c
      Matthias Braun authored
      When implementing memset's today we often see this pattern:
      $x0 = MOV 0xXYXYXYXYXYXYXYXY
      store $x0, ...
      $w1 = MOV 0xXYXYXYXY
      store $w1, ...
      
      We first create a 64bit constant in a 64bit register with all bytes the
      same and then create a 32bit constant with all bytes the same in a 32bit
      register. In many targets we could just access the lower byte of the
      64bit register instead.
      
      - Ideally this would be handled by the ConstantHoist pass but it runs
        too early when memset isn't expanded yet.
      - The memset expansion code already had this optimization implemented,
        however SelectionDAG constantfolding would constantfold the
        "trunc(bigconstnat)" pattern to "smallconstant".
      - This patch makes the memset expansion mark the constant as Opaque and
        stop DAGCombiner from constant folding in this situation. (Similar to
        how ConstantHoisting marks things as Opaque to avoid folding
        ADD/SUB/etc.)
      
      Differential Revision: https://reviews.llvm.org/D53181
      
      llvm-svn: 345102
      4f82406c
    • Lang Hames's avatar
      [ORC] Re-apply r345077 with fixes to remove ambiguity in lookup calls. · 23cb2e7f
      Lang Hames authored
      llvm-svn: 345098
      23cb2e7f
    • Teresa Johnson's avatar
      Revert "[ThinLTO] Fix a crash in lazy loading of Metadata" · 7c6344a6
      Teresa Johnson authored
      This reverts commit r345095. It was accidentally committed.
      
      llvm-svn: 345097
      7c6344a6
    • Teresa Johnson's avatar
      [hot-cold-split] Only perform splitting in ThinLTO backend post-link · d725335b
      Teresa Johnson authored
      Summary:
      Fix the new PM to only perform hot cold splitting once during ThinLTO,
      by skipping it in the pre-link phase.
      
      This was already fixed in the old PM by the move of the hot cold split
      pass later (after the early return when PrepareForThinLTO) by r344869.
      
      Reviewers: vsk, sebpop, hiraditya
      
      Subscribers: mehdi_amini, inglorion, eraman, steven_wu, dexonsmith, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D53611
      
      llvm-svn: 345096
      d725335b
    • Teresa Johnson's avatar
      [ThinLTO] Fix a crash in lazy loading of Metadata · 3513dc24
      Teresa Johnson authored
      Summary:
      This is a revised version of D41474.
      
      When the debug location is parsed in BitcodeReader::parseFunction, the
      scope and inlinedAt MDNodes are obtained via MDLoader->getMDNodeFwdRefOrNull(),
      which will create a forward ref if they were not yet loaded.
      Specifically, if one of these MDNodes is in the module level metadata
      block, and this is during ThinLTO importing, that metadata block is
      lazily loaded.
      
      Most places in that invoke getMDNodeFwdRefOrNull have a corresponding call
      to resolveForwardRefsAndPlaceholders which will take care of resolving them.
      E.g. places that call getMetadataFwdRefOrLoad, or at the end of parsing a
      function-level metadata block, or at the end of the initial lazy load of
      module level metadata in order to handle invocations of getMDNodeFwdRefOrNull
      for named metadata and global object attachments. However, the calls for
      the scope/inlinedAt of debug locations are not backed by any such call to
      resolveForwardRefsAndPlaceholders.
      
      To fix this, change the scope and inlinedAt parsing to instead use
      getMetadataFwdRefOrLoad, which will ensure the forward refs to lazily
      loaded metadata are resolved.
      
      Fixes PR35472.
      
      Reviewers: dexonsmith, Sunil_Srivastava, vsk
      
      Subscribers: inglorion, eraman, steven_wu, sebpop, mehdi_amini, dmikulin, vsk, hiraditya, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D53596
      
      llvm-svn: 345095
      3513dc24
    • Fangrui Song's avatar
      Actually fix test from r345085 REQUIRE: asserts · fa735b0e
      Fangrui Song authored
      llvm-svn: 345090
      fa735b0e
    • Fangrui Song's avatar
      Fix test after r345085 · 54b825ca
      Fangrui Song authored
      llvm-svn: 345089
      54b825ca
  2. Oct 23, 2018
    • Craig Topper's avatar
      [X86] Autogenerate comple checks. NFC · e01d516a
      Craig Topper authored
      llvm-svn: 345087
      e01d516a
    • Zhizhou Yang's avatar
      Print out DebugCounter info with -print-debug-counter · 13f76f84
      Zhizhou Yang authored
      Summary:
      This patch will print out {Counter, Skip, StopAfter} info of all passes which have DebugCounter set at destruction.
      
      It can be used to monitor how many times does certain transformation happen in a pass, and also help check if -debug-counter option is set correctly.
      
      Please refer to this [[ http://lists.llvm.org/pipermail/llvm-dev/2018-July/124722.html  | thread ]] for motivation.
      
      Reviewers: george.burgess.iv, davide, greened
      
      Reviewed By: greened
      
      Subscribers: kristina, llozano, mgorny, llvm-commits, mgrang
      
      Differential Revision: https://reviews.llvm.org/D50031
      
      llvm-svn: 345085
      13f76f84
    • Jonas Devlieghere's avatar
      [dwarfdump] Make incompatibility between -diff and -verbose explicit. · 3ef53e10
      Jonas Devlieghere authored
      Using -diff and -verbose together doesn't work today. We should audit
      where these two options interact and fix them. In the meantime we error
      out when the user try to specify both.
      
      llvm-svn: 345084
      3ef53e10
    • Matt Arsenault's avatar
      Fix typo in verifier error message · 9ef8e51c
      Matt Arsenault authored
      llvm-svn: 345083
      9ef8e51c
    • Peter Collingbourne's avatar
      CGP: Clear data structures at the end of a loop iteration instead of the beginning. · abd820a9
      Peter Collingbourne authored
      Clearing LargeOffsetGEPMap at the end fixes a bug where if a large
      offset GEP is in a dead basic block, we fail an assertion when trying
      to delete the block due to the asserting VH in LargeOffsetGEPMap.
      
      Differential Revision: https://reviews.llvm.org/D53464
      
      llvm-svn: 345082
      abd820a9
    • Jordan Rupprecht's avatar
      [llvm-objcopy] Fix use-after-move clang-tidy warning · ab9f6626
      Jordan Rupprecht authored
      llvm-svn: 345079
      ab9f6626
    • Reid Kleckner's avatar
      Revert r345077 "[ORC] Change how non-exported symbols are matched during lookup." · db367e95
      Reid Kleckner authored
      Doesn't build on Windows. The call to 'lookup' is ambiguous. Clang and
      MSVC agree, anyway.
      
      http://lab.llvm.org:8011/builders/clang-x64-windows-msvc/builds/787
      C:\b\slave\clang-x64-windows-msvc\build\llvm.src\unittests\ExecutionEngine\Orc\CoreAPIsTest.cpp(315): error C2668: 'llvm::orc::ExecutionSession::lookup': ambiguous call to overloaded function
      C:\b\slave\clang-x64-windows-msvc\build\llvm.src\include\llvm/ExecutionEngine/Orc/Core.h(823): note: could be 'llvm::Expected<llvm::JITEvaluatedSymbol> llvm::orc::ExecutionSession::lookup(llvm::ArrayRef<llvm::orc::JITDylib *>,llvm::orc::SymbolStringPtr)'
      C:\b\slave\clang-x64-windows-msvc\build\llvm.src\include\llvm/ExecutionEngine/Orc/Core.h(817): note: or       'llvm::Expected<llvm::JITEvaluatedSymbol> llvm::orc::ExecutionSession::lookup(const llvm::orc::JITDylibSearchList &,llvm::orc::SymbolStringPtr)'
      C:\b\slave\clang-x64-windows-msvc\build\llvm.src\unittests\ExecutionEngine\Orc\CoreAPIsTest.cpp(315): note: while trying to match the argument list '(initializer list, llvm::orc::SymbolStringPtr)'
      
      llvm-svn: 345078
      db367e95
    • Lang Hames's avatar
      [ORC] Change how non-exported symbols are matched during lookup. · 841796de
      Lang Hames authored
      In the new scheme the client passes a list of (JITDylib&, bool) pairs, rather
      than a list of JITDylibs. For each JITDylib the boolean indicates whether or not
      to match against non-exported symbols (true means that they should be found,
      false means that they should not). The MatchNonExportedInJD and MatchNonExported
      parameters on lookup are removed.
      
      The new scheme is more flexible, and easier to understand.
      
      This patch also updates JITDylib search orders to be lists of (JITDylib&, bool)
      pairs to match the new lookup scheme. Error handling is also plumbed through
      the LLJIT class to allow regression tests to fail predictably when a lookup from
      a lazy call-through fails.
      
      llvm-svn: 345077
      841796de
    • Michael Kruse's avatar
      [test-suite/doc] Add list of programs we might add. · 53c722df
      Michael Kruse authored
      Add a list of benchmarks, applications and algorithms which are under
      discussion to be added to the test-suite.
      
      The initial list includes the the benchmarks mentioned at
      https://llvm.org/PR34216, missing SPEC benchmarks, some image processing
      algorithms and a few others. The bug tracker only allows adding to the
      discussion, not removing, commenting, adding details to individual
      benchmarks.
      
      The first proposal was to add these benchmark into the test-suite
      repository, but after a discussion, adding it to llvm/docs/Proposals
      seem more appropriate. One advantage is that llvm.org will have a
      browsable web page with these suggestions.
      
      Suggested-by: Hal Finkel
      
      Differential Revision: https://reviews.llvm.org/D46714
      
      llvm-svn: 345074
      53c722df
    • Vedant Kumar's avatar
      [HotColdSplitting] Attach MinSize to outlined code · 50315461
      Vedant Kumar authored
      Outlined code is cold by assumption, so it makes sense to optimize it
      for minimal code size rather than performance.
      
      After r344869 moved the splitting pass to the end of the IR pipeline,
      this does not result in much of a code size reduction. This is probably
      because a comparatively small number backend transforms make use of the
      MinSize hint.
      
      Running LNT on x86_64, I see that 33/1020 binaries shrink for a total of
      919 bytes of TEXT reduction. I didn't measure a significant performance
      impact.
      
      Differential Revision: https://reviews.llvm.org/D53518
      
      llvm-svn: 345072
      50315461
    • Simon Pilgrim's avatar
      [X86][SSE] Revert rL343922 combinePMULDQ AddToWorklist (PR39398) · b6c57075
      Simon Pilgrim authored
      We can't add the MULDQ node back to the worklist after the demanded bits change has been committed in case the node has been removed entirely. This will have to wait until we have SimplifyDemandedBitsForTargetNode.
      
      llvm-svn: 345070
      b6c57075
    • Jordan Rupprecht's avatar
      [llvm-strip] Support -s alias for --strip-all. Make both strip and objcopy... · aaeaa0a8
      Jordan Rupprecht authored
      [llvm-strip] Support -s alias for --strip-all. Make both strip and objcopy case sensitive to support both -s (--strip-all) and -S (--strip-debug).
      
      Summary:
      GNU strip supports both `-s` and `-S` as aliases for `--strip-all` and `--strip-debug`, respectfully.
      
      As part of this, it turns out that strip/objcopy were accepting case insensitive command line args. I'm not sure if there was an explicit reason for this. The only others uses of this are llvm-cvtres/llvm-mt/llvm-lib, which are all tools specific for windows support. Forcing case sensitivity allows both aliases to exist, but seems like a good idea anyway.
      
      And as a surprise test case adjustment, the llvm-strip unit test was running with `-keep=unavailable_symbol`, despite `keep` not be a valid flag for strip. This is because there is a flag `-K` which, when case insensitivity is permitted, allows it to be interpreted as `-K` = `eep=unavailable_symbol` (e.g. to allow `-Kfoo` == `--keep-symbol=foo`).
      
      Reviewers: jakehehrlich, jhenderson, alexshap
      
      Reviewed By: jakehehrlich
      
      Subscribers: llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D53163
      
      llvm-svn: 345068
      aaeaa0a8
    • Simon Pilgrim's avatar
      [LegalizeDAG] Share Vector/Scalar CTPOP Expansion · 8c4796de
      Simon Pilgrim authored
      As suggested on D53258, this patch move the CTPOP expansion code from SelectionDAGLegalize to TargetLowering to allow it to be reused by the VectorLegalizer.
      
      Proper vector support will be added by D53258.
      
      llvm-svn: 345066
      8c4796de
    • Roman Lebedev's avatar
      X86DAGToDAGISel::matchBitExtract(): lambdas can't have default arguments. · 2fae9857
      Roman Lebedev authored
      As reported by ctopper.
      That is a gcc-only warning at the moment.
      
      llvm-svn: 345065
      2fae9857
    • Simon Pilgrim's avatar
      [LegalizeDAG] Share Vector/Scalar CTLZ Expansion · d705ba97
      Simon Pilgrim authored
      As suggested on D53258, this patch shares common CTLZ expansion code between VectorLegalizer and SelectionDAGLegalize by putting it in TargetLowering.
      
      Extension to D53474
      
      llvm-svn: 345060
      d705ba97
    • Daniel Sanders's avatar
      Fix MSVC build by correcting placement of declspec after r345056 · d0ef6898
      Daniel Sanders authored
      Going by the MSVC toolchains at godbolt.org, declspec comes after the template<...>.
      
      llvm-svn: 345059
      d0ef6898
    • Fangrui Song's avatar
      [IR] Fix -Wunused-function after r345052 · 531e3d0c
      Fangrui Song authored
      llvm-svn: 345057
      531e3d0c
    • Daniel Sanders's avatar
      [tblgen] Allow FixedLenDecoderEmitter to use APInt-like objects as InsnType · d300ba1e
      Daniel Sanders authored
      Summary:
      Some targets have very long encodings and uint64_t isn't sufficient. uint128_t
      isn't portable so such targets need to use an object instead.
      
      There is one catch with this at the moment, no string of bits extracted
      from the encoding may exceeed 64-bits. Fields are still permitted to
      exceed 64-bits so long as they aren't one contiguous string of bits. If
      this proves to be a problem then we can modify the generation of
      fieldFromInstruction() calls to account for it but for now I've added an
      assertion for this.
      
      InsnType must either be integral or an APInt-like object that must:
      * Have a static const max_size_in_bits equal to the number of bits in the encoding.
      * be default-constructible and copy-constructible
      * be constructible from a uint64_t (this is the key area the interface deviates
        from APInt since this constructor does not take the bit width)
      * be constructible from an APInt (this can be private)
      * be convertible to uint64_t
      * Support the ~, &,, ==, !=, and |= operators with other objects of the same type
      * Support shift (<<, >>) with signed and unsigned integers on the RHS
      * Support put (<<) to raw_ostream&
      
      Reviewers: bogner, charukcs
      
      Subscribers: nhaehnle, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D52100
      
      llvm-svn: 345056
      d300ba1e
    • Reid Kleckner's avatar
      [PDB] Fix -Wunused-private-field in DIA · 07589729
      Reid Kleckner authored
      llvm-svn: 345054
      07589729
    • Stefan Pintilie's avatar
      [Power9] Add __float128 support in the backend for bitcast to a i128 · 927e8bf3
      Stefan Pintilie authored
      Add support to allow bit-casting from f128 to i128 and then
      extracting 64 bits from the result.
      
      Differential Revision: https://reviews.llvm.org/D49507
      
      llvm-svn: 345053
      927e8bf3
    • Sanjay Patel's avatar
      [IR] remove fake binop queries for not/neg · 07076cfd
      Sanjay Patel authored
      The initial motivation is that we want to remove the
      fneg API because that would silently fail if we add
      an actual fneg instruction to IR. The same would be
      true for the integer ops, so we might as well get rid 
      of these too.
      
      We have a newer 'match' API that makes checking for
      these patterns simpler. It also works with vectors
      that may include undef elements in constants.
      
      If any out-of-tree users need updating, they can model
      their code changes on these commits:
      rL345050
      rL345043
      rL345042
      rL345041
      rL345036
      rL345030
      
      llvm-svn: 345052
      07076cfd
    • Sanjay Patel's avatar
      [InstCombine] use 'match' to simplify code · 95790c54
      Sanjay Patel authored
      There's probably some vector-with-undef-element pattern
      that shows an improvement, so this is probably not quite
      'NFC'.
      
      This is the last step towards removing the fake binop 
      queries for not/neg. Ie, there are no more uses of those
      functions in trunk. Fneg should follow.
      
      llvm-svn: 345050
      95790c54
    • Simon Pilgrim's avatar
    • Jordan Rupprecht's avatar
      [DebugInfo][GlobalOpt] Fix -debugify for globalopt shrinking globals to booleans. · 2fed6ac1
      Jordan Rupprecht authored
      Summary:
      TryToShrinkGlobalToBoolean, when possible, will split store <value> + load <value> into store <bool> + select <bool ? value : 0>. This preserves DebugLoc during that pass.
      
      Fixes PR37959. The test case here is the simplified .ll for:
      
      ```
      static int foo;
      int bar() {
        foo = 5;
        return foo;
      }
      ```
      
      Reviewers: dblaikie, gbedwell, aprantl
      
      Reviewed By: dblaikie
      
      Subscribers: mehdi_amini, JDevlieghere, dexonsmith, llvm-commits
      
      Tags: #debug-info
      
      Differential Revision: https://reviews.llvm.org/D53531
      
      llvm-svn: 345046
      2fed6ac1
    • Simon Pilgrim's avatar
      [CostModel][X86] Add transpose shuffle cost tests · f1d8b7c4
      Simon Pilgrim authored
      llvm-svn: 345045
      f1d8b7c4
    • Sanjay Patel's avatar
      [WebAssembly] use 'match' to simplify code; NFC · 47a52a05
      Sanjay Patel authored
      Vector types are not possible here because this code explicitly
      checks for a scalar type, but this is another step towards 
      completely removing the fake binop queries for not/neg/fneg.
      
      llvm-svn: 345043
      47a52a05
    • Sanjay Patel's avatar
      [Reassociate] replace fake binop queries with 'match' API · 5b6b090c
      Sanjay Patel authored
      We need to update this code before introducing an 'fneg' instruction in IR,
      so we might as well kill off the integer neg/not queries too.
      
      This is no-functional-change-intended for scalar code and most vector code. 
      For vectors, we can see that the 'match' API allows for undef elements in 
      constants, so we optimize those cases better.
      
      Ideally, there would be a test for each code diff, but I don't see evidence
      of that for the existing code, so I didn't try very hard to come up with new 
      vector tests for each code change.
      
      Differential Revision: https://reviews.llvm.org/D53533
      
      llvm-svn: 345042
      5b6b090c
    • Sanjay Patel's avatar
      [SelectionDAG] use 'match' to simplify code; NFC · ad12df82
      Sanjay Patel authored
        
      Vector types are not possible here because this code only starts
      matching from the scalar bool value of a conditional branch, but
      this is another step towards completely removing the fake binop
      queries for not/neg/fneg.
      
      llvm-svn: 345041
      ad12df82
    • Benjamin Kramer's avatar
      [LegalizeDAG] Remove unused variable · 1e212e8a
      Benjamin Kramer authored
      llvm-svn: 345040
      1e212e8a
    • Simon Pilgrim's avatar
      [LegalizeDAG] Share Vector/Scalar CTTZ Expansion · b975ff47
      Simon Pilgrim authored
      As suggested on D53258, this patch demonstrates sharing common CTTZ expansion code between VectorLegalizer and SelectionDAGLegalize by putting it in TargetLowering.
      
      I intend to move CTLZ and (scalar) CTPOP over as well and then update D53258 accordingly.
      
      Differential Revision: https://reviews.llvm.org/D53474
      
      llvm-svn: 345039
      b975ff47
    • Simon Pilgrim's avatar
      [SLPVectorizer] Add basic support for mul/and/or/xor horizontal reductions · 532a0f12
      Simon Pilgrim authored
      Expand arithmetic reduction to include mul/and/or/xor instructions.
      
      This patch just fixes the SLPVectorizer - the effective reduction costs for AVX1+ are still poor (see rL344846) and will need to be improved before SLP sees this as a valid transform - but we can already see the effect on SSE2 tests.
      
      This partially helps PR37731, but doesn't fix it all as it still falls over on the extraction/reduction order for some reason.
      
      Differential Revision: https://reviews.llvm.org/D53473
      
      llvm-svn: 345037
      532a0f12
    • Sanjay Patel's avatar
      [InstCombine] use 'match' to handle vectors and simplify code · 747feb28
      Sanjay Patel authored
      This is another step towards completely removing the fake 
      binop queries for not/neg/fneg.
      
      llvm-svn: 345036
      747feb28
    • Sanjay Patel's avatar
      [InstCombine] swap select profile metadata when swapping select ops · ad76c682
      Sanjay Patel authored
      llvm-svn: 345034
      ad76c682
Loading