Skip to content
  1. Mar 15, 2021
    • Hsiangkai Wang's avatar
      [RISCV] Support inline asm for vector instructions. · a81dff1e
      Hsiangkai Wang authored
      Types of fractional LMUL and LMUL=1 are all using VR register class. When
      using inline asm, it will use the first type in the register class as the
      type for the register. It is not necessary the same as the value type. We
      need to use INSERT_SUBVECTOR/EXTRACT_SUBVECToR/BITCAST to make it legal
      to put the value in the corresponding register class.
      
      Differential Revision: https://reviews.llvm.org/D97480
      a81dff1e
    • Stephen Kelly's avatar
      Revert "[AST] Add generator for source location introspection" · 6e303a98
      Stephen Kelly authored
      This reverts commit 91abaa1f.
      6e303a98
    • Craig Topper's avatar
      [RISCV] Give an explicit error if 'generic' CPU is passed instead of... · fcdf7f62
      Craig Topper authored
      [RISCV] Give an explicit error if 'generic' CPU is passed instead of 'generic-rv32' or 'generic-rv64'. Validate 64Bit feature against the triple.
      
      I encountered a project that uses llvm that passes "generic" by
      default. While I could fix that project, I wouldn't be surprised
      if other projects did something similar. So it seems like
      a good idea to provide a better error here.
      
      I've also added validation of the 64Bit feature against the
      triple so that we can catch a mismatched CPU before failing in
      a mysterious way. We can make it pretty far in isel because we
      calculate XLenVT from the triple and use that to set up the legal
      integer type.
      
      Reviewed By: luismarques, khchen
      
      Differential Revision: https://reviews.llvm.org/D98307
      fcdf7f62
    • Stephen Kelly's avatar
      [AST] Add generator for source location introspection · 91abaa1f
      Stephen Kelly authored
      Generate a json file containing descriptions of AST classes and their
      public accessors which return SourceLocation or SourceRange.
      
      Use the JSON file to generate a C++ API and implementation for accessing
      the source locations and method names for accessing them for a given AST
      node.
      
      This new API can be used to implement 'srcloc' output in clang-query:
      
        http://ce.steveire.com/z/m_kTIo
      
      The JSON file can also be used to generate bindings for other languages,
      such as Python and Javascript:
      
        https://steveire.wordpress.com/2019/04/30/the-future-of-ast-matching
      
      In this first version of this feature, only the accessors for Stmt
      classes are generated, not Decls, TypeLocs etc.  Those can be added
      after this change is reviewed, as this change is mostly about
      infrastructure of these code generators.
      
      Also in this version, the platforms/cmake configurations are excluded as
      much as possible so that support can be added iteratively.  Currently a
      break on any platform causes a revert of the entire feature.  This way,
      the `OR WIN32` can be removed in a future commit and if it breaks the
      buildbots, only that commit gets reverted, making the entire process
      easier to manage.
      
      Differential Revision: https://reviews.llvm.org/D93164
      91abaa1f
  2. Mar 14, 2021
  3. Mar 13, 2021
    • Philip Reames's avatar
      [CGP] Consolidate logic for getIVIncrement and isIVIncrement · 9228f2f3
      Philip Reames authored
      This fixes the bug demonstrated by the test case in the commit message of 8d20f2c2 (which was a revert of cf82700a).  The root issue was that we have two transforms which are inverses of each other.  We use one for simple induction variables (where we can use the post-inc form), and the other for everything else.  The problem was that the two transforms could disagree about whether something was an induction variable.
      
      The reverted commit made a change to one of the matcher routines which was used for one of the two transforms without updating the other matcher.  However, it's worth noting the existing code w/o the reverted change also has cases where the decision could differ between the two paths.
      
      The fix is simply to consolidate the code such that two paths must agree by construction, and to add an assert to catch any potential future re-divergence.
      
      Triggering the infinite loop requires side stepping the SunkAddrs cache.  The SunkAddrs cache has the effect of suppressing the iteration in the common case, but there are codepaths through CGP which restart iteration and clear this cache.
      
      Unfortunately, I have not been able to construct a standalone IR test case for this.  The original test case is a c++ program which when compiled by clang demonstrates the infinite loop, but all of my attempts at extracting an IR test case runnable through opt/llc have failed to reproduce.  (Including capturing the IR at point of the transform itself!)  I have no idea what weird state clang is creating here.
      
      I also tried creating a test case by hand, but gave up after about an hour of trying to find the right combination to dance through multiple transforms to create the end result needed to trip the bug.
      9228f2f3
    • Simonas Kazlauskas's avatar
      [InstCombine] Update GEP tests · 7ee96429
      Simonas Kazlauskas authored
      Adds a test for D98588 and updates the test checks.
      7ee96429
    • Nikita Popov's avatar
      [SROA] Regenerate test checks (NFC) · 6491e016
      Nikita Popov authored
      6491e016
    • Nikita Popov's avatar
      [MemCpyOpt] Handle read from lifetime.start with offset · 55566609
      Nikita Popov authored
      This fixes a regression from the MemDep-based implementation:
      MemDep completely ignores lifetime.start intrinsics that aren't
      MustAlias -- this is probably unsound, but it does mean that the
      MemDep based implementation successfully eliminated memcpy's from
      lifetime.start if the memcpy happens at an offset, rather than
      the base address of the alloca.
      
      Add a special case for the case where the lifetime.start spans the
      whole alloca (which is pretty much the only kind of lifetime.start
      that frontends ever emit), as we don't need to figure out our exact
      aliasing relationship in that case, the whole alloca is dead prior
      to the call.
      
      If this doesn't cover all practically relevant cases, then it
      would be possible to make use of the recently added PartialAlias
      clobber offsets to make this more precise.
      55566609
    • Nikita Popov's avatar
    • Craig Topper's avatar
      [DAGCombiner] Optimize 1-bit smulo to AND+SETNE. · 5b825433
      Craig Topper authored
      A 1-bit smulo overflows is both inputs are -1 since the result
      should be +1 which can't be represented in a signed 1 bit value.
      
      We can detect this with an AND and a setcc. The multiply result
      can also use the same AND.
      
      Reviewed By: RKSimon
      
      Differential Revision: https://reviews.llvm.org/D97634
      5b825433
    • Stefan Gränitz's avatar
    • kuterd's avatar
      [Attributor][fix] Remove problematic EXPENSIVE_CHECK · 44c1425c
      kuterd authored
      Remove the check that is causing compilation issues in
      some build configurations.
      44c1425c
    • Sanjay Patel's avatar
      [InstCombine] avoid creating an extra instruction in zext fold and possible inf-loop · 4224a369
      Sanjay Patel authored
      The structure of this fold is suspect vs. most of instcombine
      because it creates instructions and tries to delete them
      immediately after.
      
      If we don't have the operand types for the icmps, then we are
      not behaving as assumed. And as shown in PR49475, we can inf-loop.
      4224a369
    • Roman Lebedev's avatar
      Reland [SCEV] Improve modelling for (null) pointer constants · 78b8ce40
      Roman Lebedev authored
      This reverts commit 329aeb5d,
      and relands commit 61f006ac.
      
      This is a continuation of D89456.
      
      As it was suggested there, now that SCEV models `PtrToInt`,
      we can try to improve SCEV's pointer handling.
      In particular, i believe, i will need this in the future
      to further fix `SCEVAddExpr`operation type handling.
      
      This removes special handling of `ConstantPointerNull`
      from `ScalarEvolution::createSCEV()`, and add constant folding
      into `ScalarEvolution::getPtrToIntExpr()`.
      This way, `null` constants stay as such in SCEV's,
      but gracefully become zero integers when asked.
      
      Reviewed By: Meinersbur
      
      Differential Revision: https://reviews.llvm.org/D98147
      78b8ce40
    • Roman Lebedev's avatar
      [LSR] Don't try to fixup uses in 'EH pad' instructions · 6e9b9978
      Roman Lebedev authored
      The added test case crashes before this fix:
      ```
      opt: /repositories/llvm-project/llvm/lib/Transforms/Scalar/LoopStrengthReduce.cpp:5172: BasicBlock::iterator (anonymous namespace)::LSRInstance::AdjustInsertPositionForExpand(BasicBlock::iterator, const (anonymous namespace)::LSRFixup &, const (anonymous namespace)::LSRUse &, llvm::SCEVExpander &) const: Assertion `!isa<PHINode>(LowestIP) && !LowestIP->isEHPad() && !isa<DbgInfoIntrinsic>(LowestIP) && "Insertion point must be a normal instruction"' failed.
      ```
      This is fully analogous to the previous commit,
      with the pointer constant replaced to be something non-null.
      
      The comparison here can be strength-reduced,
      but the second operand of the comparison happens to be identical
      to the constant pointer in the `catch` case of `landingpad`.
      
      While LSRInstance::CollectLoopInvariantFixupsAndFormulae()
      already gave up on uses in blocks ending up with EH pads,
      it didn't consider this case.
      
      Eventually, `LSRInstance::AdjustInsertPositionForExpand()`
      will be called, but the original insertion point it will get
      is the user instruction itself, and it doesn't want to
      deal with EH pads, and asserts as much.
      
      It would seem that this basically never happens in-the-wild,
      otherwise it would have been reported already,
      so it seems safe to take the cautious approach,
      and just not deal with such users.
      6e9b9978
    • Roman Lebedev's avatar
      [NFC][LSR] Add test case that from https://reviews.llvm.org/D98147#2623549 · 23d591ef
      Roman Lebedev authored
      With that patch, this test fails with an assertion
      ```
      opt: /repositories/llvm-project/llvm/lib/Transforms/Scalar/LoopStrengthReduce.cpp:5169: BasicBlock::iterator (anonymous namespace)::LSRInstance::AdjustInsertPositionForExpand(BasicBlock::iterator, const (anonymous namespace)::LSRFixup &, const (anonymous namespace)::LSRUse &, llvm::SCEVExpander &) const: Assertion `!isa<PHINode>(LowestIP) && !LowestIP->isEHPad() && !isa<DbgInfoIntrinsic>(LowestIP) && "Insertion point must be a normal instruction"' failed.
      ```
      23d591ef
    • Nikita Popov's avatar
      [MemCpyOpt] Use AA to check for MustAlias between memset and memcpy · 2902bdee
      Nikita Popov authored
      Rather than checking for simple equality, check for MustAlias, as
      we do in other transforms. This catches equivalent GEPs.
      2902bdee
    • Nikita Popov's avatar
      [MemCpyOpt] Don't generate zero-size memset · 9080444f
      Nikita Popov authored
      If a memset destination is overwritten by a memcpy and the sizes
      are exactly the same, then the memset is simply dead. We can
      directly drop it, instead of replacing it with a memset of zero
      size, which is particularly ugly for the case of a dynamic size.
      9080444f
    • Nikita Popov's avatar
    • Nikita Popov's avatar
      [MemorySSA] Don't bail on phi starting access · b2f933a6
      Nikita Popov authored
      When calling getClobberingMemoryAccess() with MemoryLocation on a
      MemoryPHI starting access, the walker currently immediately bails
      and returns the starting access. This makes sense for the API that
      does not accept a location (as we wouldn't know what clobber we
      should be checking for), but doesn't make sense for the
      MemoryLocation-based API. This means that it can't look through
      a MemoryPHI if it's the starting access, but can if there is one
      more non-clobbering def in between. This patch removes the limitation.
      
      Differential Revision: https://reviews.llvm.org/D98557
      b2f933a6
    • luxufan's avatar
      change rvv frame layout · a9b9c64f
      luxufan authored
      This patch change the rvv frame layout that proposed in D94465. In patch D94465, In the eliminateFrameIndex function,
      to eliminate the rvv frame index, create temp virtual register is needed. This virtual register should be scavenged by class
      RegsiterScavenger. If the machine function has other unused registers, there is no problem. But if there isn't unused registers,
      we need a emergency spill slot. Because of the emergency spill slot belongs to the scalar local variables field, to access emergency
      spill slot, we need a temp virtual register again. This makes the compiler report the "Incomplete scavenging after 2nd pass" error.
      So I change the rvv frame layout as follows:
      
      ```
      |--------------------------------------|
      |   arguments passed on the stack      |
      |--------------------------------------|<--- fp
      |   callee saved registers             |
      |--------------------------------------|
      |   rvv vector objects(local variables |
      |   and outgoing arguments             |
      |--------------------------------------|
      |   realignment field                  |
      |--------------------------------------|
      |   scalar local variable(also contains|
      |   emergency spill slot)              |
      |--------------------------------------|<--- bp
      |   variable-sized local variables     |
      |--------------------------------------|<--- sp
      ```
      
      Differential Revision: https://reviews.llvm.org/D97111
      a9b9c64f
    • luxufan's avatar
      [RISCV] Remove redundancy -mattr=+d in test file · 5ddbd1fd
      luxufan authored
      Differential Revision: https://reviews.llvm.org/D97177
      5ddbd1fd
Loading