Skip to content
  1. Jun 01, 2020
  2. May 31, 2020
  3. May 30, 2020
    • Whitney Tsang's avatar
      [LoopUnroll] Add a test case for rG7873376bb36b. · 0fee91a1
      Whitney Tsang authored
      rG7873376bb36b fixes a build failure for allyesconfig.
      
      The problem happened when the single exiting block doesn't dominate the
      loop latch, then the immediate dominator of the exit block should not be
      the exiting block after unrolling. As the exiting block of
      different unrolled iteration can branch to the exit block, and the ith
      exiting block doesn't dominate (i+1)th exiting block, the immediate
      dominator of the exit block should not the nearest common dominator of
      the exiting block and the loop latch of the same iteration.
      
      Differential Revision: https://reviews.llvm.org/D80477
      0fee91a1
    • Philip Reames's avatar
    • zoecarver's avatar
      [DSE] Remove noop stores in MSSA. · 065bf124
      zoecarver authored
      Adds a simple fast-path check for the pattern:
      v = load ptr
      store v to ptr
      
      I took the tests from the bugzilla post, I can add more if needed (but I think these should be sufficent).
      
      Refs: https://bugs.llvm.org/show_bug.cgi?id=45795
      
      Differential Revision: https://reviews.llvm.org/D79391
      065bf124
    • Florian Hahn's avatar
      [BasicAA] Use known lower bounds for index values for size based check. · d99a1848
      Florian Hahn authored
      Currently, BasicAA does not exploit information about value ranges of
      indexes. For example, consider the 2 pointers %a = %base and
      %b = %base + %stride below, assuming they are used to access 4 elements.
      
      If we know that %stride >= 4, we know the accesses do not alias. If
      %stride is a constant, BasicAA currently gets that. But if the >= 4
      constraint is encoded using an assume, it misses the NoAlias.
      
      This patch extends DecomposedGEP to include an additional MinOtherOffset
      field, which tracks the constant offset similar to the existing
      OtherOffset, which the difference that it also includes non-negative
      lower bounds on the range of the index value. When checking if the
      distance between 2 accesses exceeds the access size, we can use this
      improved bound.
      
      For now this is limited to using non-negative lower bounds for indices,
      as this conveniently skips cases where we do not have a useful lower
      bound (because it is not constrained). We potential miss out in cases
      where the lower bound is constrained but negative, but that can be
      exploited in the future.
      
      Reviewers: sanjoy, hfinkel, reames, asbirlea
      
      Reviewed By: asbirlea
      
      Differential Revision: https://reviews.llvm.org/D76194
      d99a1848
    • Craig Topper's avatar
      [X86] Autogenerate complete checks. NFC · c65c1d78
      Craig Topper authored
      c65c1d78
    • Martin Storsjö's avatar
    • Martin Storsjö's avatar
      [AArch64] Treat x18 as callee-saved in functions with windows calling... · cf97e0ec
      Martin Storsjö authored
      [AArch64] Treat x18 as callee-saved in functions with windows calling convention on non-windows OSes
      
      Treat it as callee-saved, and always back it up. When windows code calls
      entry points in unix code, marked with the windows calling convention,
      that unix code can call other functions that isn't compiled with
      -ffixed-x18 which may clobber x18 freely. By backing it up and restoring
      it on return, we preserve the register across the function call,
      fulfilling this part of the windows calling convention on another OS.
      
      This isn't enough for making sure that x18 is preseved when non-windows
      code does a callback to windows code, but is a clear improvement over
      the current status quo. Additionally, wine is nowadays building many
      modules as PE DLLs, which avoids the callback issue altogether for those
      DLLs.
      
      Differential Revision: https://reviews.llvm.org/D61892
      cf97e0ec
    • Sourabh Singh Tomar's avatar
      [DWARF5] Added support for emission of .debug_macro.dwo section · 20c9bb44
      Sourabh Singh Tomar authored
      This patch adds support for emission of following DWARFv5 macro
      forms in .debug_macro.dwo section:
      
      - DW_MACRO_start_file
      - DW_MACRO_end_file
      - DW_MACRO_define_strx
      - DW_MACRO_undef_strx
      
      Reviewed By: dblaikie
      
      Differential Revision: https://reviews.llvm.org/D78866
      20c9bb44
    • Eric Christopher's avatar
      Fix full unrolling with new pass manager. · c554c5e1
      Eric Christopher authored
      Last we looked at this and couldn't come up with a reason to change
      it, but with a pragma for full loop unrolling we bypass every other
      loop unroll and then fail to fully unroll a loop when the pragma is set.
      
      Move the OnlyWhenForced out of the check and into the initialization
      of the full unroll pass in the new pass manager. This doesn't show up
      with the old pass manager.
      
      Add a new option to opt so that we can turn off loop unrolling
      manually since this is a difference between clang and opt.
      
      Tested with check-clang and check-llvm.
      c554c5e1
    • Carl Ritson's avatar
      [AMDGPU] Remove assertion on S1024 SGPR to VGPR spill · d0414778
      Carl Ritson authored
      Summary:
      Replace an assertion that blocks S1024 SGPR to VGPR spill.
      The assertion pre-dates S1024 and is not wave size dependent.
      
      Reviewers: arsenm, sameerds, rampitec
      
      Reviewed By: arsenm
      
      Subscribers: qcolombet, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits
      
      Tags: #llvm
      
      Differential Revision: https://reviews.llvm.org/D80783
      d0414778
    • Matt Arsenault's avatar
      AMDGPU: Optimize s_setreg_b32 to s_denorm_mode/s_round_mode · 0892a96a
      Matt Arsenault authored
      This is a custom inserter because it was less work than teaching
      tablegen a way to indicate that it is sometimes OK to have a no side
      effect instruction in the output of a side effecting pattern.
      
      The asm is needed to look like a read of the mode register to prevent
      it from being deleted. However, there seems to be a bug where the mode
      register def instructions are moved across the asm sideeffect by the
      post-RA scheduler.
      
      Another oddity is the immediate is formatted differently between
      s_denorm_mode and s_round_mode.
      0892a96a
    • Matt Arsenault's avatar
      AMDGPU: Add new baseline tests for setreg handling · 4f300d49
      Matt Arsenault authored
      Most of these should be identical and use a common prefix, but
      update_llc_test_checks is failing to generate shared checks for some
      reason.
      4f300d49
    • Matt Arsenault's avatar
      AMDGPU: Move MIMG MMO check to verifier · f012c58a
      Matt Arsenault authored
      f012c58a
Loading