Skip to content
  1. Aug 28, 2018
    • Ryan Taylor's avatar
      [AMDGPU] Add support for a16 modifiear for gfx9 · 1f334d00
      Ryan Taylor authored
      Summary:
      Adding support for a16 for gfx9. A16 bit replaces r128 bit for gfx9.
      
      Change-Id: Ie8b881e4e6d2f023fb5e0150420893513e5f4841
      
      Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, jfb, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D50575
      
      llvm-svn: 340831
      1f334d00
    • Mikael Holmen's avatar
      [CloneFunction] Constant fold terminators before checking single predecessor · 4d652c4c
      Mikael Holmen authored
      Summary:
      This fixes PR31105.
      
      There is code trying to delete dead code that does so by e.g. checking if
      the single predecessor of a block is the block itself.
      
      That check fails on a block like this
       bb:
         br i1 undef, label %bb, label %bb
      since that has two (identical) predecessors.
      
      However, after the check for dead blocks there is a call to
      ConstantFoldTerminator on the basic block, and that call simplifies the
      block to
       bb:
         br label %bb
      
      Therefore we now do the call to ConstantFoldTerminator before the check if
      the block is dead, so it can realize that it really is.
      
      The original behavior lead to the block not being removed, but it was
      simplified as above, and then we did a call to
          Dest->replaceAllUsesWith(&*I);
      with old and new being equal, and an assertion triggered.
      
      Reviewers: chandlerc, fhahn
      
      Reviewed By: fhahn
      
      Subscribers: eraman, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D51280
      
      llvm-svn: 340820
      4d652c4c
    • Alexandros Lamprineas's avatar
      [GVNHoist] Prune out useless CHI insertions · 484bd13e
      Alexandros Lamprineas authored
      Fix for the out-of-memory error when compiling SemaChecking.cpp
      with GVNHoist and ubsan enabled. I've used a cache for inserted
      CHIs to avoid excessive memory usage.
      
      Differential Revision: https://reviews.llvm.org/D50323
      
      llvm-svn: 340818
      484bd13e
    • Simon Pilgrim's avatar
      [X86][SSE] Improve variable scalar shift of vXi8 vectors (PR34694) · af985870
      Simon Pilgrim authored
      This patch creates the shift mask and actual shift using the vXi16 vector shift ops.
      
      Differential Revision: https://reviews.llvm.org/D51263
      
      llvm-svn: 340813
      af985870
    • Simon Pilgrim's avatar
      [X86][SSE] Avoid vector extraction/insertion for non-constant uniform shifts · f119e27d
      Simon Pilgrim authored
      As discussed on D51263, we're better off using byte shifts to clear the upper bits on pre-SSE41 hardware.
      
      llvm-svn: 340810
      f119e27d
    • Max Kazantsev's avatar
      [NFC] A loop can never contain Ret instruction · 0c4b84e2
      Max Kazantsev authored
      llvm-svn: 340808
      0c4b84e2
    • David Chisnall's avatar
      Fix in getAllocationDataForFunction · 5e52cadf
      David Chisnall authored
      Summary:
      Correct to use set like behaviour of AllocType.  Should check for
      subset, not precise value.
      
      Reviewers: theraven
      
      Reviewed By: theraven
      
      Subscribers: hiraditya, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D50959
      
      llvm-svn: 340807
      5e52cadf
    • Craig Topper's avatar
      [X86] Fix some comments to refer to KORTEST not KTEST. NFC · c1436db7
      Craig Topper authored
      KTEST is a different instruction. All of this code uses KORTEST.
      
      llvm-svn: 340799
      c1436db7
    • Craig Topper's avatar
      [DAGCombiner][AMDGPU][Mips] Fold bitcast with volatile loads if the resulting... · c7506b28
      Craig Topper authored
      [DAGCombiner][AMDGPU][Mips] Fold bitcast with volatile loads if the resulting load is legal for the target.
      
      Summary:
      I'm not sure if this patch is correct or if it needs more qualifying somehow. Bitcast shouldn't change the size of the load so it should be ok? We already do something similar for stores. We'll change the type of a volatile store if the resulting store is Legal or Custom. I'm not sure we should be allowing Custom there...
      
      I was playing around with converting X86 atomic loads/stores(except seq_cst) into regular volatile loads and stores during lowering. This would allow some special RMW isel patterns in X86InstrCompiler.td to be removed. But there's some floating point patterns in there that didn't work because we don't fold (f64 (bitconvert (i64 volatile load))) or (f32 (bitconvert (i32 volatile load))).
      
      Reviewers: efriedma, atanasyan, arsenm
      
      Reviewed By: efriedma
      
      Subscribers: jvesely, arsenm, sdardis, kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, arichardson, jrtc27, atanasyan, jfb, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D50491
      
      llvm-svn: 340797
      c7506b28
    • Craig Topper's avatar
      [InstCombine] Extend (add (sext x), cst) --> (sext (add x, cst')) and (add... · a6cd4b9b
      Craig Topper authored
      [InstCombine] Extend (add (sext x), cst) --> (sext (add x, cst')) and (add (zext x), cst) --> (zext (add x, cst')) to work for vectors
      
      Differential Revision: https://reviews.llvm.org/D51236
      
      llvm-svn: 340796
      a6cd4b9b
    • Kit Barton's avatar
      [PPC] Remove Darwin support from POWER backend. · 7c80f98b
      Kit Barton authored
      This patch issues an error message if Darwin ABI is attempted with the PPC
      backend. It also cleans up existing test cases, either converting the test to
      use an alternative triple or removing the test if the coverage is no longer
      needed.
      
      Updated Tests
      -------------
      The majority of test cases were updated to use a different triple that does not
      include the Darwin ABI. Many tests were also updated to use FileCheck, in place
      of grep.
      
      Deleted Tests
      -------------
      llvm/test/tools/dsymutil/PowerPC/sibling.test was originally added to test
      specific functionality of dsymutil using an object file created with an old
      version of llvm-gcc for a Powerbook G4. After a discussion with @JDevlieghere he
      suggested removing the test.
      
      llvm/test/CodeGen/PowerPC/combine_loads_from_build_pair.ll was converted from a
      PPC test to a SystemZ test, as the behavior is also reproducible there.
      
      All other tests that were deleted were specific to the darwin/ppc ABI and no
      longer necessary.
      
      Phabricator Review: https://reviews.llvm.org/D50988
      
      llvm-svn: 340795
      7c80f98b
    • David Blaikie's avatar
      Revert "[CodeGenPrepare] Scan past debug intrinsics to find select candidates (NFC)" · 7d306532
      David Blaikie authored
      This causes crashes due to the interleaved dbg.value intrinsics being
      left at the end of basic blocks, causing the actual terminators (br,
      etc) to be not where they should be (not at the end of the block),
      leading to later crashes.
      
      Further discussion on the original commit thread.
      
      This reverts commit r340368.
      
      llvm-svn: 340794
      7d306532
    • George Burgess IV's avatar
      [MemorySSA] Add NDEBUG checks to verifiers; NFC · 6a9aa02f
      George Burgess IV authored
      verify*() methods are intended to have no side-effects (unless we detect
      broken MSSA, in which case they assert()), and all of the other verify
      methods are wrapped by `#ifndef NDEBUG`.
      
      llvm-svn: 340793
      6a9aa02f
    • Sanjay Patel's avatar
      [InstCombine] fix formatting; NFC · c615910b
      Sanjay Patel authored
      llvm-svn: 340790
      c615910b
    • Sanjay Patel's avatar
      [InstCombine] allow shuffle+binop canonicalization with widening shuffles · 42d31c20
      Sanjay Patel authored
      This lines up with the behavior of an existing transform where if both 
      operands of the binop are shuffled, we allow moving the binop before the 
      shuffle regardless of whether the shuffle changes the size of the vector.
      
      llvm-svn: 340787
      42d31c20
    • Evandro Menezes's avatar
      [PATCH] [InstCombine] Fix issue in the simplification of pow() with nested exp{,2}() · 253991cf
      Evandro Menezes authored
      Fix the issue of duplicating the call to `exp{,2}()` when it's nested in
      `pow()`, as exposed by rL340462.
      
      Differential revision: https://reviews.llvm.org/D51194
      
      llvm-svn: 340784
      253991cf
    • George Burgess IV's avatar
      s/std::set/DenseSet/; NFC · aa09a82b
      George Burgess IV authored
      We only use this set for `insert` and `count`, so a hashing container
      seems better here.
      
      llvm-svn: 340783
      aa09a82b
    • Brendon Cahoon's avatar
      [Pipeliner] Fix incorrect phi values in the epilog and kernel · e3841eea
      Brendon Cahoon authored
      The code that generates the loop definition operand for phis
      in the epilog and kernel is incorrect in some cases.
      
      In the kernel, when a phi refers to another phi, the code that
      updates PhiOp2 needs to include the stage difference between
      the two phis.
      
      In the epilog, the check for using the loop definition instead
      of the phi definition uses the StageDiffAdj value (the difference
      between the phi stage and the loop definition stage), but the
      adjustment is not needed to determine if the current stage
      contains an iteration with the loop definition.
      
      Differential Revision: https://reviews.llvm.org/D51167
      
      llvm-svn: 340782
      e3841eea
    • Thomas Lively's avatar
      [WebAssembly] TableGen backend for stackifying instructions · 211874d2
      Thomas Lively authored
      Summary:
      The new stackification backend generates the giant switch statement
      used to translate instructions to their stackified forms. I did this
      because it was more interesting than adding all the different vector
      versions of the various SIMD instructions to the switch statment
      manually.
      
      Reviewers: aardappel, aheejin, dschuff
      
      Subscribers: mgorny, sbc100, jgravelle-google, sunfish, jfb, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D51318
      
      llvm-svn: 340781
      211874d2
  2. Aug 27, 2018
Loading