Skip to content
  1. Aug 28, 2018
    • Lang Hames's avatar
      [ORC] Add an addObjectFile method to LLJIT. · 37a66413
      Lang Hames authored
      The addObjectFile method adds the given object file to the JIT session, making
      its code available for execution.
      
      Support for the -extra-object flag is added to lli when operating in
      -jit-kind=orc-lazy mode to support testing of this feature.
      
      llvm-svn: 340870
      37a66413
    • Craig Topper's avatar
      [X86] Add intrinsics for KADD instructions · 9401fd0e
      Craig Topper authored
      These are intrinsics for supporting kadd builtins in clang. These builtins are already in gcc to implement intrinsics from icc. Though they are missing from the Intel Intrinsics Guide.
      
      This instruction adds two mask registers together as if they were scalar rather than a vXi1. We might be able to get away with a bitcast to scalar and a normal add instruction, but that would require DAG combine smarts in the backend to recoqnize add+bitcast. For now I'd prefer to go with the easiest implementation so we can get these builtins in to clang with good codegen.
      
      Differential Revision: https://reviews.llvm.org/D51370
      
      llvm-svn: 340869
      9401fd0e
    • Fangrui Song's avatar
      [AMDGPU] Fix -Wunused-variable when -DLLVM_ENABLE_ASSERTIONS=off · 9cca227d
      Fangrui Song authored
      llvm-svn: 340868
      9cca227d
    • Matt Morehouse's avatar
      Revert "[libFuzzer] Port to Windows" · bab8556f
      Matt Morehouse authored
      This reverts commit r340860 due to failing tests.
      
      llvm-svn: 340867
      bab8556f
    • Matt Arsenault's avatar
      AMDGPU: Don't delete instructions if S_ENDPGM has implicit uses · 755f41f3
      Matt Arsenault authored
      This can leave behind the uses with the defs removed.
      Since this should only really happen in tests, it's not worth the
      effort of trying to handle this.
      
      llvm-svn: 340866
      755f41f3
    • Aditya Nandakumar's avatar
      [GISel]: Add missing opcodes for overflow intrinsics · 6b4d343e
      Aditya Nandakumar authored
      https://reviews.llvm.org/D51197
      
      Currently, IRTranslator (and GISel) seems to be arbitrarily picking
      which overflow intrinsics get mapped into opcodes which either have a
      carry as an input or not.
      For intrinsics such as Intrinsic::uadd_with_overflow, translate it to an
      opcode (G_UADDO) which doesn't have any carry inputs (similar to LLVM
      IR).
      
      This patch adds 4 missing opcodes for completeness - G_UADDO, G_USUBO,
      G_SSUBE and G_SADDE.
      
      llvm-svn: 340865
      6b4d343e
    • Thomas Lively's avatar
      [WebAssembly][NFC] Document stackifier tablegen backend · adb6da10
      Thomas Lively authored
      Summary:
      Add comments to help readers avoid having to read tablegen backends to
      understand the code. Also remove unecessary breaks from the output.
      
      Reviewers: dschuff, aheejin
      
      Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D51371
      
      llvm-svn: 340864
      adb6da10
    • Andrea Di Biagio's avatar
      [llvm-mca] use llvm::any_of instead of std::any_of. NFC · 88d99a09
      Andrea Di Biagio authored
      llvm-svn: 340863
      88d99a09
    • Matt Arsenault's avatar
      AMDGPU: Force shrinking of add/sub even if the carry is used · 44a8a756
      Matt Arsenault authored
      The original motivating example uses a 64-bit add, so the carry
      is used. Insert a copy from VCC. This may allow shrinking of
      the used carry instruction. At worst, we are replacing a
      mov to materialize the constant with a copy of vcc.
      
      llvm-svn: 340862
      44a8a756
    • Thomas Lively's avatar
      [WebAssembly][NFC] Fix formatting from rL340781 · 94a91dba
      Thomas Lively authored
      Reviewers: aheejin
      
      Subscribers: dschuff, sbc100, jgravelle-google, sunfish, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D51367
      
      llvm-svn: 340861
      94a91dba
    • Matt Morehouse's avatar
      [libFuzzer] Port to Windows · c6fff3b6
      Matt Morehouse authored
      Summary:
      Port libFuzzer to windows-msvc.
      This patch allows libFuzzer targets to be built and run on Windows, using -fsanitize=fuzzer and/or fsanitize=fuzzer-no-link. It allows these forms of coverage instrumentation to work on Windows as well.
      It does not fix all issues, such as those with -fsanitize-coverage=stack-depth, which is not usable on Windows as of this patch.
      It also does not fix any libFuzzer integration tests. Nearly all of them fail to compile, fixing them will come in a later patch, so libFuzzer tests are disabled on Windows until them.
      
      Patch By: metzman
      
      Reviewers: morehouse, rnk
      
      Reviewed By: morehouse, rnk
      
      Subscribers: morehouse, kcc, eraman
      
      Differential Revision: https://reviews.llvm.org/D51022
      
      llvm-svn: 340860
      c6fff3b6
    • Matt Arsenault's avatar
      AMDGPU: Shrink insts to fold immediates · de6c421c
      Matt Arsenault authored
      This needs to be done in the SSA fold operands
      pass to be effective, so there is a bit of overlap
      with SIShrinkInstructions but I don't think this
      is practically avoidable.
      
      llvm-svn: 340859
      de6c421c
    • Thomas Lively's avatar
      [WebAssembly][NFC] Fix up SIMD bitwise tests · ec71e018
      Thomas Lively authored
      Summary:
      The updated tests were previously infallible because the SIMD bitwise
      operations do not contain vector types in their names.
      
      Reviewers: aheejin, dschuff
      
      Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D51369
      
      llvm-svn: 340858
      ec71e018
    • Thomas Lively's avatar
      [WebAssembly] v128.not · 995ad61f
      Thomas Lively authored
      Implementation and tests.
      
      llvm-svn: 340857
      995ad61f
    • Matt Arsenault's avatar
      AMDGPU: Move canShrink into TII · 35b1902b
      Matt Arsenault authored
      llvm-svn: 340855
      35b1902b
    • Nirav Dave's avatar
      [DAGCombine] Rework MERGE_VALUES to inline in single pass. NFCI. · 11e39fb6
      Nirav Dave authored
      Avoid hyperlinear cost of inlining MERGE_VALUE node by constructing
      temporary vector and doing a single replacement.
      
      llvm-svn: 340853
      11e39fb6
    • Nirav Dave's avatar
      [DAG] Avoid recomputing Divergence checks. NFCI. · 113f2b90
      Nirav Dave authored
      When making multiple updates to the same SDNode, recompute node
      divergence only once after all changes have been made.
      
      llvm-svn: 340852
      113f2b90
    • Nirav Dave's avatar
      [DAG] Fix updateDivergence calculation · 0b8cb46e
      Nirav Dave authored
      Check correct SDNode when deciding if we should update the divergence
      property.
      
      llvm-svn: 340851
      0b8cb46e
    • Matt Arsenault's avatar
      AMDGPU: Remove nan tests in class if src is nnan · 10de2775
      Matt Arsenault authored
      llvm-svn: 340850
      10de2775
    • Heejin Ahn's avatar
      [WebAssembly] Use getCalleeOpNo utility function (NFC) · 56e79dd0
      Heejin Ahn authored
      Reviewers: tlively
      
      Subscribers: dschuff, sbc100, jgravelle-google, sunfish, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D51366
      
      llvm-svn: 340848
      56e79dd0
    • Dean Michael Berris's avatar
      [XRay][docs] Chrome Trace Viewer Instructions · 41a21475
      Dean Michael Berris authored
      This patch adds an example on how to generate a Chrome Trace Viewer
      loadable trace from an XRay trace.
      
      llvm-svn: 340847
      41a21475
    • Sanjay Patel's avatar
      [InstCombine] fix baseline assertions · 60ffc2e9
      Sanjay Patel authored
      rL340842 contained the wrong version of the check lines.
      
      llvm-svn: 340846
      60ffc2e9
    • Craig Topper's avatar
      [X86] Mark the FUCOMI instructions as requiring CMOV to be enabled. NFCI · c73095e2
      Craig Topper authored
      These instructions were added on the PentiumPro along with CMOV.
      
      This was already comprehended by the lowering process which should emit an alternate sequence using FCOM and FNSTW. This just makes it an explicit error if that doesn't work for some reason.
      
      llvm-svn: 340844
      c73095e2
    • Dean Michael Berris's avatar
      [XRay][docs] Update instructions · f0a1088a
      Dean Michael Berris authored
      Add `xray_mode=xray-basic` to the list of options in the "further
      exploration" section of the doc.
      
      llvm-svn: 340843
      f0a1088a
    • Sanjay Patel's avatar
      [InstCombine] add tests for select narrowing (PR38691); NFC · c9756e5a
      Sanjay Patel authored
      llvm-svn: 340842
      c9756e5a
    • Stella Stamenova's avatar
      [lit, shtest-timeout] Always use an internal shell for the shtest-timeout to... · 8f8832d2
      Stella Stamenova authored
      [lit, shtest-timeout] Always use an internal shell for the shtest-timeout to diagnose buildbot failures
      
      Summary:
      Right now this test is failing on the builtbots on Windows but we have a very similar setup where the test passes. The test is meant to test that specifying a timeout works correctly by running an infnite loop and having it timeout - on the buildbot, the infinite loop doesn't actually execute. This change runs all of the tests in the set using an internal shell rather than an external shell. I expect this will make the test pass which means that either the way the external shell is invoked or the external shell setup on the buildbots is not correct. Regardless of whether the test passes with this change, we'll need to undo this change and have a real fix.
      
      @gkistanova was able to get logs from the buildbot to rule out a number of theories as to why this test is failing, but they didn't have enough information to confirm exactly what the issue is. The purpose of this change is to narrow it down, but if someone has a local repro and can aid in debugging, that would make it much speedier (and less prone to making the bots fail).
      
      Reviewers: gkistanova, asmith, zturner, modocache, rnk, delcypher
      
      Reviewed By: rnk
      
      Subscribers: delcypher, llvm-commits, gkistanova
      
      Differential Revision: https://reviews.llvm.org/D51326
      
      llvm-svn: 340840
      8f8832d2
    • Brian Cain's avatar
      [debuginfo] generate debug info with asm+.file · 3e0ca570
      Brian Cain authored
      Summary:
      For assembly input files, generate debug info even when the .file
      directive is present, provided it does not include a file-number
      argument.  Fixes PR38695.
      
      Reviewers: probinson, sidneym
      
      Subscribers: aprantl, hiraditya, JDevlieghere, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D51315
      
      llvm-svn: 340839
      3e0ca570
    • Simon Pilgrim's avatar
      [TableGen] CodeGenDAGPatterns::GenerateVariants - basic caching of matching predicates · 6a92b5e1
      Simon Pilgrim authored
      CodeGenDAGPatterns::GenerateVariants is a costly function in many tblgen commands (33.87% of the total runtime of x86 -gen-dag-isel), and due to the O(N^2) nature of the function, there are a high number of repeated comparisons of the pattern's vector<Predicate>.
      
      This initial patch at least avoids repeating these comparisons for every Variant in a pattern. I began investigating caching all the matches before entering the loop but hit issues with how best to store the data and how to update the cache as patterns were added.
      
      Saves around 15secs in debug builds of x86 -gen-dag-isel.
      
      Differential Revision: https://reviews.llvm.org/D51035
      
      llvm-svn: 340837
      6a92b5e1
    • Kirill Bobyrev's avatar
      [benchmark] Stop building benchmarks by default · a294dfa8
      Kirill Bobyrev authored
      Although the benchmark regex-related build issue seems to be
      fixed, it appears that benchmark library triggers some stage 2 clang-cl
      bugs:
      
      http://lab.llvm.org:8011/builders/clang-x86-windows-msvc2015/builds/13495/steps/build%20stage%202/logs/stdio
      
      The only sensible option now is to prevent benchmark library from
      building in the default configuration.
      
      llvm-svn: 340836
      a294dfa8
    • David Bolvansky's avatar
      [Inliner] Attribute callsites with inline remarks · c1b27b56
      David Bolvansky authored
      Summary:
      Sometimes reading an output *.ll file it is not easy to understand why some callsites are not inlined. We can read output of inline remarks (option --pass-remarks-missed=inline) and try correlating its messages with the callsites.
      
      An easier way proposed by this patch is to add to every callsite processed by Inliner an attribute with the latest message that describes the cause of not inlining this callsite. The attribute is called //inline-remark//. By default this feature is off. It can be switched on by the option //-inline-remark-attribute//.
      
      For example in the provided test the result method //@test1// has two callsites //@bar// and inline remarks report different inlining missed reasons:
        remark: <unknown>:0:0: bar not inlined into test1 because too costly to inline (cost=-5, threshold=-6)
        remark: <unknown>:0:0: bar not inlined into test1 because it should never be inlined (cost=never): recursive
      
      It is not clear which remark correspond to which callsite. With the inline remark attribute enabled we get the reasons attached to their callsites:
        define void @test1() {
          call void @bar(i1 true) #0
          call void @bar(i1 false) #2
          ret void
        }
        attributes #0 = { "inline-remark"="(cost=-5, threshold=-6)" }
        ..
        attributes #2 = { "inline-remark"="(cost=never): recursive" }
      
      Patch by: yrouban (Yevgeny Rouban)
      
      Reviewers: xbolva00, tejohnson, apilipenko
      
      Reviewed By: xbolva00, tejohnson
      
      Subscribers: eraman, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D50435
      
      llvm-svn: 340834
      c1b27b56
    • Craig Topper's avatar
      [X86] Fix copy paste mistake in vector-idiv-v2i32.ll. Add missing test case. · f1c11143
      Craig Topper authored
      Some of the test cases contained the same load twice instead of a different load.
      
      llvm-svn: 340833
      f1c11143
    • Ryan Taylor's avatar
      [AMDGPU] Add support for a16 modifiear for gfx9 · 1f334d00
      Ryan Taylor authored
      Summary:
      Adding support for a16 for gfx9. A16 bit replaces r128 bit for gfx9.
      
      Change-Id: Ie8b881e4e6d2f023fb5e0150420893513e5f4841
      
      Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, jfb, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D50575
      
      llvm-svn: 340831
      1f334d00
    • Andrea Di Biagio's avatar
      [llvm-mca] Initialize each element in vector TimelineView::UsedBuffers to a... · 4269d64b
      Andrea Di Biagio authored
      [llvm-mca] Initialize each element in vector TimelineView::UsedBuffers to a default invalid buffer descriptor. NFCI
      
      Also change the default buffer size for UsedBuffer entries to -1 (i.e. "unknown
      size"). No functional change intended.
      
      llvm-svn: 340830
      4269d64b
    • Kirill Bobyrev's avatar
      [benchmark] Fix buildbots failing to identify regex support · 3e331e0d
      Kirill Bobyrev authored
      This is cleanup after newly introduced google/benchmark library
      (rL340809). Many buildbots fail to identify regex engine support, so
      this should presumably fix the issue.
      
      llvm-svn: 340827
      3e331e0d
    • Pavel Labath's avatar
      Clarify comment in the string-offsets-table-order.ll test · 10d87abd
      Pavel Labath authored
      llvm-svn: 340826
      10d87abd
    • Andrea Di Biagio's avatar
      [llvm-mca][TimelineView] Force the same number of executions for every entry... · d17d371c
      Andrea Di Biagio authored
      [llvm-mca][TimelineView] Force the same number of executions for every entry in the 'wait-times' table.
      
      This patch also uses colors to highlight problematic wait-time entries.
      A problematic entry is an entry with an high wait time that tends to match (or
      exceed) the size of the scheduler's buffer.
      
      Color RED is used if an instruction had to wait an average number of cycles
      which is bigger than (or equal to) the size of the underlying scheduler's
      buffer.
      Color YELLOW is used if the time (in cycles) spend waiting for the
      operands or pipeline resources is bigger than half the size of the underlying
      scheduler's buffer.
      Color MAGENTA is used if an instruction does not consume buffer resources
      according to the scheduling model.
      
      llvm-svn: 340825
      d17d371c
    • Kristof Umann's avatar
      [ADT] ImmutableList no longer requires elements to be copy constructible · d0202395
      Kristof Umann authored
      ImmutableList used to require elements to have a copy constructor for no
      good reason, this patch aims to fix this.
      It also required but did not enforce its elements to be trivially
      destructible, so a new static_assert is added to guard against misuse.
      
      Differential Revision: https://reviews.llvm.org/D49985
      
      llvm-svn: 340824
      d0202395
    • Andrea Di Biagio's avatar
      [llvm-mca] Pass an instruction reference when notifying event listeners about... · 29c5d5aa
      Andrea Di Biagio authored
      [llvm-mca] Pass an instruction reference when notifying event listeners about reserved/released buffer resources. NFC
      
      llvm-svn: 340821
      29c5d5aa
    • Mikael Holmen's avatar
      [CloneFunction] Constant fold terminators before checking single predecessor · 4d652c4c
      Mikael Holmen authored
      Summary:
      This fixes PR31105.
      
      There is code trying to delete dead code that does so by e.g. checking if
      the single predecessor of a block is the block itself.
      
      That check fails on a block like this
       bb:
         br i1 undef, label %bb, label %bb
      since that has two (identical) predecessors.
      
      However, after the check for dead blocks there is a call to
      ConstantFoldTerminator on the basic block, and that call simplifies the
      block to
       bb:
         br label %bb
      
      Therefore we now do the call to ConstantFoldTerminator before the check if
      the block is dead, so it can realize that it really is.
      
      The original behavior lead to the block not being removed, but it was
      simplified as above, and then we did a call to
          Dest->replaceAllUsesWith(&*I);
      with old and new being equal, and an assertion triggered.
      
      Reviewers: chandlerc, fhahn
      
      Reviewed By: fhahn
      
      Subscribers: eraman, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D51280
      
      llvm-svn: 340820
      4d652c4c
    • Simon Pilgrim's avatar
      [TableGen] Use std::move where possible in InstructionMemo constructor. NFCI. · b7e10182
      Simon Pilgrim authored
      Requested in post-commit review for rL339670
      
      llvm-svn: 340819
      b7e10182
Loading