Skip to content
  1. Aug 06, 2016
    • Benjamin Kramer's avatar
      [LoadCombine] Simplify code with a brace init. NFC. · a3d4def8
      Benjamin Kramer authored
      llvm-svn: 277921
      a3d4def8
    • Simon Pilgrim's avatar
      [X86][SSE] Add 2 input shuffle support to matchBinaryVectorShuffle · f56309f1
      Simon Pilgrim authored
      Not actually used yet...
      
      llvm-svn: 277919
      f56309f1
    • Benjamin Kramer's avatar
      Move helpers into anonymous namespaces. NFC. · b7d3311c
      Benjamin Kramer authored
      llvm-svn: 277916
      b7d3311c
    • David Majnemer's avatar
      [CodeGen] Fix a -Wdocumentation warning · 70c93fa6
      David Majnemer authored
      A parameter was documented with the wrong name.
      No functionality change is intended.
      
      llvm-svn: 277915
      70c93fa6
    • David Majnemer's avatar
      [ValueTracking] Teach computeKnownBits about [su]min/max · a19d0f2f
      David Majnemer authored
      Reasoning about a select in terms of a min or max allows us to derive a
      tigher bound on the result.
      
      llvm-svn: 277914
      a19d0f2f
    • David Majnemer's avatar
      [CallGraphSCCPass] Use an ArrayRef instead of a pair of iterators · 1665d863
      David Majnemer authored
      No functional change is intended.
      
      llvm-svn: 277913
      1665d863
    • Chandler Carruth's avatar
      [ADT] Make the triple test 1000x faster through more focused test cases. · 9dd96d73
      Chandler Carruth authored
      The current approach isn't a long-term viable pattern. Given the set of
      architectures A, vendors V, operating systems O, and environments E, it
      does |A| * |V| * |O| * |E| * 4! tests. As LLVM grows, this test keeps
      getting slower, despite my working very hard to make it get some
      "optimizations" even in -O0 builds in order to lower the constant
      factors. Fundamentally, we're doing an unreasonable amount of work.i
      
      Looking at the specific thing being tested -- the goal seems very
      clearly to be testing the *permutations*, not the *combinations*. The
      combinations are driving up the complexity much more than anything else.
      
      Instead, test every possible value for a given triple entry in every
      permutation of *some* triple. This really seems to cover the core goal
      of the test. Every single possible triple component is tested in every
      position. But because we keep the rest of the triple constant, it does
      so in a dramatically more scalable amount of time. With this model we do
      (|A| + |V| + |O| + |E|) * 4! tests.
      
      For me on a debug build, this goes from running for 19 seconds to 19
      milliseconds, or a 1000x improvement. This makes a world of difference
      for the critical path of 'ninja check-llvm' and other extremely common
      workflows.
      
      Thanks to Renato, Dean, and David for the helpful review comments and
      helping me refine the explanation of the change.
      
      Differential Revision: https://reviews.llvm.org/D23156
      
      llvm-svn: 277912
      9dd96d73
    • Sanjoy Das's avatar
      [InstCombine] Don't coerce non-integral pointers to integers · ba04d3a6
      Sanjoy Das authored
      Reviewers: majnemer
      
      Subscribers: mcrosier, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D23231
      
      llvm-svn: 277910
      ba04d3a6
    • Matthias Braun's avatar
      Revert "(refs/bisect/bad) GVN-hoist: enable by default" · 9a0035d8
      Matthias Braun authored
      GVN-Hoist appears to miscompile llvm-testsuite
      SingleSource/Benchmarks/Misc/fbench.c at the moment.
      
      I filed http://llvm.org/PR28880
      
      This reverts commit r277786.
      
      llvm-svn: 277909
      9a0035d8
    • Gor Nishanov's avatar
      Part 4c: Coroutine Devirtualization: Devirtualize coro.resume and coro.destroy. · 31d8c9af
      Gor Nishanov authored
      Summary:
      This is the 4c patch of the coroutine series. CoroElide pass now checks if PostSplit coro.begin
      is referenced by coro.subfn.addr intrinsics. If so replace coro.subfn.addrs with an appropriate coroutine
      subfunction associated with that coro.begin.
      
      Documentation and overview is here: http://llvm.org/docs/Coroutines.html.
      
      Upstreaming sequence (rough plan)
      1.Add documentation. (https://reviews.llvm.org/D22603)
      2.Add coroutine intrinsics. (https://reviews.llvm.org/D22659)
      3.Add empty coroutine passes. (https://reviews.llvm.org/D22847)
      4.Add coroutine devirtualization + tests.
      ab) Lower coro.resume and coro.destroy (https://reviews.llvm.org/D22998)
      c) Do devirtualization <= we are here
      5.Add CGSCC restart trigger + tests.
      6.Add coroutine heap elision + tests.
      7.Add the rest of the logic (split into more patches)
      
      Reviewers: majnemer
      
      Subscribers: mehdi_amini, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D23229
      
      llvm-svn: 277908
      31d8c9af
    • Nico Weber's avatar
      Revert r277896. · c893e603
      Nico Weber authored
      It breaks ExecutionEngine/OrcLazy/weak-function.ll on most bots.
      
      Script:
      --
      ...
      --
      Exit Code: 1
      
      Command Output (stderr):
      --
      Could not find main function.
      
      llvm-svn: 277907
      c893e603
    • Kyle Butt's avatar
      CodeGen: If Convert blocks that would form a diamond when tail-merged. · 71cb44d9
      Kyle Butt authored
      The following function currently relies on tail-merging for if
      conversion to succeed. The common tail of cond_true and cond_false is
      extracted, and this then forms a diamond pattern that can be
      successfully if converted.
      
      If this block does not get extracted, either because tail-merging is
      disabled or the threshold is higher, we should still recognize this
      pattern and if-convert it.
      define i32 @t2(i32 %a, i32 %b) nounwind {
      entry:
      	%tmp1434 = icmp eq i32 %a, %b		; <i1> [#uses=1]
      	br i1 %tmp1434, label %bb17, label %bb.outer
      
      bb.outer:		; preds = %cond_false, %entry
      	%b_addr.021.0.ph = phi i32 [ %b, %entry ], [ %tmp10, %cond_false ]
      	%a_addr.026.0.ph = phi i32 [ %a, %entry ], [ %a_addr.026.0, %cond_false ]
      	br label %bb
      
      bb:		; preds = %cond_true, %bb.outer
      	%indvar = phi i32 [ 0, %bb.outer ], [ %indvar.next, %cond_true ]
      	%tmp. = sub i32 0, %b_addr.021.0.ph
      	%tmp.40 = mul i32 %indvar, %tmp.
      	%a_addr.026.0 = add i32 %tmp.40, %a_addr.026.0.ph
      	%tmp3 = icmp sgt i32 %a_addr.026.0, %b_addr.021.0.ph
      	br i1 %tmp3, label %cond_true, label %cond_false
      
      cond_true:		; preds = %bb
      	%tmp7 = sub i32 %a_addr.026.0, %b_addr.021.0.ph
      	%tmp1437 = icmp eq i32 %tmp7, %b_addr.021.0.ph
      	%indvar.next = add i32 %indvar, 1
      	br i1 %tmp1437, label %bb17, label %bb
      
      cond_false:		; preds = %bb
      	%tmp10 = sub i32 %b_addr.021.0.ph, %a_addr.026.0
      	%tmp14 = icmp eq i32 %a_addr.026.0, %tmp10
      	br i1 %tmp14, label %bb17, label %bb.outer
      
      bb17:		; preds = %cond_false, %cond_true, %entry
      	%a_addr.026.1 = phi i32 [ %a, %entry ], [ %tmp7, %cond_true ], [ %a_addr.026.0, %cond_false ]
      	ret i32 %a_addr.026.1
      }
      
      Without tail-merging or diamond-tail if conversion:
      LBB1_1:                                 @ %bb
                                              @ =>This Inner Loop Header: Depth=1
              cmp     r0, r1
              ble     LBB1_3
      @ BB#2:                                 @ %cond_true
                                              @   in Loop: Header=BB1_1 Depth=1
              subs    r0, r0, r1
              cmp     r1, r0
              it      ne
              cmpne   r0, r1
              bgt     LBB1_4
      LBB1_3:                                 @ %cond_false
                                              @   in Loop: Header=BB1_1 Depth=1
              subs    r1, r1, r0
              cmp     r1, r0
              bne     LBB1_1
      LBB1_4:                                 @ %bb17
              bx      lr
      
      With diamond-tail if conversion, but without tail-merging:
      @ BB#0:                                 @ %entry
              cmp     r0, r1
              it      eq
              bxeq    lr
      LBB1_1:                                 @ %bb
                                              @ =>This Inner Loop Header: Depth=1
              cmp     r0, r1
              ite     le
              suble   r1, r1, r0
              subgt   r0, r0, r1
              cmp     r1, r0
              bne     LBB1_1
      @ BB#2:                                 @ %bb17
              bx      lr
      
      llvm-svn: 277905
      71cb44d9
    • Kyle Butt's avatar
      IfConverter: Split ScanInstructions into 2 functions. · 54bf3cef
      Kyle Butt authored
      ScanInstructions is now 2 functions:
      AnalyzeBranches and ScanInstructions. ScanInstructions also now takes a
      pair of arguments delimiting the instructions to be scanned. This will
      be used for forked diamond support to re-scan only a portion of the
      block.
      
      llvm-svn: 277904
      54bf3cef
    • Kyle Butt's avatar
      IfConversion: Document countDuplicatedInstructions. NFC · 4f0e2879
      Kyle Butt authored
      llvm-svn: 277903
      4f0e2879
    • Kyle Butt's avatar
      IfConversion: factor out 2 functions to skip debug instrs. NFC · fe916828
      Kyle Butt authored
      Skipping debug instructions occurrs repeatedly, factor it out.
      
      llvm-svn: 277902
      fe916828
    • Michael Zolotukhin's avatar
      Revert "[LoopSimplify] Fix updating LCSSA after separating nested loops." · 09cf304e
      Michael Zolotukhin authored
      This reverts commit r277877.
      Try to appease clang-x64-ninja-win7 buildbot.
      
      llvm-svn: 277901
      09cf304e
    • Lang Hames's avatar
      [ORC] Add (partial) weak symbol support to the CompileOnDemand layer. · 62a45960
      Lang Hames authored
      This adds partial support for weak functions to the CompileOnDemandLayer by
      modifying the addLogicalModule method to check for existing stub definitions
      before building a new stub for a weak function. This scheme is sufficient to
      support ODR definitions, but fails for general weak definitions if strong
      definition is encountered after the first weak definition. (A more extensive
      refactor will be required to fully support weak symbols).
      
      This patch does *not* add weak symbol support to RuntimeDyld: I hope to add
      that in the near future.
      
      llvm-svn: 277896
      62a45960
    • Zachary Turner's avatar
      Fix a -Wunused-const-variable due to a bug in clang. · 83816cea
      Zachary Turner authored
      llvm-svn: 277893
      83816cea
    • Sanjoy Das's avatar
      [IRCE] Remove unused headers; NFC · b8c2ebea
      Sanjoy Das authored
      llvm-svn: 277892
      b8c2ebea
    • Sanjoy Das's avatar
      [IRCE] Preserve loop-simplify form · cf181867
      Sanjoy Das authored
      Fixes PR28764.  Right now there is no way to test this, but (as
      mentioned on the PR) with Michael Zolotukhin's yet to be checked in
      LoopSimplify verfier, 8 of the llvm-lit tests for IRCE crash.
      
      llvm-svn: 277891
      cf181867
    • Zachary Turner's avatar
      Resubmit "Make YAML support SmallVector" · 9e91c28b
      Zachary Turner authored
      This resubmits a3770391c5fb64108d565e12f61dd77ce71b5b4f,
      which was reverted due to breakages on non-Windows machines.
      
      Due to differences in template instantiation rules on Microsoft
      and non-Microsoft platforms, a member access restriction was
      triggering on non-Microsoft compilers.  Previously, a friend
      declaration for std::vector<> had been introduced into the
      DebugMap class to make the member access restriction pass,
      but the introduction of support for SmallVector<> meant that
      an additional friend declaration would need to be added.
      
      This didn't really make a lot of sense since the user of the
      macro is probably only using one type (SmallVector<>, vector<>,
      etc) and we could in theory add support for even more types
      to this macro in the future (e.g. std::deque), so rather than
      add another friend declaration, I just made the type being
      referenced a public nested typedef instead of a private nested
      typedef.
      
      llvm-svn: 277888
      9e91c28b
    • Sanjay Patel's avatar
      [InstCombine] refactor ctlz/cttz folds (NFCI) · 8e3ab17c
      Sanjay Patel authored
      Note that this fold really belongs in InstSimplify.
      Refactoring here anyway as an intermediate step because
      there's a planned addition to this function in D23134.
      
      Differential Revision: https://reviews.llvm.org/D23223
      
      llvm-svn: 277883
      8e3ab17c
    • Justin Bogner's avatar
      Revert "Make YAML support SmallVector" · 1219a60e
      Justin Bogner authored
      This breaks building dsymutil, causing my local build and many bots to
      fail.
      
      This reverts r277870.
      
      llvm-svn: 277881
      1219a60e
    • Daniel Berlin's avatar
      [MSSA] Use depth first iterator instead of custom version. · 7ac3d740
      Daniel Berlin authored
      Summary:
      Originally the plan was to use the custom worklist to do some block popping,
      and because we don't actually need a visited set. The custom one we have
      here is slightly broken, and it's not worth fixing vs using depth_first_iterator since we aren't going to go the route we originally
      were.
      
      Fixes PR28874
      Reviewers: george.burgess.iv
      
      Subscribers: llvm-commits, gberry
      
      Differential Revision: https://reviews.llvm.org/D23187
      
      llvm-svn: 277880
      7ac3d740
  2. Aug 05, 2016
Loading