Skip to content
  1. Dec 17, 2018
  2. Dec 16, 2018
  3. Dec 15, 2018
    • Simon Pilgrim's avatar
      [X86] Begin cleaning up combineOr -> SHLD/SHRD. NFCI. · 52c98240
      Simon Pilgrim authored
      In preparation for converting to funnel shifts.
      
      llvm-svn: 349286
      52c98240
    • Simon Pilgrim's avatar
      [X86] Lower to SHLD/SHRD on slow machines for optsize · ef7b5949
      Simon Pilgrim authored
      Use consistent rules for when to lower to SHLD/SHRD for slow machines - fixes a weird issue where funnel shift gets expanded but then X86ISelLowering's combineOr sees the optsize and combines to SHLD/SHRD, but now with the modulo amount guard......
      
      llvm-svn: 349285
      ef7b5949
    • Simon Pilgrim's avatar
      Fix -Wunused-variable warning. NFCI. · 9831d405
      Simon Pilgrim authored
      llvm-svn: 349265
      9831d405
    • Florian Hahn's avatar
      [SILoadStoreOptimizer] Use std::abs to avoid truncation. · abe32c91
      Florian Hahn authored
      Using regular abs() causes the following warning
      
      error: absolute value function 'abs' given an argument of type 'int64_t' (aka 'long') but has parameter of type 'int' which may cause truncation of value [-Werror,-Wabsolute-value]
              (uint32_t)abs(Dist) > MaxDist) {
                        ^
      lib/Target/AMDGPU/SILoadStoreOptimizer.cpp:1369:19: note: use function 'std::abs' instead
      
      which causes a bot to fail:
      http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux/builds/18284/steps/bootstrap%20clang/logs/stdio
      
      llvm-svn: 349224
      abe32c91
    • Craig Topper's avatar
      [X86] Rename hasNoSignedComparisonUses to hasNoSignFlagUses. Add the... · 1fc257d9
      Craig Topper authored
      [X86] Rename hasNoSignedComparisonUses to hasNoSignFlagUses. Add the instruction that only modify the O flag to the waiver list.
      
      The only caller of this turns CMP with 0 into TEST. CMP with 0 and TEST both set OF to 0 so we should have no issues with instructions that only use OF.
      
      Though I don't think there's any reason we would read just OF after a compare with 0 anyway. So this probably isn't an observable change.
      
      llvm-svn: 349223
      1fc257d9
    • Craig Topper's avatar
      [X86] Make hasNoCarryFlagUses/hasNoSignedComparisonUses take an SDValue that... · 5c304eac
      Craig Topper authored
      [X86] Make hasNoCarryFlagUses/hasNoSignedComparisonUses take an SDValue that indicates which result is the flag result. NFCI
      
      hasNoCarryFlagUses hardcoded that the flag result is 1 and used that to filter which uses were of interest. hasNoSignedComparisonUses just assumes the only result is flags and checks whether any user of the node is a CopyToReg instruction.
      
      After this patch we now do a result number check in both and rely on the caller to provide the result number.
      
      This shouldn't change behavior it was just an odd difference between the two functions that I noticed.
      
      llvm-svn: 349222
      5c304eac
    • Artem Belevich's avatar
      [NVPTX] Lower instructions that expand into libcalls. · 6d74bd63
      Artem Belevich authored
      The change is an effort to split and refactor abandoned
      D34708 into smaller parts.
      
      Here the behaviour of unsupported instructions is changed
      to match the behaviour of explicit intrinsics calls.
      Currently LLVM crashes with:
      > Assertion getInstruction() && "Not a call or invoke instruction!" failed.
      
      With this patch LLVM produces a more sensible error message:
      > Cannot select: ... i32 = ExternalSymbol'__foobar'
      
      Author: Denys Zariaiev <denys.zariaiev@gmail.com>
      
      Differential Revision: https://reviews.llvm.org/D55145
      
      llvm-svn: 349213
      6d74bd63
  4. Dec 14, 2018
    • Krzysztof Parzyszek's avatar
      [Hexagon] Add patterns for shifts of v2i16 · 26d994f5
      Krzysztof Parzyszek authored
      This fixes https://llvm.org/PR39983.
      
      llvm-svn: 349202
      26d994f5
    • Krzysztof Parzyszek's avatar
      c0fc0a97
    • Farhana Aleen's avatar
      [AMDGPU] Promote constant offset to the immediate by finding a new base with... · ce095c56
      Farhana Aleen authored
      [AMDGPU] Promote constant offset to the immediate by finding a new base with 13bit constant offset from the nearby instructions.
      
      Summary: Promote constant offset to immediate by recomputing the relative 13bit offset from nearby instructions.
       E.g.
        s_movk_i32 s0, 0x1800
        v_add_co_u32_e32 v0, vcc, s0, v2
        v_addc_co_u32_e32 v1, vcc, 0, v6, vcc
      
        s_movk_i32 s0, 0x1000
        v_add_co_u32_e32 v5, vcc, s0, v2
        v_addc_co_u32_e32 v6, vcc, 0, v6, vcc
        global_load_dwordx2 v[5:6], v[5:6], off
        global_load_dwordx2 v[0:1], v[0:1], off
        =>
        s_movk_i32 s0, 0x1000
        v_add_co_u32_e32 v5, vcc, s0, v2
        v_addc_co_u32_e32 v6, vcc, 0, v6, vcc
        global_load_dwordx2 v[5:6], v[5:6], off
        global_load_dwordx2 v[0:1], v[5:6], off offset:2048
      
      Author: FarhanaAleen
      
      Reviewed By: arsenm, rampitec
      
      Subscribers: llvm-commits, AMDGPU
      
      Differential Revision: https://reviews.llvm.org/D55539
      
      llvm-svn: 349196
      ce095c56
    • Evandro Menezes's avatar
      [AArch64] Simplify the scheduling predicates (NFC) · ea9d9008
      Evandro Menezes authored
      The instruction encodings make it unnecessary to distinguish extended W-form
      from X-form instructions.
      
      llvm-svn: 349185
      ea9d9008
    • Ehsan Amiri's avatar
      NFC. Adding an empty line to test the updated commit credentials. · de1742c2
      Ehsan Amiri authored
      llvm-svn: 349158
      de1742c2
    • Diana Picus's avatar
      [ARM GlobalISel] Thumb2: casts between int and ptr · 02c8343c
      Diana Picus authored
      Mark as legal and add tests. Nothing special to do.
      
      llvm-svn: 349147
      02c8343c
    • Diana Picus's avatar
      [ARM GlobalISel] Minor refactoring. NFCI · 813af0d2
      Diana Picus authored
      Refactor the ARMInstructionSelector to cache some opcodes in the
      constructor instead of checking all the time if we're in ARM or Thumb
      mode.
      
      llvm-svn: 349143
      813af0d2
    • Diana Picus's avatar
      [ARM GlobalISel] Allow simple binary ops in Thumb2 · 14dc3b29
      Diana Picus authored
      Mark G_ADD, G_SUB, G_MUL, G_AND, G_OR and G_XOR as legal for both ARM
      and Thumb2.
      
      Extract the legalizer tests for these opcodes into another file.
      
      Add tests for the instruction selector.
      
      llvm-svn: 349142
      14dc3b29
    • Craig Topper's avatar
      [DAGCombiner][X86] Prevent visitSIGN_EXTEND from returning N when (sext... · 257ce387
      Craig Topper authored
      [DAGCombiner][X86] Prevent visitSIGN_EXTEND from returning N when (sext (setcc)) already has the target desired type for the setcc
      
      Summary:
      If the setcc already has the target desired type we can reach the getSetCC/getSExtOrTrunc after the MatchingVecType check with the exact same types as the nodes we started with. This causes those causes VsetCC to be CSEd to N0 and the getSExtOrTrunc will CSE to N. When we return N, the caller will think that meant we called CombineTo and did our own worklist management. But that's not what happened. This prevents target hooks from being called for the node.
      
      To fix this, I've now returned SDValue if the setcc is already the desired type. But to avoid some regressions in X86 I've had to disable one of the target combines that wasn't being reached before in the case of a (sext (setcc)). If we get vector widening legalization enabled that entire function will be deleted anyway so hopefully this is only for the short term.
      
      Reviewers: RKSimon, spatel
      
      Subscribers: llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D55459
      
      llvm-svn: 349137
      257ce387
    • Craig Topper's avatar
      [X86] Demote EmitTest to a helper function of EmitCmp. Route all callers... · 178abc59
      Craig Topper authored
      [X86] Demote EmitTest to a helper function of EmitCmp. Route all callers except EmitCmp through EmitCmp.
      
      This requires the two callers to manifest a 0 to make EmitCmp call EmitTest.
      
      I'm looking into changing how we combine TEST and flag setting instructions to not be part of lowering. And instead be part of DAG combine or isel. Which will mean EmitTest will probably become gutted and maybe disappear entirely.
      
      llvm-svn: 349094
      178abc59
    • Evandro Menezes's avatar
      [AArch64] Fix Exynos predicates (NFC) · 6fe51ac9
      Evandro Menezes authored
      Fix the logic in the definition of the `ExynosShiftExPred` as a more
      specific version of `ExynosShiftPred`.  But, since `ExynosShiftExPred` is
      not used yet, this change has NFC.
      
      llvm-svn: 349091
      6fe51ac9
  5. Dec 13, 2018
Loading