Skip to content
  1. Aug 23, 2016
    • Pete Cooper's avatar
      Fix some more asserts after r279466. · 036b94da
      Pete Cooper authored
      That commit added a new version of Intrinsic::getName which should only
      be called when the intrinsic has no overloaded types.  There are several
      debugging paths, such as SDNode::dump which are printing the name of the
      intrinsic but don't have the overloaded types.  These paths should be ok
      to just print the name instead of crashing.
      
      The fix here is ultimately to just add a 'None' second argument as that
      calls the overload capable getName, which is less efficient, but this is a
      debugging path anyway, and not perf critical.
      
      Thanks to Björn Pettersson for pointing out that there were more crashes.
      
      llvm-svn: 279528
      036b94da
  2. Aug 20, 2016
  3. Aug 19, 2016
  4. Aug 17, 2016
  5. Aug 16, 2016
  6. Aug 15, 2016
  7. Aug 12, 2016
    • Duncan P. N. Exon Smith's avatar
      ADT: Remove all ilist_iterator => pointer casts, NFC · f197b1f7
      Duncan P. N. Exon Smith authored
      Remove all ilist_iterator to pointer casts.  There were two reasons for
      casts:
      
        - Checking for an uninitialized (i.e., null) iterator.  I added
          MachineInstrBundleIterator::isValid() to check for that case.
      
        - Comparing an iterator against the underlying pointer value while
          avoiding converting the pointer value to an iterator.  This is
          occasionally necessary in MachineInstrBundleIterator, since there is
          an assertion in the constructors that the underlying MachineInstr is
          not bundled (but we don't care about that if we're just checking for
          pointer equality).
      
      To support the latter case, I rewrote the == and != operators for
      ilist_iterator and MachineInstrBundleIterator.
      
        - The implicit constructors now use enable_if to exclude
          const-iterator => non-const-iterator conversions from overload
          resolution (previously it was a compiler error on instantiation, now
          it's SFINAE).
      
        - The == and != operators are now global (friends), and are not
          templated.
      
        - MachineInstrBundleIterator has overloads to compare against both
          const_pointer and const_reference.  This avoids the implicit
          conversions to MachineInstrBundleIterator that assert, instead just
          checking the address (and I added unit tests to confirm this).
      
      Notably, the only remaining uses of ilist_iterator::getNodePtrUnchecked
      are in ilist.h, and no code outside of ilist*.h directly relies on this
      UB end-iterator-to-pointer conversion anymore.  It's still needed for
      ilist_*sentinel_traits, but I'll clean that up soon.
      
      llvm-svn: 278478
      f197b1f7
    • David Majnemer's avatar
      Use the range variant of find instead of unpacking begin/end · 0d955d0b
      David Majnemer authored
      If the result of the find is only used to compare against end(), just
      use is_contained instead.
      
      No functionality change is intended.
      
      llvm-svn: 278433
      0d955d0b
  8. Aug 11, 2016
  9. Aug 10, 2016
  10. Aug 09, 2016
    • Simon Pilgrim's avatar
      [DAGCombiner] Better support for shifting large value type by constants · 76964e31
      Simon Pilgrim authored
      As detailed on D22726, much of the shift combining code assume constant values will fit into a uint64_t value and calls ConstantSDNode::getZExtValue where it probably shouldn't (leading to asserts). Using APInt directly avoids this problem but we encounter other assertions if we attempt to compare/operate on 2 APInt of different bitwidths.
      
      This patch adds a helper function to ensure that 2 APInt values are zero extended as required so that they can be safely used together. I've only added an initial example use for this to the '(SHIFT (SHIFT x, c1), c2) --> (SHIFT x, (ADD c1, c2))' combines. Further cases can easily be added as required.
      
      Differential Revision: https://reviews.llvm.org/D23007
      
      llvm-svn: 278141
      76964e31
  11. Aug 08, 2016
  12. Aug 04, 2016
    • Nikolai Bozhenov's avatar
      [X86] Heuristic to selectively build Newton-Raphson SQRT estimation · f679530b
      Nikolai Bozhenov authored
      On modern Intel processors hardware SQRT in many cases is faster than RSQRT
      followed by Newton-Raphson refinement. The patch introduces a simple heuristic
      to choose between hardware SQRT instruction and Newton-Raphson software
      estimation.
      
      The patch treats scalars and vectors differently. The heuristic is that for
      scalars the compiler should optimize for latency while for vectors it should
      optimize for throughput. It is based on the assumption that throughput bound
      code is likely to be vectorized.
      
      Basically, the patch disables scalar NR for big cores and disables NR completely
      for Skylake. Firstly, scalar SQRT has shorter latency than NR code in big cores.
      Secondly, vector SQRT has been greatly improved in Skylake and has better
      throughput compared to NR.
      
      Differential Revision: https://reviews.llvm.org/D21379
      
      llvm-svn: 277725
      f679530b
    • Diana Picus's avatar
      Typo fix in comment. NFC · ddddbc24
      Diana Picus authored
      llvm-svn: 277704
      ddddbc24
  13. Aug 03, 2016
    • Elliot Colp's avatar
      Disable shrinking of SNaN constants · 82b1468a
      Elliot Colp authored
      When expanding FP constants, we attempt to shrink doubles to floats and perform an extending load.
      However, on SystemZ, and possibly on other targets (I've only confirmed the problem on SystemZ), the FP extending load instruction may convert SNaN into QNaN, or may cause an exception. So in the general case, we would still like to shrink FP constants, but SNaNs should be left as doubles.
      
      Differential Revision: https://reviews.llvm.org/D22685
      
      llvm-svn: 277602
      82b1468a
  14. Aug 01, 2016
    • Michael Kuperstein's avatar
      [DAGCombine] Make sext(setcc) combine respect getBooleanContents · c97da7f3
      Michael Kuperstein authored
      We used to combine "sext(setcc x, y, cc) -> (select (setcc x, y, cc), -1, 0)"
      Instead, we should combine to (select (setcc x, y, cc), T, 0) where the value
      of T is 1 or -1, depending on the type of the setcc, and getBooleanContents()
      for the type if it is not i1.
      
      This fixes PR28504.
      
      llvm-svn: 277371
      c97da7f3
  15. Jul 30, 2016
  16. Jul 29, 2016
  17. Jul 28, 2016
  18. Jul 27, 2016
  19. Jul 26, 2016
  20. Jul 22, 2016
  21. Jul 19, 2016
    • Elena Demikhovsky's avatar
      AVX-512: Fixed BT instruction selection. · 2c0780b8
      Elena Demikhovsky authored
      The following condition expression ( a >> n) & 1 is converted to "bt a, n" instruction. It works on all intel targets.
      But on AVX-512 it was broken because the expression is modified to (truncate (a >>n) to i1).
      
      I added the new sequence (truncate (a >>n) to i1) to the BT pattern.
      
      Differential Revision: https://reviews.llvm.org/D22354
      
      llvm-svn: 275950
      2c0780b8
  22. Jul 18, 2016
    • Chih-Hung Hsieh's avatar
      [X86] Accept SELECT op code for x86-64 fp128 type · 4d9f2c15
      Chih-Hung Hsieh authored
      DAGTypeLegalizer::CanSkipSoftenFloatOperand should allow
      SELECT op code for x86_64 fp128 type for MME targets,
      so SoftenFloatOperand does not abort on SELECT op code.
      
      Differential Revision: http://reviews.llvm.org/D21758
      
      llvm-svn: 275818
      4d9f2c15
    • Simon Dardis's avatar
      [inlineasm] Propagate operand constraints to the backend · d32a2d30
      Simon Dardis authored
      When SelectionDAGISel transforms a node representing an inline asm
      block, memory constraint information is not preserved. This can cause
      constraints to be broken when a memory offset is of the form:
      
      offset + frame index
      
      when the frame is resolved.
      
      By propagating the constraints all the way to the backend, targets can
      enforce memory operands of inline assembly to conform to their constraints.
      
      For MIPSR6, some instructions had their offsets reduced to 9 bits from
      16 bits such as ll/sc. This becomes problematic when using inline assembly
      to perform atomic operations, as an offset can generated that is too big to
      encode in the instruction.
      
      Reviewers: dsanders, vkalintris
      
      Differential Review: https://reviews.llvm.org/D21615
      
      llvm-svn: 275786
      d32a2d30
  23. Jul 15, 2016
  24. Jul 14, 2016
  25. Jul 13, 2016
  26. Jul 11, 2016
Loading