Skip to content
  1. May 02, 2017
  2. May 01, 2017
  3. Apr 28, 2017
  4. Apr 24, 2017
  5. Apr 21, 2017
    • Hans Wennborg's avatar
      Re-commit r301040 "X86: Don't emit zero-byte functions on Windows" · 9b9a5358
      Hans Wennborg authored
      In addition to the original commit, tighten the condition for when to
      pad empty functions to COFF Windows.  This avoids running into problems
      when targeting e.g. Win32 AMDGPU, which caused test failures when this
      was committed initially.
      
      llvm-svn: 301047
      9b9a5358
    • Hans Wennborg's avatar
      Revert r301040 "X86: Don't emit zero-byte functions on Windows" · 04593000
      Hans Wennborg authored
      This broke almost all bots. Reverting while fixing.
      
      llvm-svn: 301041
      04593000
    • Hans Wennborg's avatar
      X86: Don't emit zero-byte functions on Windows · cb3e8107
      Hans Wennborg authored
      Empty functions can lead to duplicate entries in the Guard CF Function
      Table of a binary due to multiple functions sharing the same RVA,
      causing the kernel to refuse to load that binary.
      
      We had a terrific bug due to this in Chromium.
      
      It turns out we were already doing this for Mach-O in certain
      situations. This patch expands the code for that in
      AsmPrinter::EmitFunctionBody() and renames
      TargetInstrInfo::getNoopForMachoTarget() to simply getNoop() since it
      seems it was used for not just Mach-O anyway.
      
      Differential Revision: https://reviews.llvm.org/D32330
      
      llvm-svn: 301040
      cb3e8107
  6. Apr 20, 2017
  7. Apr 17, 2017
  8. Apr 12, 2017
    • Jonas Paulsson's avatar
      [SystemZ] TargetTransformInfo cost functions implemented. · fccc7d66
      Jonas Paulsson authored
      getArithmeticInstrCost(), getShuffleCost(), getCastInstrCost(),
      getCmpSelInstrCost(), getVectorInstrCost(), getMemoryOpCost(),
      getInterleavedMemoryOpCost() implemented.
      
      Interleaved access vectorization enabled.
      
      BasicTTIImpl::getCastInstrCost() improved to check for legal extending loads,
      in which case the cost of the z/sext instruction becomes 0.
      
      Review: Ulrich Weigand, Renato Golin.
      https://reviews.llvm.org/D29631
      
      llvm-svn: 300052
      fccc7d66
  9. Apr 11, 2017
  10. Apr 07, 2017
  11. Apr 05, 2017
    • Sanjay Patel's avatar
      [DAGCombiner] add and use TLI hook to convert and-of-seteq / or-of-setne to... · b2f1621b
      Sanjay Patel authored
      [DAGCombiner] add and use TLI hook to convert and-of-seteq / or-of-setne to bitwise logic+setcc (PR32401)
      
      This is a generic combine enabled via target hook to reduce icmp logic as discussed in:
      https://bugs.llvm.org/show_bug.cgi?id=32401
      
      It's likely that other targets will want to enable this hook for scalar transforms, 
      and there are probably other patterns that can use bitwise logic to reduce comparisons.
      
      Note that we are missing an IR canonicalization for these patterns, and we will probably
      prefer the pair-of-compares form in IR (shorter, more likely to fold).
      
      Differential Revision: https://reviews.llvm.org/D31483
      
      llvm-svn: 299542
      b2f1621b
    • Alex Bradbury's avatar
      Add MCContext argument to MCAsmBackend::applyFixup for error reporting · 866113c2
      Alex Bradbury authored
      A number of backends (AArch64, MIPS, ARM) have been using
      MCContext::reportError to report issues such as out-of-range fixup values in
      their TgtAsmBackend. This is great, but because MCContext couldn't easily be
      threaded through to the adjustFixupValue helper function from its usual
      callsite (applyFixup), these backends ended up adding an MCContext* argument
      and adding another call to applyFixup to processFixupValue. Adding an
      MCContext parameter to applyFixup makes this unnecessary, and even better -
      applyFixup can take a reference to MCContext rather than a potentially null
      pointer.
      
      Differential Revision: https://reviews.llvm.org/D30264
      
      llvm-svn: 299529
      866113c2
  12. Mar 31, 2017
  13. Mar 30, 2017
  14. Mar 29, 2017
  15. Mar 28, 2017
  16. Mar 23, 2017
  17. Mar 18, 2017
  18. Mar 16, 2017
    • Reid Kleckner's avatar
      Remove getArgumentList() in favor of arg_begin(), args(), etc · 45707d4d
      Reid Kleckner authored
      Users often call getArgumentList().size(), which is a linear way to get
      the number of function arguments. arg_size(), on the other hand, is
      constant time.
      
      In general, the fact that arguments are stored in an iplist is an
      implementation detail, so I've removed it from the Function interface
      and moved all other users to the argument container APIs (arg_begin(),
      arg_end(), args(), arg_size()).
      
      Reviewed By: chandlerc
      
      Differential Revision: https://reviews.llvm.org/D31052
      
      llvm-svn: 298010
      45707d4d
    • Hiroshi Inoue's avatar
      Test commit. · 138a3faa
      Hiroshi Inoue authored
      llvm-svn: 297959
      138a3faa
  19. Mar 15, 2017
  20. Mar 08, 2017
  21. Mar 07, 2017
  22. Mar 06, 2017
  23. Mar 04, 2017
    • Sanjay Patel's avatar
      [DAGCombiner] allow transforming (select Cond, C +/- 1, C) to (add(ext Cond), C) · 066f3208
      Sanjay Patel authored
      select Cond, C +/- 1, C --> add(ext Cond), C -- with a target hook.
      
      This is part of the ongoing process to obsolete D24480.  The motivation is to 
      canonicalize to select IR in InstCombine whenever possible, so we need to have a way to
      undo that easily in codegen.
       
      PowerPC is an obvious winner for this kind of transform because it has fast and complete 
      bit-twiddling abilities but generally lousy conditional execution perf (although this might
      have changed in recent implementations).
      
      x86 also sees some wins, but the effect is limited because these transforms already mostly
      exist in its target-specific combineSelectOfTwoConstants(). The fact that we see any x86 
      changes just shows that that code is a mess of special-case holes. We may be able to remove 
      some of that logic now.
      
      My guess is that other targets will want to enable this hook for most cases. The likely 
      follow-ups would be to add value type and/or the constants themselves as parameters for the
      hook. As the tests in select_const.ll show, we can transform any select-of-constants to 
      math/logic, but the general transform for any 2 constants needs one more instruction 
      (multiply or 'and').
      
      ARM is one target that I think may not want this for most cases. I see infinite loops there
      because it wants to use selects to enable conditionally executed instructions.
      
      Differential Revision: https://reviews.llvm.org/D30537
      
      llvm-svn: 296977
      066f3208
  24. Mar 03, 2017
  25. Mar 02, 2017
    • Guozhi Wei's avatar
      [PPC] Fix code generation for bswap(int32) followed by store16 · ed28e742
      Guozhi Wei authored
      This patch fixes pr32063.
      
      Current code in PPCTargetLowering::PerformDAGCombine can transform
      
      bswap
      store
      
      into a single PPCISD::STBRX instruction. but it doesn't consider the case that the operand size of bswap may be larger than store size. When it occurs, we need 2 modifications,
      
      1 For the last operand of PPCISD::STBRX, we should not use DAG.getValueType(N->getOperand(1).getValueType()), instead we should use cast<StoreSDNode>(N)->getMemoryVT().
      
      2 Before PPCISD::STBRX, we need to shift the original operand of bswap to the right side.
      
      Differential Revision: https://reviews.llvm.org/D30362
      
      llvm-svn: 296811
      ed28e742
    • Nemanja Ivanovic's avatar
      [PowerPC][ELFv2ABI] Allocate parameter area on-demand to reduce stack frame size · db8425ef
      Nemanja Ivanovic authored
      This patch reduces the stack frame size by not allocating the parameter area if
      it is not required. In the current implementation LowerFormalArguments_64SVR4
      already handles the parameter area, but LowerCall_64SVR4 does not
      (when calculating the stack frame size). What this patch does is make
      LowerCall_64SVR4 consistent with LowerFormalArguments_64SVR4.
      
      Committing on behalf of Hiroshi Inoue.
      
      Differential Revision: https://reviews.llvm.org/D29881
      
      llvm-svn: 296771
      db8425ef
  26. Feb 26, 2017
  27. Feb 24, 2017
  28. Feb 17, 2017
  29. Feb 11, 2017
Loading