Skip to content
  1. May 22, 2010
  2. May 06, 2010
  3. Apr 29, 2010
  4. Apr 27, 2010
  5. Apr 26, 2010
  6. Mar 31, 2010
  7. Mar 30, 2010
  8. Mar 25, 2010
    • Jakob Stoklund Olesen's avatar
      Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings. · 49e121d5
      Jakob Stoklund Olesen authored
      On Nehalem and newer CPUs there is a 2 cycle latency penalty on using a register
      in a different domain than where it was defined. Some instructions have
      equvivalents for different domains, like por/orps/orpd.
      
      The SSEDomainFix pass tries to minimize the number of domain crossings by
      changing between equvivalent opcodes where possible.
      
      This is a work in progress, in particular the pass doesn't do anything yet. SSE
      instructions are tagged with their execution domain in TableGen using the last
      two bits of TSFlags. Note that not all instructions are tagged correctly. Life
      just isn't that simple.
      
      The SSE execution domain issue is very similar to the ARM NEON/VFP pipeline
      issue handled by NEONMoveFixPass. This pass may become target independent to
      handle both.
      
      llvm-svn: 99524
      49e121d5
  9. Mar 24, 2010
  10. Feb 13, 2010
  11. Feb 12, 2010
  12. Feb 05, 2010
  13. Feb 03, 2010
  14. Jan 22, 2010
  15. Jan 13, 2010
    • Evan Cheng's avatar
      Add a quick pass to optimize sign / zero extension instructions. For targets... · 30bebff4
      Evan Cheng authored
      Add a quick pass to optimize sign / zero extension instructions. For targets where the pre-extension values are available in the subreg of the result of the extension, replace the uses of the pre-extension value with the result + extract_subreg.
      
      For now, this pass is fairly conservative. It only perform the replacement when both the pre- and post- extension values are used in the block. It will miss cases where the post-extension values are live, but not used.
      
      llvm-svn: 93278
      30bebff4
  16. Jan 12, 2010
  17. Dec 11, 2009
  18. Dec 05, 2009
  19. Dec 04, 2009
    • David Greene's avatar
      · 0508e435
      David Greene authored
      Have hasLoad/StoreFrom/ToStackSlot return the relevant MachineMemOperand.
      
      llvm-svn: 90608
      0508e435
  20. Nov 30, 2009
    • Bob Wilson's avatar
      Remove isProfitableToDuplicateIndirectBranch target hook. It is profitable · 505ddaa4
      Bob Wilson authored
      for all the processors where I have tried it, and even when it might not help
      performance, the cost is quite low.  The opportunities for duplicating
      indirect branches are limited by other factors so code size does not change
      much due to tail duplicating indirect branches aggressively.
      
      llvm-svn: 90144
      505ddaa4
  21. Nov 25, 2009
  22. Nov 14, 2009
  23. Nov 13, 2009
    • David Greene's avatar
      · 2f4c3742
      David Greene authored
      Fix a bootstrap failure.
      
      Provide special isLoadFromStackSlotPostFE and isStoreToStackSlotPostFE
      interfaces to explicitly request checking for post-frame ptr elimination
      operands.  This uses a heuristic so it isn't reliable for correctness.
      
      llvm-svn: 87047
      2f4c3742
  24. Nov 12, 2009
    • David Greene's avatar
      · 70fdd57d
      David Greene authored
      Add hasLoadFromStackSlot and hasStoreToStackSlot to return whether a
      machine instruction loads or stores from/to a stack slot.  Unlike
      isLoadFromStackSlot and isStoreFromStackSlot, the instruction may be
      something other than a pure load/store (e.g. it may be an arithmetic
      operation with a memory operand).  This helps AsmPrinter determine when
      to print a spill/reload comment.
      
      This is only a hint since we may not be able to figure this out in all
      cases.  As such, it should not be relied upon for correctness.
      
      Implement for X86.  Return false by default for other architectures.
      
      llvm-svn: 87026
      70fdd57d
  25. Oct 30, 2009
    • Dan Gohman's avatar
      Fix MachineLICM to use the correct virtual register class when · 49fa51d9
      Dan Gohman authored
      unfolding loads for hoisting.  getOpcodeAfterMemoryUnfold returns the
      opcode of the original operation without the load, not the load
      itself, MachineLICM needs to know the operand index in order to get
      the correct register class. Extend getOpcodeAfterMemoryUnfold to
      return this information.
      
      llvm-svn: 85622
      49fa51d9
  26. Oct 10, 2009
  27. Oct 09, 2009
  28. Oct 07, 2009
  29. Oct 05, 2009
  30. Sep 11, 2009
    • Evan Cheng's avatar
      It's not legal to fold a load from a narrower stack slot into a wider... · 3cad6283
      Evan Cheng authored
      It's not legal to fold a load from a narrower stack slot into a wider instruction. If done, the instruction does a 64-bit load and that's not
      safe. This can happen we a subreg_to_reg 0 has been coalesced. One
      exception is when the instruction that folds the load is a move, then we
      can simply turn it into a 32-bit load from the stack slot.                                                                                                                    
      
      rdar://7170444
      
      llvm-svn: 81494
      3cad6283
Loading