Skip to content
  1. Feb 26, 2010
    • Johnny Chen's avatar
      Added the follwoing 32-bit Thumb instructions for disassembly only: · 38e7bb6f
      Johnny Chen authored
      o Parallel addition and subtraction, signed/unsigned
      o Miscellaneous operations: QADD, QDADD, QSUB, QDSUB
      o Unsigned sum of absolute differences [and accumulate]: USAD8, USADA8
      o Signed/Unsigned saturate: SSAT, SSAT16, USAT, USAT16
      o Signed multiply accumulate long (halfwords): SMLAL<x><y>
      o Signed multiply accumulate/subtract [long] (dual): SMLAD[x], SMLALD[X], SMLSD[X], SMLSLD[X]
      o Signed dual multiply add/subtract [long]: SMUAD[X], SMUSD[X]
      
      llvm-svn: 97276
      38e7bb6f
  2. Feb 25, 2010
  3. Feb 16, 2010
  4. Feb 09, 2010
  5. Feb 02, 2010
  6. Jan 22, 2010
  7. Jan 19, 2010
  8. Jan 18, 2010
    • Jim Grosbach's avatar
      Patch by David Conrad: · 8546ec9c
      Jim Grosbach authored
      "On ARMv6T2 this turns cttz into rbit, clz instead of the 4 instruction
       sequence it is now."
      
      llvm-svn: 93758
      8546ec9c
  9. Jan 08, 2010
  10. Jan 05, 2010
  11. Dec 16, 2009
  12. Dec 15, 2009
  13. Dec 14, 2009
  14. Nov 24, 2009
    • Anton Korobeynikov's avatar
      Materialize global addresses via movt/movw pair, this is always better · 25229086
      Anton Korobeynikov authored
      than doing the same via constpool:
      1. Load from constpool costs 3 cycles on A9, movt/movw pair - just 2.
      2. Load from constpool might stall up to 300 cycles due to cache miss.
      3. Movt/movw does not use load/store unit.
      4. Less constpool entries => better compiler performance.
      
      This is only enabled on ELF systems, since darwin does not have needed
      relocations (yet).
      
      llvm-svn: 89720
      25229086
    • Jim Grosbach's avatar
      80 column violations · f890f516
      Jim Grosbach authored
      llvm-svn: 89718
      f890f516
  15. Nov 23, 2009
  16. Nov 20, 2009
  17. Nov 07, 2009
    • Evan Cheng's avatar
      - Add pseudo instructions tLDRpci_pic and t2LDRpci_pic which does a pc-relative · 207b2466
      Evan Cheng authored
        load of a GV from constantpool and then add pc. It allows the code sequence to
        be rematerializable so it would be hoisted by machine licm.
      - Add a late pass to break these pseudo instructions into a number of real
        instructions. Also move the code in Thumb2 IT pass that breaks up t2MOVi32imm
        to this pass. This is done before post regalloc scheduling to allow the
        scheduler to proper schedule these instructions. It also allow them to be
        if-converted and shrunk by later passes.
      
      llvm-svn: 86304
      207b2466
  18. Nov 04, 2009
  19. Nov 02, 2009
  20. Oct 31, 2009
  21. Oct 30, 2009
  22. Oct 29, 2009
  23. Oct 27, 2009
  24. Oct 21, 2009
  25. Oct 13, 2009
  26. Oct 01, 2009
  27. Sep 30, 2009
  28. Sep 28, 2009
Loading