Skip to content
  1. Jun 02, 2009
  2. May 26, 2009
    • Stefanus Du Toit's avatar
      Update CPU capabilities for AMD machines · 96180b53
      Stefanus Du Toit authored
      - added processors k8-sse3, opteron-sse3, athlon64-sse3, amdfam10, and
      barcelona with appropriate sse3/4a levels
      - added FeatureSSE4A for amdfam10 processors
      in X86Subtarget:
      - added hasSSE4A
      - updated AutoDetectSubtargetFeatures to detect SSE4A
      - updated GetCurrentX86CPU to detect family 15 with sse3 as k8-sse3 and
      family 10h as amdfam10
      
      New processor names match those used by gcc.
      
      Patch by Paul Redmond!
      
      llvm-svn: 72434
      96180b53
  3. Feb 03, 2009
    • Dan Gohman's avatar
      Change Feature64Bit to not imply FeatureSSE2. All x86-64 hardware has · 7403751e
      Dan Gohman authored
      SSE2, however it's possible to disable SSE2, and the subtarget support
      code thinks that if 64-bit implies SSE2 and SSE2 is disabled then
      64-bit should also be disabled. Instead, just mark all the 64-bit
      subtargets as explicitly supporting SSE2.
      
      Also, move the code that makes -march=x86-64 enable 64-bit support by
      default to only apply when there is no explicit subtarget. If you
      need to specify a subtarget and you want 64-bit code, you'll need to
      select a subtarget that supports 64-bit code.
      
      llvm-svn: 63575
      7403751e
  4. Jan 03, 2009
  5. Jan 02, 2009
  6. Nov 24, 2008
  7. Oct 15, 2008
  8. Oct 11, 2008
  9. Mar 01, 2008
  10. Feb 15, 2008
  11. Feb 03, 2008
  12. Dec 29, 2007
  13. Oct 12, 2007
  14. May 22, 2007
  15. May 06, 2007
  16. May 04, 2007
  17. Apr 25, 2007
  18. Apr 11, 2007
  19. Feb 26, 2007
  20. Oct 06, 2006
  21. Sep 08, 2006
  22. May 18, 2006
  23. May 17, 2006
  24. Feb 01, 2006
    • Evan Cheng's avatar
      - Use xor to clear integer registers (set R, 0). · 9e350cd6
      Evan Cheng authored
      - Added a new format for instructions where the source register is implied
        and it is same as the destination register. Used for pseudo instructions
        that clear the destination register.
      
      llvm-svn: 25872
      9e350cd6
  25. Jan 31, 2006
    • Chris Lattner's avatar
      * Fix 80-column violations · c642aa5e
      Chris Lattner authored
      * Rename hasSSE -> hasSSE1 to avoid my continual confusion with 'has any SSE'.
      * Add inline asm constraint specification.
      
      llvm-svn: 25854
      c642aa5e
  26. Jan 29, 2006
  27. Jan 27, 2006
  28. Jan 26, 2006
  29. Jul 15, 2005
  30. Jul 06, 2005
    • Nate Begeman's avatar
      First round of support for doing scalar FP using the SSE2 ISA extension and · 8a093360
      Nate Begeman authored
      XMM registers.  There are many known deficiencies and fixmes, which will be
      addressed ASAP.  The major benefit of this work is that it will allow the
      LLVM register allocator to allocate FP registers across basic blocks.
      
      The x86 backend will still default to x87 style FP.  To enable this work,
      you must pass -enable-sse-scalar-fp and either -sse2 or -sse3 to llc.
      
      An example before and after would be for:
      double foo(double *P) { double Sum = 0; int i; for (i = 0; i < 1000; ++i)
                              Sum += P[i]; return Sum; }
      
      The inner loop looks like the following:
      x87:
      .LBB_foo_1:     # no_exit
              fldl (%esp)
              faddl (%eax,%ecx,8)
              fstpl (%esp)
              incl %ecx
              cmpl $1000, %ecx
              #FP_REG_KILL
              jne .LBB_foo_1  # no_exit
      
      SSE2:
              addsd (%eax,%ecx,8), %xmm0
              incl %ecx
              cmpl $1000, %ecx
              #FP_REG_KILL
              jne .LBB_foo_1  # no_exit
      
      llvm-svn: 22340
      8a093360
  31. Oct 03, 2004
  32. Aug 11, 2004
  33. Aug 01, 2004
  34. Apr 13, 2004
  35. Apr 08, 2004
    • John Criswell's avatar
      Added the llvm.readport and llvm.writeport intrinsics for x86. These do · 10db062d
      John Criswell authored
      I/O port instructions on x86.  The specific code sequence is tailored to
      the parameters and return value of the intrinsic call.
      Added the ability for implicit defintions to be printed in the Instruction
      Printer.
      Added the ability for RawFrm instruction to print implict uses and
      defintions with correct comma output.  This required adjustment to some
      methods so that a leading comma would or would not be printed.
      
      llvm-svn: 12782
      10db062d
  36. Apr 01, 2004
  37. Feb 28, 2004
Loading