Skip to content
  1. Feb 28, 2008
  2. Feb 27, 2008
    • Dale Johannesen's avatar
      Handle load/store of misaligned vectors that are the · bf76a08e
      Dale Johannesen authored
      same size as an int type by doing a bitconvert of
      load/store of the int type (same algorithm as floating point).
      This makes them work for ppc Altivec.  There was some
      code that purported to handle loads of (some) vectors
      by splitting them into two smaller vectors, but getExtLoad
      rejects subvector loads, so this could never have worked;
      the patch removes it.
      
      llvm-svn: 47696
      bf76a08e
    • Dan Gohman's avatar
      Don't hard-code the mask size to be 32, which is incorrect on ppc64 · 26854f24
      Dan Gohman authored
      and was causing aborts with the new APInt changes. This may also be
      fixing an obscure ppc64 bug.
      
      llvm-svn: 47692
      26854f24
    • Evan Cheng's avatar
      This is done. · 3d17e4c4
      Evan Cheng authored
      llvm-svn: 47688
      3d17e4c4
    • Evan Cheng's avatar
      Fix a bug in dead spill slot elimination. · fdc732ab
      Evan Cheng authored
      llvm-svn: 47687
      fdc732ab
    • Dan Gohman's avatar
      Remove the `else', at Evan's insistence. · e5e32ec8
      Dan Gohman authored
      llvm-svn: 47686
      e5e32ec8
    • Duncan Sands's avatar
      Add a FIXME about the VECTOR_SHUFFLE evil hack. · ef40c5b2
      Duncan Sands authored
      llvm-svn: 47676
      ef40c5b2
    • Lauro Ramos Venancio's avatar
      Emit an error when a library is not found. It is the GNU ld behavior and it is... · 14241b29
      Lauro Ramos Venancio authored
      Emit an error when a library is not found. It is the GNU ld behavior and it is expected by the configure scripts.
      
      llvm-svn: 47674
      14241b29
    • Duncan Sands's avatar
      LegalizeTypes support for EXTRACT_VECTOR_ELT. The · e158a82f
      Duncan Sands authored
      approach taken is different to that in LegalizeDAG
      when it is a question of expanding or promoting the
      result type: for example, if extracting an i64 from
      a <2 x i64>, when i64 needs expanding, it bitcasts
      the vector to <4 x i32>, extracts the appropriate
      two i32's, and uses those for the Lo and Hi parts.
      Likewise, when extracting an i16 from a <4 x i16>,
      and i16 needs promoting, it bitcasts the vector to
      <2 x i32>, extracts the appropriate i32, twiddles
      the bits if necessary, and uses that as the promoted
      value.  This puts more pressure on bitcast legalization,
      and I've added the appropriate cases.  They needed to
      be added anyway since users can generate such bitcasts
      too if they want to.  Also, when considering various
      cases (Legal, Promote, Expand, Scalarize, Split) it is
      a pain that expand can correspond to Expand, Scalarize
      or Split, so I've changed the LegalizeTypes enum so it
      lists those different cases - now Expand only means
      splitting a scalar in two.
      The code produced is the same as by LegalizeDAG for
      all relevant testcases, except for
      2007-10-31-extractelement-i64.ll, where the code seems
      to have improved (see below; can an expert please tell
      me if it is better or not).
      Before < vs after >.
      
      <       subl    $92, %esp
      <       movaps  %xmm0, 64(%esp)
      <       movaps  %xmm0, (%esp)
      <       movl    4(%esp), %eax
      <       movl    %eax, 28(%esp)
      <       movl    (%esp), %eax
      <       movl    %eax, 24(%esp)
      <       movq    24(%esp), %mm0
      <       movq    %mm0, 56(%esp)
      ---
      >       subl    $44, %esp
      >       movaps  %xmm0, 16(%esp)
      >       pshufd  $1, %xmm0, %xmm1
      >       movd    %xmm1, 4(%esp)
      >       movd    %xmm0, (%esp)
      >       movq    (%esp), %mm0
      >       movq    %mm0, 8(%esp)
      
      <       subl    $92, %esp
      <       movaps  %xmm0, 64(%esp)
      <       movaps  %xmm0, (%esp)
      <       movl    12(%esp), %eax
      <       movl    %eax, 28(%esp)
      <       movl    8(%esp), %eax
      <       movl    %eax, 24(%esp)
      <       movq    24(%esp), %mm0
      <       movq    %mm0, 56(%esp)
      ---
      >       subl    $44, %esp
      >       movaps  %xmm0, 16(%esp)
      >       pshufd  $3, %xmm0, %xmm1
      >       movd    %xmm1, 4(%esp)
      >       movhlps %xmm0, %xmm0
      >       movd    %xmm0, (%esp)
      >       movq    (%esp), %mm0
      >       movq    %mm0, 8(%esp)
      
      <       subl    $92, %esp
      <       movaps  %xmm0, 64(%esp)
      ---
      >       subl    $44, %esp
      
      <       movl    16(%esp), %eax
      <       movl    %eax, 48(%esp)
      <       movl    20(%esp), %eax
      <       movl    %eax, 52(%esp)
      <       movaps  %xmm0, (%esp)
      <       movl    4(%esp), %eax
      <       movl    %eax, 60(%esp)
      <       movl    (%esp), %eax
      <       movl    %eax, 56(%esp)
      ---
      >       pshufd  $1, %xmm0, %xmm1
      >       movd    %xmm1, 4(%esp)
      >       movd    %xmm0, (%esp)
      >       movd    %xmm1, 12(%esp)
      >       movd    %xmm0, 8(%esp)
      
      <       subl    $92, %esp
      <       movaps  %xmm0, 64(%esp)
      ---
      >       subl    $44, %esp
      
      <       movl    24(%esp), %eax
      <       movl    %eax, 48(%esp)
      <       movl    28(%esp), %eax
      <       movl    %eax, 52(%esp)
      <       movaps  %xmm0, (%esp)
      <       movl    12(%esp), %eax
      <       movl    %eax, 60(%esp)
      <       movl    8(%esp), %eax
      <       movl    %eax, 56(%esp)
      ---
      >       pshufd  $3, %xmm0, %xmm1
      >       movd    %xmm1, 4(%esp)
      >       movhlps %xmm0, %xmm0
      >       movd    %xmm0, (%esp)
      >       movd    %xmm1, 12(%esp)
      >       movd    %xmm0, 8(%esp)
      
      llvm-svn: 47672
      e158a82f
    • Duncan Sands's avatar
      LegalizeTypes support for legalizing the mask · 2111bd2e
      Duncan Sands authored
      operand of a VECTOR_SHUFFLE.  The mask is a
      vector of constant integers.  The code in
      LegalizeDAG doesn't bother to legalize the
      mask, since it's basically just storage for
      a bunch of constants, however LegalizeTypes
      is more picky.  The problem is that there may
      not exist any legal vector-of-integers type
      with a legal element type, so it is impossible
      to create a legal mask!  Unless of course you
      cheat by creating a BUILD_VECTOR where the
      operands have a different type to the element
      type of the vector being built...  This is
      pretty ugly but works - all relevant tests in
      the testsuite pass, and produce the same
      assembler with and without LegalizeTypes.
      
      llvm-svn: 47670
      2111bd2e
    • Duncan Sands's avatar
      LegalizeTypes support for INSERT_VECTOR_ELT. · 5d5bc484
      Duncan Sands authored
      llvm-svn: 47669
      5d5bc484
    • Evan Cheng's avatar
      Don't track max alignment during stack object allocations since they can be... · 8ae8e2d5
      Evan Cheng authored
      Don't track max alignment during stack object allocations since they can be deleted later. Let PEI compute it.
      
      llvm-svn: 47668
      8ae8e2d5
    • Duncan Sands's avatar
      Support for legalizing MEMBARRIER. · 96658d01
      Duncan Sands authored
      llvm-svn: 47667
      96658d01
    • Bill Wendling's avatar
      Final de-tabification. · 97925ec7
      Bill Wendling authored
      llvm-svn: 47663
      97925ec7
    • Chris Lattner's avatar
      Add path separator support, patch by Sam Bishop. · 6fca9389
      Chris Lattner authored
      llvm-svn: 47662
      6fca9389
    • Chris Lattner's avatar
      Make X86TargetLowering::LowerSINT_TO_FP return without creating a dead · 83263b8c
      Chris Lattner authored
      stack slot and store if the  SINT_TO_FP is actually legal.  This allows
      us to compile:
      
      double a(double b) {return (unsigned)b;}
      
      to:
      
      _a:
      	cvttsd2siq	%xmm0, %rax
      	movl	%eax, %eax
      	cvtsi2sdq	%rax, %xmm0
      	ret
      
      instead of:
      
      _a:
      	subq	$8, %rsp
      	cvttsd2siq	%xmm0, %rax
      	movl	%eax, %eax
      	cvtsi2sdq	%rax, %xmm0
      	addq	$8, %rsp
      	ret
      
      crazy.
      
      llvm-svn: 47660
      83263b8c
    • Chris Lattner's avatar
      this code is correct but strange looking ;-) · 5fe95a04
      Chris Lattner authored
      llvm-svn: 47659
      5fe95a04
    • Chris Lattner's avatar
      Compile x86-64-and-mask.ll into: · 3c7d3d57
      Chris Lattner authored
      _test:
      	movl	%edi, %eax
      	ret
      
      instead of:
      
      _test:
              movl    $4294967295, %ecx
              movq    %rdi, %rax
              andq    %rcx, %rax
              ret
      
      It would be great to write this as a Pat pattern that used subregs 
      instead of a 'pseudo' instruction, but I don't know how to do that
      in td files.
      
      llvm-svn: 47658
      3c7d3d57
    • Evan Cheng's avatar
      Spiller now remove unused spill slots. · 6d56368c
      Evan Cheng authored
      llvm-svn: 47657
      6d56368c
    • Dan Gohman's avatar
      Teach Legalize how to expand an EXTRACT_ELEMENT. · 66272a54
      Dan Gohman authored
      llvm-svn: 47656
      66272a54
    • Dan Gohman's avatar
      Convert the last remaining users of the non-APInt form of · f19609ab
      Dan Gohman authored
      ComputeMaskedBits to use the APInt form, and remove the
      non-APInt form.
      
      llvm-svn: 47654
      f19609ab
Loading