Skip to content
  1. Oct 24, 2008
  2. Oct 23, 2008
  3. Oct 22, 2008
  4. Oct 21, 2008
    • Dale Johannesen's avatar
      Add an SSE2 algorithm for uint64->f64 conversion. · 28929589
      Dale Johannesen authored
      The same one Apple gcc uses, faster.  Also gets the
      extreme case in gcc.c-torture/execute/ieee/rbug.c
      correct which we weren't before; this is not
      sufficient to get the test to pass though, there
      is another bug.
      
      llvm-svn: 57926
      28929589
    • Dan Gohman's avatar
      Fix SelectionDAGBuild lowering of Select instructions to · 8b44b88e
      Dan Gohman authored
      handle first-class aggregate values. Also, fix a bug in
      the Ret handling for empty aggregates.
      
      llvm-svn: 57925
      8b44b88e
    • Daniel Dunbar's avatar
      Clear raw_fd_ostream error string on success and explain behavior in · 1ca20df4
      Daniel Dunbar authored
      documentation.
      
      Add C++ header marker.
      
      llvm-svn: 57923
      1ca20df4
    • Dan Gohman's avatar
      Implement the optimized FCMP_OEQ/FCMP_UNE code for x86 fast-isel. · 4ddf7a4c
      Dan Gohman authored
      llvm-svn: 57915
      4ddf7a4c
    • Jim Grosbach's avatar
      use pre-UAL mnemonics for push/pop for compilaton callback function · cfebc18d
      Jim Grosbach authored
      llvm-svn: 57911
      cfebc18d
    • Nuno Lopes's avatar
      fix a tricky bug in the JIT global variable emitter, that was triggered when... · 94844e20
      Nuno Lopes authored
      fix a tricky bug in the JIT global variable emitter, that was triggered when JITing a variable independently of a function. This lead to sharing memory memory between functions and GVs thus changing the value of a GV could change the code in execution. more details on the ML.
      
      llvm-svn: 57900
      94844e20
    • Dan Gohman's avatar
      Disable constant-offset folding for PowerPC, as the PowerPC target · c14e5227
      Dan Gohman authored
      isn't yet prepared for it.
      
      llvm-svn: 57886
      c14e5227
    • Dan Gohman's avatar
      Don't create TargetGlobalAddress nodes with offsets that don't fit · 269246b0
      Dan Gohman authored
      in the 32-bit signed offset field of addresses. Even though this
      may be intended, some linkers refuse to relocate code where the
      relocated address computation overflows.
      
      Also, fix the sign-extension of constant offsets to use the
      actual pointer size, rather than the size of the GlobalAddress
      node, which may be different, for example on x86-64 where MVT::i32
      is used when the address is being fit into the 32-bit displacement
      field.
      
      llvm-svn: 57885
      269246b0
    • Dan Gohman's avatar
      Optimized FCMP_OEQ and FCMP_UNE for x86. · 97d95d6d
      Dan Gohman authored
      Where previously LLVM might emit code like this:
      
              ucomisd %xmm1, %xmm0
              setne   %al
              setp    %cl
              orb     %al, %cl
              jne     .LBB4_2
      
      it now emits this:
      
              ucomisd %xmm1, %xmm0
              jne     .LBB4_2
              jp      .LBB4_2
      
      It has fewer instructions and uses fewer registers, but it does
      have more branches. And in the case that this code is followed by
      a non-fallthrough edge, it may be followed by a jmp instruction,
      resulting in three branch instructions in sequence. Some effort
      is made to avoid this situation.
      
      To achieve this, X86ISelLowering.cpp now recognizes FCMP_OEQ and
      FCMP_UNE in lowered form, and replace them with code that emits
      two branches, except in the case where it would require converting
      a fall-through edge to an explicit branch.
      
      Also, X86InstrInfo.cpp's branch analysis and transform code now
      knows now to handle blocks with multiple conditional branches. It
      uses loops instead of having fixed checks for up to two
      instructions. It can now analyze and transform code generated
      from FCMP_OEQ and FCMP_UNE.
      
      llvm-svn: 57873
      97d95d6d
    • Dan Gohman's avatar
      When the coalescer is doing rematerializing, have it remove · c835458d
      Dan Gohman authored
      the copy instruction from the instruction list before asking the
      target to create the new instruction. This gets the old instruction
      out of the way so that it doesn't interfere with the target's
      rematerialization code. In the case of x86, this helps it find
      more cases where EFLAGS is not live.
      
      Also, in the X86InstrInfo.cpp, teach isSafeToClobberEFLAGS to check
      to see if it reached the end of the block after scanning each
      instruction, instead of just before. This lets it notice when the
      end of the block is only two instructions away, without doing any
      additional scanning.
      
      These changes allow rematerialization to clobber EFLAGS in more
      cases, for example using xor instead of mov to set the return value
      to zero in the included testcase.
      
      llvm-svn: 57872
      c835458d
    • Dan Gohman's avatar
      Make the NaN test come second, heuristically assuming · 97d3f6cf
      Dan Gohman authored
      that NaNs are less common.
      
      llvm-svn: 57871
      97d3f6cf
    • Dan Gohman's avatar
      72e66eed
Loading