Skip to content
  1. Apr 15, 2007
  2. Apr 13, 2007
    • Chris Lattner's avatar
      Now that codegen prepare isn't defeating me, I can finally fix what I set · efd3051d
      Chris Lattner authored
      out to do! :)
      
      This fixes a problem where LSR would insert a bunch of code into each MBB
      that uses a particular subexpression (e.g. IV+base+C).  The problem is that
      this code cannot be CSE'd back together if inserted into different blocks.
      
      This patch changes LSR to attempt to insert a single copy of this code and
      share it, allowing codegenprepare to duplicate the code if it can be sunk
      into various addressing modes.  On CodeGen/ARM/lsr-code-insertion.ll,
      for example, this gives us code like:
      
              add r8, r0, r5
              str r6, [r8, #+4]
      ..
              ble LBB1_4      @cond_next
      LBB1_3: @cond_true
              str r10, [r8, #+4]
      LBB1_4: @cond_next
      ...
      LBB1_5: @cond_true55
              ldr r6, LCPI1_1
              str r6, [r8, #+4]
      
      instead of:
      
              add r10, r0, r6
              str r8, [r10, #+4]
      ...
              ble LBB1_4      @cond_next
      LBB1_3: @cond_true
              add r8, r0, r6
              str r10, [r8, #+4]
      LBB1_4: @cond_next
      ...
      LBB1_5: @cond_true55
              add r8, r0, r6
              ldr r10, LCPI1_1
              str r10, [r8, #+4]
      
      Besides being smaller and more efficient, this makes it immediately
      obvious that it is profitable to predicate LBB1_3 now :)
      
      llvm-svn: 35972
      efd3051d
  3. Apr 10, 2007
  4. Apr 07, 2007
  5. Apr 03, 2007
  6. Apr 02, 2007
  7. Mar 26, 2007
  8. Mar 20, 2007
  9. Mar 13, 2007
  10. Mar 09, 2007
  11. Mar 06, 2007
  12. Mar 03, 2007
  13. Mar 02, 2007
  14. Feb 10, 2007
  15. Feb 06, 2007
  16. Jan 15, 2007
  17. Jan 08, 2007
    • Reid Spencer's avatar
      For PR1097: · bf96e02a
      Reid Spencer authored
      Enable complex addressing modes on 64-bit platforms involving two induction
      variables by keeping a size and scale in 64-bits not 32.
      Patch by Dan Gohman.
      
      llvm-svn: 33011
      bf96e02a
  18. Jan 06, 2007
  19. Dec 31, 2006
    • Reid Spencer's avatar
      For PR950: · c635f47d
      Reid Spencer authored
      This patch replaces signed integer types with signless ones:
      1. [US]Byte -> Int8
      2. [U]Short -> Int16
      3. [U]Int   -> Int32
      4. [U]Long  -> Int64.
      5. Removal of isSigned, isUnsigned, getSignedVersion, getUnsignedVersion
         and other methods related to signedness. In a few places this warranted
         identifying the signedness information from other sources.
      
      llvm-svn: 32785
      c635f47d
  20. Dec 23, 2006
    • Reid Spencer's avatar
      For PR950: · 266e42b3
      Reid Spencer authored
      This patch removes the SetCC instructions and replaces them with the ICmp
      and FCmp instructions. The SetCondInst instruction has been removed and
      been replaced with ICmpInst and FCmpInst.
      
      llvm-svn: 32751
      266e42b3
  21. Dec 19, 2006
  22. Dec 13, 2006
  23. Dec 12, 2006
  24. Dec 07, 2006
  25. Dec 06, 2006
  26. Nov 27, 2006
    • Reid Spencer's avatar
      For PR950: · 6c38f0bb
      Reid Spencer authored
      The long awaited CAST patch. This introduces 12 new instructions into LLVM
      to replace the cast instruction. Corresponding changes throughout LLVM are
      provided. This passes llvm-test, llvm/test, and SPEC CPUINT2000 with the
      exception of 175.vpr which fails only on a slight floating point output
      difference.
      
      llvm-svn: 31931
      6c38f0bb
  27. Nov 26, 2006
  28. Nov 17, 2006
    • Chris Lattner's avatar
      If an indvar with a variable stride is used by the exit condition, go ahead · 21eba2da
      Chris Lattner authored
      and handle it like constant stride vars.  This fixes some bad codegen in
      variable stride cases.  For example, it compiles this:
      
      void foo(int k, int i) {
        for (k=i+i; k <= 8192; k+=i)
          flags2[k] = 0;
      }
      
      to:
      
      LBB1_1: #bb.preheader
              movl %eax, %ecx
              addl %ecx, %ecx
              movl L_flags2$non_lazy_ptr, %edx
      LBB1_2: #bb
              movb $0, (%edx,%ecx)
              addl %eax, %ecx
              cmpl $8192, %ecx
              jle LBB1_2      #bb
      LBB1_5: #return
              ret
      
      or (if the array is local and we are in dynamic-nonpic or static mode):
      
      LBB3_2: #bb
              movb $0, _flags2(%ecx)
              addl %eax, %ecx
              cmpl $8192, %ecx
              jle LBB3_2      #bb
      
      and:
      
              lis r2, ha16(L_flags2$non_lazy_ptr)
              lwz r2, lo16(L_flags2$non_lazy_ptr)(r2)
              slwi r3, r4, 1
      LBB1_2: ;bb
              li r5, 0
              add r6, r4, r3
              stbx r5, r2, r3
              cmpwi cr0, r6, 8192
              bgt cr0, LBB1_5 ;return
      
      instead of:
      
              leal (%eax,%eax,2), %ecx
              movl %eax, %edx
              addl %edx, %edx
              addl L_flags2$non_lazy_ptr, %edx
              xorl %esi, %esi
      LBB1_2: #bb
              movb $0, (%edx,%esi)
              movl %eax, %edi
              addl %esi, %edi
              addl %ecx, %esi
              cmpl $8192, %esi
              jg LBB1_5       #return
      
      and:
      
              lis r2, ha16(L_flags2$non_lazy_ptr)
              lwz r2, lo16(L_flags2$non_lazy_ptr)(r2)
              mulli r3, r4, 3
              slwi r5, r4, 1
              li r6, 0
              add r2, r2, r5
      LBB1_2: ;bb
              li r5, 0
              add r7, r3, r6
              stbx r5, r2, r6
              add r6, r4, r6
              cmpwi cr0, r7, 8192
              ble cr0, LBB1_2 ;bb
      
      This speeds up Benchmarks/Shootout/sieve from 8.533s to 6.464s and
      implements LoopStrengthReduce/var_stride_used_by_compare.ll
      
      llvm-svn: 31809
      21eba2da
  29. Nov 02, 2006
    • Reid Spencer's avatar
      For PR786: · de46e484
      Reid Spencer authored
      Turn on -Wunused and -Wno-unused-parameter. Clean up most of the resulting
      fall out by removing unused variables. Remaining warnings have to do with
      unused functions (I didn't want to delete code without review) and unused
      variables in generated code. Maintainers should clean up the remaining
      issues when they see them. All changes pass DejaGnu tests and Olden.
      
      llvm-svn: 31380
      de46e484
  30. Oct 28, 2006
Loading