Skip to content
  1. Jul 02, 2012
  2. Jul 01, 2012
  3. Jun 30, 2012
  4. Jun 29, 2012
    • Manman Ren's avatar
      Add SrcReg2 to analyzeCompare and optimizeCompareInstr to handle Compare · 6fa76dc0
      Manman Ren authored
      instructions with two register operands.
      
      llvm-svn: 159465
      6fa76dc0
    • Chandler Carruth's avatar
      Move llvm/Support/IRBuilder.h -> llvm/IRBuilder.h · aafe0918
      Chandler Carruth authored
      This was always part of the VMCore library out of necessity -- it deals
      entirely in the IR. The .cpp file in fact was already part of the VMCore
      library. This is just a mechanical move.
      
      I've tried to go through and re-apply the coding standard's preferred
      header sort, but at 40-ish files, I may have gotten some wrong. Please
      let me know if so.
      
      I'll be committing the corresponding updates to Clang and Polly, and
      Duncan has DragonEgg.
      
      Thanks to Bill and Eric for giving the green light for this bit of cleanup.
      
      llvm-svn: 159421
      aafe0918
    • Andrew Trick's avatar
      Revert "Make NumMicroOps a variable in the subtarget's instruction itinerary." · 51a8cf77
      Andrew Trick authored
      This reverts commit r159406. I noticed a performance regression so I'll back out for now.
      
      llvm-svn: 159411
      51a8cf77
    • Rafael Espindola's avatar
      In the initial exec mode we always do a load to find the address of a variable. · efdfb1e6
      Rafael Espindola authored
      Before this patch in pic 32 bit code we would add the global base register
      and not load from that address. This is a really old bug, but before the
      introduction of the tls attributes we would never select initial exec for
      pic code.
      
      llvm-svn: 159409
      efdfb1e6
    • Andrew Trick's avatar
      Make NumMicroOps a variable in the subtarget's instruction itinerary. · 1f50152b
      Andrew Trick authored
      The TargetInstrInfo::getNumMicroOps API does not change, but soon it
      will be used by MachineScheduler. Now each subtarget can specify the
      number of micro-ops per itinerary class. For ARM, this is currently
      always dynamic (-1), because it is used for load/store multiple which
      depends on the number of register operands.
      
      Zero is now a valid number of micro-ops. This can be used for
      nop pseudo-instructions or instructions that the hardware can squash
      during dispatch.
      
      llvm-svn: 159406
      1f50152b
    • Manman Ren's avatar
      X86: add more GATHER intrinsics in LLVM · 98a5bf24
      Manman Ren authored
      Corrected type for index of llvm.x86.avx2.gather.d.pd.256
        from 256-bit to 128-bit.
      Corrected types for src|dst|mask of llvm.x86.avx2.gather.q.ps.256
        from 256-bit to 128-bit.
      
      Support the following intrinsics:
        llvm.x86.avx2.gather.d.q, llvm.x86.avx2.gather.q.q
        llvm.x86.avx2.gather.d.q.256, llvm.x86.avx2.gather.q.q.256
        llvm.x86.avx2.gather.d.d, llvm.x86.avx2.gather.q.d
        llvm.x86.avx2.gather.d.d.256, llvm.x86.avx2.gather.q.d.256
      
      llvm-svn: 159402
      98a5bf24
  5. Jun 28, 2012
    • Jack Carter's avatar
      Changed the formatting sequence of a curly brace to · 27747b57
      Jack Carter authored
      the comment per code review feedback.
      
      llvm-svn: 159376
      27747b57
    • Bill Wendling's avatar
      Remove layering violation #include. · b2f11986
      Bill Wendling authored
      llvm-svn: 159372
      b2f11986
    • Jack Carter's avatar
      The Mips specific inline asm operand modifier 'z' has the · 6c0bc0b3
      Jack Carter authored
      following description in the gnu sources:
      
          Print $0 if operand is zero otherwise print the op normally.
      
      llvm-svn: 159324
      6c0bc0b3
    • Bill Wendling's avatar
      Move lib/Analysis/DebugInfo.cpp to lib/VMCore/DebugInfo.cpp and · e38859dc
      Bill Wendling authored
      include/llvm/Analysis/DebugInfo.h to include/llvm/DebugInfo.h.
      
      The reasoning is because the DebugInfo module is simply an interface to the
      debug info MDNodes and has nothing to do with analysis.
      
      llvm-svn: 159312
      e38859dc
    • Jack Carter's avatar
      This allows hello world to be compiled for Mips 64 direct object. · ef40238a
      Jack Carter authored
      It takes advantage of r159299 which introduces relocation support for N64. 
      elf-dump needed to be upgraded to support N64 relocations as well.
      
      This passes make check.
      
      Jack
      
      llvm-svn: 159302
      ef40238a
    • Jack Carter's avatar
      This allows hello world to be compiled for Mips 64 direct object. · b9f9de93
      Jack Carter authored
      It takes advantage of r159299 which introduces relocation support for N64. 
      elf-dump needed to be upgraded to support N64 relocations as well.
      
      This passes make check.
      
      Jack
      
      llvm-svn: 159301
      b9f9de93
    • Chad Rosier's avatar
      Whitespace. · 51afe639
      Chad Rosier authored
      llvm-svn: 159300
      51afe639
    • Jack Carter's avatar
      The ELF relocation record format is different for N64 · 8ad0c272
      Jack Carter authored
      which many Mips 64 ABIs use than for O64 which many 
      if not all other target ABIs use.
      
      Most architectures have the following 64 bit relocation record format:
      
        typedef struct
        {
          Elf64_Addr   r_offset; /* Address of reference */
          Elf64_Xword  r_info;   /* Symbol index and type of relocation */
        } Elf64_Rel;
      
        typedef struct
        {
          Elf64_Addr    r_offset;
          Elf64_Xword   r_info;
          Elf64_Sxword  r_addend;
        } Elf64_Rela;
      
      Whereas N64 has the following format:
      
        typedef struct
        {
          Elf64_Addr    r_offset;/* Address of reference */
          Elf64_Word  r_sym;     /* Symbol index */
          Elf64_Byte  r_ssym;    /* Special symbol */
          Elf64_Byte  r_type3;   /* Relocation type */
          Elf64_Byte  r_type2;   /* Relocation type */
          Elf64_Byte  r_type;    /* Relocation type */
        } Elf64_Rel;
      
        typedef struct
        {
          Elf64_Addr    r_offset;/* Address of reference */
          Elf64_Word  r_sym;     /* Symbol index */
          Elf64_Byte  r_ssym;    /* Special symbol */
          Elf64_Byte  r_type3;   /* Relocation type */
          Elf64_Byte  r_type2;   /* Relocation type */
          Elf64_Byte  r_type;    /* Relocation type */
          Elf64_Sxword  r_addend;
        } Elf64_Rela;
      
      The structure is the same size, but the r_info data element 
      is now 5 separate elements. Besides the content aspects, 
      endian byte reordering will be different for the area with 
      each element being endianized separately.
      
      I treat this as generic and continue to pass r_type as 
      an integer masking and unmasking the byte sized N64 
      values for N64 mode. I've implemented this and it causes no 
      affect on other current targets.
      
      This passes make check.
      
      Jack
      
      llvm-svn: 159299
      8ad0c272
  6. Jun 27, 2012
  7. Jun 26, 2012
    • Manman Ren's avatar
      X86: add GATHER intrinsics (AVX2) in LLVM · a0982041
      Manman Ren authored
      Support the following intrinsics:
      llvm.x86.avx2.gather.d.pd, llvm.x86.avx2.gather.q.pd
      llvm.x86.avx2.gather.d.pd.256, llvm.x86.avx2.gather.q.pd.256
      llvm.x86.avx2.gather.d.ps, llvm.x86.avx2.gather.q.ps
      llvm.x86.avx2.gather.d.ps.256, llvm.x86.avx2.gather.q.ps.256
      
      Modified Disassembler to handle VSIB addressing mode.
      
      llvm-svn: 159221
      a0982041
    • Jack Carter's avatar
      There are a number of generic inline asm operand modifiers that · 5e69cffe
      Jack Carter authored
      up to r158925 were handled as processor specific. Making them 
      generic and putting tests for these modifiers in the CodeGen/Generic
      directory caused a number of targets to fail. 
      
      This commit addresses that problem by having the targets call 
      the generic routine for generic modifiers that they don't currently
      have explicit code for.
      
      For now only generic print operands 'c' and 'n' are supported.vi
      
      
      Affected files:
      
          test/CodeGen/Generic/asm-large-immediate.ll
          lib/Target/PowerPC/PPCAsmPrinter.cpp
          lib/Target/NVPTX/NVPTXAsmPrinter.cpp
          lib/Target/ARM/ARMAsmPrinter.cpp
          lib/Target/XCore/XCoreAsmPrinter.cpp
          lib/Target/X86/X86AsmPrinter.cpp
          lib/Target/Hexagon/HexagonAsmPrinter.cpp
          lib/Target/CellSPU/SPUAsmPrinter.cpp
          lib/Target/Sparc/SparcAsmPrinter.cpp
          lib/Target/MBlaze/MBlazeAsmPrinter.cpp
          lib/Target/Mips/MipsAsmPrinter.cpp
          
      MSP430 isn't represented because it did not even run with
      the long existing 'c' modifier and it was not apparent what
      needs to be done to get it inline asm ready.
      
      Contributer: Jack Carter
      llvm-svn: 159203
      5e69cffe
    • Elena Demikhovsky's avatar
      Removed unused variable · 863d2d32
      Elena Demikhovsky authored
      llvm-svn: 159197
      863d2d32
    • Bill Wendling's avatar
      Rename to match other X86_64* names. · 8ed44466
      Bill Wendling authored
      llvm-svn: 159196
      8ed44466
    • Elena Demikhovsky's avatar
      Shuffle optimization for AVX/AVX2. · 26088d2e
      Elena Demikhovsky authored
      The current patch optimizes frequently used shuffle patterns and gives these instruction sequence reduction.
      Before:
            vshufps $-35, %xmm1, %xmm0, %xmm2 ## xmm2 = xmm0[1,3],xmm1[1,3]
             vpermilps       $-40, %xmm2, %xmm2 ## xmm2 = xmm2[0,2,1,3]
             vextractf128    $1, %ymm1, %xmm1
             vextractf128    $1, %ymm0, %xmm0
             vshufps $-35, %xmm1, %xmm0, %xmm0 ## xmm0 = xmm0[1,3],xmm1[1,3]
             vpermilps       $-40, %xmm0, %xmm0 ## xmm0 = xmm0[0,2,1,3]
             vinsertf128     $1, %xmm0, %ymm2, %ymm0
      After:
            vshufps $13, %ymm0, %ymm1, %ymm1 ## ymm1 = ymm1[1,3],ymm0[0,0],ymm1[5,7],ymm0[4,4]
            vshufps $13, %ymm0, %ymm0, %ymm0 ## ymm0 = ymm0[1,3,0,0,5,7,4,4]
            vunpcklps       %ymm1, %ymm0, %ymm0 ## ymm0 = ymm0[0],ymm1[0],ymm0[1],ymm1[1],ymm0[4],ymm1[4],ymm0[5],ymm1[5]
      
      llvm-svn: 159188
      26088d2e
    • Craig Topper's avatar
      Remove some duplicate instructions that exist only to given different... · 94bf0f38
      Craig Topper authored
      Remove some duplicate instructions that exist only to given different mnemonics for the assembler. Use InstAlias instead.
      
      llvm-svn: 159184
      94bf0f38
    • Eli Friedman's avatar
      Make some ugly hacks for inline asm operands which name a specific register a... · bbcd09cc
      Eli Friedman authored
      Make some ugly hacks for inline asm operands which name a specific register a bit more thorough.  PR13196.
      
      llvm-svn: 159176
      bbcd09cc
  8. Jun 25, 2012
    • Manman Ren's avatar
      ARM: update peephole optimization. · 606953fb
      Manman Ren authored
      More condition codes are included when deciding whether to remove cmp after
      a sub instruction. Specifically, we extend from GE|LT|GT|LE to 
      GE|LT|GT|LE|HS|LS|HI|LO|EQ|NE. If we have "sub a, b; cmp b, a; movhs", we
      should be able to replace with "sub a, b; movls".
      
      rdar: 11725965
      llvm-svn: 159166
      606953fb
Loading