Skip to content
  1. Jul 02, 2015
  2. Jul 01, 2015
    • Pete Cooper's avatar
      Pack MCSymbol::Flags in to the bitfield with other members. NFC. · abcac11d
      Pete Cooper authored
      All file formats only needed 16-bits right now which is enough to fit
      in to the padding with other fields.
      
      This reduces the size of MCSymbol to 24-bytes on a 64-bit system.  The
      layout is now
      
         0 | class llvm::MCSymbol
         0 |   class llvm::PointerIntPair SectionOrFragmentAndHasName
         0 |     intptr_t Value
           |   [sizeof=8, dsize=8, align=8
           |    nvsize=8, nvalign=8]
      
         8 |   unsigned int IsTemporary
         8 |   unsigned int IsRedefinable
         8 |   unsigned int IsUsed
         8 |   _Bool IsRegistered
         8 |   unsigned int IsExternal
         8 |   unsigned int IsPrivateExtern
         8 |   unsigned int Kind
         9 |   unsigned int IsUsedInReloc
         9 |   unsigned int SymbolContents
         9 |   unsigned int CommonAlignLog2
        10 |   uint32_t Flags
        12 |   uint32_t Index
        16 |   union
        16 |     uint64_t Offset
        16 |     uint64_t CommonSize
        16 |     const class llvm::MCExpr * Value
           |   [sizeof=8, dsize=8, align=8
           |    nvsize=8, nvalign=8]
      
           | [sizeof=24, dsize=24, align=8
           |  nvsize=24, nvalign=8]
      
      llvm-svn: 241196
      abcac11d
    • Dan Gohman's avatar
      [WebAssembly] Define separate Target instances for 32-bit and 64-bit. · d82494bb
      Dan Gohman authored
      llvm-svn: 241193
      d82494bb
    • Jingyue Wu's avatar
      [NVPTX] expand extload/truncstore for vectors of floats · a0a56601
      Jingyue Wu authored
      Summary:
      According to PTX ISA:
      
      For convenience, ld, st, and cvt instructions permit source and destination data operands to be wider than the instruction-type size, so that narrow values may be loaded, stored, and converted using regular-width registers. For example, 8-bit or 16-bit values may be held directly in 32-bit or 64-bit registers when being loaded, stored, or converted to other types and sizes. The operand type checking rules are relaxed for bit-size and integer (signed and unsigned) instruction types; floating-point instruction types still require that the operand type-size matches exactly, unless the operand is of bit-size type.
      
      So, the ISA does not support load with extending/store with truncatation for floating numbers. This is reflected in setting the loadext/truncstore actions to expand in the code for floating numbers, but vectors of floating numbers are not taken care of.
      
      As a result, loading a vector of floats followed by a fp_extend may be combined by DAGCombiner to a extload, and the extload may be lowered to NVPTXISD::LoadV2 with extending information. However, NVPTXISD::LoadV2 does not perform extending, and no extending instructions are inserted. Finally, PTX instructions with mismatched types are generated, like
      ld.v2.f32 {%fd3, %fd4}, [%rd2]
      
      This patch adds the correct actions for vectors of floats, so DAGCombiner would not create loads with extending, and correct code is generated.
      
      Patched by Gang Hu. 
      
      Test Plan: Test case attached.
      
      Reviewers: jingyue
      
      Reviewed By: jingyue
      
      Subscribers: llvm-commits, jholewinski
      
      Differential Revision: http://reviews.llvm.org/D10876
      
      llvm-svn: 241191
      a0a56601
    • Pete Cooper's avatar
      Encode MCSymbol alignment as log2(align). · acfd55b0
      Pete Cooper authored
      Given that alignments are always powers of 2, just encode it this way.
      
      This matches how we encode alignment on IR GlobalValue's for example.
      
      This compresses the CommonAlign member down to 5 bits which allows it
      to pack better with the surrounding fields.
      
      Reviewed by Duncan Exon Smith.
      
      llvm-svn: 241189
      acfd55b0
    • Reid Kleckner's avatar
      [WinEH] Use llvm.x86.seh.recoverfp in WinEHPrepare · 6511c8bb
      Reid Kleckner authored
      Don't pattern match for frontend outlined finally calls on non-x64
      platforms. The 32-bit runtime uses a different funclet prototype.  Now,
      the frontend is pre-outlining the finally bodies so that it ends up
      doing most of the heavy lifting for variable capturing. We're just
      outlining the callsite, and adapting the frameaddress(0) call to line up
      the frame pointer recovery.
      
      llvm-svn: 241186
      6511c8bb
    • Jingyue Wu's avatar
      [NVPTX] Move NVPTXPeephole after NVPTXPrologEpilogPass · 77b5b385
      Jingyue Wu authored
      Summary:
      Offset of frame index is calculated by NVPTXPrologEpilogPass. Before
      that the correct offset of stack objects cannot be obtained, which
      leads to wrong offset if there are more than 2 frame objects. This patch
      move NVPTXPeephole after NVPTXPrologEpilogPass. Because the frame index
      is already replaced by %VRFrame in NVPTXPrologEpilogPass, we check
      VRFrame register instead, and try to remove the VRFrame if there
      is no usage after NVPTXPeephole pass.
      
      Patched by Xuetian Weng. 
      
      Test Plan:
      Strengthened test/CodeGen/NVPTX/local-stack-frame.ll to check the
      offset calculation based on SP and SPL.
      
      Reviewers: jholewinski, jingyue
      
      Reviewed By: jingyue
      
      Subscribers: jholewinski, llvm-commits
      
      Differential Revision: http://reviews.llvm.org/D10853
      
      llvm-svn: 241185
      77b5b385
    • Bill Schmidt's avatar
      [PPC64LE] Enable missing lxvdsx optimization, and related swap optimization · ae94f11d
      Bill Schmidt authored
      When adding little-endian vector support for PowerPC last year, I
      inadvertently disabled an optimization that recognizes a load-splat
      idiom and generates the lxvdsx instruction.  This patch moves the
      offending logic so lxvdsx is once again generated.
      
      This pattern is frequently generated by the vectorizer for scalar
      loads of an effective constant.  Previously the lxvdsx instruction was
      wrongly listed as lane-sensitive for the VSX swap optimization (since
      both doublewords are identical, swaps are safe).  This patch fixes
      this as well, so that vectorized code using lxvdsx can now have swaps
      removed from the computation.
      
      There is an existing test (@test50) in test/CodeGen/PowerPC/vsx.ll
      that checks for the missing optimization.  However, vsx.ll was only
      being tested for POWER7 with big-endian code generation.  I've added
      a little-endian RUN statement and expected LE code generation for all
      the tests in vsx.ll to give us a bit better VSX coverage, including
      what's needed for this patch.
      
      llvm-svn: 241183
      ae94f11d
    • Sanjay Patel's avatar
      add a cl::opt override for TargetLoweringBase's JumpIsExpensive · 943829a1
      Sanjay Patel authored
      This patch is not intended to change existing codegen behavior for any target. 
      It just exposes the JumpIsExpensive setting on the command-line to allow for
      easier testing and emergency overrides.
      
      Also, change the existing regression test to use FileCheck, explicitly specify
      the jump-is-expensive option, and use more precise checks.
      
      Differential Revision: http://reviews.llvm.org/D10846
      
      llvm-svn: 241179
      943829a1
    • Jon Roelofs's avatar
    • David Blaikie's avatar
      Revert "[DWARF] Fix debug info generation for function static variables, typedefs, and records" · d51dea67
      David Blaikie authored
      Caused PR24008
      
      This reverts commit 37cb5f1c2db9f42d29f26b215585f56bb64ae4f5.
      
      llvm-svn: 241176
      d51dea67
    • Sanjay Patel's avatar
      fix formatting; NFC · 910d5daa
      Sanjay Patel authored
      llvm-svn: 241175
      910d5daa
    • Sanjay Patel's avatar
      fix typos in comment; NFC · e4d95c6c
      Sanjay Patel authored
      llvm-svn: 241174
      e4d95c6c
    • Matthias Braun's avatar
      LivePhysRegs: Add support to add pristine registers when populating with... · e1cd96bf
      Matthias Braun authored
      LivePhysRegs: Add support to add pristine registers when populating with live-in/live-out registers.
      
      Differential Revision: http://reviews.llvm.org/D10139
      
      llvm-svn: 241172
      e1cd96bf
    • Reid Kleckner's avatar
      [SEH] Don't assert if the parent function lacks a personality · f8063668
      Reid Kleckner authored
      The EH code might have been deleted as unreachable and the personality
      pruned while the filter is still present.  Currently I'm hitting this at
      -O0 due to the clang bug PR24009.
      
      llvm-svn: 241170
      f8063668
    • Benjamin Kramer's avatar
      [AsmPrinter] Hide implementation details · 286d4660
      Benjamin Kramer authored
      NFC.
      
      llvm-svn: 241169
      286d4660
    • Arnaud A. de Grandmaison's avatar
      [AArch64] Implement add/adds/sub/subs/cmp/cmn with negative immediate aliases · 650c5200
      Arnaud A. de Grandmaison authored
      This patch teaches the AsmParser to accept add/adds/sub/subs/cmp/cmn
      with a negative immediate operand and convert them as shown:
      
        add  Rd, Rn, -imm -> sub  Rd, Rn, imm
        sub  Rd, Rn, -imm -> add  Rd, Rn, imm
        adds Rd, Rn, -imm -> subs Rd, Rn, imm
        subs Rd, Rn, -imm -> adds Rd, Rn, imm
        cmp  Rn, -imm     -> cmn  Rn, imm
        cmn  Rn, -imm     -> cmp  Rn, imm
      
      Those instructions are an alternate syntax available to assembly coders,
      and are needed in order to support code already compiling with some other
      assemblers (gas). They are documented in the "ARMv8 Instruction Set
      Overview", in the "Arithmetic (immediate)" section. This makes llvm-mc
      a programmer-friendly assembler !
      
      This also fixes PR20978: "Assembly handling of adding negative numbers
      not as smart as gas".
      
      llvm-svn: 241166
      650c5200
    • Benjamin Kramer's avatar
      [SDAG] Give InstrEmitter hidden visibility · 85b2815a
      Benjamin Kramer authored
      NFC.
      
      llvm-svn: 241165
      85b2815a
    • Benjamin Kramer's avatar
      [CodeGen] Reduce visibility of implementation details · f4c20253
      Benjamin Kramer authored
      NFC.
      
      llvm-svn: 241164
      f4c20253
    • James Y Knight's avatar
      [Sparc] Rearrange SparcInstrInfo, no change. · a8a8c605
      James Y Knight authored
      Move some instructions into order of sections in the spec, as the rest
      already were.
      
      Differential Revision: http://reviews.llvm.org/D9102
      
      llvm-svn: 241163
      a8a8c605
    • Michael Kuperstein's avatar
      Test committed in r241153 is more target-specific than I thought. · 33d99d60
      Michael Kuperstein authored
      Moving the (original, x86-only) test to the X86 directory.
      
      llvm-svn: 241162
      33d99d60
Loading