Skip to content
  1. Oct 15, 2012
  2. Oct 14, 2012
  3. Oct 13, 2012
  4. Oct 12, 2012
  5. Oct 11, 2012
  6. Oct 10, 2012
    • Nadav Rotem's avatar
      Patch by Shuxin Yang <shuxin.llvm@gmail.com>. · 17418964
      Nadav Rotem authored
      Original message:
      
      The attached is the fix to radar://11663049. The optimization can be outlined by following rules:
      
         (select (x != c), e, c) -> select (x != c), e, x),
         (select (x == c), c, e) -> select (x == c), x, e)
      where the <c> is an integer constant.
      
       The reason for this change is that : on x86, conditional-move-from-constant needs two instructions;
      however, conditional-move-from-register need only one instruction.
      
        While the LowerSELECT() sounds to be the most convenient place for this optimization, it turns out to be a bad place. The reason is that by replacing the constant <c> with a symbolic value, it obscure some instruction-combining opportunities which would otherwise be very easy to spot. For that reason, I have to postpone the change to last instruction-combining phase.
      
        The change passes the test of "make check-all -C <build-root/test" and "make -C project/test-suite/SingleSource".
      
      llvm-svn: 165661
      17418964
    • Bill Schmidt's avatar
      When generating spill and reload code for vector registers on PowerPC, · b9bc4740
      Bill Schmidt authored
      the compiler makes use of GPR0.  However, there are two flavors of
      GPR0 defined by the target:  the 32-bit GPR0 (R0) and the 64-bit GPR0
      (X0).  The spill/reload code makes use of R0 regardless of whether we
      are generating 32- or 64-bit code.
      
      This patch corrects the problem in the obvious manner, using X0 and
      ADDI8 for 64-bit and R0 and ADDI for 32-bit.
      
      llvm-svn: 165658
      b9bc4740
    • Bill Schmidt's avatar
      The PowerPC VRSAVE register has been somewhat of an odd beast since · 38d94587
      Bill Schmidt authored
      the Altivec extensions were introduced.  Its use is optional, and
      allows the compiler to communicate to the operating system which
      vector registers should be saved and restored during a context switch.
      In practice, this information is ignored by the various operating
      systems using the SVR4 ABI; the kernel saves and restores the entire
      register state.  Setting the VRSAVE register is no longer performed by
      the AIX XL compilers, the IBM i compilers, or by GCC on Power Linux
      systems.  It seems best to avoid this logic within LLVM as well.
      
      This patch avoids generating code to update and restore VRSAVE for the
      PowerPC SVR4 ABIs (32- and 64-bit).  The code remains in place for the
      Darwin ABI.
      
      llvm-svn: 165656
      38d94587
    • Michael Liao's avatar
      Add support for FP_ROUND from v2f64 to v2f32 · e999b865
      Michael Liao authored
      - Due to the current matching vector elements constraints in
        ISD::FP_ROUND, rounding from v2f64 to v4f32 (after legalization from
        v2f32) is scalarized. Add a customized v2f32 widening to convert it
        into a target-specific X86ISD::VFPROUND to work around this
        constraints.
      
      llvm-svn: 165631
      e999b865
    • Michael Liao's avatar
      Add alternative support for FP_ROUND from v2f32 to v2f64 · effae0c8
      Michael Liao authored
      - Due to the current matching vector elements constraints in ISD::FP_EXTEND,
        rounding from v2f32 to v2f64 is scalarized. Add a customized v2f32 widening
        to convert it into a target-specific X86ISD::VFPEXT to work around this
        constraints. This patch also reverts a previous attempt to fix this issue by
        recovering the scalarized ISD::FP_EXTEND pattern and thus significantly
        reduces the overhead of supporting non-power-2 vector FP extend.
      
      llvm-svn: 165625
      effae0c8
    • Stepan Dyatkovskiy's avatar
      Fix for LDRB instruction: · 283baa00
      Stepan Dyatkovskiy authored
      SDNode for LDRB_POST_IMM is invalid: number of registers added to SDNode fewer
      that described in .td.
      
      7 ops is needed, but SDNode with only 6 is created.
      
      In more details:
      In ARMInstrInfo.td, in multiclass AI2_ldridx, in definition _POST_IMM, offset
      operand is defined as am2offset_imm. am2offset_imm is complex parameter type,
      and actually it consists from dummy register and imm itself. As I understood
      trick with dummy reg was made for AsmParser. In ARMISelLowering.cpp, this dummy
      register was not added to SDNode, and it cause crash in Peephole Optimizer pass.
      
      The problem fixed by setting up additional dummy reg when emitting
      LDRB_POST_IMM instruction.
      
      llvm-svn: 165617
      283baa00
    • Stepan Dyatkovskiy's avatar
      Issue description: · f13dbb8e
      Stepan Dyatkovskiy authored
      SchedulerDAGInstrs::buildSchedGraph ignores dependencies between FixedStack
      objects and byval parameters. So loading byval parameters from stack may be
      inserted *before* it will be stored, since these operations are treated as
      independent.
      
      Fix:
      Currently ARMTargetLowering::LowerFormalArguments saves byval registers with
      FixedStack MachinePointerInfo. To fix the problem we need to store byval
      registers with MachinePointerInfo referenced to first the "byval" parameter.
      
      Also commit adds two new fields to the InputArg structure: Function's argument
      index and InputArg's part offset in bytes relative to the start position of
      Function's argument. E.g.: If function's argument is 128 bit width and it was
      splitted onto 32 bit regs, then we got 4 InputArg structs with same arg index,
      but different offset values. 
      
      llvm-svn: 165616
      f13dbb8e
    • Bill Wendling's avatar
      Remove the final bits of Attributes being declared in the Attribute · bbcdf4e2
      Bill Wendling authored
      namespace. Use the attribute's enum value instead. No functionality change
      intended.
      
      llvm-svn: 165610
      bbcdf4e2
    • Andrew Trick's avatar
      misched: Use the TargetSchedModel interface wherever possible. · dd79f0fc
      Andrew Trick authored
      Allows the new machine model to be used for NumMicroOps and OutputLatency.
      
      Allows the HazardRecognizer to be disabled along with itineraries.
      
      llvm-svn: 165603
      dd79f0fc
    • Andrew Trick's avatar
      whitespace · d9296ec2
      Andrew Trick authored
      llvm-svn: 165601
      d9296ec2
Loading