Skip to content
  1. Aug 30, 2014
  2. Aug 29, 2014
    • Robin Morisset's avatar
      Fix typos in comments, NFC · 039781ef
      Robin Morisset authored
      Summary: Just fixing comments, no functional change.
      
      Test Plan: N/A
      
      Reviewers: jfb
      
      Subscribers: mcrosier, llvm-commits
      
      Differential Revision: http://reviews.llvm.org/D5130
      
      llvm-svn: 216784
      039781ef
    • Reid Kleckner's avatar
      Add a const and munge some comments · dccd0cbe
      Reid Kleckner authored
      llvm-svn: 216781
      dccd0cbe
    • Reid Kleckner's avatar
      musttail: Forward regparms of variadic functions on x86_64 · 16e55412
      Reid Kleckner authored
      Summary:
      If a variadic function body contains a musttail call, then we copy all
      of the remaining register parameters into virtual registers in the
      function prologue. We track the virtual registers through the function
      body, and add them as additional registers to pass to the call. Because
      this is all done in virtual registers, the register allocator usually
      gives us good code. If the function does a call, however, it will have
      to spill and reload all argument registers (ew).
      
      Forwarding regparms on x86_32 is not implemented because most compilers
      don't support varargs in 32-bit with regparms.
      
      Reviewers: majnemer
      
      Subscribers: aemerson, llvm-commits
      
      Differential Revision: http://reviews.llvm.org/D5060
      
      llvm-svn: 216780
      16e55412
    • Reid Kleckner's avatar
      Verifier: Don't reject varargs callee cleanup functions · 329d4a2b
      Reid Kleckner authored
      We've rejected these kinds of functions since r28405 in 2006 because
      it's impossible to lower the return of a callee cleanup varargs
      function. However there are lots of legal ways to leave such a function
      without returning, such as aborting. Today we can leave a function with
      a musttail call to another function with the correct prototype, and
      everything works out.
      
      I'm removing the verifier check declaring that a normal return from such
      a function is UB.
      
      Reviewed By: nlewycky
      
      Differential Revision: http://reviews.llvm.org/D5059
      
      llvm-svn: 216779
      329d4a2b
    • Louis Gerbarg's avatar
      Remove spurious mask operations from AArch64 add->compares on 16 and 8 bit values · 03c627e8
      Louis Gerbarg authored
      This patch checks for DAG patterns that are an add or a sub followed by a
      compare on 16 and 8 bit inputs. Since AArch64 does not support those types
      natively they are legalized into 32 bit values, which means that mask operations
      are inserted into the DAG to emulate overflow behaviour. In many cases those
      masks do not change the result of the processing and just introduce a dependent
      operation, often in the middle of a hot loop.
      
      This patch detects the relevent DAG patterns and then tests to see if the
      transforms are equivalent with and without the mask, removing the mask if
      possible. The exact mechanism of this patch was discusses in
      http://lists.cs.uiuc.edu/pipermail/llvmdev/2014-July/074444.html
      
      There is a reasonably good chance there are missed oppurtunities due to similiar
      (but not identical) DAG patterns that could be funneled into this test, adding
      them should be simple if we see test cases.
      
      Tests included.
      
      rdar://13754426
      
      llvm-svn: 216776
      03c627e8
    • Reid Kleckner's avatar
      X86: Fix conflict over ESI between base register and rep;movsl · ab99e24e
      Reid Kleckner authored
      The new solution is to not use this lowering if there are any dynamic
      allocas in the current function. We know up front if there are dynamic
      allocas, but we don't know if we'll need to create stack temporaries
      with large alignment during lowering. Conservatively assume that we will
      need such temporaries.
      
      Reviewed By: hans
      
      Differential Revision: http://reviews.llvm.org/D5128
      
      llvm-svn: 216775
      ab99e24e
    • Robin Morisset's avatar
      Relax the constraint more in MemoryDependencyAnalysis.cpp · 163ef040
      Robin Morisset authored
      Even loads/stores that have a stronger ordering than monotonic can be safe.
      The rule is no release-acquire pair on the path from the QueryInst, assuming that
      the QueryInst is not atomic itself.
      
      llvm-svn: 216771
      163ef040
    • Robin Morisset's avatar
      [X86] Refactor X86ISelDAGToDAG::SelectAtomicLoadArith - NFC · 5ce0ce44
      Robin Morisset authored
      Summary:
      Mostly renaming the (not very explicit) variables Tmp0, .. Tmp4, and grouping
      related statements together, along with a few lines of comments for the
      surprising parts.
      
      No functional change intended.
      
      Test Plan: make check-all
      
      Reviewers: jfb
      
      Subscribers: llvm-commits
      
      Differential Revision: http://reviews.llvm.org/D5088
      
      llvm-svn: 216768
      5ce0ce44
    • Jean-Luc Duprat's avatar
      Tablegen fixes for new syntax when initializing bits from variables. · 6d7b4561
      Jean-Luc Duprat authored
      Followup to r215086.
      
      llvm-svn: 216757
      6d7b4561
    • Juergen Ributzka's avatar
      [FastISel][AArch64] Fix an incorrect kill flag due to a bug in SelectTrunc. · f6ee7a7c
      Juergen Ributzka authored
      When we select a trunc instruction we don't emit any code if the type is already
      i32 or smaller. This is because the instruction that uses the truncated value
      will deal with it.
      
      This behavior can incorrectly transfer a kill flag, which was meant for the
      result of the truncate, onto the source register.
      
      %2 = trunc i32 %1 to i16
      ... = ... %2                -> ... = ... vreg1 <kill>
      ... = ... %1                   ... = ... vreg1
      
      This commit fixes this by emitting a COPY instruction, so that the result and
      source register are distinct virtual registers.
      
      This fixes rdar://problem/18178188.
      
      llvm-svn: 216750
      f6ee7a7c
    • Hal Finkel's avatar
      Fix a typo in AddAliasScopeMetadata · 2d3d6da4
      Hal Finkel authored
      llvm-svn: 216741
      2d3d6da4
    • Matt Arsenault's avatar
      Make fabs safe to speculatively execute · 85cbc7e3
      Matt Arsenault authored
      llvm-svn: 216736
      85cbc7e3
    • Matt Arsenault's avatar
      R600/SI: Use mad for fsub + fmul · 8675db15
      Matt Arsenault authored
      We can use a negate source modifier to match
      this for fsub.
      
      llvm-svn: 216735
      8675db15
    • Tim Northover's avatar
      AArch64: only try to get operand of a known node. · 3c0915e8
      Tim Northover authored
      A bug in r216725 meant we tried to discover the type of a SETCC before
      confirming the node actually was a SETCC.
      
      llvm-svn: 216734
      3c0915e8
    • Sanjay Patel's avatar
      typo · a065eb44
      Sanjay Patel authored
      llvm-svn: 216732
      a065eb44
    • Jingyue Wu's avatar
      [NVPTX] Make the alignment an explicit argument to ldu/ldg · cb83a155
      Jingyue Wu authored
      Summary:
      Instead of specifying the alignment as metadata which may be destroyed by
      transformation passes, make the alignment the second argument to ldu/ldg
      intrinsic calls.
      
      Test Plan:
      ldu-ldg.ll
      ldu-i8.ll
      ldu-reg-plus-offset.ll
      
      Reviewers: eliben, meheff, jholewinski
      
      Reviewed By: meheff, jholewinski
      
      Subscribers: jholewinski, llvm-commits
      
      Differential Revision: http://reviews.llvm.org/D5093
      
      llvm-svn: 216731
      cb83a155
    • Tim Northover's avatar
      AArch64: skip select/setcc combine in complex case. · c1c05aeb
      Tim Northover authored
      In an llvm-stress generated test, we were trying to create a v0iN type and
      asserting when that failed. This case could probably be handled by the
      function, but not without added complexity and the situation it arises in is
      sufficiently odd that there's probably no benefit anyway.
      
      Should fix PR20775.
      
      llvm-svn: 216725
      c1c05aeb
    • Arnaud A. de Grandmaison's avatar
      [AArch64] FPLoadBalancing: move ownership of the chain to its current accumulator register · 6afbf2aa
      Arnaud A. de Grandmaison authored
      and forget about the previously used accumulator.
      
      Coming up with a simple testcase is not easy, as this highly depends on
      what the register allocator is doing: this issue showed up while working
      with the PBQP allocator, which produced a different allocation scheme.
      A testcase would need to come up with chain starting in D[0-7], then
      moving to D[8-15], followed by a call to a function whose regmask
      clobbers the starting accumulator in D[0-7], then another use of the chain.
      
      Fixed some formatting, added some invariant checks while there.
      
      llvm-svn: 216721
      6afbf2aa
    • Frederic Riss's avatar
      Use DwarfDebug::attachLowHighPC for the compilation unit DIE. · 5732301a
      Frederic Riss authored
      llvm-svn: 216719
      5732301a
    • Robert Khasanov's avatar
      [SKX] Enable lowering of integer CMP operations. · a651a623
      Robert Khasanov authored
      Added new types to Legalizer.
      Fixed getSetCCResultType function
      Added lowering tests.
      
      Reviewed by Elena Demikhovsky.
      
      llvm-svn: 216717
      a651a623
    • Job Noorman's avatar
      Do not assume the value passed to memset is an i32. · 9b31bd6b
      Job Noorman authored
      The code in SelectionDAG::getMemset for some reason assumes the value passed to
      memset is an i32. This breaks the generated code for targets that only have
      registers smaller than 32 bits because the value might get split into multiple
      registers by the calling convention. See the test for the MSP430 target included
      in the patch for an example.
      
      This patch ensures that nothing is assumed about the type of the value. Instead,
      the type is taken from the selected overload of the llvm.memset intrinsic.
      
      llvm-svn: 216716
      9b31bd6b
Loading