Skip to content
  1. Dec 03, 2014
    • Charlie Turner's avatar
      Emit ABI_FP_rounding attribute. · f02c9248
      Charlie Turner authored
      LLVM understands a -enable-sign-dependent-rounding-fp-math codegen option. When
      the user has specified this option, the Tag_ABI_FP_rounding attribute should be
      emitted with value 1. This option currently does not appear to disable
      transformations and optimizations that assume default floating point rounding
      behavior, AFAICT, but the intention should be recorded in the build attributes,
      regardless of what the compiler actually does with the intention.
      
      Change-Id: If838578df3dc652b6f2796b8d152545674bcb30e
      llvm-svn: 223218
      f02c9248
    • Charlie Turner's avatar
      Add tests for default value of Tag_ABI_FP_rounding. · 1620a69f
      Charlie Turner authored
      Change-Id: I051866d073fc6ce87ce3e693a3762da6d81f4393
      llvm-svn: 223217
      1620a69f
    • Benjamin Poulain's avatar
      Fix a typo in the documentation of LTO · 78e1380a
      Benjamin Poulain authored
      Fix defininitions->definitions.
      
      Reviewed by David Blaikie.
      
      llvm-svn: 223216
      78e1380a
    • Rafael Espindola's avatar
      Ask the module for its the identified types. · 2fa1e43a
      Rafael Espindola authored
      When lazy reading a module, the types used in a function will not be visible to
      a TypeFinder until the body is read.
      
      This patch fixes that by asking the module for its identified struct types.
      If a materializer is present, the module asks it. If not, it uses a TypeFinder.
      
      This fixes pr21374.
      
      I will be the first to say that this is ugly, but it was the best I could find.
      
      Some of the options I looked at:
      
      * Asking the LLVMContext. This could be made to work for gold, but not currently
        for ld64. ld64 will load multiple modules into a single context before merging
        them. This causes us to see types from future merges. Unfortunately,
        MappedTypes is not just a cache when it comes to opaque types. Once the
        mapping has been made, we have to remember it for as long as the key may
        be used. This would mean moving MappedTypes to the Linker class and having
        to drop the Linker::LinkModules static methods, which are visible from C.
      
      * Adding an option to ignore function bodies in the TypeFinder. This would
        fix the PR by picking the worst result. It would work, but unfortunately
        we are currently quite dependent on the upfront type merging. I will
        try to reduce our dependency, but it is not clear that we will be able
        to get rid of it for now.
      
      The only clean solution I could think of is making the Module own the types.
      This would have other advantages, but it is a much bigger change. I will
      propose it, but it is nice to have this fixed while that is discussed.
      
      With the gold plugin, this patch takes the number of types in the LTO clang
      binary from 52817 to 49669.
      
      llvm-svn: 223215
      2fa1e43a
    • Duncan P. N. Exon Smith's avatar
      ADT: Rename argument in emplace_back_impl · 74864449
      Duncan P. N. Exon Smith authored
      Rename a functor argument in r223201 from `emplace` to `construct` to
      reduce confusion.
      
      llvm-svn: 223212
      74864449
    • Nick Lewycky's avatar
      Revert r222997. The newly added compile-time checks are finding missing... · a4acb449
      Nick Lewycky authored
      Revert r222997. The newly added compile-time checks are finding missing origins, testcase is being reduced and a PR will be posted shortly.
      
      llvm-svn: 223211
      a4acb449
    • Duncan P. N. Exon Smith's avatar
      LoopVectorize: Remove unnecessary RAUW · a48bd07e
      Duncan P. N. Exon Smith authored
      Remove an unnecessary `MDNode::replaceAllUsesWith()`.  In the preceding
      line, `TheLoop->setLoopID()` visits all backedges and sets the new loop
      ID.  This sufficiently updates the loop metadata.
      
      Metadata RAUW is going away as part of PR21532.
      
      llvm-svn: 223210
      a48bd07e
    • Matt Arsenault's avatar
      R600/SI: Fix SIFixSGPRCopies for copies to physical registers · 120a0c92
      Matt Arsenault authored
      This shows up when operands required to be passed in VCC are copied
      to.
      
      llvm-svn: 223208
      120a0c92
    • Matt Arsenault's avatar
      R600/SI: Remove incorrect assertion · 88652a00
      Matt Arsenault authored
      This can be a COPY to a physical register, such as VCC
      
      llvm-svn: 223207
      88652a00
    • Matt Arsenault's avatar
      R600/SI: Remove i1 pseudo VALU ops · becd656c
      Matt Arsenault authored
      Select i1 logical ops directly to 64-bit SALU instructions.
      Vector i1 values are always really in SGPRs, with each
      bit for each item in the wave. This saves about 4 instructions
      when and/or/xoring any condition, and also helps write conditions
      that need to be passed in vcc.
      
      This should work correctly now that the SGPR live range
      fixing pass works. More work is needed to eliminate the VReg_1
      pseudo regclass and possibly the entire SILowerI1Copies pass.
      
      llvm-svn: 223206
      becd656c
    • Matt Arsenault's avatar
      R600/SI: Fix suspicious indexing · 2f470c62
      Matt Arsenault authored
      The loop is over the operands of an instruction, and checks the
      register with the sub reg index of the dest register. This probably
      meant to be checking the sub reg index of the same operand.
      
      llvm-svn: 223205
      2f470c62
    • Matt Arsenault's avatar
      R600/SI: Fix running SILowerI1Copies a second time · 691ae3d6
      Matt Arsenault authored
      llvm-svn: 223204
      691ae3d6
    • Matt Arsenault's avatar
      R600/SI: Fix live range error hidden by SIFoldOperands · 0d2832ae
      Matt Arsenault authored
      m0 is treated as a virtual register class with a single register
      rather than the physical register it really is. This was updating
      the live range of the used virtual copy of m0 from the first ds_read
      instruction, and leaving the unused copy unchanged. This resulted in a
      "Live segment doesn't end at a valid instruction" verifier error because
      the erased instructions. Update the live range of the second copy (which
      should be dead).
      
      No test since I'm not sure how to trigger this with SIFoldOperands
      enabled.
      
      llvm-svn: 223203
      0d2832ae
    • Duncan P. N. Exon Smith's avatar
      ADT: Add SmallVector<>::emplace_back(): fixup · 78116d7a
      Duncan P. N. Exon Smith authored
      Add missing `void` return type from `!LLVM_HAS_VARIADIC_TEMPLATES` case
      in r223201.
      
      llvm-svn: 223202
      78116d7a
    • Duncan P. N. Exon Smith's avatar
      ADT: Add SmallVector<>::emplace_back() · f2396e65
      Duncan P. N. Exon Smith authored
      llvm-svn: 223201
      f2396e65
    • Tom Stellard's avatar
      StructurizeCFG: Use LoopInfo analysis for better loop detection · 1f0dded0
      Tom Stellard authored
      We were assuming that each back-edge in a region represented a unique
      loop, which is not always the case.  We need to use LoopInfo to
      correctly determine which back-edges are loops.
      
      llvm-svn: 223199
      1f0dded0
    • Duncan P. N. Exon Smith's avatar
      NVPTX: Delete dead code · c280ff0e
      Duncan P. N. Exon Smith authored
      `MDNode` does not inherit from `User`, and it never has a name.
      
      llvm-svn: 223198
      c280ff0e
    • Tom Stellard's avatar
      R600/SI: Enable inline assembly · 36930806
      Tom Stellard authored
      We just needed to remove the assertion in
      AMDGPURegisterInfo::getFrameRegister(), which is called when
      initializing the parser for inline assembly.
      
      llvm-svn: 223197
      36930806
    • Peter Zotov's avatar
      [OCaml] [cmake] Disable OCaml bindings if ctypes >=0.3 is not found. · fcefcf96
      Peter Zotov authored
      llvm-svn: 223195
      fcefcf96
    • Matt Arsenault's avatar
      R600/SI: Change mubuf offsets to print as decimal · fb13b22d
      Matt Arsenault authored
      This matches SC's behavior.
      
      llvm-svn: 223194
      fb13b22d
    • Nick Lewycky's avatar
      Emit the entry block first and the exit block second, then all the blocks in... · 2e8a6219
      Nick Lewycky authored
      Emit the entry block first and the exit block second, then all the blocks in between afterwards. This is what gcc always does, and some out of tree tools depend on that.
      
      llvm-svn: 223193
      2e8a6219
    • NAKAMURA Takumi's avatar
      GCRelocateOperands: Try to appease msc17. · 0a64776c
      NAKAMURA Takumi authored
      llvm-svn: 223192
      0a64776c
    • Peter Collingbourne's avatar
      Prologue support · 51d2de7b
      Peter Collingbourne authored
      Patch by Ben Gamari!
      
      This redefines the `prefix` attribute introduced previously and
      introduces a `prologue` attribute.  There are a two primary usecases
      that these attributes aim to serve,
      
        1. Function prologue sigils
      
        2. Function hot-patching: Enable the user to insert `nop` operations
           at the beginning of the function which can later be safely replaced
           with a call to some instrumentation facility
      
        3. Runtime metadata: Allow a compiler to insert data for use by the
           runtime during execution. GHC is one example of a compiler that
           needs this functionality for its tables-next-to-code functionality.
      
      Previously `prefix` served cases (1) and (2) quite well by allowing the user
      to introduce arbitrary data at the entrypoint but before the function
      body. Case (3), however, was poorly handled by this approach as it
      required that prefix data was valid executable code.
      
      Here we redefine the notion of prefix data to instead be data which
      occurs immediately before the function entrypoint (i.e. the symbol
      address). Since prefix data now occurs before the function entrypoint,
      there is no need for the data to be valid code.
      
      The previous notion of prefix data now goes under the name "prologue
      data" to emphasize its duality with the function epilogue.
      
      The intention here is to handle cases (1) and (2) with prologue data and
      case (3) with prefix data.
      
      References
      ----------
      
      This idea arose out of discussions[1] with Reid Kleckner in response to a
      proposal to introduce the notion of symbol offsets to enable handling of
      case (3).
      
      [1] http://lists.cs.uiuc.edu/pipermail/llvmdev/2014-May/073235.html
      
      Test Plan: testsuite
      
      Differential Revision: http://reviews.llvm.org/D6454
      
      llvm-svn: 223189
      51d2de7b
    • NAKAMURA Takumi's avatar
      4c71cc1d
    • Ahmed Bougacha's avatar
      [X86][MC] Intel syntax: accept implicit memory operand sizes larger than 80. · d65f787a
      Ahmed Bougacha authored
      The X86AsmParser intel handling was refactored in r216481, making it
      try each different memory operand size to see which one matches.
      Operand sizes larger than 80 ("[xyz]mmword ptr") were forgotten, which
      led to an "invalid operand" error for code such as:
        movdqa [rax], xmm0
      
      llvm-svn: 223187
      d65f787a
    • Lang Hames's avatar
      [MCJIT] Unique-ptrify the RTDyldMemoryManager member of MCJIT. NFC. · 4a5697e6
      Lang Hames authored
      llvm-svn: 223183
      4a5697e6
    • Hal Finkel's avatar
      [PowerPC] Fix readcyclecounter to be custom expanded for all 32-bit targets · 01fa7701
      Hal Finkel authored
      We need to use the custom expansion of readcyclecounter on all 32-bit targets
      (even those with 64-bit registers). This should fix the ppc64 buildbot.
      
      llvm-svn: 223182
      01fa7701
    • Tim Northover's avatar
      AArch64: strengthen Darwin ABI alignment assumptions · 4a8ac260
      Tim Northover authored
      A global variable without an explicit alignment specified should be assumed to
      be ABI-aligned according to its type, like on other platforms. This allows us
      to use better memory operations when accessing it.
      
      rdar://18533701
      
      llvm-svn: 223180
      4a8ac260
    • Pete Cooper's avatar
      Use a typed enum instead of 'unsigned char' for packed field. NFC. · 521b5a8a
      Pete Cooper authored
      This makes it easier to debug Twine as the 'Kind' fields now show their enum values in lldb and not escaped characters.
      
      llvm-svn: 223178
      521b5a8a
    • Tim Northover's avatar
      AArch64: don't be too greedy when folding :lo12: accesses into mem ops. · ec7ebebe
      Tim Northover authored
      This frequently leads to cases like:
         ldr xD, [xN, :lo12:var]
         add xA, xN, :lo12:var
         ldr xD, [xA, #8]
      
      where the ADD would have been needed anyway, and the two distinct addressing
      modes can prevent the formation of an ldp. Because of how we handle ADRP
      (aggressively forming an ADRP/ADD pseudo-inst at ISel time), this pattern also
      results in duplicated ADRP instructions (one on its own to cover the ldr, and
      one combined with the add).
      
      llvm-svn: 223172
      ec7ebebe
  2. Dec 02, 2014
Loading