Skip to content
  1. Feb 10, 2017
    • Craig Topper's avatar
      [SelectionDAG] Dump the DAG after legalizing vector ops and after the second type legalization · a9f11218
      Craig Topper authored
      Summary:
      With -debug, we aren't dumping the DAG after legalizing vector ops. In particular, on X86 with AVX1 only, we don't dump the DAG after we split 256-bit integer ops into pairs of 128-bit ADDs since this occurs during vector legalization.
      
      I'm only dumping if the legalize vector ops changes something since we don't print anything during legalize vector ops. So this dump shows up right after the first type-legalization dump happens. So if nothing changed this second dump is unnecessary.
      
      Having said that though, I think we should probably fix legalize vector ops to log what its doing.
      
      Reviewers: RKSimon, eli.friedman, spatel, arsenm, chandlerc
      
      Reviewed By: RKSimon
      
      Subscribers: wdng, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D29554
      
      llvm-svn: 294711
      a9f11218
    • Eric Christopher's avatar
      Temporarily revert "For X86-64 linux and PPC64 linux align int128 to 16 bytes." · 0824096c
      Eric Christopher authored
      until we can get better TargetMachine::isCompatibleDataLayout to compare - otherwise
      we can't code generate existing bitcode without a string equality data layout.
      
      This reverts commit r294702.
      
      llvm-svn: 294709
      0824096c
    • Eric Christopher's avatar
      For X86-64 linux and PPC64 linux align int128 to 16 bytes. · 42b92488
      Eric Christopher authored
      For other platforms we should find out what they need and likely
      make the same change, however, a smaller additional change is easier
      for platforms we know have it specified	in the ABI. As part of this
      rewrite some of the handling in the backends for data layout and update
      a bunch of testcases.
      
      Based on a patch by Simonas Kazlauskas!
      
      llvm-svn: 294702
      42b92488
    • Matt Arsenault's avatar
      AMDGPU: Fix trailing whitespace · b4493e90
      Matt Arsenault authored
      llvm-svn: 294694
      b4493e90
    • Wei Ding's avatar
      AMDGPU : Add trap handler support. · 205bfdb3
      Wei Ding authored
      Differential Revision: http://reviews.llvm.org/D26010
      
      llvm-svn: 294692
      205bfdb3
    • Stanislav Mekhanoshin's avatar
      [AMDGPU] Override PSet for M0 · 6dec2431
      Stanislav Mekhanoshin authored
      This change returns empty PSet list for M0 register. Otherwise its
      PSet as defined by tablegen is SReg_32. This results in incorrect
      register pressure calculation every time an instruction uses M0.
      Such uses count as SReg_32 PSet and inadequately increase pressure
      on SGPRs.
      
      Differential Revision: https://reviews.llvm.org/D29798
      
      llvm-svn: 294691
      6dec2431
    • Eric Fiselier's avatar
      [CMake] Fix pthread handling for out-of-tree builds · 87c87f4c
      Eric Fiselier authored
      LLVM defines `PTHREAD_LIB` which is used by AddLLVM.cmake and various projects
      to correctly link the threading library when needed. Unfortunately
      `PTHREAD_LIB` is defined by LLVM's `config-ix.cmake` file which isn't installed
      and therefore can't be used when configuring out-of-tree builds. This causes
      such builds to fail since `pthread` isn't being correctly linked.
      
      This patch attempts to fix that problem by renaming and exporting
      `LLVM_PTHREAD_LIB` as part of`LLVMConfig.cmake`. I renamed `PTHREAD_LIB`
      because It seemed likely to cause collisions with downstream users of
      `LLVMConfig.cmake`.
      
      llvm-svn: 294690
      87c87f4c
    • Marcos Pividori's avatar
      [libFuzzer] Export external functions on tests. · a0b23b8e
      Marcos Pividori authored
      We need to export external functions so they are found when calling
      GetProcAddress() on Windows. But we can't use `__declspec(dllexport)` because
      we want the targets to be completely independent from the fuzz engines and don't
      depend on other header files. Also, we don't want to include platform specific
      code managed with conditional macros.
      So, the solution is to add the exported symbols with linker flags in cmake.
      
      Differential revision: https://reviews.llvm.org/D29752
      
      llvm-svn: 294688
      a0b23b8e
    • Marcos Pividori's avatar
      [libFuzzer] Use dynamic loading for External Functions on Windows. · 0ae27e80
      Marcos Pividori authored
      Replace weak aliases with dynamic loading.
      Weak aliases were generating some problems when linking for MT on Windows. For
      MT, compiler-rt's libraries are statically linked to the main executable the
      same than libFuzzer, so if we use weak aliases, we are providing two different
      default implementations for the same weak function and the linker fails.
      
      In this diff I re implement ExternalFunctions() using dynamic loading, so it
      works in both cases (MD and MT). Also, dynamic loading is simpler, since we are
      not defining any auxiliary external function, and we don't need to deal with
      weak aliases.
      This is equivalent to the implementation using dlsym(RTLD_DEFAULT, FnName) for
      Posix.
      
      Differential revision: https://reviews.llvm.org/D29751
      
      llvm-svn: 294687
      0ae27e80
    • Eugene Zelenko's avatar
    • Dan Gohman's avatar
      [WebAssembly] Pass an MCContext to WebAssemblyMCCodeEmitter. NFC. · df4f4d45
      Dan Gohman authored
      llvm-svn: 294679
      df4f4d45
    • Matthias Braun's avatar
      Fix syntax error · 2bef2a08
      Matthias Braun authored
      llvm-svn: 294678
      2bef2a08
    • Matthias Braun's avatar
      ARMSubtarget.h: Change to one line per enum element; NFC · 62e1e853
      Matthias Braun authored
      Change syntax to have enum elements sorted alphabetically and one per
      line as that is more merge/cherry pick friendly.
      
      llvm-svn: 294677
      62e1e853
    • Eric Christopher's avatar
      e4b10f5d
    • Chandler Carruth's avatar
      [PM] Add Argument Promotion to the pass pipeline. · 0ede22e1
      Chandler Carruth authored
      This needs explicit requires of the optimization remark emission before
      loop pass pipelines containing LICM as we no longer get it from the
      inliner -- Argument Promotion may invalidate it. Technically the inliner
      could also have broken this, but it never came up in testing.
      
      Differential Revision: https://reviews.llvm.org/D29595
      
      llvm-svn: 294670
      0ede22e1
    • Chandler Carruth's avatar
      [PM] Port ArgumentPromotion to the new pass manager. · addcda48
      Chandler Carruth authored
      Now that the call graph supports efficient replacement of a function and
      spurious reference edges, we can port ArgumentPromotion to the new pass
      manager very easily.
      
      The old PM-specific bits are sunk into callbacks that the new PM simply
      doesn't use. Unlike the old PM, the new PM simply does argument
      promotion and afterward does the update to LCG reflecting the promoted
      function.
      
      Differential Revision: https://reviews.llvm.org/D29580
      
      llvm-svn: 294667
      addcda48
    • Peter Collingbourne's avatar
      WholeProgramDevirt: Check that VCP candidate functions are defined before evaluating them. · 17febdbb
      Peter Collingbourne authored
      This was crashing before.
      
      llvm-svn: 294666
      17febdbb
    • Chandler Carruth's avatar
      [PM/LCG] Teach LCG to support spurious reference edges. · 1f8fcfea
      Chandler Carruth authored
      Somewhat amazingly, this only requires teaching it to clean them up when
      deleting a dead function from the graph. And we already have exactly the
      necessary data structures to do that in the parent RefSCCs.
      
      This allows ArgPromote to work in a much simpler way be merely letting
      reference edges linger in the graph after the causing IR is deleted. We
      will clean up these edges when we run any function pass over the IR, but
      don't remove them eagerly.
      
      This avoids all of the quadratic update issues both in the current pass
      manager and in my previous attempt with the new pass manager.
      
      Differential Revision: https://reviews.llvm.org/D29579
      
      llvm-svn: 294663
      1f8fcfea
    • George Burgess IV's avatar
      [ARM] Add support for armv7ve triple in llvm (PR31358). · ccf11c2f
      George Burgess IV authored
      Gcc supports target armv7ve which is armv7-a with virtualization
      extensions. This change adds support for this in llvm for gcc
      compatibility.
      
      Also remove redundant FeatureHWDiv, FeatureHWDivARM for a few models as
      this is specified automatically by FeatureVirtualization.
      
      Patch by Manoj Gupta.
      
      Differential Revision: https://reviews.llvm.org/D29472
      
      llvm-svn: 294661
      ccf11c2f
    • Chandler Carruth's avatar
      [PM/LCG] Teach the LazyCallGraph how to replace a function without · aaad9f84
      Chandler Carruth authored
      disturbing the graph or having to update edges.
      
      This is motivated by porting argument promotion to the new pass manager.
      Because of how LLVM IR Function objects work, in order to change their
      signature a new object needs to be created. This is efficient and
      straight forward in the IR but previously was very hard to implement in
      LCG. We could easily replace the function a node in the graph
      represents. The challenging part is how to handle updating the edges in
      the graph.
      
      LCG previously used an edge to a raw function to represent a node that
      had not yet been scanned for calls and references. This was the core
      of its laziness. However, that model causes this kind of update to be
      very hard:
      1) The keys to lookup an edge need to be `Function*`s that would all
         need to be updated when we update the node.
      2) There will be some unknown number of edges that haven't transitioned
         from `Function*` edges to `Node*` edges.
      
      All of this complexity isn't necessary. Instead, we can always build
      a node around any function, always pointing edges at it and always using
      it as the key to lookup an edge. To maintain the laziness, we need to
      sink the *edges* of a node into a secondary object and explicitly model
      transitioning a node from empty to populated by scanning the function.
      This design seems much cleaner in a number of ways, but importantly
      there is now exactly *one* place where the `Function*` has to be
      updated!
      
      Some other cleanups that fall out of this include having something to
      model the *entry* edges more accurately. Rather than hand rolling parts
      of the node in the graph itself, we have an explicit `EdgeSequence`
      object that gives us exactly the functionality needed. We also have
      a consistent place to define the edge iterators and can use them for
      both the entry edges and the internal edges of the graph.
      
      The API used to model the separation between a node and its edges is
      intentionally very thin as most clients are expected to deal with nodes
      that have populated edges. We model this exactly as an optional does
      with an additional method to populate the edges when that is
      a reasonable thing for a client to do. This is based on API design
      suggestions from Richard Smith and David Blaikie, credit goes to them
      for helping pick how to model this without it being either too explicit
      or too implicit.
      
      The patch is somewhat noisy due to shifting around iterator types and
      new syntax for walking the edges of a node, but most of the
      functionality change is in the `Edge`, `EdgeSequence`, and `Node` types.
      
      Differential Revision: https://reviews.llvm.org/D29577
      
      llvm-svn: 294653
      aaad9f84
    • Dan Gohman's avatar
      b6afd207
    • Sanjay Patel's avatar
      [InstCombine] allow (X * C2) << C1 --> X * (C2 << C1) for vectors · f38bab73
      Sanjay Patel authored
      This fold already existed for vectors but only when 'C1' was a splat
      constant (but 'C2' could be any constant). 
      
      There were no tests for any vector constants, so I'm adding a test
      that shows non-splat constants for both operands.  
      
      llvm-svn: 294650
      f38bab73
    • Peter Collingbourne's avatar
      De-duplicate some code for creating an AARGetter suitable for the legacy PM. · cea1e4e7
      Peter Collingbourne authored
      I'm about to use this in a couple more places.
      
      Differential Revision: https://reviews.llvm.org/D29793
      
      llvm-svn: 294648
      cea1e4e7
  2. Feb 09, 2017
Loading