Skip to content
  1. Apr 18, 2018
    • Lei Huang's avatar
      [Power9]Legalize and emit code for converting (Un)Signed Word to Quad-Precision · 198e6785
      Lei Huang authored
      Legalize and emit code for converting (Un)Signed Word to quad-precision via:
      
      xscvsdqp
      xscvudqp
      
      Differential Revision: https://reviews.llvm.org/D45389
      
      llvm-svn: 330273
      198e6785
    • Alexey Bataev's avatar
      [DEBUG] Initial adaptation of NVPTX target for debug info emission. · 242706b8
      Alexey Bataev authored
      Summary:
      Patch adds initial emission of the debug info for NVPTX target.
      Currently, only .file and .loc directives are emitted, everything else is
      commented out to not break the compilation of Cuda.
      
      Reviewers: echristo, jlebar, tra, jholewinski
      
      Subscribers: mgorny, aprantl, JDevlieghere, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D41827
      
      llvm-svn: 330271
      242706b8
    • Chandler Carruth's avatar
      [x86] Switch EFLAGS copy lowering to use reg-reg form of testing for · ccd3ecb9
      Chandler Carruth authored
      a zero register.
      
      Previously I tried this and saw LLVM unable to transform this to fold
      with memory operands such as spill slot rematerialization. However, it
      clearly works as shown in this patch. We turn these into `cmpb $0,
      <mem>` when useful for folding a memory operand without issue. This form
      has no disadvantage compared to `testb $-1, <mem>`. So overall, this is
      likely no worse and may be slightly smaller in some cases due to the
      `testb %reg, %reg` form.
      
      Differential Revision: https://reviews.llvm.org/D45475
      
      llvm-svn: 330269
      ccd3ecb9
    • Aaron Smith's avatar
      [support] Revert the changes made to Path.inc for the default Windows code page · 02caafd7
      Aaron Smith authored
      Path.inc/widenPath tries to decode the path using both UTF-8 and the default Windows code page.
      This is no longer necessary with the new InitLLVM method which ensures that the command line
      arguemnts are already UTF-8 on Windows.
       
      
      llvm-svn: 330266
      02caafd7
    • Chandler Carruth's avatar
      [x86] Fix PR37100 by teaching the EFLAGS copy lowering to rewrite uses · 1f87618f
      Chandler Carruth authored
      across basic blocks in the limited cases where it is very straight
      forward to do so.
      
      This will also be useful for other places where we do some limited
      EFLAGS propagation across CFG edges and need to handle copy rewrites
      afterward. I think this is rapidly approaching the maximum we can and
      should be doing here. Everything else begins to require either heroic
      analysis to prove how to do PHI insertion manually, or somehow managing
      arbitrary PHI-ing of EFLAGS with general PHI insertion. Neither of these
      seem at all promising so if those cases come up, we'll almost certainly
      need to rewrite the parts of LLVM that produce those patterns.
      
      We do now require dominator trees in order to reliably diagnose patterns
      that would require PHI nodes. This is a bit unfortunate but it seems
      better than the completely mysterious crash we would get otherwise.
      
      Differential Revision: https://reviews.llvm.org/D45673
      
      llvm-svn: 330264
      1f87618f
    • Sanjay Patel's avatar
      [SimplifyLibcalls] Realloc(null, N) -> Malloc(N) · b2ab3f28
      Sanjay Patel authored
      Patch by Dávid Bolvanský!
      
      Differential Revision: https://reviews.llvm.org/D45413
      
      llvm-svn: 330259
      b2ab3f28
    • David Stuttard's avatar
      [AMDGPU] Fix issues for backend divergence tracking · 31f482c2
      David Stuttard authored
      Summary:
      A change to use divergence analysis in the AMDGPU backend was getting formal
      arguments incorrect (not tagged as divergent) unless they were VGPR0, VGPR1 or
      VGPR2
      
      For graphics shaders it is possible to have more than these passed in as VGPR
      
      Modified the checking code to check for any VGPR registers passed in as formal
      arguments.
      
      Also, some intrinsics that are sources of divergence may have been lowered
      during instruction selection and are missed on subsequent calls to
      isSDNodeSourceOfDivergence - added the relevant AMDGPUISD checks as well.
      
      Finally, the FunctionLoweringInfo tracks virtual registers that are live across
      basic block boundaries. This is used to check for divergence of CopyFromRegister
      registers using the DivergenceAnalysis analysis. For multiple blocks the lazily
      evaluated inverted map VirtReg2Value was not cleared when the ValueMap map was.
      
      Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tpr, t-tye, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D45372
      
      Change-Id: I112f3bd6dfe0f62e63ce9b43b893982778e4bee3
      llvm-svn: 330257
      31f482c2
    • Sam Parker's avatar
      [IRCE] Only check for NSW on equality predicates · 3c19051b
      Sam Parker authored
      After investigation discussed in D45439, it would seem that the nsw
      flag restriction is unnecessary in most cases. So the IsInductionVar
      lambda has been removed, the functionality extracted, and now only
      require nsw when using eq/ne predicates.
      
      Differential Revision: https://reviews.llvm.org/D45617
      
      llvm-svn: 330256
      3c19051b
    • Pavel Labath's avatar
      [cmake] Improve pthread_[gs]etname_np detection code · 8f5a456e
      Pavel Labath authored
      Summary:
      Due to some android peculiarities, in some build configurations
      (statically linked executables targeting older releases) we could detect
      the presence of these functions (because they are present in libc.a,
      where check_library_exists searches), but then fail to build because the
      headers did not include the definition.
      
      This attempts to remedy that by upgrading the check_library_exists to
      check_symbol_exists, which will check that the function is declared too.
      
      I am hoping that a more thorough check will make the messy #ifdef we
      have accumulated in the code obsolete, so I optimistically try to remove
      them.
      
      Reviewers: zturner, kparzysz, danalbert
      
      Subscribers: srhines, mgorny, krytarowski, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D45359
      
      llvm-svn: 330251
      8f5a456e
    • Florian Hahn's avatar
      [LoopUnroll] Only peel if a predicate becomes known in the loop body. · ac277588
      Florian Hahn authored
      If a predicate does not become known after peeling, peeling is unlikely
      to be beneficial.
      
      Reviewers: mcrosier, efriedma, mkazantsev, junbuml
      
      Reviewed By: mkazantsev
      
      Differential Revision: https://reviews.llvm.org/D44983
      
      llvm-svn: 330250
      ac277588
    • Pavel Labath's avatar
      [CodeGen/Dwarf] Make debug_names compatible with split-dwarf · 3fb39c79
      Pavel Labath authored
      Summary:
      Previously we crashed for the combination of the two features because we
      tried to reference the dwo CU from the main object file. The fix
      consists of two items:
      - reference the skeleton CU from the name index (the consumer is
        expected to use the skeleton CU to find the real data).
      - use the main object file string pool for the strings in the index
      
      Reviewers: JDevlieghere, aprantl, dblaikie
      
      Subscribers: llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D45566
      
      llvm-svn: 330249
      3fb39c79
    • Bjorn Pettersson's avatar
      [DebugInfo] Sink related dbg users when sinking in InstCombine · bc4f19b6
      Bjorn Pettersson authored
      Summary:
      When sinking an instruction in InstCombine we now also sink
      the DbgInfoIntrinsics that are using the sunken value.
      
      Example)
      
      When sinking the load in this input
      
      bb.X:
        %0 = load i64, i64* %start, align 4, !dbg !31
        tail call void @llvm.dbg.value(metadata i64 %0, ...)
        br i1 %cond, label %for.end, label %for.body.lr.ph
      for.body.lr.ph:
        br label %for.body
      
      we now also move the dbg.value, like this
      
      bb.X:
        br i1 %cond, label %for.end, label %for.body.lr.ph
      for.body.lr.ph:
        %0 = load i64, i64* %start, align 4, !dbg !31
        tail call void @llvm.dbg.value(metadata i64 %0, ...)
        br label %for.body
      
      In the past we haven't moved the dbg.value so we got
      
      bb.X:
        tail call void @llvm.dbg.value(metadata i64 %0, ...)
        br i1 %cond, label %for.end, label %for.body.lr.ph
      for.body.lr.ph:
        %0 = load i64, i64* %start, align 4, !dbg !31
        br label %for.body
      
      
      So in the past we got a debug-use before the def of %0.
      And that dbg.value was also on the path jumping to %for.end, for
      which %0 never was defined.
      
      CodeGenPrepare normally comes to rescue later (when not moving
      the dbg.value), since it moves dbg.value instrinsics quite
      brutally, without really analysing if it is correct to move
      the intrinsic (see PR31878).
      So at the moment this patch isn't expected to have much impact,
      besides that it is moving the dbg.value already in opt, making
      the IR look more sane directly.
      
      This can be seen as a preparation to (hopefully) make it possible
      to turn off CodeGenPrepare::placeDbgValues later as a solution
      to PR31878.
      
      I also adjusted test/DebugInfo/X86/sdagsplit-1.ll to make the
      IR in the test case up-to-date with this behavior in InstCombine.
      
      Reviewers: rnk, vsk, aprantl
      
      Reviewed By: vsk, aprantl
      
      Subscribers: mattd, JDevlieghere, llvm-commits
      
      Tags: #debug-info
      
      Differential Revision: https://reviews.llvm.org/D45425
      
      llvm-svn: 330243
      bc4f19b6
    • Craig Topper's avatar
      dfccafe1
    • Craig Topper's avatar
      [X86] Give CMOV 2 cycle latency on SLM. · 513e11bb
      Craig Topper authored
      llvm-svn: 330239
      513e11bb
    • Craig Topper's avatar
      [X86] Don't crash on bad operand modifiers in inline assembly · 87046124
      Craig Topper authored
      Summary: Previously if a modifer was placed on a non-GPR register class we would hit an assert or crash.
      
      Reviewers: echristo
      
      Reviewed By: echristo
      
      Subscribers: eraman, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D45751
      
      llvm-svn: 330238
      87046124
    • Sanjay Patel's avatar
      [InstCombine] peek through bitcasted vector/array pointer GEP operand · aea15131
      Sanjay Patel authored
      The bitcast may be interfering with other combines or vectorization 
      as shown in PR16739:
      https://bugs.llvm.org/show_bug.cgi?id=16739
      
      Most pointer-related optimizations are probably able to look through 
      this bitcast, but removing the bitcast shrinks the IR, so it's at
      least a size savings.
      
      Differential Revision: https://reviews.llvm.org/D44833
      
      llvm-svn: 330237
      aea15131
    • Bob Haarman's avatar
      Fix lock order inversion between ManagedStatic and Statistic · 37a9269c
      Bob Haarman authored
      Summary:
      Statistic and ManagedStatic both use mutexes. There was a lock order
      inversion where, during initialization, Statistic's mutex would be
      held while taking ManagedStatic's, and in llvm_shutdown,
      ManagedStatic's mutex would be held while taking Statistic's
      mutex. This change causes Statistic's initialization code to avoid
      holding its mutex while calling ManagedStatic's methods, avoiding the
      inversion.
      
      Reviewers: dsanders, rtereshin
      
      Reviewed By: dsanders
      
      Subscribers: hiraditya, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D45398
      
      llvm-svn: 330236
      37a9269c
    • Stanislav Mekhanoshin's avatar
      [AMDGPU] Enabled v2.16 literals for VOP3P · 8b20b7dc
      Stanislav Mekhanoshin authored
      Literal encoding needs op_sel_hi to select low 16 bit in this case.
      
      Differential Revision: https://reviews.llvm.org/D45745
      
      llvm-svn: 330230
      8b20b7dc
    • Vedant Kumar's avatar
      [Mem2Reg] Create merged debug locations for inserted phis · b0585893
      Vedant Kumar authored
      Track the debug locations of the incoming values to newly-created phis,
      and apply merged debug locations to the phis.
      
      A merged location will be on line 0, but will have the correct scope
      set. This improves crash reporting when an inlined instruction with a
      merged location triggers a machine exception. A debugger will be able to
      narrow down the crash to the correct inlined scope, instead of simply
      pointing to the outer scope of the caller.
      
      Taken together with a change allows generating merged line-0 locations
      for  instructions which aren't calls, this results in a 0.5% increase in
      the uncompressed size of the .debug_line section of a stage2+Release
      build of clang (-O3 -g).
      
      rdar://33858697
      
      Differential Revision: https://reviews.llvm.org/D45397
      
      llvm-svn: 330227
      b0585893
    • Vedant Kumar's avatar
      [Mem2Reg] Make RenamePassData a struct, NFC · 4b29172d
      Vedant Kumar authored
      llvm-svn: 330226
      4b29172d
  2. Apr 17, 2018
Loading