Skip to content
  1. Feb 16, 2017
  2. Feb 15, 2017
    • George Burgess IV's avatar
      [Modules] Consider enable_if attrs in isSameEntity. · 9584508d
      George Burgess IV authored
      Two functions that differ only in their enable_if attributes are
      considered overloads, so we should check for those when we're trying to
      figure out if two functions are mergeable.
      
      We need to do the same thing for pass_object_size, as well. Looks like
      that'll be a bit less trivial, since we sometimes do these merging
      checks before we have pass_object_size attributes available (see the
      merge checks in ASTDeclReader::VisitFunctionDecl that happen before we
      read parameters, and merge checks in calls to ReadDeclAs<>()).
      
      llvm-svn: 295252
      9584508d
    • Tim Shen's avatar
      c6ce73b1
    • Tim Shen's avatar
      [XRay] Fix gtest error code comparison. NFC. · b3937285
      Tim Shen authored
      llvm-svn: 295248
      b3937285
    • Matt Arsenault's avatar
      AMDGPU: Remove dead node definitions · 824de226
      Matt Arsenault authored
      llvm-svn: 295247
      824de226
    • Matt Arsenault's avatar
      Fix typos · 900b21c3
      Matt Arsenault authored
      llvm-svn: 295246
      900b21c3
    • Rafael Espindola's avatar
      Fix the static build. · 46f68f50
      Rafael Espindola authored
      llvm-svn: 295245
      46f68f50
    • Matt Arsenault's avatar
      AMDGPU: Consolidate sendmsg/sendmsghalt handling and tests · a78ca62c
      Matt Arsenault authored
      llvm-svn: 295244
      a78ca62c
    • Eugene Zelenko's avatar
    • Matt Arsenault's avatar
      DAG: Do not scalarize fsub if fneg is legal · 5de8dc9c
      Matt Arsenault authored
      Tests will be included with future commit.
      
      llvm-svn: 295242
      5de8dc9c
    • Peter Collingbourne's avatar
      Re-apply r295110 and r295144 with a fix for the ASan issue. · 50cbd7cc
      Peter Collingbourne authored
      llvm-svn: 295241
      50cbd7cc
    • Ed Schouten's avatar
      Make --export-dynamic work on non-PIC/PIE targets. · 3bf71304
      Ed Schouten authored
      For CloudABI I'm only interested in generating non-PIC/PIE executables
      on armv6 and i686, as PIE introduces larger overhead than on aarch64 and
      x86_64. Still, I want to be able to instruct the linker to generate a
      dynamic symbol table if requested. One example use for this is that
      dynamic symbol tables can be used by programs to print nicely formatted
      stacktraces, including symbol names.
      
      Right now there seems to be some logic in LLD that it only wants to emit
      dynamic symbol tables when either linking against libraries or when
      building PIC. Let's extend this to also take --export-dynamic into
      account.
      
      Reviewed by:	ruiu
      Differential Revision:	https://reviews.llvm.org/D29982
      
      llvm-svn: 295240
      3bf71304
    • Matt Arsenault's avatar
      AMDGPU: Replace assert with report_fatal_error · d122abea
      Matt Arsenault authored
      Also use a more refined condition.
      
      llvm-svn: 295239
      d122abea
    • Keno Fischer's avatar
      [GlobalObject] Fix setSection("") · 5e1e5918
      Keno Fischer authored
      Summary:
      In rL291613, the section name was interned in LLVMContext. However,
      this broke the ability to remove the section from a GlobalObject,
      because it tried to intern empty strings, which is not allowed.
      Fix that and add an appropriate regression test.
      
      Reviewed By: rnk
      Differential Revision: https://reviews.llvm.org/D29795
      
      llvm-svn: 295238
      5e1e5918
    • Sanjay Patel's avatar
      [InstCombine] improve formatting; NFC · 845ea963
      Sanjay Patel authored
      llvm-svn: 295237
      845ea963
    • Peter Collingbourne's avatar
      AssumptionCache: Disable the verifier by default, move it behind a hidden... · 9421c2dc
      Peter Collingbourne authored
      AssumptionCache: Disable the verifier by default, move it behind a hidden cl::opt and verify from releaseMemory().
      
      This is a short term solution to the problem that many passes currently fail
      to update the assumption cache. In the long term the verifier should not
      be controllable with a flag. We should either fix all passes to correctly
      update the assumption cache and enable the verifier unconditionally or
      somehow arrange for the assumption list to be updated automatically by passes.
      
      Differential Revision: https://reviews.llvm.org/D30003
      
      llvm-svn: 295236
      9421c2dc
    • Simon Pilgrim's avatar
      [X86][SSE] Don't call EltsFromConsecutiveLoads if any element is missing. · 5b4c30fb
      Simon Pilgrim authored
      Minor performance speedup - if any call to getShuffleScalarElt fails to get a result, don't both calling for the remaining elements as EltsFromConsecutiveLoads will fail anyhow.
      
      llvm-svn: 295235
      5b4c30fb
    • Etienne Bergeron's avatar
      [compiler-rt][asan|win] Fix ASAN exception handler missing import · b69639e2
      Etienne Bergeron authored
      Summary:
      This patch is adding a missing ASAN API redirection from an instrumented DLL.
      The bug was introduced here:
        https://reviews.llvm.org/D29463
      
      This is causing this chromium bug:
        https://bugs.chromium.org/p/chromium/issues/detail?id=692580
      
      Reviewers: rnk
      
      Reviewed By: rnk
      
      Subscribers: kubamracek, dberris, llvm-commits, chrisha, thakis
      
      Differential Revision: https://reviews.llvm.org/D30001
      
      llvm-svn: 295232
      b69639e2
    • George Rokos's avatar
      [OpenMP] libomptarget: Protect parent struct from being deallocated · 15a6e7da
      George Rokos authored
      Fixed bug due to which a parent struct was deallocated when one of the struct's pointers was being unmapped.
      
      Differential Revision: https://reviews.llvm.org/D29914
      
      llvm-svn: 295231
      15a6e7da
    • Arnold Schwaighofer's avatar
      AddressSanitizer: don't track swifterror memory addresses · 8d61e003
      Arnold Schwaighofer authored
      They are register promoted by ISel and so it makes no sense to treat them as
      memory.
      
      Inserting calls to the thread sanitizer would also generate invalid IR.
      
      You would hit:
      
      "swifterror value can only be loaded and stored from, or as a swifterror
      argument!"
      
      llvm-svn: 295230
      8d61e003
    • Ahmed Bougacha's avatar
      [AArch64] Make am_ldrlit an iPTR - not OtherVT - operand. NFC-ish. · f8acf568
      Ahmed Bougacha authored
      am_ldrlit diverged from am_brcond in r207105, but kept the OtherVT
      operand type.  It made sense for branch targets, as those are
      represented as MVT::Other in SDAG.  But loads operate on pointers.
      
      This shouldn't have an observable effect on any in-tree code, but helps
      make the patterns consistent for external users.
      
      llvm-svn: 295229
      f8acf568
    • Ahmed Bougacha's avatar
      [OptDiag] Pass const Values/Types to Argument. NFC. · 36026006
      Ahmed Bougacha authored
      llvm-svn: 295228
      36026006
    • Ahmed Bougacha's avatar
      [IR] Accept 'const Type &' in the Type operator<<. NFC. · f9e5a1dd
      Ahmed Bougacha authored
      Type::print is const; there's no reason for the operator not to be.
      
      llvm-svn: 295227
      f9e5a1dd
    • Tobias Edler von Koch's avatar
      [LTO] Add ability to emit assembly to new LTO API · f454b9ea
      Tobias Edler von Koch authored
      Summary:
      Add a field to LTO::Config, CGFileType, to select the file type to emit (object
      or assembly). This is useful for testing and to implement -save-temps.
      
      Reviewers: tejohnson, mehdi_amini, pcc
      
      Reviewed By: mehdi_amini
      
      Subscribers: davide, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D29475
      
      llvm-svn: 295226
      f454b9ea
    • Rui Ueyama's avatar
      Apply different tokenization rules to linker script expressions. · 731a66ae
      Rui Ueyama authored
      The linker script lexer is context-sensitive. In the regular context,
      arithmetic operator characters are regular characters, but in the
      expression context, they are independent tokens. This afects how the
      lexer tokenizes "3*4", for example. (This kind of expression is real;
      the Linux kernel uses it.)
      
      This patch defines function `maybeSplitExpr`. This function splits the
      current token into multiple expression tokens if the lexer is in the
      expression context.
      
      Differential Revision: https://reviews.llvm.org/D29963
      
      llvm-svn: 295225
      731a66ae
    • Richard Smith's avatar
    • Kyle Butt's avatar
      Codegen: Make chains from trellis-shaped CFGs · 7fbec9bd
      Kyle Butt authored
      Lay out trellis-shaped CFGs optimally.
      A trellis of the shape below:
      
        A     B
        |\   /|
        | \ / |
        |  X  |
        | / \ |
        |/   \|
        C     D
      
      would be laid out A; B->C ; D by the current layout algorithm. Now we identify
      trellises and lay them out either A->C; B->D or A->D; B->C. This scales with an
      increasing number of predecessors. A trellis is a a group of 2 or more
      predecessor blocks that all have the same successors.
      
      because of this we can tail duplicate to extend existing trellises.
      
      As an example consider the following CFG:
      
          B   D   F   H
         / \ / \ / \ / \
        A---C---E---G---Ret
      
      Where A,C,E,G are all small (Currently 2 instructions).
      
      The CFG preserving layout is then A,B,C,D,E,F,G,H,Ret.
      
      The current code will copy C into B, E into D and G into F and yield the layout
      A,C,B(C),E,D(E),F(G),G,H,ret
      
      define void @straight_test(i32 %tag) {
      entry:
        br label %test1
      test1: ; A
        %tagbit1 = and i32 %tag, 1
        %tagbit1eq0 = icmp eq i32 %tagbit1, 0
        br i1 %tagbit1eq0, label %test2, label %optional1
      optional1: ; B
        call void @a()
        br label %test2
      test2: ; C
        %tagbit2 = and i32 %tag, 2
        %tagbit2eq0 = icmp eq i32 %tagbit2, 0
        br i1 %tagbit2eq0, label %test3, label %optional2
      optional2: ; D
        call void @b()
        br label %test3
      test3: ; E
        %tagbit3 = and i32 %tag, 4
        %tagbit3eq0 = icmp eq i32 %tagbit3, 0
        br i1 %tagbit3eq0, label %test4, label %optional3
      optional3: ; F
        call void @c()
        br label %test4
      test4: ; G
        %tagbit4 = and i32 %tag, 8
        %tagbit4eq0 = icmp eq i32 %tagbit4, 0
        br i1 %tagbit4eq0, label %exit, label %optional4
      optional4: ; H
        call void @d()
        br label %exit
      exit:
        ret void
      }
      
      here is the layout after D27742:
      straight_test:                          # @straight_test
      ; ... Prologue elided
      ; BB#0:                                 # %entry ; A (merged with test1)
      ; ... More prologue elided
      	mr 30, 3
      	andi. 3, 30, 1
      	bc 12, 1, .LBB0_2
      ; BB#1:                                 # %test2 ; C
      	rlwinm. 3, 30, 0, 30, 30
      	beq	 0, .LBB0_3
      	b .LBB0_4
      .LBB0_2:                                # %optional1 ; B (copy of C)
      	bl a
      	nop
      	rlwinm. 3, 30, 0, 30, 30
      	bne	 0, .LBB0_4
      .LBB0_3:                                # %test3 ; E
      	rlwinm. 3, 30, 0, 29, 29
      	beq	 0, .LBB0_5
      	b .LBB0_6
      .LBB0_4:                                # %optional2 ; D (copy of E)
      	bl b
      	nop
      	rlwinm. 3, 30, 0, 29, 29
      	bne	 0, .LBB0_6
      .LBB0_5:                                # %test4 ; G
      	rlwinm. 3, 30, 0, 28, 28
      	beq	 0, .LBB0_8
      	b .LBB0_7
      .LBB0_6:                                # %optional3 ; F (copy of G)
      	bl c
      	nop
      	rlwinm. 3, 30, 0, 28, 28
      	beq	 0, .LBB0_8
      .LBB0_7:                                # %optional4 ; H
      	bl d
      	nop
      .LBB0_8:                                # %exit ; Ret
      	ld 30, 96(1)                    # 8-byte Folded Reload
      	addi 1, 1, 112
      	ld 0, 16(1)
      	mtlr 0
      	blr
      
      The tail-duplication has produced some benefit, but it has also produced a
      trellis which is not laid out optimally. With this patch, we improve the layouts
      of such trellises, and decrease the cost calculation for tail-duplication
      accordingly.
      
      This patch produces the layout A,C,E,G,B,D,F,H,Ret. This layout does have
      back edges, which is a negative, but it has a bigger compensating
      positive, which is that it handles the case where there are long strings
      of skipped blocks much better than the original layout. Both layouts
      handle runs of executed blocks equally well. Branch prediction also
      improves if there is any correlation between subsequent optional blocks.
      
      Here is the resulting concrete layout:
      
      straight_test:                          # @straight_test
      ; BB#0:                                 # %entry ; A (merged with test1)
      	mr 30, 3
      	andi. 3, 30, 1
      	bc 12, 1, .LBB0_4
      ; BB#1:                                 # %test2 ; C
      	rlwinm. 3, 30, 0, 30, 30
      	bne	 0, .LBB0_5
      .LBB0_2:                                # %test3 ; E
      	rlwinm. 3, 30, 0, 29, 29
      	bne	 0, .LBB0_6
      .LBB0_3:                                # %test4 ; G
      	rlwinm. 3, 30, 0, 28, 28
      	bne	 0, .LBB0_7
      	b .LBB0_8
      .LBB0_4:                                # %optional1 ; B (Copy of C)
      	bl a
      	nop
      	rlwinm. 3, 30, 0, 30, 30
      	beq	 0, .LBB0_2
      .LBB0_5:                                # %optional2 ; D (Copy of E)
      	bl b
      	nop
      	rlwinm. 3, 30, 0, 29, 29
      	beq	 0, .LBB0_3
      .LBB0_6:                                # %optional3 ; F (Copy of G)
      	bl c
      	nop
      	rlwinm. 3, 30, 0, 28, 28
      	beq	 0, .LBB0_8
      .LBB0_7:                                # %optional4 ; H
      	bl d
      	nop
      .LBB0_8:                                # %exit
      
      Differential Revision: https://reviews.llvm.org/D28522
      
      llvm-svn: 295223
      7fbec9bd
    • Rui Ueyama's avatar
      Simplify operator tests. · a4601b5d
      Rui Ueyama authored
      llvm-svn: 295222
      a4601b5d
    • Rui Ueyama's avatar
      Rename a test as they are tests for operators. · fd5edff8
      Rui Ueyama authored
      llvm-svn: 295221
      fd5edff8
    • Xinliang David Li's avatar
      include function name in dot filename · 538d6668
      Xinliang David Li authored
      Differential Revision: http://reviews.llvm.org/D29975
      
      llvm-svn: 295220
      538d6668
Loading