Skip to content
  1. May 23, 2018
    • Lei Huang's avatar
      [Power9]Legalize and emit code for DW vector extract and convert to QP · 8990168a
      Lei Huang authored
      Implemente patterns to extract [Un]signed DWord vector element and convert to
      quad-precision.
      
      Differential Revision: https://reviews.llvm.org/D46333
      
      llvm-svn: 333112
      8990168a
    • Changpeng Fang's avatar
      StructurizeCFG: Adjust the loop depth for a subregion to order the nodes correctly · 5f915461
      Changpeng Fang authored
      Summary:
        StructurizeCFG::orderNodes basically uses a reverse post-order (RPO) traversal of the region list to get the order.
      The only problem with it is that sometimes backedges for outer loops will be visited before backedges for inner loops.
      To solve this problem, a loop depth based approach has been used to make sure all blocks in this loop has been visited
      before moving on to outer loop.
      
      However, we found a problem for a SubRegion which is a loop itself:
      
      --> BB1 --> BB2 --> BB3 -->
      
      In this case, BB2 is a SubRegion (loop), and thus its loopdepth is different than that of BB1 and BB3. This fact will lead
      BB2 to be placed in the wrong order.
      
      In this work, we treat the SubRegion as a special case and use its exit block to determine the loop and its depth
      to guard the sorting.
      
      Reviewers:
        arsenm, jlebar
      
      Differential Revision:
        https://reviews.llvm.org/D46912
      
      llvm-svn: 333111
      5f915461
    • Chad Rosier's avatar
      [CodeGen][AArch64] Use RegUnits to track register aliases. (NFC) · 3f663631
      Chad Rosier authored
      Use RegUnits to track register aliases in AArch64RedundantCopyElimination.
      
      Differential Revision: https://reviews.llvm.org/D47269
      
      llvm-svn: 333107
      3f663631
    • Roman Lebedev's avatar
      [InstCombine] Fold unfolded masked merge pattern with variable mask! · 6b6c553b
      Roman Lebedev authored
      Summary:
      Finally fixes [[ https://bugs.llvm.org/show_bug.cgi?id=6773 | PR6773 ]].
      
      Now that the backend is all done, we can finally fold it!
      
      The canonical unfolded masked merge pattern is
      ```(x &  m) | (y & ~m)```
      There is a second, equivalent variant:
      ```(x | ~m) & (y |  m)```
      Only one of them (the or-of-and's i think) is canonical.
      And if the mask is not a constant, we should fold it to:
      ```((x ^ y) & M) ^ y```
      
      https://rise4fun.com/Alive/ndQw
      
      Reviewers: spatel, craig.topper
      
      Reviewed By: spatel
      
      Subscribers: nicholas, RKSimon, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D46814
      
      llvm-svn: 333106
      6b6c553b
    • Jakub Kuderski's avatar
      [Dominators] Add PDT constructor from Function · ef33edd9
      Jakub Kuderski authored
      Summary: This patch adds a PDT constructor from Function and lets codes previously using a local class to do this use PostDominatorTree class directly.
      
      Reviewers: davide, kuhar, grosser, dberlin
      
      Reviewed By: kuhar
      
      Author: NutshellySima
      
      Subscribers: llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D46709
      
      llvm-svn: 333102
      ef33edd9
    • Craig Topper's avatar
    • Petar Jovanovic's avatar
      Silence warnings introduced with r333093 · 7d37bb42
      Petar Jovanovic authored
      r333093 introduced several warnings (-Wlogical-not-parentheses,
      -Wbool-compare).
      Adding parentheses in MipsSEInstrInfo::isCopyInstr() to silence it.
      
      llvm-svn: 333097
      7d37bb42
    • Matt Davis's avatar
      [llvm-mca] Fix header comments. NFC. · 6172c746
      Matt Davis authored
      llvm-svn: 333096
      6172c746
    • Andrea Di Biagio's avatar
      [llvm-mca] Print the "Block RThroughput" in the SummaryView. · 3fc20c9c
      Andrea Di Biagio authored
      This patch implements the "block reciprocal throughput" computation in the
      SummaryView.
      
      The block reciprocal throughput is computed as the MAX of:
        - NumMicroOps / DispatchWidth
        - Resource Cycles / #Units   (for every resource consumed).
      
      The block throughput is bounded from above by the hardware dispatch throughput.
      That is because the DispatchWidth is an upper bound on how many opcodes can be part
      of a single dispatch group.
      
      The block throughput is also limited by the amount of hardware parallelism. The
      number of available resource units affects how the resource pressure is
      distributed, and also how many blocks can be delivered every cycle.
      
      llvm-svn: 333095
      3fc20c9c
    • Petar Jovanovic's avatar
      [X86][MIPS][ARM] New machine instruction property 'isMoveReg' · c051000b
      Petar Jovanovic authored
      This property is needed in order to follow values movement between
      registers. This property is used in TII to implement method that
      returns true if simple copy like instruction is recognized, along
      with source and destination machine operands.
      
      Patch by Nikola Prica.
      
      Differential Revision: https://reviews.llvm.org/D45204
      
      llvm-svn: 333093
      c051000b
    • Nicola Zaghen's avatar
      Remove DEBUG macro. · 03d0b91f
      Nicola Zaghen authored
      Now that the LLVM_DEBUG() macro landed on the various sub-projects
      the DEBUG macro can be removed.
      Also change the new uses of DEBUG to LLVM_DEBUG.
      
      Differential Revision: https://reviews.llvm.org/D46952
      
      llvm-svn: 333091
      03d0b91f
    • Aaron Ballman's avatar
      Update my information in the CREDITS file. · 6d55b90a
      Aaron Ballman authored
      llvm-svn: 333088
      6d55b90a
    • Erich Keane's avatar
      Add myself to CREDITS.txt · 2b5b5a4c
      Erich Keane authored
      llvm-svn: 333087
      2b5b5a4c
    • Alex Bradbury's avatar
      [RISCV] Add symbol diff relocation support for RISC-V · 257d5b56
      Alex Bradbury authored
      For RISC-V it is desirable to have relaxation happen in the linker once 
      addresses are known, and as such the size between two instructions/byte 
      sequences in a section could change.
      
      For most assembler expressions, this is fine, as the absolute address results 
      in the expression being converted to a fixup, and finally relocations. 
      However, for expressions such as .quad .L2-.L1, the assembler folds this down 
      to a constant once fragments are laid out, under the assumption that the 
      difference can no longer change, although in the case of linker relaxation the 
      differences can change at link time, so the constant is incorrect. One place 
      where this commonly appears is in debug information, where the size of a 
      function expression is in a form similar to the above.
      
      This patch extends the assembler to allow an AsmBackend to declare that it 
      does not want the assembler to fold down this expression, and instead generate 
      a pair of relocations that allow the linker to carry out the calculation. In 
      this case, the expression is not folded, but when it comes to emitting a 
      fixup, the generic FK_Data_* fixups are converted into a pair, one for the 
      addition half, one for the subtraction, and this is passed to the relocation 
      generating methods as usual. I have named these FK_Data_Add_* and 
      FK_Data_Sub_* to indicate which half these are for.
      
      For RISC-V, which supports this via e.g. the R_RISCV_ADD64, R_RISCV_SUB64 pair 
      of relocations, these are also set to always emit relocations relative to 
      local symbols rather than section offsets. This is to deal with the fact that 
      if relocations were calculated on e.g. .text+8 and .text+4, the result 12 
      would be stored rather than 4 as both addends are added in the linker.
      
      Differential Revision: https://reviews.llvm.org/D45181
      Patch by Simon Cook.
      
      llvm-svn: 333079
      257d5b56
    • Alex Bradbury's avatar
      [Sparc] Use addAliasForDirective to support data directives · 3fa69dd0
      Alex Bradbury authored
      The Sparc asm parser currently has custom parsing logic for .half, .word, 
      .nword and .xword. Rather than use this custom logic, we can just use 
      addAliasForDirective to enable the reuse of AsmParser::parseDirectiveValue.
      
      https://reviews.llvm.org/D47003
      
      llvm-svn: 333078
      3fa69dd0
    • Alex Bradbury's avatar
      [AArch64] Use addAliasForDirective to support data directives · 0a59f189
      Alex Bradbury authored
      The AArch64 asm parser currently has custom parsing logic for .hword, .word, 
      and .xword. Rather than use this custom logic, we can just use 
      addAliasForDirective to enable the reuse of AsmParser::parseDirectiveValue.
      
      Differential Revision: https://reviews.llvm.org/D47000
      
      llvm-svn: 333077
      0a59f189
    • Alex Bradbury's avatar
      [RISCV] Correctly report sizes for builtin fixups · 1c010d0f
      Alex Bradbury authored
      This is a different approach to fixing the problem described in D46746. 
      RISCVAsmBackend currently depends on the getSize helper function returning the 
      number of bytes a fixup may change (note: some other backends have a similar 
      helper named getFixupNumKindBytes). As noted in that review, this doesn't 
      return the correct size for FK_Data_1, FK_Data_2, or FK_Data_8 meaning that 
      too few bytes will be written in the case of FK_Data_8, and there's the 
      potential of writing outside the Data array for the smaller fixups.
      
      D46746 extends getSize to recognise some of the builtin fixup types. Rather 
      than having a function that needs to be kept up to date as new builtin or 
      target-specific fixups are added, We can calculate an appropriate bound on the 
      number of bytes that might be touched using Info.TargetSize and 
      Info.TargetOffset.
      
      Differential Revision: https://reviews.llvm.org/D46965
      
      llvm-svn: 333076
      1c010d0f
    • Max Kazantsev's avatar
      [LoopUnswitch] Fix SCEV invalidation in unswitching · d99f3bac
      Max Kazantsev authored
      Loop unswitching makes substantial changes to a loop that can also affect cached
      SCEV info in its outer loops as well, but it only cares to invalidate SCEV cache for the
      innermost loop in case of full unswitching and does not invalidate anything at all in
      case of trivial unswitching. As result, we may end up with incorrect data in cache.
      
      Differential Revision: https://reviews.llvm.org/D46045
      Reviewed By: mzolotukhin
      
      llvm-svn: 333072
      d99f3bac
    • Piotr Padlewski's avatar
      Fix aliasing of launder.invariant.group · d6f7346a
      Piotr Padlewski authored
      Summary:
      Patch for capture tracking broke
      bootstrap of clang with -fstict-vtable-pointers
      which resulted in debbugging nightmare. It was fixed
      https://reviews.llvm.org/D46900 but as it turned
      out, there were other parts like inliner (computing of
      noalias metadata) that I found after bootstraping with enabled
      assertions.
      
      Reviewers: hfinkel, rsmith, chandlerc, amharc, kuhar
      
      Subscribers: JDevlieghere, eraman, llvm-commits, hiraditya
      
      Differential Revision: https://reviews.llvm.org/D47088
      
      llvm-svn: 333070
      d6f7346a
    • Daniel Cederman's avatar
      [Sparc] Add mnemonic aliases for flush, stb, stba, sth, and stha · 6356571e
      Daniel Cederman authored
      Reviewers: jyknight
      
      Reviewed By: jyknight
      
      Subscribers: fedor.sergeev, jrtc27, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D47140
      
      llvm-svn: 333068
      6356571e
    • Serguei Katkov's avatar
      SafepointIRVerifier is made unreachable block tolerant · 46ef8fff
      Serguei Katkov authored
      SafepointIRVerifier crashed while traversing blocks without a DomTreeNode.
      This could happen with a custom pipeline or when some optional passes were skipped by OptBisect.
      
      SafepointIRVerifier is fixed to traverse basic blocks that are reachable from entry. Test are added.
      
      Patch Author: Yevgeny Rouban!
      Reviewers: anna, reames, dneilson, DaniilSuchkov, skatkov
      Reviewed By: reames
      Subscribers: llvm-commits
      Differential Revision: https://reviews.llvm.org/D47011
      
      llvm-svn: 333063
      46ef8fff
    • David Bolvansky's avatar
      [InstCombine] [NFC] Added more tests for unlocked IO transformation · cd3eb990
      David Bolvansky authored
      Subscribers: llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D47243
      
      llvm-svn: 333057
      cd3eb990
    • Roman Tereshin's avatar
      [GlobalISel][ARM] Adding HPR and QPR regclasses to FPRB regbank · e79d656c
      Roman Tereshin authored
      Also bringing ARMRegisterBankInfo::getRegBankFromRegClass
      implementation up to speed with the *.td-definition.
      
      Reviewed By: qcolombet
      
      Differential Revision: https://reviews.llvm.org/D43982
      
      llvm-svn: 333056
      e79d656c
    • Roman Tereshin's avatar
      [GlobalISel][InstructionSelect] MatchTable second level grouping, perf patch 5 · fedae33e
      Roman Tereshin authored
      This patch continues a series of patches started by r332907 (reapplied
      as r332917)
      
      In this commit we start grouping rules with common first condition on
      the second level of the table.
      
      This is expected to decrease time GlobalISel spends in its
      InstructionSelect pass by roughly 13% for an -O0 build as measured on
      sqlite3-amalgamation (http://sqlite.org/download.html) targeting
      AArch64.
      
      Reviewers: qcolombet, dsanders, bogner, aemerson, javed.absar
      
      Reviewed By: qcolombet
      
      Subscribers: rovka, llvm-commits, kristof.beyls
      
      Differential Revision: https://reviews.llvm.org/D44700
      
      llvm-svn: 333053
      fedae33e
    • Heejin Ahn's avatar
      [WebAssembly] Add functions for EHScopes · 1e4d3504
      Heejin Ahn authored
      Summary:
      There are functions using the term 'funclet' to refer to both
      1. an EH scopes, the structure of BBs that starts with
      catchpad/cleanuppad and ends with catchret/cleanupret, and
      2. a small function that gets outlined in AsmPrinter, which is the
      original meaning of 'funclet'.
      
      So far the two have been the same thing; EH scopes are always outlined
      in AsmPrinter as funclets at the end of the compilation pipeline. But
      now wasm also uses scope-based EH but does not outline those, so we now
      need to correctly distinguish those two use cases in functions.
      
      This patch splits `MachineBasicBlock::isFuncletEntry` into
      `isFuncletEntry` and `isEHScopeEntry`, and
      `MachineFunction::hasFunclets` into `hasFunclets` and `hasEHScopes`, in
      order to distinguish the two different use cases. And this also changes
      some uses of the term 'funclet' to 'scope' in `getFuncletMembership` and
      change the function name to `getEHScopeMembership` because this function
      is not about outlined funclets but about EH scope memberships.
      
      This change is in the same vein as D45559.
      
      Reviewers: majnemer, dschuff
      
      Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D47005
      
      llvm-svn: 333045
      1e4d3504
    • Sanjay Patel's avatar
      [InstCombine] use nsw negation for abs libcalls · 4b96935b
      Sanjay Patel authored
      Also, produce the canonical IR abs (s<0) to be more efficient. 
      
      This is the libcall equivalent of the clang builtin change from:
      rL333038
      
      Pasting from that commit message:
      The stdlib functions are defined in section 7.20.6.1 of the C standard with:
      "If the result cannot be represented, the behavior is undefined."
      
      That lets us mark the negation with 'nsw' because "sub i32 0, INT_MIN" would
      be UB/poison.
      
      llvm-svn: 333042
      4b96935b
    • Sanjay Patel's avatar
      [InstCombine] move misplaced test file and regenerate checks; NFC · 3ef8f858
      Sanjay Patel authored
      llvm-svn: 333039
      3ef8f858
    • Eli Friedman's avatar
      [docs] Clarify usage of "vector" in Programmer's Manual. · 35756e89
      Eli Friedman authored
      The explanation is specifically referring to std::vector; this might
      not be clear from the context.
      
      llvm-svn: 333036
      35756e89
    • Max Moroz's avatar
      [Coverage] Update CSS to make HTML reports copy-paste friendly. · 5b1083d7
      Max Moroz authored
      Summary:
      This minor change allows to copy snippets from HTML reports so they
      will be pasted in the following format:
      %LineNumber%\t%HitCount%\t%CodeLine%
      
      rather then being split onto multiple lines. To see this in action, try copy
      pasting from https://chromium-coverage.appspot.com/reports/560344/linux/chromium/src/third_party/zlib/compress.c.html
      
      Requested in https://bugs.chromium.org/p/chromium/issues/detail?id=845571
      
      Reviewers: vsk
      
      Reviewed By: vsk
      
      Subscribers: llvm-commits, morehouse, kcc
      
      Differential Revision: https://reviews.llvm.org/D47231
      
      llvm-svn: 333034
      5b1083d7
  2. May 22, 2018
Loading