Skip to content
  1. Jun 12, 2017
  2. Jun 08, 2017
  3. Jun 07, 2017
  4. Jun 06, 2017
    • Chandler Carruth's avatar
      Sort the remaining #include lines in include/... and lib/.... · 6bda14b3
      Chandler Carruth authored
      I did this a long time ago with a janky python script, but now
      clang-format has built-in support for this. I fed clang-format every
      line with a #include and let it re-sort things according to the precise
      LLVM rules for include ordering baked into clang-format these days.
      
      I've reverted a number of files where the results of sorting includes
      isn't healthy. Either places where we have legacy code relying on
      particular include ordering (where possible, I'll fix these separately)
      or where we have particular formatting around #include lines that
      I didn't want to disturb in this patch.
      
      This patch is *entirely* mechanical. If you get merge conflicts or
      anything, just ignore the changes in this patch and run clang-format
      over your #include lines in the files.
      
      Sorry for any noise here, but it is important to keep these things
      stable. I was seeing an increasing number of patches with irrelevant
      re-ordering of #include lines because clang-format was used. This patch
      at least isolates that churn, makes it easy to skip when resolving
      conflicts, and gets us to a clean baseline (again).
      
      llvm-svn: 304787
      6bda14b3
  5. May 31, 2017
  6. May 30, 2017
    • Matthias Braun's avatar
      TargetPassConfig: Keep a reference to an LLVMTargetMachine; NFC · 5e394c3d
      Matthias Braun authored
      TargetPassConfig is not useful for targets that do not use the CodeGen
      library, so we may just as well store a pointer to an
      LLVMTargetMachine instead of just to a TargetMachine.
      
      While at it, also change the constructor to take a reference instead of a
      pointer as the TM must not be nullptr.
      
      llvm-svn: 304247
      5e394c3d
    • Craig Topper's avatar
      [SelectionDAG] Set ISD::FPOWI to Expand by default · f6d4dc5b
      Craig Topper authored
      Summary:
      Currently FPOWI defaults to Legal and LegalizeDAG.cpp turns Legal into Expand for this opcode because Legal is a "lie".
      
      This patch changes the default for this opcode to Expand and removes the hack from LegalizeDAG.cpp. It also removes all the code in the targets that set this opcode to Expand themselves since they can just rely on the default.
      
      Reviewers: spatel, RKSimon, efriedma
      
      Reviewed By: RKSimon
      
      Subscribers: jfb, dschuff, sbc100, jgravelle-google, nemanjai, javed.absar, andrew.w.kaylor, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D33530
      
      llvm-svn: 304215
      f6d4dc5b
  7. May 29, 2017
    • Hiroshi Inoue's avatar
      [PPC] Fix assertion failure during binary encoding with -mcpu=pwr9 · e3c14ebb
      Hiroshi Inoue authored
      Summary
      clang -c -mcpu=pwr9 test/CodeGen/PowerPC/build-vector-tests.ll causes an assertion failure during the binary encoding.
      The failure occurs when a D-form load instruction takes two register operands instead of a register + an immediate.
      
      This patch fixes the problem and also adds an assertion to catch this failure earlier before the binary encoding (i.e. during lit test).
      The fix is from Nemanja Ivanovic @nemanjai.
      
      Differential Revision: https://reviews.llvm.org/D33482
      
      llvm-svn: 304133
      e3c14ebb
  8. May 26, 2017
  9. May 25, 2017
    • Kyle Butt's avatar
      PPC: Correct Size for GETtlsADDR · 13379d7c
      Kyle Butt authored
      PPC::GETtlsADDR is lowered to a branch and a nop, by the assembly
      printer. Its size was incorrectly marked as 4, correct it to 8. The
      incorrect size can cause incorrect branch relaxation in
      PPCBranchSelector under the right conditions.
      
      llvm-svn: 303904
      13379d7c
    • Tony Jiang's avatar
      [PowerPC] Fix a performance bug for PPC::XXSLDWI. · 0a429f04
      Tony Jiang authored
      There are some VectorShuffle Nodes in SDAG which can be selected to XXSLDWI
      instruction, this patch recognizes them and does the selection to improve the
      PPC performance.
      
      llvm-svn: 303822
      0a429f04
  10. May 24, 2017
  11. May 21, 2017
    • Hiroshi Inoue's avatar
      Summary · 37e63b1b
      Hiroshi Inoue authored
      PPC backend eliminates compare instructions by using record-form instructions in PPCInstrInfo::optimizeCompareInstr, which is called from peephole optimization pass.
      This patch improves this optimization to eliminate more compare instructions in two types of common case.
      
      
      - comparison against a constant 1 or -1
      
      The record-form instructions set CR bit based on signed comparison against 0. So, the current implementation does not exploit the record-form instruction for comparison against a non-zero constant.
      This patch enables record-form optimization for constant of 1 or -1 if possible; it changes the condition "greater than -1" into "greater than or equal to 0" and "less than 1" into "less than or equal to 0".
      With this patch, compare can be eliminated in the following code sequence, as an example.
      
      uint64_t a, b;
      if ((a | b) & 0x8000000000000000ull) { ... }
      else { ... }
      
      
      - andi for 32-bit comparison on PPC64
      
      Since record-form instructions execute 64-bit signed comparison and so we have limitation in eliminating 32-bit comparison, i.e. with cmplwi, using the record-form. The original implementation already has such checks but andi. is not recognized as an instruction which executes implicit zero extension and hence safe to convert into record-form if used for equality check.
      
      %1 = and i32 %a, 10
      %2 = icmp ne i32 %1, 0
      br i1 %2, label %foo, label %bar
      
      In this simple example, LLVM generates andi. + cmplwi + beq on PPC64.
      This patch make it possible to eliminate the cmplwi for this case.
      I added andi. for optimization targets if it is safe to do so.
      
      Differential Revision: https://reviews.llvm.org/D30081
      
      llvm-svn: 303500
      37e63b1b
  12. May 18, 2017
    • Francis Visoiu Mistrih's avatar
      [LegacyPassManager] Remove TargetMachine constructors · 8b61764c
      Francis Visoiu Mistrih authored
      This provides a new way to access the TargetMachine through
      TargetPassConfig, as a dependency.
      
      The patterns replaced here are:
      
      * Passes handling a null TargetMachine call
        `getAnalysisIfAvailable<TargetPassConfig>`.
      
      * Passes not handling a null TargetMachine
        `addRequired<TargetPassConfig>` and call
        `getAnalysis<TargetPassConfig>`.
      
      * MachineFunctionPasses now use MF.getTarget().
      
      * Remove all the TargetMachine constructors.
      * Remove INITIALIZE_TM_PASS.
      
      This fixes a crash when running `llc -start-before prologepilog`.
      
      PEI needs StackProtector, which gets constructed without a TargetMachine
      by the pass manager. The StackProtector pass doesn't handle the case
      where there is no TargetMachine, so it segfaults.
      
      Related to PR30324.
      
      Differential Revision: https://reviews.llvm.org/D33222
      
      llvm-svn: 303360
      8b61764c
  13. May 17, 2017
  14. May 16, 2017
  15. May 12, 2017
  16. May 11, 2017
  17. May 09, 2017
    • Tim Shen's avatar
      [Atomic] Remove IsStore/IsLoad in the interface, and pass the instruction instead. NFC. · 04de70d3
      Tim Shen authored
      Now both emitLeadingFence and emitTrailingFence take the instruction
      itself, instead of taking IsLoad/IsStore pairs.
      Instruction::mayReadFromMemory and Instrucion::mayWriteToMemory are used
      for determining those two booleans.
      
      The instruction argument is also useful for later D32763, in
      emitTrailingFence. For emitLeadingFence, it seems to have cleaner
      interface with the proposed change.
      
      Differential Revision: https://reviews.llvm.org/D32762
      
      llvm-svn: 302539
      04de70d3
    • Serge Pavlov's avatar
      Add extra operand to CALLSEQ_START to keep frame part set up previously · d526b13e
      Serge Pavlov authored
      Using arguments with attribute inalloca creates problems for verification
      of machine representation. This attribute instructs the backend that the
      argument is prepared in stack prior to  CALLSEQ_START..CALLSEQ_END
      sequence (see http://llvm.org/docs/InAlloca.htm for details). Frame size
      stored in CALLSEQ_START in this case does not count the size of this
      argument. However CALLSEQ_END still keeps total frame size, as caller can
      be responsible for cleanup of entire frame. So CALLSEQ_START and
      CALLSEQ_END keep different frame size and the difference is treated by
      MachineVerifier as stack error. Currently there is no way to distinguish
      this case from actual errors.
      
      This patch adds additional argument to CALLSEQ_START and its
      target-specific counterparts to keep size of stack that is set up prior to
      the call frame sequence. This argument allows MachineVerifier to calculate
      actual frame size associated with frame setup instruction and correctly
      process the case of inalloca arguments.
      
      The changes made by the patch are:
      - Frame setup instructions get the second mandatory argument. It
        affects all targets that use frame pseudo instructions and touched many
        files although the changes are uniform.
      - Access to frame properties are implemented using special instructions
        rather than calls getOperand(N).getImm(). For X86 and ARM such
        replacement was made previously.
      - Changes that reflect appearance of additional argument of frame setup
        instruction. These involve proper instruction initialization and
        methods that access instruction arguments.
      - MachineVerifier retrieves frame size using method, which reports sum of
        frame parts initialized inside frame instruction pair and outside it.
      
      The patch implements approach proposed by Quentin Colombet in
      https://bugs.llvm.org/show_bug.cgi?id=27481#c1.
      It fixes 9 tests failed with machine verifier enabled and listed
      in PR27481.
      
      Differential Revision: https://reviews.llvm.org/D32394
      
      llvm-svn: 302527
      d526b13e
  18. May 05, 2017
  19. May 04, 2017
  20. May 03, 2017
  21. May 02, 2017
  22. May 01, 2017
Loading