Skip to content
  1. Apr 04, 2018
  2. Apr 03, 2018
    • Jessica Paquette's avatar
      [MachineOutliner] Keep track of fns that use a redzone in AArch64FunctionInfo · 642f6c61
      Jessica Paquette authored
      This patch adds a hasRedZone() function to AArch64MachineFunctionInfo. It
      returns true if the function is known to use a redzone, false if it is known
      to not use a redzone, and no value otherwise.
      
      This removes the requirement to pass -mno-red-zone when outlining for AArch64.
      
      https://reviews.llvm.org/D45189
      
      llvm-svn: 329120
      642f6c61
    • Farhana Aleen's avatar
      Revert "MSG" · 93694734
      Farhana Aleen authored
      This reverts commit 9a0ce889d1c39c74d69ecad5ce9c875155ae55de.
      
      This was committed by mistake.
      
      llvm-svn: 329119
      93694734
    • Farhana Aleen's avatar
      MSG · 3ab409dc
      Farhana Aleen authored
      llvm-svn: 329114
      3ab409dc
    • Jun Bum Lim's avatar
      [CodeGen]Add NoVRegs property on PostRASink and ShrinkWrap · 7ab1b32b
      Jun Bum Lim authored
      Summary:
      This change declare that PostRAMachineSinking and ShrinkWrap require NoVRegs
      property, so now the MachineFunctionPass can enforce this check.
      These passes are disabled in NVPTX & WebAssembly.
      
      Reviewers: dschuff, jlebar, tra, jgravelle-google, MatzeB, sebpop, thegameg, mcrosier
      
      Reviewed By: dschuff, thegameg
      
      Subscribers: jholewinski, jfb, sbc100, aheejin, sunfish, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D45183
      
      llvm-svn: 329095
      7ab1b32b
    • Krzysztof Parzyszek's avatar
      [Hexagon] Remove -mhvx-double and the corresponding subtarget feature · 9fa6ffe2
      Krzysztof Parzyszek authored
      Specifying the HVX vector length should be done via the -mhvx-length
      option.
      
      llvm-svn: 329079
      9fa6ffe2
    • Andrea Di Biagio's avatar
      [MC][Tablegen] Allow the definition of processor register files in the... · 9da4d6db
      Andrea Di Biagio authored
      [MC][Tablegen] Allow the definition of processor register files in the scheduling model for llvm-mca
      
      This patch allows the description of register files in processor scheduling
      models. This addresses PR36662.
      
      A new tablegen class named 'RegisterFile' has been added to TargetSchedule.td.
      Targets can optionally describe register files for their processors using that
      class. In particular, class RegisterFile allows to specify:
       - The total number of physical registers.
       - Which target registers are accessible through the register file.
       - The cost of allocating a register at register renaming stage.
      
      Example (from this patch - see file X86/X86ScheduleBtVer2.td)
      
        def FpuPRF : RegisterFile<72, [VR64, VR128, VR256], [1, 1, 2]>
      
      Here, FpuPRF describes a register file for MMX/XMM/YMM registers. On Jaguar
      (btver2), a YMM register definition consumes 2 physical registers, while MMX/XMM
      register definitions only cost 1 physical register.
      
      The syntax allows to specify an empty set of register classes.  An empty set of
      register classes means: this register file models all the registers specified by
      the Target.  For each register class, users can specify an optional register
      cost. By default, register costs default to 1.  A value of 0 for the number of
      physical registers means: "this register file has an unbounded number of
      physical registers".
      
      This patch is structured in two parts.
      
      * Part 1 - MC/Tablegen *
      
      A first part adds the tablegen definition of RegisterFile, and teaches the
      SubtargetEmitter how to emit information related to register files.
      
      Information about register files is accessible through an instance of
      MCExtraProcessorInfo.
      The idea behind this design is to logically partition the processor description
      which is only used by external tools (like llvm-mca) from the processor
      information used by the llvm machine schedulers.
      I think that this design would make easier for targets to get rid of the extra
      processor information if they don't want it.
      
      * Part 2 - llvm-mca related *
      
      The second part of this patch is related to changes to llvm-mca.
      
      The main differences are:
       1) class RegisterFile now needs to take into account the "cost of a register"
      when allocating physical registers at register renaming stage.
       2) Point 1. triggered a minor refactoring which lef to the removal of the
      "maximum 32 register files" restriction.
       3) The BackendStatistics view has been updated so that we can print out extra
      details related to each register file implemented by the processor.
      
      The effect of point 3. is also visible in tests register-files-[1..5].s.
      
      Differential Revision: https://reviews.llvm.org/D44980
      
      llvm-svn: 329067
      9da4d6db
    • Hiroshi Inoue's avatar
      [PowerPC] reorder entries in P9InstrResources.td in alphabetical order; NFC · 08a1775f
      Hiroshi Inoue authored
      Reorder entries added in my previous commit (rL328969) to keep alphabetical order.
      
      llvm-svn: 329064
      08a1775f
    • Craig Topper's avatar
      [X86] Reduce number of OpPrefix bits in TSFlags to 2. NFCI · 9b6a65b9
      Craig Topper authored
      TSFlag doesn't need to disambiguate NoPrfx from PS. So shift the encodings so PS is NoPrfx|0x4.
      
      llvm-svn: 329049
      9b6a65b9
    • Yonghong Song's avatar
      bpf: fix incorrect SELECT_CC lowering · d3b522f5
      Yonghong Song authored
      
      
      Commit 37962a331c77 ("bpf: Improve expanding logic in LowerSELECT_CC")
      intended to improve code quality for certain jmp conditions. The
      commit, however, has a couple of issues:
        (1). In code, just swap is not enough, ConditionalCode CC
             should also be swapped, otherwise incorrect code will
             be generated.
        (2). The ConditionalCode swap should be subject to
             getHasJmpExt(). If getHasJmpExt() is False, certain
             conditional codes will not be supported and swap
             may generate incorrect code.
      
      The original goal for this patch is to optimize jmp operations
      which does not have JmpExt turned on. If JmpExt is on,
      better code could be generated. For example, the test
      select_ri.ll is introduced to demonstrate the optimization.
      The same result can be achieved with -mcpu=v2 flag.
      
      Signed-off-by: default avatarYonghong Song <yhs@fb.com>
      Acked-by: default avatarAlexei Starovoitov <ast@kernel.org>
      llvm-svn: 329043
      d3b522f5
    • Ikhlas Ajbar's avatar
      peel loops with runtime small trip counts · b7322e8a
      Ikhlas Ajbar authored
      For Hexagon, peeling loops with small runtime trip count is beneficial for our
      benchmarks. We set PeelCount in HexagonTargetInfo.cpp and we use PeelCount set
      by the target for computing the desired peel count.
      
      Differential Revision: https://reviews.llvm.org/D44880
      
      llvm-svn: 329042
      b7322e8a
  3. Apr 02, 2018
Loading