Skip to content
  1. Jul 18, 2016
    • Tobias Grosser's avatar
      GPGPU: collect array references · b9fc860a
      Tobias Grosser authored
      Initialize the list of references to a GPU array to ensure that the arrays that
      need to be passed to kernel calls are computed correctly.  Furthermore, the very
      same information is also necessary to compute synchronization correctly. As the
      functionality to compute these references is already available, what is left for
      us to do is only to connect the necessary functionality to compute array
      reference information.
      
      llvm-svn: 275798
      b9fc860a
    • Tobias Grosser's avatar
      test: Add missing 'REQUIRES' line · 05aad8db
      Tobias Grosser authored
      llvm-svn: 275784
      05aad8db
    • Tobias Grosser's avatar
      GPGPU: Create host control flow · 38fc0aed
      Tobias Grosser authored
      Create LLVM-IR for all host-side control flow of a given GPU AST. We implement
      this by introducing a new GPUNodeBuilder class derived from IslNodeBuilder.  The
      IslNodeBuilder will take care of generating all general-purpose ast nodes, but
      we provide our own createUser implementation to handle the different GPU
      specific user statements. For now, we just skip any user statement and only
      generate a host-code sceleton, but in subsequent commits we will add handling of
      normal ScopStmt's performing computations, kernel calls, as well as host-device
      data transfers. We will also introduce run-time check generation and LICM in
      subsequent commits.
      
      llvm-svn: 275783
      38fc0aed
  2. Jul 15, 2016
    • Tobias Grosser's avatar
      GPGPU: Format statements scheduled on the host ourselves · 20251734
      Tobias Grosser authored
      Otherwise ppcg would try to call into pet functionality that this not available,
      which obviously will cause trouble. As we can easily print these statements
      ourselves, we just do so.
      
      llvm-svn: 275579
      20251734
    • Tobias Grosser's avatar
      GPGPU: Use schedule whole components for scheduler · 2341fe9e
      Tobias Grosser authored
      This option increases the scalability of the scheduler and allows us to remove
      the 'gisting' workaround we introduced in r275565 to handle a more complicated
      test case. Another benefit of using this option is also that the generated
      code looks a lot more streamlined.
      
      Thanks to Sven Verdoolaege for reminding me of this option.
      
      llvm-svn: 275573
      2341fe9e
    • Tobias Grosser's avatar
      GPGPU: Drop domain constraints from flow dependences · e4725437
      Tobias Grosser authored
      This works around a shortcoming of the isl scheduler, which even for some
      smaller test cases does not terminate in case domain constraints are part
      of the flow dependences.
      
      llvm-svn: 275565
      e4725437
    • Tobias Grosser's avatar
      GPGPU: Test scalar/array types i1/i3/i8/i32/i60/i64/i80/i120/i128/i3000 · 225dca78
      Tobias Grosser authored
      Arrays with integer base type are similar to arrays with floating point types,
      with the exception that LLVM's integer types can take some odd values. We
      add a selection of different values to make sure we correctly round these
      types when necessary.
      
      References to scalar integer types are special, as we currently do not model
      these types as array accesses as they are considered 'synthesizable' by Polly.
      As a result, we do not generate explicit data-transfers for them, but instead
      will need to keep track of all references to 'synthesizable' values separately.
      
      At the current stage, this is only visible by missing host-to-device
      data-transfer calls. In the future, we will also require special code generation
      strategies.
      
      llvm-svn: 275551
      225dca78
    • Tobias Grosser's avatar
      GPGPU: Test scalar parameters of type half/float/double/fp128/x86_fp80/ppc_fp128 · 8d9dcfc5
      Tobias Grosser authored
      We currently only test that the code structure we generate for these scalar
      parameters is correct and we add these types to make sure later code generation
      additions have sufficient test coverage.
      
      In case some of these types cannot be mapped due to missing hardware support
      on the GPU some of these test cases may need to be updated later on.
      
      llvm-svn: 275548
      8d9dcfc5
    • Tobias Grosser's avatar
      GPGPU: Make sure scops with more than one array work · 2d010daf
      Tobias Grosser authored
      We use this opportunity to add a test case containing a scalar parameter.
      
      llvm-svn: 275547
      2d010daf
    • Tobias Grosser's avatar
      GPGPU: Model array access information · 60f63b49
      Tobias Grosser authored
      This allows us to derive host-device and device-host data-transfers.
      
      llvm-svn: 275535
      60f63b49
    • Tobias Grosser's avatar
      GPGPU: Use CHECK-NEXT to harden test cases · eeb8a95a
      Tobias Grosser authored
      A sequence of CHECK lines allows additional statements to appear in the
      output of the tested program without any test failures appearing. As we do
      not want this to happen, switch this test case to use CHECK-NEXT.
      
      llvm-svn: 275534
      eeb8a95a
  3. Jul 14, 2016
    • Tobias Grosser's avatar
      GPGPU: Generate an AST for the GPU-mapped schedule · 69b46751
      Tobias Grosser authored
      For this we need to provide an explicit list of statements as they occur in
      the polly::Scop to ppcg.
      
      We also setup basic AST printing facilities to facilitate debugging. To allow
      code reuse some (minor) changes in ppcg are have been necessary.
      
      llvm-svn: 275436
      69b46751
    • Tobias Grosser's avatar
      GPGPU: Use a tile size of 32 by default · 4eaedde5
      Tobias Grosser authored
      The tile size was previously uninitialized. As a result, it was often zero (aka.
      no tiling), which is not what we want in general. More importantly, there was
      the risk for arbitrary tile sizes to be choosen, which we did not observe, but
      which still is highly problematic.
      
      llvm-svn: 275418
      4eaedde5
    • Tobias Grosser's avatar
      GPGPU: Map initial schedule to GPU schedule · aef5196f
      Tobias Grosser authored
      This change now applies ppcg's GPU mapping on our initial schedule. For this
      to work, we need to also initialize the set of all names (isl_ids) used in
      the scop as well as the program context.
      
      llvm-svn: 275396
      aef5196f
    • Tobias Grosser's avatar
      GPGPU: compute new schedule from polly scop · f384594d
      Tobias Grosser authored
      To do so we copy the necessary information to compute an initial schedule from
      polly::Scop to ppcg's scop. Most of the necessary information is directly
      available and only needs to be passed on to ppcg, with the exception of 'tagged'
      access relations, access relations that additionally carry information about
      which memory access an access relation originates from.
      
      We could possibly perform the construction of tagged accesses as part of
      ScopInfo, but as this format is currently specific to ppcg we do not do this
      yet, but keep this functionality local to our GPU code generation.
      
      After the scop has been initialized, we compute data dependences and ask ppcg to
      compute an initial schedule. Some of this functionality is already available in
      polly::DependenceInfo and polly::ScheduleOptimizer, but to keep differences
      to ppcg small we use ppcg's functionality here. We may later investiage if
      a closer integration of these tools makes sense.
      
      llvm-svn: 275390
      f384594d
    • Tobias Grosser's avatar
      GPGPU: create default initialized PPCG scop and gpu program · e938517e
      Tobias Grosser authored
      At this stage, we do not yet modify the IR but just generate a default
      initialized ppcg_scop and gpu_prog and free both immediately. Both will later be
      filled with data from the polly::Scop and are needed to use PPCG for GPU
      schedule generation. This commit does not yet perform any GPU code generation,
      but ensures that the basic infrastructure has been put in place.
      
      We also add a simple test case to ensure the new code is run and use this
      opportunity to verify that GPU_CODEGEN tests are only run if GPU code generation
      has been enabled in cmake.
      
      llvm-svn: 275389
      e938517e
  4. Jul 12, 2016
    • Michael Kruse's avatar
      Add CHECK line to test case. NFC. · 3b0a9934
      Michael Kruse authored
      Check not only that the compiler is not crashing, but also whether the
      probablematic part (The sequence of instructions simplified to '4') is reflected
      in the output.
      
      Thanks to Tobias for the hint.
      
      llvm-svn: 275189
      3b0a9934
    • Michael Kruse's avatar
      [SCEVAffinator] Fix assertion checking for constant divisor. · e4483643
      Michael Kruse authored
      An assertion in visitSDivInstruction() checked whether the divisor is constant
      by checking whether the argument is a ConstantInt. However, SCEVValidator allows
      the divisor to be simplified to a constant by ScalarEvolution.
      
      We synchronize the implementation of SCEVValidator and SCEVAffinator to both
      accept simplified SCEV expressions.
      
      llvm-svn: 275174
      e4483643
  5. Jul 11, 2016
  6. Jul 08, 2016
  7. Jul 06, 2016
  8. Jul 01, 2016
  9. Jun 27, 2016
  10. Jun 22, 2016
  11. Jun 12, 2016
    • Tobias Grosser's avatar
      Update isl to isl-0.17.1-57-g1879898 · 07b20952
      Tobias Grosser authored
      With this update the isl AST generation extracts disjunctive constraints early
      on. As a result, code that previously resulted in two branches with (close-to)
      identical code within them:
      
        if (P <= -1) {
          for (int c0 = 0; c0 < N; c0 += 1)
            Stmt_store(c0);
        } else if (P >= 1)
          for (int c0 = 0; c0 < N; c0 += 1)
             Stmt_store(c0);
      
      results now in only a single branch body:
      
        if (P <= -1 || P >= 1)
          for (int c0 = 0; c0 < N; c0 += 1)
             Stmt_store(c0);
      
      This resolves http://llvm.org/PR27559
      
      Besides the above change, this isl update brings better simplification of
      sets/maps containing existentially quantified dimensions and fixes a bug in
      isl's coalescing.
      
      llvm-svn: 272500
      07b20952
    • Tobias Grosser's avatar
      Expand test cases affected by next commit · 8620679e
      Tobias Grosser authored
      As these test cases will be changed in a subsequent commit, we expand and
      tighten them to make the subsequent changes to them more obvious. As part of
      this we add more context to some test cases and add CHECK-NEXT lines to ensure
      no intermediate lines are missed by accident.
      
      llvm-svn: 272499
      8620679e
  12. Jun 11, 2016
    • Tobias Grosser's avatar
      Recommit: "[FIX] Determine insertion point during SCEV expansion" · 971336d3
      Tobias Grosser authored
      This patch was originally contributed by Johannes Doerfert in r271892, but
      was in conflict with the revert in r272483.
      
      llvm-svn: 272486
      971336d3
    • Tobias Grosser's avatar
      Recommit: "Look through IntToPtr & PtrToInt instructions" · 423642a5
      Tobias Grosser authored
      IntToPtr and PtrToInt instructions are basically no-ops that we can handle as
      such. In order to generate them properly as parameters we had to improve the
      ScopExpander, though the change is the first in the direction of a more
      aggressive scalar synthetization.
      
      This patch was originally contributed by Johannes Doerfert in r271888, but was
      in conflict with the revert in r272483. This is a recommit with some minor
      adjustment to the test cases to take care of differing instruction names.
      
      llvm-svn: 272485
      423642a5
    • Tobias Grosser's avatar
      This reverts recent expression type changes · 3717aa5d
      Tobias Grosser authored
      The recent expression type changes still need more discussion, which will happen
      on phabricator or on the mailing list. The precise list of commits reverted are:
      
      - "Refactor division generation code"
      - "[NFC] Generate runtime checks after the SCoP"
      - "[FIX] Determine insertion point during SCEV expansion"
      - "Look through IntToPtr & PtrToInt instructions"
      - "Use minimal types for generated expressions"
      - "Temporarily promote values to i64 again"
      - "[NFC] Avoid unnecessary comparison for min/max expressions"
      - "[Polly] Fix -Wunused-variable warnings (NFC)"
      - "[NFC] Simplify min/max expression generation"
      - "Simplify the type adjustment in the IslExprBuilder"
      
      Some of them are just reverted as we would otherwise get conflicts. I will try
      to re-commit them if possible.
      
      llvm-svn: 272483
      3717aa5d
  13. Jun 07, 2016
  14. Jun 06, 2016
    • Johannes Doerfert's avatar
      Refactor division generation code · 8448071d
      Johannes Doerfert authored
        This patch refactors the code generation for divisions. This allows to
        always generate a shift for a power-of-two division and to utilize
        information about constant divisors in order to truncate the result
        type.
      
      llvm-svn: 271898
      8448071d
    • Johannes Doerfert's avatar
      [NFC] Generate runtime checks after the SCoP · c0ece9b6
      Johannes Doerfert authored
        We now generate runtime checks __after__ the SCoP code generation and
        not before, though they are still inserted at the same position int
        the code. This allows to modify the runtime check during SCoP code
        generation.
      
      llvm-svn: 271894
      c0ece9b6
    • Johannes Doerfert's avatar
      [FIX] Determine insertion point during SCEV expansion · 4db8d807
      Johannes Doerfert authored
      llvm-svn: 271892
      4db8d807
    • Johannes Doerfert's avatar
      Look through IntToPtr & PtrToInt instructions · dedb7693
      Johannes Doerfert authored
        IntToPtr and PtrToInt instructions are basically no-ops that we can handle as
        such. In order to generate them properly as parameters we had to improve the
        ScopExpander, though the change is the first in the direction of a more
        aggressive scalar synthetization.
      
      llvm-svn: 271888
      dedb7693
    • Johannes Doerfert's avatar
      [FIX] Do not recognize division by 0 as affine · 4b2fd892
      Johannes Doerfert authored
      llvm-svn: 271885
      4b2fd892
    • Johannes Doerfert's avatar
      Use minimal types for generated expressions · 0767a511
      Johannes Doerfert authored
        We now use the minimal necessary bit width for the generated code. If
        operations might overflow (add/sub/mul) we will try to adjust the types in
        order to ensure a non-wrapping computation. If the type adjustment is not
        possible, thus the necessary type is bigger than the type value of
        --polly-max-expr-bit-width, we will use assumptions to verify the computation
        will not wrap. However, for run-time checks we cannot build assumptions but
        instead utilize overflow tracking intrinsics.
      
      llvm-svn: 271878
      0767a511
  15. Jun 03, 2016
  16. Jun 02, 2016
Loading