Skip to content
  1. Aug 03, 2012
    • Tobias Grosser's avatar
      Add preliminary implementation for GPGPU code generation. · 6217e18a
      Tobias Grosser authored
      Translate the selected parallel loop body into a ptx string and run it with the
      cuda driver API. We limit this preliminary implementation to target the
      following special test cases:
      
        - Support only 2-dimensional parallel loops with or without only one innermost
          non-parallel loop.
        - Support write memory access to only one array in a SCoP.
      
      The patch was committed with smaller changes to the build system:
      
      There is now a flag to enable gpu code generation explictly. This was required
      as we need the llvm.codegen() patch applied on the llvm sources, to compile this
      feature correctly. Also, enabling gpu code generation does not require cuda.
      This requirement was removed to allow 'make polly-test' runs, even without an
      installed cuda runtime.
      
      Contributed by:  Yabin Hu  <yabin.hwu@gmail.com>
      
      llvm-svn: 161239
      6217e18a
  2. Aug 02, 2012
  3. Jun 06, 2012
  4. May 07, 2012
  5. May 04, 2012
  6. Oct 04, 2011
  7. Apr 29, 2011
Loading