Unverified Commit afe40062 authored Oct 04, 2023 by Guray Ozen Committed by GitHub Oct 04, 2023

[MLIR] Use `test-lower-to-nvvm` for sm_90 Integration Tests on GitHub (#68184)

This PR enables `test-lower-to-nvvm` pass pipeline for the integration
tests for NVIDIA sm_90 architecture.

This PR adjusts `test-lower-to-nvvm` pass in two ways: 

1) Calls `createConvertNVGPUToNVVMPass` before the outlining process.
This particular pass is responsible for generating both device and host
code. On the host, it calls the CUDA driver to build the TMA descriptor
(`cuTensorMap`).

2) Integrates the `createConvertNVVMToLLVMPass` to generate PTXs for
NVVM Ops.

parent 20fc2ffb

Show whitespace changes

Inline Side-by-side

Please to comment