[mlir][tensor] Implement TilingInterface for tensor.pack op.
We can compute the offsets and sizes for the slice of input because the iteration domain is defined over outer loops. If the dimension is tiled, the i-th index is the product of offset_i and inner_tile_i. Different from tiling a pad op, we do not have to deal with reading zero data from input. Because the tiling sizes are indicated to packed outer dimensions. We will read either the entire tile or partial tile for each packed tile. The scf.if and tensor.generate ops are not needed in this context. Co-authored-by:Lorenzo Chelini <l.chelini@icloud.com> Reviewed By: rengolin, mravishankar Differential Revision: https://reviews.llvm.org/D138631
Loading
Please sign in to comment