You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The pipeline and async copy feature in CUDA 11 can overlap data movement and computation.
TVM has already supported compiler passes such as InjectSoftwarePipeline/InjectDoubleBuffer/etc (https://github.com/apache/tvm-rfcs/blob/main/rfcs/0077-async-pipeline.md), however, we have not made these passes compatible with sparse workloads, which limits the performance of SparseTIR generated kernels.
The text was updated successfully, but these errors were encountered:
yzh119
changed the title
[Tracking Issue] SparseTIR Compatibility with Software Pipelining
[Tracking Issue] SparseTIR Compatibility with Software Pipelining and Async Copy
Nov 10, 2022
The pipeline and async copy feature in CUDA 11 can overlap data movement and computation.
TVM has already supported compiler passes such as InjectSoftwarePipeline/InjectDoubleBuffer/etc (https://github.com/apache/tvm-rfcs/blob/main/rfcs/0077-async-pipeline.md), however, we have not made these passes compatible with sparse workloads, which limits the performance of SparseTIR generated kernels.
The text was updated successfully, but these errors were encountered: