[CUDA] Add cupti #129
cuda.yml
on: pull_request
linux-arm64
32m 5s
linux-x86_64
24m 2s
windows-x86_64
56m 22s
redeploy
8s