[WIP] Support for reusing the input to W_k and W_v #9519
Triggered via pull request
January 13, 2025 16:49
ShashankMosaicML
synchronize
#1710
Status
Success
Total duration
12m 50s
Artifacts
–
pr-gpu.yaml
on: pull_request_target
Matrix: pytest-gpu-1
Matrix: pytest-gpu-2
Matrix: pytest-gpu-4