[WIP] Support for reusing the input to W_k and W_v #9538
Triggered via pull request
January 25, 2025 05:13
ShashankMosaicML
synchronize
#1710
Status
Success
Total duration
12m 13s
Artifacts
–
pr-gpu.yaml
on: pull_request_target
Matrix: pytest-gpu-1
Matrix: pytest-gpu-2
Matrix: pytest-gpu-4