Fix : get_balanced_memory when using multi gpus with small models or quantized models with a large vocabulary #5231
Triggered via pull request
November 19, 2024 15:09
Status
Success
Total duration
10m 22s
Artifacts
–
Annotations
20 warnings