-
Notifications
You must be signed in to change notification settings - Fork 268
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Will veRL support deepspeed? #221
Comments
in the short term - no. we have limited staff maintaining the repo. That being said, we always welcome contribution from the community. |
Thanks for the question! We had a deepspeed backend one year ago but deprecate it as not enough man power to maintain it. Also, we found that torch FSDP is comparable to (or even better) DeepSpeed. It can support training up to 70B models with high MFU |
How to set FSDP zero3? |
It is said in the paper that
However, I can only find support for FSDP and Megatron-LM in the current version. Is there any plan to support Deepspeed in the near future?
I think Deepspeed has some advantages over FSDP and is more feasible for large-scale training, and its advantages are also orthogonal to that of Megatron-LM. Therefore we may achieve higher speedup if we can support Deepspeed.
The text was updated successfully, but these errors were encountered: