We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I'm looking at the GRPO documentation
https://huggingface.co/docs/trl/main/en/grpo_trainer
#2565
I see that it is not complete, could you please make an example in Google colab since it is easier to implement and reproduce the code.
can flash attn be used? can it be used with unsloth?
thank you very much
The text was updated successfully, but these errors were encountered:
No branches or pull requests
I'm looking at the GRPO documentation
https://huggingface.co/docs/trl/main/en/grpo_trainer
#2565
I see that it is not complete, could you please make an example in Google colab since it is easier to implement and reproduce the code.
can flash attn be used?
can it be used with unsloth?
thank you very much
The text was updated successfully, but these errors were encountered: