Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

train GRPO Google colab #2609

Open
NickyDark1 opened this issue Jan 22, 2025 · 0 comments
Open

train GRPO Google colab #2609

NickyDark1 opened this issue Jan 22, 2025 · 0 comments
Labels
📚 documentation Improvements or additions to documentation 🦥 unsloth Related to Unsloth

Comments

@NickyDark1
Copy link

NickyDark1 commented Jan 22, 2025

I'm looking at the GRPO documentation

https://huggingface.co/docs/trl/main/en/grpo_trainer

#2565

I see that it is not complete, could you please make an example in Google colab since it is easier to implement and reproduce the code.

can flash attn be used?
can it be used with unsloth?

thank you very much

@github-actions github-actions bot added 📚 documentation Improvements or additions to documentation 🦥 unsloth Related to Unsloth labels Jan 22, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
📚 documentation Improvements or additions to documentation 🦥 unsloth Related to Unsloth
Projects
None yet
Development

No branches or pull requests

1 participant