Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PPO Algorithm Convergence Issue: Ladder Degradation Problem #17

Open
heping103 opened this issue Sep 12, 2024 · 1 comment
Open

PPO Algorithm Convergence Issue: Ladder Degradation Problem #17

heping103 opened this issue Sep 12, 2024 · 1 comment

Comments

@heping103
Copy link

屏幕截图 2024-09-12 102525
I customized an environment and trained it with the PPO algorithm,Why does my strategy suddenly collapse as the model is trained?
Is this a problem with my environment? Or is it a common problem in reinforcement learning? How do I fix it?Thank you for your teaching and look forward to receiving a response。

@ericyangyu
Copy link
Owner

Hi, thanks for reaching out. This can be many several reasons off the top of my head, but I cannot say much unless I know more about the task you want to train on. I know it's been a few weeks since you've posted this but if you still have questions on this, feel free to email me!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants