Critic Network Error and Poor Rewards in mujoco_sac.py #1166
Unanswered
payamparvizi
asked this question in
Q&A
Replies: 1 comment
-
Hi, sorry for the late answer. It's better to open an issue instead of a discussion, I rarely look at those ;) I'm going to look into it |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
I am trying to run the code in
tianshou/examples/mujoco/mujoco_sac.py
. However, I got an error in the critic networks (net_c1
andnet_c2
). The error message indicates that settingconcat=True
causes a size mismatch.I commented that part and ran the code, but I did not get good rewards from the Ant-v4 environment as expected. To enable
concat=True
, I customized the network, but the results were still not sufficient. The rewards I get are always below 0, no matter how long I run it.Is it possible that the hyperparameters are incorrect?
Please let me know how I can solve this problem.
Regards
Beta Was this translation helpful? Give feedback.
All reactions