We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
the update value network should be:
alpha_w = 1e-3 # 初始化 optimizer_w = optim.Adam(**s_value_func**.parameters(), lr=alpha_w) optimizer_w.zero_grad() policy_loss_w =-delta policy_loss_w.backward(retain_graph = True) clip_grad_norm_(policy_loss_w, 0.1) optimizer_w.step()
The text was updated successfully, but these errors were encountered:
There's some error in this code. when run this code,it shows some error about compute graph. do you meet the same problem?
Sorry, something went wrong.
same problem here, you can debug it step by step to see the errors
No branches or pull requests
the update value network should be:
The text was updated successfully, but these errors were encountered: