Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

策略迭代代码问题 #10

Open
FUNKYQ opened this issue Feb 21, 2023 · 1 comment
Open

策略迭代代码问题 #10

FUNKYQ opened this issue Feb 21, 2023 · 1 comment

Comments

@FUNKYQ
Copy link

FUNKYQ commented Feb 21, 2023

每次update_V的时候后面调用的compute_V时的策略是基于当前V的,而不是上一次策略提升后得到的策略,这不就相当于是值迭代了,并没有体现出策略评估和策略提升的两步分别进行。
有没有大佬帮我看一下。

@Easyboy0405
Copy link

Easyboy0405 commented Feb 21, 2023 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants