Support for mixed discrete-continuous actions #856

CloudyDory · 2023-04-20T07:26:14Z

CloudyDory
Apr 20, 2023

Hi, I have a custom environment that needs to input both discrete and continuous actions. The continuous action should be a Gaussian policy with learned mean and standard deviation. I hope to train a PPO agent to solve the task. Is it still possible to use Tianshou's default Collector and OnPolicyTrainer, or should I write my custom Collector and OnPolicyTrainer?

Trinkle23897 · 2023-04-20T20:05:06Z

Trinkle23897
Apr 20, 2023
Maintainer

imo you only need to inherit PPOPolicy and rewrite parts of them (forward, learn) to let it support combined action input. No need to rewrite Collector and Trainer

2 replies

CloudyDory Apr 23, 2023
Author

Hi, I guess the collector should also be rewritten as well?

Trinkle23897 Apr 23, 2023
Maintainer

I don't think so. You can pack your action as a dict (similar to dict action space) each time, which will be converted to Batch automatically in Collector.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for mixed discrete-continuous actions #856

{{title}}

Replies: 1 comment 2 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

Support for mixed discrete-continuous actions #856

CloudyDory Apr 20, 2023

Replies: 1 comment · 2 replies

Trinkle23897 Apr 20, 2023 Maintainer

CloudyDory Apr 23, 2023 Author

Trinkle23897 Apr 23, 2023 Maintainer

CloudyDory
Apr 20, 2023

Replies: 1 comment 2 replies

Trinkle23897
Apr 20, 2023
Maintainer

CloudyDory Apr 23, 2023
Author

Trinkle23897 Apr 23, 2023
Maintainer