Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Proposal] Add more datasets for discrete-action envs #258

Open
carlosgmartin opened this issue Nov 3, 2024 · 3 comments
Open

[Proposal] Add more datasets for discrete-action envs #258

carlosgmartin opened this issue Nov 3, 2024 · 3 comments

Comments

@carlosgmartin
Copy link

Proposal

Currently, there are only 2 datasets for discrete-action envs:

Both are for MiniGrid.

Would it be possible to add a greater number and variety of datasets for discrete-action envs?

@younik
Copy link
Member

younik commented Nov 3, 2024

Hi @carlosgmartin,

Robotic tasks are usually the most interesting for offline RL, and they usually have continuous action space.
Do you have any environment in mind that you would like to see in our datasets?

@carlosgmartin
Copy link
Author

carlosgmartin commented Nov 3, 2024

@younik Thanks for your quick response. I'd love to see datasets for the following discrete-action environments:

To make the task easier, here's a potential systematic way to generate a dataset for each environment:

  1. Pick a state-of-the-art RL algorithm (to keep training time as short as possible).
  2. Save every Nth training episode to the dataset.

That way the dataset includes a mixture of different levels of skill.

For example, if the environment is Breakout and the algorithm is PPO, we could create a dataset ALE/breakout/ppo-v0.

We could also create a dataset for each environment based on the random policy, e.g. ALE/breakout/random-v0.

@younik
Copy link
Member

younik commented Nov 3, 2024

Thanks for the proposal, I would love to host these datasets in our remote!
I believe ALE and minigrid expert datasets would be especially interesting for the community.

The way we usually proceed for expert dataset is:

  1. Train an agent on the env
  2. Publish the model on our HF space
  3. Publish a simple collection script on our script repo, like this for example

Would you be interested in contributing to it?
The random datasets are less interesting as it is easy for the user to generate them, but we can have for minigrid.
For ALE, it would be amazing to have a small human dataset, but of course is more work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants