[Proposal] Add more datasets for discrete-action envs #258

carlosgmartin · 2024-11-03T03:43:58Z

Proposal

Currently, there are only 2 datasets for discrete-action envs:

Both are for MiniGrid.

Would it be possible to add a greater number and variety of datasets for discrete-action envs?

younik · 2024-11-03T10:27:34Z

Robotic tasks are usually the most interesting for offline RL, and they usually have continuous action space.
Do you have any environment in mind that you would like to see in our datasets?

carlosgmartin · 2024-11-03T16:00:31Z

@younik Thanks for your quick response. I'd love to see datasets for the following discrete-action environments:

Arcade Learning Environments. Examples (all used in the original 2013 DQN paper):
- Pong
- Breakout
- Space Invaders
- Seaquest
- Beam Rider
- Enduro
- Q*bert
Minigrid Environments, beyond just Four Rooms.
Classic Control Environments + Lunar Lander.

To make the task easier, here's a potential systematic way to generate a dataset for each environment:

Pick a state-of-the-art RL algorithm (to keep training time as short as possible).
Save every Nth training episode to the dataset.

That way the dataset includes a mixture of different levels of skill.

For example, if the environment is Breakout and the algorithm is PPO, we could create a dataset ALE/breakout/ppo-v0.

We could also create a dataset for each environment based on the random policy, e.g. ALE/breakout/random-v0.

younik · 2024-11-03T17:05:58Z

Thanks for the proposal, I would love to host these datasets in our remote!
I believe ALE and minigrid expert datasets would be especially interesting for the community.

The way we usually proceed for expert dataset is:

Train an agent on the env
Publish the model on our HF space
Publish a simple collection script on our script repo, like this for example

Would you be interested in contributing to it?
The random datasets are less interesting as it is easy for the user to generate them, but we can have for minigrid.
For ALE, it would be amazing to have a small human dataset, but of course is more work.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Proposal] Add more datasets for discrete-action envs #258

[Proposal] Add more datasets for discrete-action envs #258

carlosgmartin commented Nov 3, 2024

younik commented Nov 3, 2024

carlosgmartin commented Nov 3, 2024 •

edited

Loading

younik commented Nov 3, 2024

[Proposal] Add more datasets for discrete-action envs #258

[Proposal] Add more datasets for discrete-action envs #258

Comments

carlosgmartin commented Nov 3, 2024

Proposal

younik commented Nov 3, 2024

carlosgmartin commented Nov 3, 2024 • edited Loading

younik commented Nov 3, 2024

carlosgmartin commented Nov 3, 2024 •

edited

Loading