Install using coda or venv

I launched into learning Pygame with reinforcement learning.

I have documented my whole journey while learning here: https://medium.com/@manubotija/list/my-trip-into-reinforcement-learning-d6c244d5aa29

Install using coda or venv

pip install -r requirements.txt

Since using specific branch of sb3 that supports gym 0.26, install using following command:

pip install git+https://github.com/carlosluis/stable-baselines3@fix_tests

see DLR-RM/stable-baselines3#780 for more information.

Running

Running python main.py -h provides all options available. Some examples are

Play a game:

python main.py play mid-barrier-no-proj config-4

Train a model:

python main.py train mid-barrier-no-proj config-4 --time_steps 300000 --project_name TEST

Evaluate a model:

python main.py evaluate mid-barrier-no-proj config-4 --model_path path/to/best_model.zip --render

TODO:

Run hyperparam search from CLI

Training metrics/heuristics

On my Macbook Air M1, I can get 3-4k fps
PPO starts showing improvements usually after 200-300k steps. Progress flattens at 1M steps.
Best metric so far for mid-barrier-no-proj scenario is a success rate of 70%, with average score of 0.5-0.6 (reward scheme config-4)

Other tricks

Since scenario requires pygame, when training on a VM in the cloud, may need to apply this trick to prevent Pygame from failing to launch:

import os
os.environ["SDL_VIDEODRIVER"] = "dummy"

Old code

Under old_code is my the implementation of DQN, various utilities and training pipeline (up to DAY 12). None of it probably works out of the box since I have not kept the game nor the wrapper backwards compatible

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.vscode		.vscode
configs		configs
old_code		old_code
.gitignore		.gitignore
callbacks.py		callbacks.py
debug_env.py		debug_env.py
evaluate.py		evaluate.py
fractal.py		fractal.py
game.py		game.py
learn_from_pixels.ipynb		learn_from_pixels.ipynb
learn_pixels.py		learn_pixels.py
main.py		main.py
opt_hyperparams.py		opt_hyperparams.py
readme.md		readme.md
requirements.txt		requirements.txt
settings.py		settings.py
sprites.py		sprites.py
test_env.py		test_env.py
train.py		train.py
wrappers.py		wrappers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Install using coda or venv

Running

Training metrics/heuristics

Other tricks

Old code

About

Releases

Packages

Languages

manubotija/My-trip-into-Reinforcement-Learning

Folders and files

Latest commit

History

Repository files navigation

Install using coda or venv

Running

Training metrics/heuristics

Other tricks

Old code

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages