Reinforcement Learning

This is an implementation of code for a reinforcement learning course.

Multi-armed Bandits

This repository implements a set of algorithms to solve the multi-armed bandit problem:

Furthermore, we implemented 2 sample bandit interfaces as examples of how the algorithms (agent) can interact with bandits (environment).

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
bandits		bandits
cartpole		cartpole
gridworld		gridworld
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
pyproject.toml		pyproject.toml
setup.py		setup.py