Markov Game Learning Convergence Experiment

Description

The purpose of this project was to test the convergence of four different types of learning algorithms in a simple zero sum markov game (4x2 grid soccer game with 2 players). Each algorithm simulates 1,000,000 turns and checks if the Q-value of a particular state converges. The algorithms tested are: Q-Learning, Friend Q-Learning, Foe Q-Learning, and Correlated Q-Learning. Foe-Q and Correlated-Q use linear programming.

Project structure

MarkovGameLearning
- soccer
  - actions.py
  - player.py
  - soccer_game.py
  - solver.py
  - state.py
- main.py
- README.md

How To Run

Install Python 3.5
Please install cvxopt and the required dependencies http://cvxopt.org/install/
Using Python 3.5, run main.py. Some logging is printed to standard out to give me idea how far along each test is.
Results for each type of learning are printed to files q-learning.csv, friend-q.csv, foe-q.csv, ce-q.csv. Results are a csv with the following format: time-step,q-value-diff,pre-q-value,post-q-value,action/joint-action probabilities (foe-q & ce-q)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
soccer		soccer
.gitignore		.gitignore
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Markov Game Learning Convergence Experiment

Description

Project structure

How To Run

About

Releases

Packages

Languages

auputiger/MarkovGameLearning

Folders and files

Latest commit

History

Repository files navigation

Markov Game Learning Convergence Experiment

Description

Project structure

How To Run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages