Multi-Agent Coverage Control utilizing Reinforcement Learning

Adapted from Minimalistic Gridworld Environment (MiniGrid)

Installation and Setup

Install Python >= 3.5 and clone this project:

$ git clone https://github.com/DVSimon/295bEnv.git
$ cd 295bEnv

Set up the virtual environment:

$ python3 -m venv env
$ source env/bin/activate

Install project dependencies:

$ pip3 install -r requirements.txt

Basic Usage

Observation-based Q-Learning simulation and training

./QL-obs.py

Location-based DQN training

./DQN-loc.py

Observation-based DQN training

./DQN-obs.py

Manually control agents with keyboard input

./manual_control.py

Configuration

Navigate to config.yml file

Environment Variables

grid_size determines the height and width of the grid environment including the outer walls
obstacles decides the # of obstacles to be placed into the environment (randomized)
agents decides the # of agents within environment
obs_radius decides the agents surrounding visibility of grid
reward_type of 0 decides to use generic +1, -1 reward formula
reward_type of 1 decides to use custom reward formula based on times visited
seed decides the seed for generation for reproducability

Q-Learning Parameters

These parameters only affect the Q-learning algorithm implementation itself, not DQN

To turn on environment grid rendering

Set grid_render to True

To turn on agent observation rendering within the environment grid

Set grid_obs_render to True

To turn on isolated agent observation rendering

Set obs_render to True

To add sleep timer between steps

Set sleep to desired time (in seconds)

To change plot regression type of number of steps taken

Set regression_type to null/lin/quad/exp

Deep Q Network (DQN) approach:

DQN_loc.py script is location based DQN implementation.

Each agent takes action one by one.

Entire Image of Environment is fed to Neural Network(NN).

NN outputs single value of optimum action for that each agents one by one.
DQN_obs.py script is observation based DQN implementation

Each agent's observation space is fed to NN.

NN outputs action value for each agents individually & simultaneously.

Name		Name	Last commit message	Last commit date
Latest commit History 317 Commits
gym_minigrid		gym_minigrid
results		results
DQN-loc.py		DQN-loc.py
DQN-obs.py		DQN-obs.py
LICENSE		LICENSE
QL-loc.py		QL-loc.py
QL-obs.py		QL-obs.py
QL-test.py		QL-test.py
README.md		README.md
config.yml		config.yml
coverage_12x12_o15_a5_r2_t1.csv		coverage_12x12_o15_a5_r2_t1.csv
coverage_12x12_o15_a5_r2_t1.png		coverage_12x12_o15_a5_r2_t1.png
manual_control.py		manual_control.py
qt.pkl		qt.pkl
render_coverage.py		render_coverage.py
requirements.txt		requirements.txt
st_plt_12x12_o15_a5_r2_t1.png		st_plt_12x12_o15_a5_r2_t1.png
steps_12x12_o15_a5_r2_t1.csv		steps_12x12_o15_a5_r2_t1.csv
trajectory_12x12_o15_a5_r2_t1.pkl		trajectory_12x12_o15_a5_r2_t1.pkl
trajectory_12x12_o15_a5_r2_t1.png		trajectory_12x12_o15_a5_r2_t1.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Agent Coverage Control utilizing Reinforcement Learning

Installation and Setup

Basic Usage

Configuration

Environment Variables

Q-Learning Parameters

To turn on environment grid rendering

To turn on agent observation rendering within the environment grid

To turn on isolated agent observation rendering

To add sleep timer between steps

To change plot regression type of number of steps taken

Deep Q Network (DQN) approach:

About

Releases

Packages

Contributors 4

Languages

License

DVSimon/295bEnv

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent Coverage Control utilizing Reinforcement Learning

Installation and Setup

Basic Usage

Configuration

Environment Variables

Q-Learning Parameters

To turn on environment grid rendering

To turn on agent observation rendering within the environment grid

To turn on isolated agent observation rendering

To add sleep timer between steps

To change plot regression type of number of steps taken

Deep Q Network (DQN) approach:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages