Keras implementation of Branching Dueling Q-Network (BDQ algorithm)

This is an implementation of a Keras version of the Branching Dueling Deep Q-Learning algorithm. It is based on https://github.com/MoMe36/BranchingDQN, on the paper https://arxiv.org/pdf/1711.08946.pdf and their implementation https://github.com/atavakol/action-branching-agents/tree/master/agents/bdq

BDQ allows a Q-Learning agent to select multiple actions simultaneously, it scales linearly with the action space dimension, thus solving the 'curse of dimentionality' problem for the DQN algorithm. The same principle could also be used for other RL algorithms that suffer from the curse of action space dimensionality...

This BDQ implementation in Keras is demonstrated on BipedalWalker-v3 environment.

How to use:

To train an agent, run:

python train.py

To see the agent perform:

python enjoy.py

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
runs/BipedalWalker-v3_tf		runs/BipedalWalker-v3_tf
.gitignore		.gitignore
README.md		README.md
agent.py		agent.py
enjoy.py		enjoy.py
network.py		network.py
requirements.txt		requirements.txt
train.py		train.py
train_parallel.py		train_parallel.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Keras implementation of Branching Dueling Q-Network (BDQ algorithm)

How to use:

Performances

About

Releases

Packages

Languages

BFAnas/BranchingDQN_keras

Folders and files

Latest commit

History

Repository files navigation

Keras implementation of Branching Dueling Q-Network (BDQ algorithm)

How to use:

Performances

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages