DDPG with Curiosity Driven Exploration and Multi-Criteria Hindsight Experience Replay

(modified from OpenAI Baselines commit #3900f2a4473ce6b26a8129372ca8d5e02c766c9c)

Prerequisites

Baselines requires python3 (>=3.5) with the development headers. You'll also need system packages CMake, OpenMPI and zlib. Those can be installed as follows

Ubuntu

sudo apt-get update && sudo apt-get install cmake libopenmpi-dev python3-dev zlib1g-dev

Mac OS X

Installation of system packages on Mac requires Homebrew. With Homebrew installed, run the follwing:

brew install cmake openmpi

Virtual environment

From the general python package sanity perspective, it is a good idea to use virtual environments (virtualenvs) to make sure packages from different projects do not interfere with each other. You can install virtualenv (which is itself a pip package) via

pip install virtualenv

Virtualenvs are essentially folders that have copies of python executable and all python packages. To create a virtualenv called venv with python3, one runs

virtualenv /path/to/venv --python=python3

To activate a virtualenv:

. /path/to/venv/bin/activate

More thorough tutorial on virtualenvs and options can be found here

Installation

Clone the repo and cd into it:

git clone https://github.com/CDMCH/ddpg-with-curiosity-and-multi-criteria-her.git
cd ddpg-with-curiosity-and-multi-criteria-her

If using virtualenv, create a new virtualenv and activate it

virtualenv env --python=python3
. env/bin/activate

Install baselines package

pip install -e .

Block Stacking Environments

The block stacking environments associated with the paper can be found here.

They use the MuJoCo physics simulator, which is proprietary and requires binaries and a license (temporary 30-day license can be obtained from www.mujoco.org). Instructions on setting up MuJoCo can be found here

Example training script

After installing the block stacking environments, you can run the example script to train an agent to stack 2 blocks with sparse rewards:

./train_on_stack2_sparse_full_curriculum_curiosity_multi_criteria.sh

Or visualize the pretrained agents with:

./watch_stack2.sh
./watch_stack3.sh
./watch_stack4.sh

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
ddpg_curiosity_mc_her		ddpg_curiosity_mc_her
trained_models		trained_models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
setup.py		setup.py
train_on_stack2_sparse_full_curriculum_curiosity_multi_criteria.sh		train_on_stack2_sparse_full_curriculum_curiosity_multi_criteria.sh
watch_stack2.sh		watch_stack2.sh
watch_stack3.sh		watch_stack3.sh
watch_stack4.sh		watch_stack4.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DDPG with Curiosity Driven Exploration and Multi-Criteria Hindsight Experience Replay

Prerequisites

Ubuntu

Mac OS X

Virtual environment

Installation

Block Stacking Environments

Example training script

About

Releases

Packages

Languages

License

julio-design/ddpg-curiosity-and-multi-criteria-her

Folders and files

Latest commit

History

Repository files navigation

DDPG with Curiosity Driven Exploration and Multi-Criteria Hindsight Experience Replay

Prerequisites

Ubuntu

Mac OS X

Virtual environment

Installation

Block Stacking Environments

Example training script

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages