Bittle_Leveraging_Symmetries_in_RL

About this repository

This repository is the implementation of paper "Leveraging Symmetries in Gaits for Reinforcement Learning: A Case Study on Quadrupedal Gaits", based on Isaac Gym and Isaac Gym Benchmark Environments.

Features:

Symmetry-based Reward Design for RL: Incorporate three symmetries (temporal symmetry, time-reversal symmetry and morphological symmetry) into the reward function, and train 4 gaits for a quadrupedal robot Bittle.

Authors: Jiayu Ding ([email protected]), Xulin Chen ([email protected])

Affiliation: DLAR Lab

Instructors: Zhenyu Gan, Garrett E. Katz

This projected was initially developed at Syracuse University (Dynamic Locomotion and Robotics Lab).

Publications

This work has been submitted to IROS 2024. If you use this work in an academic context, please cite the following publication: https://arxiv.org/submit/5474477.

Installation

Download Isaac Gym from the website and follow the installation instructions. Recommend using an individual conda environment.

Once Isaac Gym is properly installed, download this repository and run the following commands

cd Bittle_Leveraging_Symmetries_in_RL/
pip install -e .
pip install -r requirements.txt

Organization of project files

cfg/task/DLARBittle_PRD_v2.yaml: Parameters for creating a Bittle environment.
cfg/train/DLARBittlePPO_LSTM.yaml: The configuration of RL policy training (using PPO algorithms and LSTM network).
tasks/dlar_bittle_PRD_v2.py: The definition of Bittle environment in python.
runs/: Save the trained policies.

Initialize the environment

Before running any code, change the directory and activate the conda environment

cd isaacgymenvs/
conda activate your_conda_env_name

Train a policy

To train policies, run

./train.sh

The trained policies are saved under runs/DLARBittle_ww-xx-yy-zz. In this directory,

nn/DLARBittle.pth saves the policy parameters achieving the best performance.
nn/last_DLARBittle_ep_x_rew_y.pth files are the policy parameters at training epoch x achieving reward y.
summaries/ includes the log file for training. To visualize the log file, install tensorboard by run tensorboard --logdir=/path/to/log/file.

Visualize pre-trained policies

To visualize policies, change checkpoint=/path/to/your/policy in visualize.sh. Then run

./visualize.sh

We upload 4 pretrained policies for bounding (runs/DLARBittle_B2_0.1-0.8/), galloping (runs/DLARBittle_GP_0.1-0.8/), half-bounding (runs/DLARBittle_HB_H2_0.3-0.6/) and pronking (runs/DLARBittle_PK_0.1-0.8/) gait.

Record video for pre-trained policies

To visualize a policy, change checkpoint=/path/to/your/policy in record_video.sh. Then run

./record_video.sh

Summary of work

Sym_Guided_RL_Video_v2.mp4

ace04f2ae93e65340a0d10df2a615ed40966767c

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
assets		assets
docs		docs
isaacgymenvs.egg-info		isaacgymenvs.egg-info
isaacgymenvs		isaacgymenvs
mujocoenvs		mujocoenvs
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
Sym_Guided_RL_Video_v2.mp4		Sym_Guided_RL_Video_v2.mp4
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bittle_Leveraging_Symmetries_in_RL

About this repository

Authors: Jiayu Ding ([email protected]), Xulin Chen ([email protected])

Affiliation: DLAR Lab

Instructors: Zhenyu Gan, Garrett E. Katz

Publications

Installation

Organization of project files

Initialize the environment

Train a policy

Visualize pre-trained policies

Record video for pre-trained policies

Summary of work

About

Releases

Packages

Languages

License

rohit-kumar-j/Bittle_Leveraging_Symmetries_in_RL

Folders and files

Latest commit

History

Repository files navigation

Bittle_Leveraging_Symmetries_in_RL

About this repository

Authors: Jiayu Ding ([email protected]), Xulin Chen ([email protected])

Affiliation: DLAR Lab

Instructors: Zhenyu Gan, Garrett E. Katz

Publications

Installation

Organization of project files

Initialize the environment

Train a policy

Visualize pre-trained policies

Record video for pre-trained policies

Summary of work

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages