Constraints As Terminations (CaT)

About this repository

This repository contains the code associated with the article CaT: Constraints as Terminations for Legged Locomotion Reinforcement Learning by Elliot Chane-Sane*, Pierre-Alexandre Leziart*, Thomas Flayols, Olivier Stasse, Philippe Souères and Nicolas Mansard.

This paper has been accepted for the 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024).

This code relies on the high performance RL library RL Games and a reinforcement learning wrapper for the NVIDIA Isaac Gym simulator provided by IsaacGymEnvs.

We lightly modified both RL Games and IsaacGymEnvs to integrate our method in a minimal way, feel free to check cat_common.py, cat_experience.py and isaacgymenvs_make.py in the utils directory.

In utils/constraint_manager.py, we provide a ConstraintManager class to easily implement CaT:

cstr_manager = ConstraintManager()

# Add all constraints
cstr_manager.add("torque", cstr_torque, max_p=0.1) # Add a soft constraint
cstr_manager.add("base_contact", cstr_base_contact, max_p=1.0) # Add a hard constraint

# Get the reward termination probabilities
cstr_delta = cstr_manager.get_probs()

# Rescale the rewards and rewrite the termination values
rewards *= (1.0 - cstr_delta)
dones = cstr_delta

Installation

Download the Isaac Gym Preview 4 release from the Isaac Gym website, then follow the installation instructions in the documentation. You can use the conda environment provided by create_conda_env_rlgpu.sh or manually install Isaac in a virtual Python environment.

Virtual environment: as binaries for Isaac Gym Preview 4 have been compiled a few years ago, precompiled bindings are only provided for Python 3.6, 3.7 and 3.8 so you might encounter issues trying to use them with newer Python versions. On a recent system we recommand to use Python 3.8.16 through pyenv which will allow you to install 3.8.16 without messing with your main Python installation. You can then create a .python-version file in your work directory, in which you just write 3.8.16. If you type python in a terminal when placed in this directory, it should now use pyenv's Python 3.8.16 automatically. Finally, you can create a virtual environment for your Python 3.8.16 (that we call venv).

python3 -m pip install virtualenv
python3 -m venv venv
source venv/bin/activate

If you chose to use a virtual environment, you can install Isaac Gym with:

cd isaacgym/python/
python3 -m pip install -e .

Ensure that Isaac Gym works on your system by running one of the examples from the python/examples directory, like joint_monkey.py or 1080_balls_of_solitude.py. Follow troubleshooting steps described in the Isaac Gym Preview 4 install instructions if you have any trouble running the samples.

Once Isaac Gym is installed and samples work within your current python environment, install the RL games repository that contains a PPO implementation in Pytorch.

git clone https://github.com/Denys88/rl_games.git
python3 -m pip install -e .

Once RL Games is installed, install the Isaac Gym Benchmark Environments that contains environments and utils to run reinforcement learning in Isaac Gym, relying on RL Games.

git clone https://github.com/NVIDIA-Omniverse/IsaacGymEnvs.git
python3 -m pip install -e .

You will likely need to install Ray as it is not included in the requirements:

python3 -m pip install ray

In the following, if you encounter an error AttributeError: module 'numpy' has no attribute 'int'., you can manually patch the problematic package.

nano venv/lib/python3.8/site-packages/networkx/readwrite/graphml.py

Look for # prepend so that python types are created upon read (last entry wins) and replace (np.int, "int") by (np.int_, "int")

To check your installation, while inside /IsaacGymEnvs/isaacgymenvs, you can run a basic example like the Cartpole training:

python3 train.py task=Cartpole headless=True

And check the result with:

python3 train.py task=Cartpole checkpoint=runs/Cartpole/nn/Cartpole.pth test=True num_envs=32

Check IsaacGymEnvs' README for more information about the available parameters.

Running CaT

Install the texttable module for monitoring purpose:

python3 -m pip install texttable

Place yourself in /constraint-as-terminations and launch a basic training setup on flat ground:

python3 train.py task=SoloTerrain  headless=True

If everything goes well, you will see monitoring statistics in the terminal as the training progresses. At the end, you can check the result with:

python3 train.py task=SoloTerrain num_envs=1 test=True task.env.enableDebugVis=True task.env.onlyForwards=True checkpoint=runs/SoloTerrain_xx-xx-xx-xx/nn/SoloTerrain.pth

Citing

Please cite this work as:

@inproceedings{chane2024cat,
      title={CaT: Constraints as Terminations for Legged Locomotion Reinforcement Learning},
      author={Elliot Chane-Sane and Pierre-Alexandre Leziart and Thomas Flayols and Olivier Stasse and Philippe Sou{\`e}res and Nicolas Mansard},
      booktitle={IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
      year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
assets		assets
cfg		cfg
solo12		solo12
tasks		tasks
utils		utils
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Constraints As Terminations (CaT)

About this repository

Installation

Running CaT

Citing

About

Releases

Packages

Languages

License

Gepetto/constraints-as-terminations

Folders and files

Latest commit

History

Repository files navigation

Constraints As Terminations (CaT)

About this repository

Installation

Running CaT

Citing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages