first commit

eltociear · Sep 29, 2024 · 53dbfce · 53dbfce
commit 53dbfce
Show file tree

Hide file tree

Showing 1,078 changed files with 261,933 additions and 0 deletions.
diff --git a/.gitignore b/.gitignore
@@ -0,0 +1,35 @@
+runs
+*.pkl
+data
+save
+*.pyc
+*.sh.log*
+.idea/
+*__pycache__
+*.pyc
+.idea/*
+*.so
+*egg-info*
+*.avi
+*.pdf
+*.xyz
+*.zip
+venv
+wandb
+assets/demonstrations
+*.svg
+.vscode
+output
+outputs
+test_shooting
+multirun
+~/tmp
+huggingface_model
+src/plot
+sim_env/mujoco/demonstrations
+.pre-commit-config.yaml
+cache
+backup_output
+misc/wandb_data
+misc/wandb_output
+*.DS_Store
diff --git a/LICENSE b/LICENSE
@@ -0,0 +1,21 @@
+MIT License
+
+Copyright (c) 2020  
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
diff --git a/README.md b/README.md
@@ -0,0 +1,121 @@
+# 🦾 Heterogenous Pre-trained Transformers
+[![HF Models](https://img.shields.io/badge/%F0%9F%A4%97-Models-yellow?style=flat-square)](https://huggingface.co/liruiw/hpt-base)
+[![License](https://img.shields.io/badge/License-MIT-blue.svg?style=flat-square)](LICENSE)
+[![Paper](https://badgen.net/badge/icon/arXiv?icon=awesome&label&color=red&style=flat-square)]()
+[![Website](https://img.shields.io/badge/Website-hpt-blue?style=flat-square)](https://liruiw.github.io/hpt)
+[![Python](https://img.shields.io/badge/Python-%3E=3.8-blue?style=flat-square)]()
+[![PyTorch](https://img.shields.io/badge/PyTorch-%3E=2.0-orange?style=flat-square)]()
+
+[Lirui Wang](https://liruiw.github.io/), [Xinlei Chen](https://xinleic.xyz/), [Jialiang Zhao](https://alanz.info/), [Kaiming He](https://people.csail.mit.edu/kaiming/)
+
+Neural Information Processing Systems (Spotlight), 2024
+
+
+
+<hr style="border: 2px solid gray;"></hr>
+
+
+This is a pytorch implementation for pre-training Heterogenous Pre-trained Transformers (HPTs). The pre-training procedure train on mixture of embodiment datasets with a supervised learning objective. The pre-training process can take some time, so we also provide pre-trained checkpoints below. You can find more details on our [project page](https://liruiw.github.io/hpt). An alternative clean implementation of HPT in Hugging Face can also be found [here](https://github.com/liruiw/lerobot/tree/hpt_squash/lerobot/common/policies/hpt).
+
+
+**TL;DR:** HPT aligns different embodiment to a shared latent space and investigates the scaling behaviors in policy learning. Put a scalable transformer in the middle of your policy and don’t train from scratch!
+
+
+
+
+## ⚙️ Setup
+1. ```pip install -e .```
+2. This repository should share the `data` folder and the `output` folder as the [pretraining repo](https://github.com/liruiw/HPT-pretrain).
+
+<details>
+<summary><span style="font-weight: bold;">Install (old-version) Mujoco</span></summary>
+
+```
+mkdir ~/.mujoco
+cd ~/.mujoco
+wget https://mujoco.org/download/mujoco210-linux-x86_64.tar.gz  -O mujoco210.tar.gz --no-check-certificate
+tar -xvzf mujoco210.tar.gz
+
+# add the following line to ~/.bashrc if needed
+export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:${HOME}/.mujoco/mujoco210/bin
+export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/lib/nvidia
+export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/cuda/lib64
+export MUJOCO_GL=egl
+```
+
+</details>
+
+## 🚶 Usage
+0. Check out ``quickstart.ipynb`` for how to use the pretrained HPTs.
+1. ```python -m hpt.run``` train policies on each environment. Add `+mode=debug`  for debugging.
+2. ```bash experiments/scripts/metaworld/train_test_metaworld_1task.sh test test 1 +mode=debug``` for example script.
+3. Change ``train.pretrained_dir`` for loading pre-trained trunk transformer. The model can be loaded either from local checkpoint folder or huggingface [repository](https://huggingface.co/liruiw/hpt-xlarge).
+4. Run the following scripts for mujoco experiments.
+
+<details>
+  <summary><span style="font-weight: bold;">Metaworld 20 Task Experiments</span></summary>
+
+```
+bash experiments/scripts/metaworld/train_test_metaworld_20task_finetune.sh hf://liruiw/hpt-base
+```
+5. See [here](experiments/config/config.yaml) for defining and modifying the hyperparameters.
+6. We use [wandb](https://wandb.ai/home) to log the training process.
+
+</details>
+
+## 🤖 Try this On Your Own Dataset
+0. For training, it requires a dataset conversion  `convert_dataset` function for packing your own datasets. Check [this](env/realworld) for example.
+1. For evaluation, it requires a `rollout_runner.py` file for each benchmark and  a ``learner_trajectory_generator`` evaluation function that provides rollouts.
+2. If needed, modify the [config](experiments/configs/config.yaml) for changing the perception stem networks and action head networks in the models. Take a look at [`realrobot_image.yaml`](experiments/configs/env/realrobot_image.yaml) for example script in the real world.
+3. Add `dataset.use_disk=True` for saving and loading the dataset in disk.
+
+## 💽 Checkpoints
+You can find pretrained HPT checkpoints here. At the moment we provide the following model versions:
+
+| Model                                                                  |   Size         |
+|--------------------------------------------------------------------------------|----------------|
+| [HPT-XLarge](https://huggingface.co/liruiw/hpt-xlarge)                 |  226.8M Params  |
+| [HPT-Large](https://huggingface.co/liruiw/hpt-large)                 |  50.5M Params  |
+| [HPT-Base](https://huggingface.co/liruiw/hpt-base)                 |  12.6M Params  |
+| [HPT-Small](https://huggingface.co/liruiw/hpt-small)                 |  3.1M Params   |
+| [HPT-Large (With Language)](https://huggingface.co/liruiw/hpt-base-lang)       |  50.6M Params  |
+
+
+---
+## 💾 File Structure
+```angular2html
+├── ...
+├── HPT
+|   ├── data            # cached datasets
+|   ├── output          # trained models and figures
+|   ├── env             # environment wrappers
+|   ├── hpt             # model training and dataset source code
+|   |   ├── models      # network models
+|   |   ├── datasets    # dataset related
+|   |   ├── run         # transfer learning main loop
+|   |   ├── run_eval    # evaluation main loop
+|   |   └── ...
+|   ├── experiments     # training configs
+|   |   ├── configs     # modular configs
+└── ...
+```
+
+### 🕹️ Citation
+If you find HPT useful in your research, please consider citing:
+```
+@inproceedings{wang2024hpt,
+author    = {Lirui Wang, Xinlei Chen, Jialiang Zhao, Kaiming He},
+title     = {Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers},
+booktitle = {Neurips},
+year      = {2024}
+}
+```
+
+
+## Contact
+
+If you have any questions, feel free to contact me through email ([email protected]). Enjoy!
+
+![](doc/framework.png)
+
+
diff --git a/doc/framework.png b/doc/framework.png
diff --git a/doc/hpt_new.mp4 b/doc/hpt_new.mp4
diff --git a/env/__init__.py b/env/__init__.py
diff --git a/env/mujoco/__init__.py b/env/mujoco/__init__.py
diff --git a/env/mujoco/metaworld/LICENSE b/env/mujoco/metaworld/LICENSE
@@ -0,0 +1,21 @@
+MIT License
+
+Copyright (c) 2019 Meta-World Team
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
diff --git a/env/mujoco/metaworld/README.md b/env/mujoco/metaworld/README.md
@@ -0,0 +1,14 @@
+# citation
+@inproceedings{yu2019meta,
+  title={Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning},
+  author={Tianhe Yu and Deirdre Quillen and Zhanpeng He and Ryan Julian and Karol Hausman and Chelsea Finn and Sergey Levine},
+  booktitle={Conference on Robot Learning (CoRL)},
+  year={2019}
+  eprint={1910.10897},
+  archivePrefix={arXiv},
+  primaryClass={cs.LG}
+  url={https://arxiv.org/abs/1910.10897}
+}
+
+# link
+https://github.com/Farama-Foundation/Metaworld