DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning

Gaoyue Zhou, Hengkai Pan, Yann LeCun and Lerrel Pinto, New York University, Meta AI

Getting Started

Installation
Datasets
Train a DINO-WM
Plan with a DINO-WM

Installation

Setup an environment

git clone https://github.com/gaoyuezhou/dino_wm.git
cd dino_wm
conda env create -f environment.yaml
conda activate dino_wm
pip install hydra-submitit-launcher

Install Mujoco

Create the .mujoco directory and download Mujoco210 using wget:

mkdir -p ~/.mujoco
wget https://mujoco.org/download/mujoco210-linux-x86_64.tar.gz -P ~/.mujoco/
cd ~/.mujoco
tar -xzvf mujoco210-linux-x86_64.tar.gz

Append the following lines to your ~/.bashrc:

# Mujoco Path. Replace `<username>` with your actual username if necessary.
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/home/<username>/.mujoco/mujoco210/bin

# NVIDIA Library Path (if using NVIDIA GPUs)
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/lib/nvidia

Reload your shell configuration to apply the environment variable changes:

source ~/.bashrc

Notes

For GPU-accelerated simulations, ensure the NVIDIA drivers are correctly installed.
If you encounter issues, confirm that the paths in your LD_LIBRARY_PATH are correct.
If problems persist, refer to these GitHub issue pages for potential solutions: openai/mujoco-py#773, ethz-asl/reinmav-gym#35.

The following are optional installation steps for planning in the deformable environments.

Install PyFlex (optional for deformable environments)

Install PyFleX if you need to plan within the deformable environments. These installation instructions are adapted from AdaptiGraph.

We are using a docker image to compile PyFleX. Make sure you have the following packages:

Full installation:

pip install "pybind11[global]"
sudo docker pull xingyu/softgym

Run bash install_pyflex.sh. You may need to source ~/.bashrc to import PyFleX.

Or you can manually run

# compile pyflex in docker image
# re-compile if source code changed
# make sure ${PWD}/PyFleX is the pyflex root path when re-compiling
sudo docker run \
    -v ${PWD}/PyFleX:/workspace/PyFleX \
    -v ${CONDA_PREFIX}:/workspace/anaconda \
    -v /tmp/.X11-unix:/tmp/.X11-unix \
    --gpus all \
    -e DISPLAY=$DISPLAY \
    -e QT_X11_NO_MITSHM=1 \
    -it xingyu/softgym:latest bash \
    -c "export PATH=/workspace/anaconda/bin:$PATH; cd /workspace/PyFleX; export PYFLEXROOT=/workspace/PyFleX; export PYTHONPATH=/workspace/PyFleX/bindings/build:$PYTHONPATH; export LD_LIBRARY_PATH=$PYFLEXROOT/external/SDL2-2.0.4/lib/x64:$LD_LIBRARY_PATH; cd bindings; mkdir build; cd build; /usr/bin/cmake ..; make -j"

# import to system paths. run these if you do not have these paths yet in ~/.bashrc
echo '# PyFleX' >> ~/.bashrc
echo "export PYFLEXROOT=${PWD}/PyFleX" >> ~/.bashrc
echo 'export PYTHONPATH=${PYFLEXROOT}/bindings/build:$PYTHONPATH' >> ~/.bashrc
echo 'export LD_LIBRARY_PATH=${PYFLEXROOT}/external/SDL2-2.0.4/lib/x64:$LD_LIBRARY_PATH' >> ~/.bashrc
echo '' >> ~/.bashrc

Datasets

Dataset for each task can be downloaded here.

Once the datasets are downloaded, unzip them. For the deformable dataset, you need to combine all parts and then unzip:

zip -s- deformable.zip -O deformable_full.zip
unzip deformable_full.zip

Set an environment variable pointing to your dataset folder:

# Replace /path/to/data with the actual path to your dataset folder.
export DATASET_DIR=/path/to/data

Inside the dataset folder, you should find the following structure:

data
├── deformable
│   ├── granular
│   └── rope
├── point_maze
├── pusht_noise
└── wall_single

Train a DINO-WM

Once you have completed the above steps, you can check whether you could launch training with an example command like this:

python train.py --config-name train.yaml env=point_maze frameskip=5 num_hist=3

You may specify models' output directory at ckpt_base_path in conf/train.yaml.

Plan with a DINO-WM

Once a world model has been trained, you may use it for planning with an example command like this:

python plan.py model_name=<model_name> n_evals=5 planner=cem goal_H=5 goal_source='random_state' planner.opt_steps=30

where the model is saved at folder <ckpt_base_path>/<model_name>, and <ckpt_base_path> can be specified in conf/plan.yaml.

Citation

@misc{zhou2024dinowmworldmodelspretrained,
      title={DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning}, 
      author={Gaoyue Zhou and Hengkai Pan and Yann LeCun and Lerrel Pinto},
      year={2024},
      eprint={2411.04983},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2411.04983}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
conf		conf
datasets		datasets
distributed_fn		distributed_fn
env		env
metrics		metrics
models		models
planning		planning
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
custom_resolvers.py		custom_resolvers.py
environment.yaml		environment.yaml
plan.py		plan.py
preprocessor.py		preprocessor.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning

Getting Started

Installation

Install Mujoco

Notes

Install PyFlex (optional for deformable environments)

Datasets

Train a DINO-WM

Plan with a DINO-WM

Citation

About

Releases

Packages

Languages

License

dustland/dino_wm

Folders and files

Latest commit

History

Repository files navigation

DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning

Getting Started

Installation

Install Mujoco

Notes

Install PyFlex (optional for deformable environments)

Datasets

Train a DINO-WM

Plan with a DINO-WM

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages