Shape of Motion: 4D Reconstruction from a Single Video

Qianqian Wang^1,2*, Vickie Ye¹*, Hang Gao¹*, Weijia Zeng¹*, Jake Austin¹, Zhengqi Li², Angjoo Kanazawa¹

¹UC Berkeley ²Google Research

* Equal Contribution

*New

We have preprocessed nvidia dataset and custom dataset which can be found here. We used MegaSaM to get cameras and depths for custom dataset.

Training

To train nvidia dataset

python run_training.py \
  --work-dir <OUTPUT_DIR> \
  data:nvidia \
  --data.data-dir </path/to/data>

To train custom dataset

python run_training.py \
  --work-dir <OUTPUT_DIR> \
  data:custom \
  --data.data-dir </path/to/data>

Train with 2D Gaussian Splatting

To get better scene geometry, we use 2D Gaussian Splatting:

python run_training.py \
  --work-dir <OUTPUT_DIR> \
  --use_2dgs
  data:custom \
  --data.data-dir </path/to/data>

Installation

git clone --recurse-submodules https://github.com/vye16/shape-of-motion
cd shape-of-motion/
conda create -n som python=3.10
conda activate som

Update requirements.txt with correct CUDA version for PyTorch and cuUML, i.e., replacing cu122 and cu12 with your CUDA version.


pip install -r requirements.txt
pip install git+https://github.com/nerfstudio-project/gsplat.git

Usage

Preprocessing

We depend on the third-party libraries in preproc to generate depth maps, object masks, camera estimates, and 2D tracks. Please follow the guide in the preprocessing README.

Evaluation on iPhone Dataset

First, download our processed iPhone dataset from this link. To train on a sequence, e.g., paper-windmill, run:

python run_training.py \
  --work-dir <OUTPUT_DIR> \
  --port <PORT> \
  data:iphone \
  --data.data-dir </path/to/paper-windmill/>

After optimization, the numerical result can be evaluated via:

PYTHONPATH='.' python scripts/evaluate_iphone.py \
  --data_dir </path/to/paper-windmill/> \
  --result_dir <OUTPUT_DIR> \
  --seq_names paper-windmill

Citation

@inproceedings{som2024,
  title     = {Shape of Motion: 4D Reconstruction from a Single Video},
  author    = {Wang, Qianqian and Ye, Vickie and Gao, Hang and Zeng, Weijia and Austin, Jake and Li, Zhengqi and Kanazawa, Angjoo},
  journal   = {arXiv preprint arXiv:2407.13764},
  year      = {2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 129 Commits
.dev		.dev
flow3d		flow3d
preproc		preproc
scripts		scripts
.editorconfig		.editorconfig
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
launch_davis.py		launch_davis.py
render_tracks.py		render_tracks.py
requirements.txt		requirements.txt
run_rendering.py		run_rendering.py
run_training.py		run_training.py
run_video.py		run_video.py
vis_depths.py		vis_depths.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Shape of Motion: 4D Reconstruction from a Single Video

*New

Training

Train with 2D Gaussian Splatting

Installation

Usage

Preprocessing

Evaluation on iPhone Dataset

Citation

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors 5

Languages

License

vye16/shape-of-motion

Folders and files

Latest commit

History

Repository files navigation

Shape of Motion: 4D Reconstruction from a Single Video

*New

Training

Train with 2D Gaussian Splatting

Installation

Usage

Preprocessing

Evaluation on iPhone Dataset

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors 5

Languages

Packages