Temporally Consistent Referring Video Object Segmentation with Hybrid Memory

<--- Paper Link

The official implementation of the paper:

Temporally Consistent Referring Video Object Segmentation with Hybrid Memory

Introduction

Referring Video Object Segmentation (R-VOS) methods face challenges in maintaining consistent object segmentation due to temporal context variability and the presence of other visually similar objects. We propose an end-to-end R-VOS paradigm that explicitly models temporal instance consistency alongside the referring segmentation. Furthermore, we propose a new Mask Consistency Score (MCS) metric to evaluate the temporal consistency of video segmentation. Extensive experiments demonstrate that our approach enhances temporal consistency by a significant margin, leading to top-ranked performance on popular R-VOS benchmarks.

demo_video.mp4

Installation and Data Preparation

Please refer to SgMg for installation and data preparation.

Evaluation

The checkpoint for HTR w/ SwinL is available at HTR-SwinL.

If you want to evaluate HTR on Ref-DAVIS/YouTube-VOS, please run the following command in the scripts folder:

sh dist_test_davis_swinl.sh

sh dist_test_ytv_swinl.sh

MCS Metric for Temporal Consistency

The code for MCS evaluation is in get_mcs.py. Please click View scoring output log to download stdout.txt of your submission in Ref-YTVOS eval server.

Then you can run the script to get the MCS score under different thresholds.

Citation

@article{miao2024htr,
  title={Temporally Consistent Referring Video Object Segmentation with Hybrid Memory},
  author={Miao, Bo and Bennamoun, Mohammed and Gao, Yongsheng and Shah, Mubarak and Mian, Ajmal},
  journal={IEEE Transactions on Circuits and Systems for Video Technology},
  year={2024},
  publisher={IEEE}
}

Acknowledgements

Contact

If you have any questions about this project, please feel free to contact [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
datasets		datasets
davis2017		davis2017
models		models
scripts_eval		scripts_eval
tools		tools
util		util
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval_davis.py		eval_davis.py
eval_mevis.py		eval_mevis.py
get_mcs.py		get_mcs.py
inference_davis.py		inference_davis.py
inference_ytvos.py		inference_ytvos.py
opts.py		opts.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Temporally Consistent Referring Video Object Segmentation with Hybrid Memory

Introduction

Installation and Data Preparation

Evaluation

MCS Metric for Temporal Consistency

Citation

Acknowledgements

Contact

About

Releases

Packages

Languages

License

bo-miao/HTR

Folders and files

Latest commit

History

Repository files navigation

Temporally Consistent Referring Video Object Segmentation with Hybrid Memory

Introduction

Installation and Data Preparation

Evaluation

MCS Metric for Temporal Consistency

Citation

Acknowledgements

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages