MACA: Mask and Caption what you see

This is 2 implementations of MACA on python3, keras and tensorflow2. The model generates pixel-level masks for each instance of an object in the image and bounding-box-level captions for each area with meaningful contents. It is based on Mask R-CNN and DenseCap.

The repository includes:

Jupyter notebook of using MACA to detect single images and videos
Jupyter notebook of training MACA further on CocoVG dataset
Evaluation metrics on CocoVG dataset (mAP, BLUE, CIDEr, ROUGE)
Requirements file of building programming environments

The basic theory source code is documented and designed to be easy to extend. If you use it in your research, please consider citing this repository (bibtex below).

Getting Started

maca_detecting.ipynb is a good start for user to detect images and videos directly, this notebook contains the way to download necessary materials and how to use them to detect on single images or videos. This note book also contains tutorial about how to calculate the evaluation metrics of MACA on CocoVG validation & and training dataset.
maca_training.ipynb is a tutorial about how to download necessary materials and implement them to further train MACA on CocoVG dataset.
assets directory contains the images used in the dissertation.

Necessary materials include:

Source code of MACA
Well-trained model weights (maca_cocovg.hdf5)
Data of CocoVG dataset

(All of the necessary materials will be downloaded automatically by the code in jupyter notebook)

Note: Running online in google colab is recommended (you do not need to set up any coding environments), you can check this link to learn how to run jupyter notebook on google colab. Also you can run all of the jupyter notebooks locally on your device, you need to make sure to successfully setup python3 + jupyter notebook environments on your device (also cudnn + Nvidia environments if you want to use GPU to accelerate), the tutorial of setting up environment can be seen here.

Other Informations

If you are interested in background basic theories and the way to code them, please check my personal github repository here.

If you are interested in well-trained weights or CocoVG dataset, please check the release here

Citation

@misc{Askfk_maca_2020,
  title={MACA: Mask and Caption what you see},
  author={Yiming Li},
  year={2020},
  publisher={Github},
  journal={GitHub repository},
  howpublished={\url{https://github.com/Askfk/maca}},
}

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.idea		.idea
BackBones		BackBones
assets		assets
demo		demo
keras_flops		keras_flops
layers		layers
macacripts		macacripts
pics		pics
pycocoevalcap		pycocoevalcap
tutorials		tutorials
video		video
MACA.py		MACA.py
README.md		README.md
__init__.py		__init__.py
config.py		config.py
requirements.txt		requirements.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MACA: Mask and Caption what you see

Getting Started

Necessary materials include:

Other Informations

Citation

About

Releases 1

Packages

Languages

Askfk/maca

Folders and files

Latest commit

History

Repository files navigation

MACA: Mask and Caption what you see

Getting Started

Necessary materials include:

Other Informations

Citation

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages