Food_caption

In this project, I use the code from show attend and tell, which is implemented base on paper: show attend and tell

Since the goal is to predict the label of the image, and eventually hope one can generate the ingredient of the food image, so we think using the image caption generation technique is more appropriate than image classification. But due to the limit of the dataset, for now, I can only use the image label as the caption. Overall though, the result is not bad.

General Idea

Extract Image feature by CNN (VGG-19)
Use attention base LSTM model to generate the caption of the image, which in this case, is the label of the image

Result

Note: The white part is the attention area that the machine focus on to make the caption/label prediction

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Result		Result
caffe_model		caffe_model
Feat_Extraction.ipynb		Feat_Extraction.ipynb
README.md		README.md
cnn_util.py		cnn_util.py
model_tensorflow.py		model_tensorflow.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Food_caption

General Idea

Result

About

Releases

Packages

Languages

2g-XzenG/Food_caption

Folders and files

Latest commit

History

Repository files navigation

Food_caption

General Idea

Result

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages