Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update README.md #1

Open
wants to merge 1 commit into
base: master
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 1 addition & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,8 +2,7 @@

# Dataset link:
https://drive.google.com/file/d/1OVb4_3Uec_xbyUk90aWC6LFpKsIOtR7v/view?usp=sharing



Several deep learning models have been reviewed for captioning tasks, improving the possibilities of textual understanding,
and captioning of both simple and complex images. In this project, a deep learning model for describing the difference between a pair of images textually. supervised model uses several deep learning techniques with natural language processing techniques to explore the practicability of capturing, aligning and computing the differences between two image features and to produce textual description of the same. First, a Siamese encoder consisting of convolutional neural networks is used to extract the feature vector related to the differences in a pair of images. Attention model is used to select salient features of the vector. The resultant feature vector is processed by a long-short term memory decoder to generate textual description. The model is evaluated on a Spot-the-diff baseline dataset which contains different pairs of images with appropriate annotations.

Expand Down