Ask_Attend_and_Answer

Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering

Code

Instructions for training and testing the "SMem-VQA Two-Hop" model:

Download the provided caffe folder and install caffe following the instructions in http://caffe.berkeleyvision.org/installation.html .
Download MSCOCO images, and VQA annotations and questions:

cd example/data/

./get_image.sh
Generate the hdf5 data for training and testing:

cd example/

python ./data/generate_h5_data/generate_h5_data.py
Train the model:

cd example/

run ./train/train_mm.sh
Model trained on VQA dataset: SMem-VQA
Predict the answers for the images and questions in VQA test-dev dataset:

cd example/

python ./prediction/predict_json.py

Citation

@article{xu2015ask,
    Author = {Xu, Huijuan and Saenko, Kate},
    Title = {Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering},
    Journal = {arXiv preprint arXiv:1511.05234},
    Year = {2015}
}

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
caffe		caffe
example		example
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ask_Attend_and_Answer

Code

Citation

About

Releases

Packages

Languages

BasselAli1/Ask_Attend_and_Answer

Folders and files

Latest commit

History

Repository files navigation

Ask_Attend_and_Answer

Code

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages