Text-to-image-synthesis-with-GANs

Python 3.7+ and Pytorch 1.x

Referenced from: https://github.com/taoxugit/AttnGAN

Play with this model: Demo Link

Sneak-peek into the webapp

AttnGAN

Pytorch implementation for reproducing AttnGAN results in the paper AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks by Tao Xu, Pengchuan Zhang, Qiuyuan Huang, Han Zhang, Zhe Gan, Xiaolei Huang, Xiaodong He. (This work was performed when Tao was an intern with Microsoft Research).

Data

Download preprocessed metadata for birds coco and save them to data/
Download the birds image data. Extract them to data/birds/
Download coco dataset and extract the images to data/coco/

Training

Pre-train DAMSM models:
- For bird dataset: python pretrain_DAMSM.py --cfg cfg/DAMSM/bird.yml --gpu 0
- For coco dataset: python pretrain_DAMSM.py --cfg cfg/DAMSM/coco.yml --gpu 1
Train AttnGAN models:
- For bird dataset: python main.py --cfg cfg/bird_attn2.yml --gpu 2
- For coco dataset: python main.py --cfg cfg/coco_attn2.yml --gpu 3
*.yml files are example configuration files for training/evaluation our models.

Pretrained Model

DAMSM for bird. Download and save it to DAMSMencoders/
DAMSM for coco. Download and save it to DAMSMencoders/
AttnGAN for bird. Download and save it to models/
AttnGAN for coco. Download and save it to models/
AttnDCGAN for bird. Download and save it to models/
- This is an variant of AttnGAN which applies the proposed attention mechanisms to DCGAN framework.

Sampling

Run python main.py --cfg cfg/eval_bird.yml --gpu 1 to generate examples from captions in files listed in "./data/birds/example_filenames.txt". Results are saved to DAMSMencoders/.
Change the eval_*.yml files to generate images from other pre-trained models.
Input your own sentence in "./data/birds/example_captions.txt" if you wannt to generate images from customized sentences.

Validation

To generate images for all captions in the validation dataset, change B_VALIDATION to True in the eval_*.yml. and then run python main.py --cfg cfg/eval_bird.yml --gpu 1
We compute inception score for models trained on birds using StackGAN-inception-model.
We compute inception score for models trained on coco using improved-gan/inception_score.

Creating an API

Evaluation code embedded into a callable containerized API is included in the eval\ folder.

Citing AttnGAN

If you find AttnGAN useful in your research, please consider citing:

@article{Tao18attngan,
  author    = {Tao Xu, Pengchuan Zhang, Qiuyuan Huang, Han Zhang, Zhe Gan, Xiaolei Huang, Xiaodong He},
  title     = {AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks},
  Year = {2018},
  booktitle = {{CVPR}}
}

Reference

References

Note: This is a rough Readme as I am quite overloaded with work right now, this Readme will be updated soon with all the details (results, benchmarks, training hardware, model configurations, etc)

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
.streamlit		.streamlit
DAMSMencoders/bird		DAMSMencoders/bird
data/birds		data/birds
demo		demo
img		img
models		models
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
attngan_explanation.py		attngan_explanation.py
config.py		config.py
demo.py		demo.py
eval_bird.yml		eval_bird.yml
multiapp.py		multiapp.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text-to-image-synthesis-with-GANs

Python 3.7+ and Pytorch 1.x

Play with this model: Demo Link

Sneak-peek into the webapp

AttnGAN

Creating an API

Citing AttnGAN

References

Note: This is a rough Readme as I am quite overloaded with work right now, this Readme will be updated soon with all the details (results, benchmarks, training hardware, model configurations, etc)

About

Releases

Packages

Languages

License

Gladiator07/Text-to-image-synthesis-with-AttnGAN

Folders and files

Latest commit

History

Repository files navigation

Text-to-image-synthesis-with-GANs

Python 3.7+ and Pytorch 1.x

Play with this model: Demo Link

Sneak-peek into the webapp

AttnGAN

Creating an API

Citing AttnGAN

References

Note: This is a rough Readme as I am quite overloaded with work right now, this Readme will be updated soon with all the details (results, benchmarks, training hardware, model configurations, etc)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages