Deterministic Decoding for Discrete Data in Variational Autoencoders

Variational autoencoders are prominent generative models for modeling discrete data. However, with flexible decoders, they tend to ignore the latent codes. In this paper, we study a VAE model with a deterministic decoder (DD-VAE) for sequential data that selects the highest-scoring tokens instead of sampling. Deterministic decoding solely relies on latent codes as the only way to produce diverse objects, which improves the structure of the learned manifold. To implement DD-VAE, we propose a new class of bounded support proposal distributions and derive Kullback-Leibler divergence for Gaussian and uniform priors. We also study a continuous relaxation of deterministic decoding objective function and analyze the relation of reconstruction accuracy and relaxation parameters. We demonstrate the performance of DD-VAE on multiple datasets, including molecular generation and optimization problems.

For more details, please refer to the full paper.

Repository

In this repository, we provide all code and data that is necessary to reproduce all the results from the paper. To reproduce the experiments, we recommend using Docker image built using a provided Dockerfile:

nvidia-docker build -t dd_vae .
nvidia-docker run -it --shm-size 10G --network="host" --name dd_vae -w=/code/dd_vae dd_vae

All the code will be available inside /code/dd_vae folder. For more details on using Docker, please refer to Docker manual

You can also install dd_vae locally by running python setup.py install command.

Reproducing the experiments

You can train any model using train.py script. This scripts takes only two arguments: --config (path to .ini file that sets up the experiment) and --device (PyTorch-style device naiming such as cuda:0). We provide all configuration files in configs/ folder. For each experiment we provide a separate Jupyter Notebook, where you will find further instructions to reproduce the experiments:

How to cite

@InProceedings{pmlr-v108-polykovskiy20a,
  title = {Deterministic Decoding for Discrete Data in Variational Autoencoders},
  author = {Polykovskiy, Daniil and Vetrov, Dmitry},
  booktitle = {Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics},
  pages = {3046--3056},
  year = {2020},
  editor = {Silvia Chiappa and Roberto Calandra},
  volume = {108},
  series = {Proceedings of Machine Learning Research}, address = {Online},
  month = {26--28 Aug},
  publisher = {PMLR}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
bo		bo
configs		configs
data		data
dd_vae		dd_vae
images		images
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
bo.ipynb		bo.ipynb
illustrations.ipynb		illustrations.ipynb
mnist.ipynb		mnist.ipynb
moses_plots.ipynb		moses_plots.ipynb
moses_prepare_metrics.ipynb		moses_prepare_metrics.ipynb
setup.py		setup.py
synthetic.ipynb		synthetic.ipynb
train.py		train.py
unit_test.py		unit_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deterministic Decoding for Discrete Data in Variational Autoencoders

Repository

Reproducing the experiments

How to cite

About

Releases

Packages

Languages

insilicomedicine/DD-VAE

Folders and files

Latest commit

History

Repository files navigation

Deterministic Decoding for Discrete Data in Variational Autoencoders

Repository

Reproducing the experiments

How to cite

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages