pbt-gan

PBT is an optimization algorithm that maximizes the performance of a network by optimizating a population of models and their hyperparameters. It determines a schedule of hyperparameter settings using an evolutionary strategy of exploration and exploitation - a much more powerful method than simply using a fixed set of hyperparameters throughout the entire training or using grid-search and hand-tuning, which is time-extensive and difficult.

Implementation of PBT-GAN experiments from paper.
(Refer here for Toy Experiments from paper)

Setup

It is recommended to run from a virtual environment to ensure all dependencies are met.
Compatible with both python 2 and 3.

virtualenv -p python pbt_env
source pbt_env/bin/activate.csh
pip install -r requirements.txt

Memory Utilization

Memory limits can be set per-worker (as a percentage) by uncommenting gpu_options in pbt_main.py which can be desired for synchronous training.

Training

Asynchronous Training

python pbt_main.py --ps_hosts=localhost:2222 --worker_hosts=localhost:2223,localhost:2224,localhost:2225,localhost:2226 --job_name=ps --task_index=0 python pbt_main.py --ps_hosts=localhost:2222 --worker_hosts=localhost:2223,localhost:2224,localhost:2225,localhost:2226 --job_name=worker --task_index=0
...

Synchronous Training

python pbt_sequential.py

Results

Results for synchronous training with 20 workers.

Inception Plots

Left is PBT with 20 workers. Right compares PBT with no-PBT (grey color).
Note that PBT significantly outperforms the baseline (no PBT) by about a 0.5 inception score.

Left (smoothed plot). Right (raw plot).
Blue (with PBT), Grey (no PBT)

Learning Rates

Learning schedules discovered by PBT

Saved Sessions

The code will automatically restore from a previous save-point under ./checkpoint if exists. Tensorboard files are stored under ./logs. Images are stored under ./images. Pretrained model / checkpoint for 1 worker is provided under ./checkpoint. Unfortunately due to space limitations, tensorboard logs could not be uploaded.

Credits

GAN templates from here and here

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
checkpoint/3		checkpoint/3
images		images
results		results
tflib		tflib
tflib_defs		tflib_defs
README.md		README.md
gan_class.py		gan_class.py
gan_class_sequential.py		gan_class_sequential.py
gan_class_v2.py		gan_class_v2.py
gan_wasserstein.py		gan_wasserstein.py
inception.py		inception.py
main.py		main.py
ops.py		ops.py
pbt_main.py		pbt_main.py
pbt_sequential.py		pbt_sequential.py
prior_factory.py		prior_factory.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pbt-gan

Setup

Memory Utilization