Second order methods for minimax optimization

This repo includes code for running first- and second-order methods for minimax optimization in Newton-type Methods for Minimax Optimization, including gradient descent ascent (GDA), total gradient descent ascent (TGDA), follow-the-ridge (FR), gradient-descent Newton (GDN) and complete Newton (CN). We run these algorithms on tasks including estimation of a single Gaussian, learning a mixture of Gaussians, and generating digits on the MNIST 0/1 subset. GPU/CUDA required for running all our experiments.

model.py contains different neural net architectures for various tasks
data.py generates various datasets
run.py is the main python script for comparing different algorithms
utils.py implements various helper functions including hessian-vector-product and conjugate gradient.

Scripts for running the experiments

The following bash files include configurations to run various experiments. You can uncomment different parts to run different algorithms including GDA/TGDA/FR/GDN/CN.

bash_gaussian_mean.sh: estimation of the mean of a Gaussian
bash_gaussian_covariance.sh: estimation of the covariance of a Gaussian
bash_gmm.sh: learning a mixture of Gaussians
bash_gmm.sh: learning to generate digits in MNIST 0/1 dataset

Pretrained models

The folder ./checkpoints includes two pretrained models for GMM and MNIST separately. We used the model trained by GDA as initialization.

Plot files

plot_gmm.py: plot file for visualizing the Gaussian mixtures generated by different algorithms
plot_mnist.py: plot file for visualizing the digits generated generated by different algorithms

If you want to use our code, please add the following reference:

@inproceedings{zhang2020newton,
  title={Newton-type methods for minimax optimization},
  author={Zhang, Guojun and Wu, Kaiwen and Poupart, Pascal and Yu, Yaoliang},
  booktitle={ICML Workshop on Beyond first-order methods in ML systems},
  year={2021},
  url={https://arxiv.org/abs/2006.14592}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Second order methods for minimax optimization

Scripts for running the experiments

Pretrained models

Plot files

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
checkpoints		checkpoints
README.md		README.md
bash_gaussian_covariance.sh		bash_gaussian_covariance.sh
bash_gaussian_mean.sh		bash_gaussian_mean.sh
bash_gmm.sh		bash_gmm.sh
bash_mnist.sh		bash_mnist.sh
data.py		data.py
model.py		model.py
plot.py		plot.py
plot_cov.py		plot_cov.py
plot_gmm.py		plot_gmm.py
plot_mnist.py		plot_mnist.py
plot_mnist_curve.py		plot_mnist_curve.py
run.py		run.py
run_gmm_gdn.sh		run_gmm_gdn.sh
synthetic.py		synthetic.py
utils.py		utils.py

watml/min-max-2nd-order

Folders and files

Latest commit

History

Repository files navigation

Second order methods for minimax optimization

Scripts for running the experiments

Pretrained models

Plot files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages