Stochastic Non-convex Optimization

This is the repository of the Machine learning course project, working with the

The main purpose of this projects was to explore, implement some of the most used optimization algorithms and compare them with Stocastic Cubic Regularization (SCRN) and Stocastic Cubic Regularization with Momentum (SCRN_Momentum) in a non-convex problem.

Optimizer implemented in this repository:

SGD
Adam
Stochastic Cubic Regularization (SCRN)
Stochastic Cubic Regularization with Momentum (SCRN_Momentum)

Files in this repository:

optimizers.py: contains the optimizers implemented in this repository
models.py: Contain the final model implemented, an enconder with 3 conv net and 3 linear layers.
utils.py

How to run the code

Create a conda environment with the following command:

conda create --name <env> --file requirements.txt

conda  activate <env>

To run one model you have the following args

dataset: dataset to use, MNIST, CIFAR10 or CIFAR100.
conv_numbers: number of conv layers, default 3.
linear_numbers: number of linear layers, default 3.
hidden: size of the linera layers, default 128.
epochs: number of epochs to run the training defautl 2.
batch_size: batch size, default 100.
lr: learning rate, default 0.001.
optimizer: optimizer to use, SGD, Adam, Sophia, SCRN or SCRN_Momentum. can be one or a list
activation: activation function, default relu.
scheduler: set a scheduler for the learning rate.
verbose: Whether to print detailed training progress.
save: save the model, default False.
save_path: path to save the model, default ./models/.
model_selection: for grid search on learning rate on the optimizers required.

Examples

run one optimizer over one learning rate

Run Neural network of 3 convolutaional layer, 3 fully connected layers, over 50 epochs over MNIST dataset, over Adam SGD, SCRN and SCRN_Momentum optimizer with learning rate 0.001, 0.1, 0.001 and 0.001 respectively, verbose and save the model.

python main.py --dataset MNIST --num_layers 3 --conv_number 3 --epochs 50 --lr 0.001,0.1,0.001,0.001 --optimizer Adam,SGD,SCRN,SCRN_Momentum --verbose --save

Datsets

MNIST
CIFAR10
CIFAR100

Principal references

Stochastic Cubic Regularization for Non-Convex Optimization
Adam: A Method for Stochastic Optimization

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Stochastic Non-convex Optimization

How to run the code

Examples

run one optimizer over one learning rate

Datsets

Principal references

Files

README.md

Latest commit

History

README.md

File metadata and controls

Stochastic Non-convex Optimization

How to run the code

Examples

run one optimizer over one learning rate

Datsets

Principal references