Name		Name	Last commit message	Last commit date
parent directory ..
302-pytorch-quantization-aware-training.ipynb		302-pytorch-quantization-aware-training.ipynb
README.md		README.md

README.md

Optimizing PyTorch models with Neural Network Compression Framework of OpenVINO by 8-bit quantization.

This tutorial demonstrates how to use NNCF 8-bit quantization to optimize the PyTorch model for inference with OpenVINO Toolkit. For more advanced usage refer to these examples.

This notebook is based on 'ImageNet training in PyTorch' example. To make downloading and training fast, we use a ResNet-18 model with the Tiny ImageNet dataset.

It consists of the following steps:

Transform the original FP32 model to INT8
Use fine-tuning to restore the accuracy
Export optimized and original models to ONNX and then to OpenVINO
Measure and compare the performance of the models

Installation Instructions

If you have not done so already, please follow the Installation Guide to install all required dependencies.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

302-pytorch-quantization-aware-training

302-pytorch-quantization-aware-training

README.md

Optimizing PyTorch models with Neural Network Compression Framework of OpenVINO by 8-bit quantization.

Installation Instructions

Files

302-pytorch-quantization-aware-training

Directory actions

More options

Directory actions

More options

Latest commit

History

302-pytorch-quantization-aware-training

Folders and files

parent directory

README.md

Optimizing PyTorch models with Neural Network Compression Framework of OpenVINO by 8-bit quantization.

Installation Instructions