Quantization-Aware Training: An Example for Resnet18 in PyTorch

This example demonstrates how to use Post-Training Quantization API from Neural Network Compression Framework (NNCF) to quantize and train PyTorch models on the example of Resnet18 quantization aware training, pretrained on Tiny ImageNet-200 dataset.

The example includes the following steps:

Loading the Tiny ImageNet-200 dataset (~237 Mb) and the Resnet18 PyTorch model pretrained on this dataset.
Quantizing the model using NNCF Post-Training Quantization algorithm.
Fine tuning quantized model for one epoch to improve quantized model metrics.
Output of the following characteristics of the quantized model:
- Accuracy drop of the quantized model (INT8) over the pre-trained model (FP32)
- Compression rate of the quantized model file size relative to the pre-trained model file size
- Performance speed up of the quantized model (INT8)

Prerequisites

Before running this example, ensure you have Python 3.10+ installed and set up your environment:

1. Create and activate a virtual environment

python3 -m venv nncf_env
source nncf_env/bin/activate  # On Windows: nncf_env\Scripts\activate.bat

2. Install NNCF and other dependencies

python3 -m pip install ../../../../ -r requirements.txt

Run Example

It's pretty simple. The example does not require additional preparation. It will do the preparation itself, such as loading the dataset and model, etc.

python main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantization-Aware Training: An Example for Resnet18 in PyTorch

Prerequisites

1. Create and activate a virtual environment

2. Install NNCF and other dependencies

Run Example

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Quantization-Aware Training: An Example for Resnet18 in PyTorch

Prerequisites

1. Create and activate a virtual environment

2. Install NNCF and other dependencies

Run Example