Calibration dataset in INT8 Qunatization #44

phanben110 · 2022-12-21T08:55:45Z

phanben110
Dec 21, 2022

Why does quantization INT8 need to have calibration dataset, but FP16, FP32 don't need ?

Dec 21, 2022

Since we generally train models in FP32, we don't require any further calibration. And when you convert your model from FP32 to FP16, we just drop half of those 32bits and you loose some precision.

However, if we do the same to go from FP16 to INT8, we'd loose all the precision and most of the numbers would just become zero. That's why instead of dropping bits, we map FP16 values to INT8 values to capture the range. And to capture this range properly, we run our FP16 models with a group of expected inputs values, which helps us in minimizing precision loss. To know more about this topic, I'd recommend this article

Now what should be your calibration dataset, depends on you…

View full answer

mavihs7 · 2022-12-21T20:39:21Z

mavihs7
Dec 21, 2022
Collaborator

Hi @phanben110,

Since we generally train models in FP32, we don't require any further calibration. And when you convert your model from FP32 to FP16, we just drop half of those 32bits and you loose some precision.

However, if we do the same to go from FP16 to INT8, we'd loose all the precision and most of the numbers would just become zero. That's why instead of dropping bits, we map FP16 values to INT8 values to capture the range. And to capture this range properly, we run our FP16 models with a group of expected inputs values, which helps us in minimizing precision loss. To know more about this topic, I'd recommend this article

Now what should be your calibration dataset, depends on your use case. Sometimes people just randomly sample from their training dataset, or sometimes they selectively sample for a specific use case. For example, let's say you trained a model with animal dataset where images were captured in all conditions (i.e. different weathers, low light, etc.). But you're deploying this model to certain parts of the forest and you expect to capture only certain types of images. So when doing the INT8 quantization, you'd want to calibrate with a more selective and relevant dataset to minimize precision loss for that specific use case.

Hope this helps.

2 replies

phanben110 Dec 22, 2022
Author

Thanks for your answer @mavihs7 ❤️,

I have one more question to ask you. Have you ever done INT8 Quantization Aware Training task with yolov5s model?

I have completed the Post Training Quantization task from your repository team, and I want to do the QAT (Quantization Aware Training) task for evaluation performance.

Thanks!

mavihs7 Dec 22, 2022
Collaborator

Glad to be of help @phanben110 !

I haven't trained yolov5s in some time, so I'm not sure if ultralytics now support QAT. I would recommend you to directly ask this question on their official repo https://github.com/ultralytics/yolov5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Calibration dataset in INT8 Qunatization #44

{{title}}

Replies: 1 comment 2 replies

{{title}}

{{title}}

{{title}}

Select a reply

Calibration dataset in INT8 Qunatization #44

phanben110 Dec 21, 2022

Replies: 1 comment · 2 replies

mavihs7 Dec 21, 2022 Collaborator

phanben110 Dec 22, 2022 Author

mavihs7 Dec 22, 2022 Collaborator

phanben110
Dec 21, 2022

Replies: 1 comment 2 replies

mavihs7
Dec 21, 2022
Collaborator

phanben110 Dec 22, 2022
Author

mavihs7 Dec 22, 2022
Collaborator