My name is Daniele Affinita, and this report documents the work I completed during the Google Summer of Code 2024 for OpenCV, under the mentorship of Yuantao Feng.
Integrated Blockwise Quantization into OpenCV's DNN module, focusing on compressing model size for deployment on memory-constrained devices. Developed a tool to quantize models blockwise, achieving a 2-4x reduction in model size. Evaluated the performance, showing that blockwise quantized models retain accuracy closer to the original compared to standard int8 quantization.