Real-time portrait segmentation for mobile devices
-
Updated
Jan 17, 2021 - Jupyter Notebook
Real-time portrait segmentation for mobile devices
Generate a quantization parameter file for ncnn framework int8 inference
BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).
Quantization Aware Training
将端上模型部署过程中,常见的问题以及解决办法记录并汇总,希望能给其他人带来一点帮助。
TensorRT Int8 Python version sample. TensorRT Int8 Python 实现例子。TensorRT Int8 Pythonの例です
GPT-J 6B inference on TensorRT with INT-8 precision
VB.NET api wrapper for llm-inference chatllm.cpp
Generating tensorrt model using onnx
C# api wrapper for llm-inference chatllm.cpp
Compressed CNNs for airplane classification in satellite images (APoZ-based parameter pruning, INT8 weight quantization)
it has support for openvino converted model of yolov7-int.xml ,yolov7x,
Add a description, image, and links to the int8-inference topic page so that developers can more easily learn about it.
To associate your repository with the int8-inference topic, visit your repo's landing page and select "manage topics."