KCAPTCHA-Solver

Simple Numerical KCAPTCHA solver using YOLOv8

Usage

Training

Install requirements with below command
```
pip3 install -r requirements.txt
```
Generate KCAPTCHA dataset
1. Generate Image-label set with kcaptcha-generator
  - Recommand to generate >50K image-label set for training
2. Split them into train, test, val set
3. Reformat BBOX label to YOLO label format using bbox2yolo.py
```
usage: bbox2yolo.py [-h] [--path PATH]

options:
-h, --help   show this help message and exit
--path PATH  dataset root path
```
Configure your training config files as you want

Train YOLO with below command

usage: train.py [-h] [--config CONFIG] [--log-path PATH] [--devices DEVICES] [--dataloader-workers WORKERS] [--img-size IMG_SIZE] [--model MODEL] [--epochs EPOCHS]
                [--batch-size BATCH_SIZE] [--no-finetune] [--optimizer OPTIMIZER] [--val]

options:
-h, --help            show this help message and exit
--config CONFIG       path to yolo config file
--log-path PATH       Relative path to save model
--devices DEVICES     Devices to use on training
--dataloader-workers WORKERS
                        Dataloader workers count
--img-size IMG_SIZE   max width of image
--model MODEL         model to use
--epochs EPOCHS       Epochs to train
--batch-size BATCH_SIZE
                        batch size to use on training
--no-finetune         Train network from scratch
--optimizer OPTIMIZER
                        Optimizer to use
--val                 Use validation on training

Inference

Pretrained checkpoint available in huggingface.

Inference KCAPTCHA with below command

usage: inference.py [-h] [--model MODEL] [--input INPUT] [--label LABEL]

options:
-h, --help     show this help message and exit
--model MODEL  Checkpoint path to use
--input INPUT  KCAPTCHA image file to read
--label LABEL  classID-to-Label mapped yaml path

On test, Model's benchmark accuracy is 97.6%

Onnx export

Export model to onnx with below command

python3 export_to_onnx.py --input <model path>

Quantizaion

Prepare kcaptcha dataset(for best, prepare different dataset that doesn't used on training)

Export Quantized model with below command

usage: export_to_onnx.py [-h] --input INPUT [--output OUTPUT] [--quantize] [--calibration-dataset CALIBRATION_DATASET] [--quant_format {QOperator,QDQ}]
                        [--per_channel PER_CHANNEL]

options:
-h, --help            show this help message and exit
--input INPUT         input yolov8 torch model
--output OUTPUT       output onnx model path for quantized model
--quantize            flag for quantize onnx model. Require --calibration-dataset and --quant-format
--calibration-dataset CALIBRATION_DATASET calibration data set path
--quant_format {QOperator,QDQ}
--per_channel PER_CHANNEL

Onnx Inference

Inference KCAPTCHA with below command with onnx weight

usage: inference_onnx.py [-h] [--model MODEL] [--input INPUT] [--label LABEL] [--c_thres C_THRES] [--iou_thres IOU_THRES]

options:
-h, --help            show this help message and exit
--model MODEL         Checkpoint path to use
--input INPUT         KCAPTCHA image file to read
--label LABEL         classID-to-Label mapped yaml path
--c_thres C_THRES     Confidence threshold for filtering detections.
--iou_thres IOU_THRES IoU threshold for non-maximum suppression.

Server Usage

Pretrained checkpoint available in huggingface.

Only ONNX format weight accepted by server for performance reason. Quantized model(best_int8.onnx) is highly recommanded.

default exposed port is 8000

Local inference

Install requirements with below command
```
pip3 install -r requirements-server.txt
```

Start inference server with below command.

usage: main.py [-h] [--host HOST] [--port PORT] [--model MODEL] [--label LABEL] [--c_thres C_THRES] [--iou_thres IOU_THRES]

options:
-h, --help            show this help message and exit
--host HOST           Server host
--port PORT           Server port
--model MODEL         Checkpoint path to use
--label LABEL         classID-to-Label mapped yaml path
--c_thres C_THRES     Confidence threshold for filtering detections.
--iou_thres IOU_THRES IoU threshold for non-maximum suppression.

Docker

Docker run

docker run -d --restart always \
  -e MODEL_FILE_NAME=best_int8.onnx \
  -p 8000:8000 \
  -v ~/.model:/model \
  ghcr.io/returntofirst/kcaptcha-solver:latest

deploy model file on ~/.model and run this command.

Docker-compose

Use docker-compose.yaml to deploy with docker-compose. deploy model file on ./model and run this command.

Kubenetes

Use k8s-example.yaml to deploy with kubectl.

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
.github/workflows		.github/workflows
configs		configs
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
bbox2yolo.py		bbox2yolo.py
docker-compose.yaml		docker-compose.yaml
export_to_onnx.py		export_to_onnx.py
inference.py		inference.py
inference_onnx.py		inference_onnx.py
k8s-example.yaml		k8s-example.yaml
labels.yaml		labels.yaml
main.py		main.py
requirements-server.txt		requirements-server.txt
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KCAPTCHA-Solver

Usage

Training

Inference

Onnx export

Quantizaion

Onnx Inference

Server Usage

Local inference

Docker

Docker run

Docker-compose

Kubenetes

About

Releases

Packages

Languages

License

ReturnToFirst/KCAPTCHA-Solver

Folders and files

Latest commit

History

Repository files navigation

KCAPTCHA-Solver

Usage

Training

Inference

Onnx export

Quantizaion

Onnx Inference

Server Usage

Local inference

Docker

Docker run

Docker-compose

Kubenetes

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages