Adds model weights pruning by Burhan-Q · Pull Request #16256 · ultralytics/ultralytics

Burhan-Q · 2024-09-12T21:34:40Z

Summary

This was something I experimented with a while ago but had poor results due to global pruning methods. Thanks to @lordofkillz for sharing his experiments, pruning now has only a minor accuracy drop with a minor gain in inference speed.

Note

Saved model weights appear to have a slightly larger file size than original weights. Unclear why this occurs.

Example

from ultralytics import YOLO
from ultralytics.utils.torch_utils import prune_model

# Load trained model
model = YOLO("yolov8m.pt")

prune_model(model, 0.3)  # model pruned in-place

>>> Model sparsity achieved 29.96% from 0.00%

model.save("yolov8m-sparse-30.pt", False)

Performance

Testing on COCO128 using YOLOv8m shows a dip in performance when pruning at a target of $0.3$ from the original weights, only showing summary (all) and first five class results for brevity.

Normal model validation

yolo val model="yolov8m.pt" data="coco128.yaml" batch=1

                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95):
                   all       5000      36335      0.716       0.61      0.667      0.501
                person       2693      10777      0.821      0.745      0.829      0.616
               bicycle        149        314      0.742      0.525      0.626      0.402
                   car        535       1918      0.765      0.637      0.713      0.498
            motorcycle        159        367      0.811      0.678      0.793      0.547
              airplane         97        143       0.84      0.884      0.925      0.776

Speed: 0.3ms preprocess, 8.7ms inference, 0.0ms loss, 1.1ms postprocess per image

yolo val model="yolov8m-sparse-30.pt" data="coco128.yaml" batch=1

Pruned model validation

                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95):
                   all       5000      36335      0.706      0.595      0.653      0.489
                person       2693      10777       0.89      0.659      0.819      0.609
               bicycle        149        314      0.692      0.544      0.613      0.386
                   car        535       1918       0.79      0.595      0.704      0.487
            motorcycle        159        367      0.796      0.689      0.781       0.54
              airplane         97        143      0.821      0.902      0.916      0.767

Speed: 0.3ms preprocess, 8.7ms inference, 0.0ms loss, 1.1ms postprocess per image

Repeated inference evaluation results

Test inference on directory of images repeatedly and average overall inference runtime.

model type	AVG inference time (ms)
Normal	17.570
Pruned `@30%`	17.492

Repeated evaluation code

import timeit

from ultralytics import YOLO
from ultralytics.utils.torch_utils import prune_model

N = 17
R = 5
im = ASSETS / 'bus.jpg'
img = cv.imread(im)
img_dir = Path(r"Q:\datasets\coco128\images\valid")
# Load all images
images = [cv.imread(f) for f in img_dir.iterdir() if f.is_file()]

normal_model = YOLO("yolov8m.pt")
sparse_model = YOLO("yolov8m-sparse-30.pt")

def infer_sparse_model():
    sparse_model.predict(images, verbose=False)

def infer_normal_model():
    normal_model.predict(images, verbose=False)

if __name__ == '__main__':

    _ = normal_model.predict([img] * 15)  # warmup
    print(
        "Normal model averaged inference time:",
        round(sum(timeit.repeat(infer_normal_model, number=N, repeat=R)) / R, 3)
    )
    
    _ = sparse_model.predict([img] * 15)  # warmup
    print(
        "Sparse model averaged inference time:",
        round(sum(timeit.repeat(infer_sparse_model, number=N, repeat=R)) / R, 3)
    )

Environment info

Ultralytics YOLOv8.2.92 🚀 Python-3.10.9 torch-2.3.1+cu121 CUDA:0 (NVIDIA GeForce RTX 3080, 12288MiB)
Setup complete ✅ (12 CPUs, 31.9 GB RAM, 778.9/1863.0 GB disk)

OS                  Windows-10-10.0.19045-SP0
Environment         Windows
Python              3.10.9
Install             git
RAM                 31.86 GB
CPU                 Intel Core(TM) i5-10600K 4.10GHz
CUDA                12.1

matplotlib          ✅ 3.8.1>=3.3.0
opencv-python       ✅ 4.8.1.78>=4.6.0
pillow              ✅ 9.3.0>=7.1.2
pyyaml              ✅ 6.0.1>=5.3.1
requests            ✅ 2.31.0>=2.23.0
scipy               ✅ 1.11.3>=1.4.1
torch               ✅ 2.3.1+cu121>=1.8.0
torchvision         ✅ 0.18.1+cu121>=0.9.0
tqdm                ✅ 4.66.1>=4.64.0
psutil              ✅ 5.9.6
py-cpuinfo          ✅ 9.0.0
thop                ✅ 0.1.1-2209072238>=0.1.1
pandas              ✅ 2.1.3>=1.1.4
seaborn             ✅ 0.13.0>=0.11.0

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

New functions for model pruning and zero-count testing were added to improve efficiency and analysis.

📊 Key Changes

Added zero_count function to count zero-valued parameters in PyTorch models.
Introduced prune_model function for L1 unstructured pruning of convolutional layers.
Implemented substr_in_set function to check substring presence in a set.
Added tests for these new functions to ensure proper functionality.

🎯 Purpose & Impact

Efficiency: Pruning reduces model size, potentially improving performance and reducing resource usage. 🏋️‍♀️
Analysis: zero_count helps in assessing model sparsity, which is useful for optimization. 📊
Flexibility: The ability to exclude certain layers from pruning allows for more controlled and informed pruning strategies. 🎯

UltralyticsAssistant · 2024-09-12T21:35:25Z

👋 Hello @Burhan-Q, thank you for submitting a PR to the ultralytics/ultralytics repository! 🚀 This is an automated response. An Ultralytics engineer will review your PR shortly.

To ensure a smooth integration of your work, please review the following checklist:

✅ Define a Purpose: Clearly explain the purpose of your changes. It seems you have already done a great job describing your pruning improvements and performance results. Make sure your commit messages adhere to project conventions.
✅ Synchronize with Source: Ensure your PR is up-to-date with the main branch. If not, rebase or merge the latest changes.
✅ Verify CI Checks: Confirm all Continuous Integration (CI) checks are passing. Fix any issues if they arise.
✅ Update Documentation: Modify relevant documentation for any new or updated features you introduced.
✅ Include Tests: If applicable, update or provide new tests that cover your changes, ensuring all tests pass successfully.
✅ Sign the CLA: If this is your first contribution, please sign the Contributor License Agreement by commenting, "I have read the CLA Document and I sign the CLA" below.

For more details, please refer to our Contributing Guide. Feel free to comment if you have any questions. Thank you for enhancing Ultralytics with your contributions! 🎉

codecov · 2024-09-12T21:38:46Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 69.87%. Comparing base (0ae9ee8) to head (2d68019).
Report is 1 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #16256      +/-   ##
==========================================
+ Coverage   69.84%   69.87%   +0.02%     
==========================================
  Files         129      129              
  Lines       17095    17112      +17     
==========================================
+ Hits        11940    11957      +17     
  Misses       5155     5155

Flag	Coverage Δ
Benchmarks	`34.45% <30.00%> (-0.02%)`	⬇️
GPU	`36.10% <30.00%> (-0.02%)`	⬇️
Tests	`66.32% <100.00%> (+0.03%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…ions

github-actions · 2025-03-12T00:20:48Z

👋 Hello there! We wanted to let you know that we've decided to close this pull request due to inactivity. We appreciate the effort you put into contributing to our project, but unfortunately, not all contributions are suitable or aligned with our product roadmap.

We hope you understand our decision, and please don't let it discourage you from contributing to open source projects in the future. We value all of our community members and their contributions, and we encourage you to keep exploring new projects and ways to get involved.

For additional resources and information, please see the links below:

Docs: https://docs.ultralytics.com
HUB: https://hub.ultralytics.com
Community: https://community.ultralytics.com

Thank you for your contributions to YOLO 🚀 and Vision AI ⭐

adds functions for pruning model weights

71698dd

Burhan-Q added the enhancement New feature or request label Sep 12, 2024

Burhan-Q self-assigned this Sep 12, 2024

Auto-format by https://ultralytics.com/actions

aa64169

UltralyticsAssistant added the detect Object Detection issues, PR's label Sep 12, 2024

Burhan-Q and others added 5 commits September 12, 2024 17:47

ensure default layers are always skipped

fbf7f28

add pytests for pruning and related functions

fdd7bc6

Auto-format by https://ultralytics.com/actions

a920860

Merge branch 'main' into prune

ffd3b5d

Merge branch 'main' into prune

d977dc9

Burhan-Q requested a review from Laughing-q September 13, 2024 12:28

UltralyticsAssistant and others added 8 commits September 13, 2024 20:25

Merge branch 'main' into prune

f22a810

Merge branch 'main' into prune

8311fe0

Merge branch 'main' into prune

f8ff1ad

Merge branch 'main' into prune

043a9cd

Auto-update Ultralytics Docs Reference by https://ultralytics.com/act…

303c82a

…ions

Merge branch 'main' into prune

627a73d

Merge branch 'main' into prune

436ef9d

Merge branch 'main' into prune

2d68019

Laughing-q self-assigned this Oct 9, 2024

Burhan-Q and others added 8 commits October 22, 2024 10:22

Merge branch 'main' into prune

80f782d

Auto-format by https://ultralytics.com/actions

d78175c

Merge branch 'main' into prune

21ec901

Merge branch 'main' into prune

7039669

Merge branch 'main' into prune

b2dd2f3

Merge branch 'main' into prune

ee39f09

Merge branch 'main' into prune

ecc8161

Merge branch 'main' into prune

f10a9da

Burhan-Q and others added 20 commits November 6, 2024 08:02

Merge branch 'main' into prune

5d2e165

Merge branch 'main' into prune

ea8fb42

Auto-update Ultralytics Docs Reference by https://ultralytics.com/act…

3a6cd00

…ions

Merge branch 'main' into prune

6edaa5a

Merge branch 'main' into prune

c60cdf4

Merge branch 'main' into prune

f00f5d9

Merge branch 'main' into prune

4e464fb

Merge branch 'main' into prune

d941528

Auto-update Ultralytics Docs Reference by https://ultralytics.com/act…

1a7446b

…ions

Merge branch 'main' into prune

ae9409e

Merge branch 'main' into prune

288804d

Merge branch 'main' into prune

9c24adc

Merge branch 'main' into prune

f0d9b4b

Merge branch 'main' into prune

4bbcc13

Merge branch 'main' into prune

2f43cf3

Merge branch 'main' into prune

026b7f1

Merge branch 'main' into prune

b332e33

Merge branch 'main' into prune

82be99e

Merge branch 'main' into prune

9467858

Merge branch 'main' into prune

d920d29

github-actions bot added the Stale Stale and schedule for closing soon label Mar 12, 2025

UltralyticsAssistant added 3 commits March 14, 2025 11:31

Merge branch 'main' into prune

e0c4776

Merge branch 'main' into prune

ac0c254

Merge branch 'main' into prune

ba42c86

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adds model weights pruning#16256

Adds model weights pruning#16256
Burhan-Q wants to merge 46 commits intomainfrom
prune

Burhan-Q commented Sep 12, 2024 •

edited by UltralyticsAssistant

Loading

Uh oh!

UltralyticsAssistant commented Sep 12, 2024

Uh oh!

codecov bot commented Sep 12, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Mar 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

Burhan-Q commented Sep 12, 2024 • edited by UltralyticsAssistant Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Example

Performance

Normal model validation

Pruned model validation

Repeated inference evaluation results

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

Uh oh!

UltralyticsAssistant commented Sep 12, 2024

Uh oh!

codecov bot commented Sep 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions bot commented Mar 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Burhan-Q commented Sep 12, 2024 •

edited by UltralyticsAssistant

Loading

codecov bot commented Sep 12, 2024 •

edited

Loading