Stars
Free PyCharm image viewer plugin for visualizing and debugging NumPy, OpenCV, PyTorch, TensorFlow, JAX, and PIL data.
P2P web chat and file transfer, file synchronization application using WebRTC
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
A 3D computer vision development toolkit based on PaddlePaddle. It supports point-cloud object detection, segmentation, and monocular 3D object detection models.
A kde wallpaper plugin integrating wallpaper engine
Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
fabio-sim / LightGlue-ONNX
Forked from cvg/LightGlueONNX-compatible LightGlue: Local Feature Matching at Light Speed. Supports TensorRT, OpenVINO
pix2pix3D: Generating 3D Objects from 2D User Inputs
Real-Time end-to-end 2D-to-3D Video Conversion, based on deep learning.
[ECCV 2024] Official implementation of the paper "TAPTR: Tracking Any Point with Transformers as Detection"
使用Github Action将国外的Docker镜像转存到阿里云私有仓库,供国内服务器使用,免费易用
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, L…
[CVPR2024] The official implementation of "MoCha-Stereo: Motif Channel Attention Network for Stereo Matching”. & [Arxiv] The official implementation of "Motif Channel Opened in a White-Box: Stereo …
The ffmpegcv is a ffmpeg backbone for open-cv like Video Reader and Writer
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Samples code for world class Artificial Intelligence SoCs for computer vision applications.
linux bsp app & sample for axpi pro (ax650n)
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
A WebGPU-accelerated ONNX inference run-time written 100% in Rust, ready for native and the web
WyattBlue / pyav
Forked from PyAV-Org/PyAVPython bindings for ffmpeg libraries
Inspects Windows Bluetooth A2DP Codec without WPA tooling