Skip to content
Change the repository type filter

All

    Repositories list

    • diffusers

      Public
      🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
      Python
      Apache License 2.0
      6.9k100Updated Mar 24, 2026Mar 24, 2026
    • sglang

      Public
      SGLang is a fast serving framework for large language models and vision language models.
      Python
      Apache License 2.0
      5k100Updated Mar 24, 2026Mar 24, 2026
    • LeetCUDA

      Public
      📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
      Cuda
      GNU General Public License v3.0
      1k10k20Updated Mar 23, 2026Mar 23, 2026
    • quack

      Public
      A Quirky Assortment of CuTe Kernels
      Python
      Apache License 2.0
      98200Updated Mar 23, 2026Mar 23, 2026
    • cutlass

      Public
      CUDA Templates and Python DSLs for High-Performance Linear Algebra
      C++
      Other
      1.7k100Updated Mar 19, 2026Mar 19, 2026
    • 📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉
      Python
      GNU General Public License v3.0
      2652800Updated Mar 19, 2026Mar 19, 2026
    • Awesome-LLM-Inference

      Public
      📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
      Python
      GNU General Public License v3.0
      3495.1k10Updated Mar 19, 2026Mar 19, 2026
    • lite.ai.toolkit

      Public
      🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉
      C++
      GNU General Public License v3.0
      7754.4k10Updated Mar 19, 2026Mar 19, 2026
    • vllm-omni

      Public
      A framework for efficient model inference with omni-modality models
      Python
      Apache License 2.0
      616100Updated Mar 12, 2026Mar 12, 2026
    • ao

      Public
      PyTorch native quantization and sparsity for training and inference
      Python
      Other
      466100Updated Mar 10, 2026Mar 10, 2026
    • nunchaku

      Public
      [ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
      Python
      Apache License 2.0
      232300Updated Feb 24, 2026Feb 24, 2026
    • ffpa-attn

      Public
      🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.
      Cuda
      GNU General Public License v3.0
      1425500Updated Feb 13, 2026Feb 13, 2026
    • Cache-DiT Node for Comfyui
      Python
      Apache License 2.0
      13100Updated Feb 3, 2026Feb 3, 2026
    • Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics a…
      Cuda
      Apache License 2.0
      378000Updated Jan 22, 2026Jan 22, 2026
    • cache-dit

      Public
      A Unified and Flexible Inference Engine with Hybrid Cache Acceleration and Parallelism for 🤗DiTs.
      Python
      Apache License 2.0
      66400Updated Jan 21, 2026Jan 21, 2026
    • flux-fast

      Public
      A forked version of flux-fast that makes flux-fast even faster with cache-dit.
      Python
      17400Updated Jan 5, 2026Jan 5, 2026
    • Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
      Python
      Apache License 2.0
      463100Updated Jan 1, 2026Jan 1, 2026
    • Z-Image

      Public
      Python
      Apache License 2.0
      712100Updated Dec 25, 2025Dec 25, 2025
    • NVIDIA cuTile learn
      Python
      2000Updated Dec 9, 2025Dec 9, 2025
    • .github

      Public
      0100Updated Nov 25, 2025Nov 25, 2025
    • ImageReward

      Public
      [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
      Python
      Apache License 2.0
      90000Updated Oct 30, 2025Oct 30, 2025
    • 🔥LongCat-Video 1.7x🎉 speedup: cache acceleration and 4/8-bits weight only.
      Python
      0810Updated Oct 28, 2025Oct 28, 2025
    • Python
      MIT License
      331000Updated Oct 28, 2025Oct 28, 2025
    • ComfyUI

      Public
      The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
      Python
      GNU General Public License v3.0
      12k000Updated Oct 27, 2025Oct 27, 2025
    • ⚡️Qwen-Image 4.8x🎉 speedup with Hybrid Acceleration for low VRAM GPUs
      Python
      Apache License 2.0
      01740Updated Oct 24, 2025Oct 24, 2025
    • Kandinsky 5.0: A family of diffusion models for Video & Image generation
      Python
      Apache License 2.0
      56000Updated Oct 22, 2025Oct 22, 2025
    • Wan2.1

      Public
      Wan: Open and Advanced Large-Scale Video Generative Models
      Python
      Apache License 2.0
      2.5k100Updated Oct 17, 2025Oct 17, 2025
    • Wan2.2

      Public
      Wan: Open and Advanced Large-Scale Video Generative Models
      Python
      Apache License 2.0
      1.8k000Updated Oct 17, 2025Oct 17, 2025
    • Enjoy the magic of Diffusion models!
      Python
      Apache License 2.0
      1.2k000Updated Oct 13, 2025Oct 13, 2025
    • HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
      Python
      Other
      150100Updated Oct 4, 2025Oct 4, 2025