Skip to content
Change the repository type filter

All

    Repositories list

    • Building the Virtuous Cycle for AI-driven LLM Systems
      Python
      1711249Updated Jan 7, 2026Jan 7, 2026
    • whl

      Public
      Pre-built wheels for flashinfer python package.
      HTML
      4200Updated Jan 7, 2026Jan 7, 2026
    • FlashInfer: Kernel Library for LLM Serving
      Python
      6254.4k27365Updated Jan 7, 2026Jan 7, 2026
    • Project website of FlashInfer project
      SCSS
      4020Updated Jan 3, 2026Jan 3, 2026
    • cubloaty

      Public
      a size profiler for cuda binary
      Python
      06910Updated Oct 7, 2025Oct 7, 2025
    • web-data

      Public
      0000Updated Jun 25, 2025Jun 25, 2025
    • Python
      36500Updated Apr 26, 2025Apr 26, 2025
    • Simple python library for generating your own perfetto traces for your application. Can be used for both app instrumentation and custom trace generation (for your own purposes)
      Python
      7100Updated Apr 16, 2025Apr 16, 2025
    • flashinfer-nightly

      Public archive
      FlashInfer Nightly
      1600Updated Apr 9, 2025Apr 9, 2025
    • 0400Updated Apr 2, 2025Apr 2, 2025
    • Jupyter Notebook
      0200Updated Jan 10, 2025Jan 10, 2025
    • Debug print operator for cudagraph debugging
      Cuda
      21411Updated Aug 2, 2024Aug 2, 2024
    • The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
      16k000Updated Apr 21, 2024Apr 21, 2024
    • candle

      Public
      Minimalist ML framework for Rust
      Rust
      1.4k000Updated Mar 7, 2024Mar 7, 2024