Pinned Loading
Repositories
Showing 10 of 152 repositories
- vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
recogni/vllm’s past year of commit activity - open-register-design-tool Public Forked from Juniper/open-register-design-tool
Tool to generate register RTL, models, and docs using SystemRDL or JSpec input
recogni/open-register-design-tool’s past year of commit activity - hf-hub Public Forked from huggingface/hf-hub
Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package
recogni/hf-hub’s past year of commit activity - microxcaling_traceable Public Forked from microsoft/microxcaling
PyTorch emulation library for Microscaling (MX)-compatible data formats.
recogni/microxcaling_traceable’s past year of commit activity - foundation-model-stack Public Forked from foundation-model-stack/foundation-model-stack
🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.
recogni/foundation-model-stack’s past year of commit activity - licenseheaders Public Forked from torsten-pf/licenseheaders
Simple python script to add/replace license headers in a directory tree of source files
recogni/licenseheaders’s past year of commit activity - Atom_communication Public Forked from efeslab/Atom
[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
recogni/Atom_communication’s past year of commit activity