kvcache-reuse

Here are 2 public repositories matching this topic...

jjang-ai / vmlx

vMLX - JANGTQ Uber Compressed MLX Models - L2 Disk Cache (survives restart) + L1 Paged (super fast ttft) + Hybrid SSM Scheduler + Cont Batching + etc!

macbook persistent-memory mlx openai-api llm lmstudio anthropic-api mcp-server kvcache-optimization kvcache-compression openclaw kvcache-reuse openclaw-agent prefix-cache mlxllm mlxstudio vmlx omlx omlx-alternative

Updated Jul 2, 2026
Python

BJTU-ANT / CacheRoute

Star

CacheRoute is an innovative LLM scheduling scheme dedicated to enabling flexible KV cache reuse across LLM systems, improving task performance and system efficiency.

network routing knowledge-injection llm vllm llm-inference kvcache lmcache llm-task-scheduling kvcache-reuse

Updated Jul 2, 2026
Python

Improve this page

Add a description, image, and links to the kvcache-reuse topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the kvcache-reuse topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly