Skip to content

v0.12.0: Multi-LoRA prefix caching, fp8 kv cache, Mllama, function calling

Latest
Compare
Choose a tag to compare
@tgaddair tgaddair released this 06 Nov 21:21
· 21 commits to main since this release
e03f989

🎉 Enhancements

🐛 Bugfixes

📝 Docs

  • added metrics docs, updated links in main docs by @noyoshi in #663

🔧 Maintenance

New Contributors

Full Changelog: v0.11.0...v0.12.0