Skip to content
View huangyz0918's full-sized avatar

Organizations

@gsoc-cn @msra-alumni @MLSysOps

Block or report huangyz0918

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Making Long-Context LLM Inference 10x Faster and 10x Cheaper

Python 240 25 Updated Nov 22, 2024

Large Language Model Text Generation Inference

Python 9,134 1,075 Updated Nov 22, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 71,669 8,519 Updated Nov 13, 2024

A fully open-sourced alternative NotebookLM

2 Updated Oct 17, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 16,258 1,625 Updated Oct 15, 2024

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 523 57 Updated Nov 1, 2024

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python 1,243 50 Updated Nov 6, 2024

An open-source RAG-based tool for chatting with your documents.

Python 17,485 1,352 Updated Nov 20, 2024

A 3DGS framework for omni urban scene reconstruction and simulation.

Python 595 51 Updated Sep 6, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 6,177 524 Updated Nov 22, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 19,348 1,914 Updated Nov 22, 2024

Set of tools to assess and improve LLM security.

Python 2,731 452 Updated Nov 19, 2024

The Memory layer for your AI apps

Python 22,951 2,113 Updated Nov 22, 2024

🕹️ Open-source, developer-first LLMOps platform designed to streamline prompt design, version management, instant delivery, collaboration, troubleshooting, observability and more.

TypeScript 2,532 210 Updated Nov 20, 2024

Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

Python 13,558 1,318 Updated Nov 22, 2024

Network Analysis in Python

Python 14,987 3,253 Updated Nov 21, 2024

A Blazing Fast AI Gateway with integrated Guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.

TypeScript 6,324 451 Updated Nov 22, 2024

A survey of Code Agents / Foundation Models for improving development productivity. Become 10x SWE, MLE, etc.

10 Updated Aug 20, 2024

aider is AI pair programming in your terminal

Python 22,382 2,076 Updated Nov 23, 2024

Build resilient language agents as graphs.

Python 6,788 1,096 Updated Nov 23, 2024

Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capa…

Python 9,647 1,013 Updated Oct 22, 2024

Drag & drop UI to build your customized LLM flow

TypeScript 31,719 16,547 Updated Nov 21, 2024

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 16,218 1,516 Updated Nov 20, 2024

Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

Python 18,416 1,290 Updated Nov 23, 2024

The first real AI developer

Python 31,927 3,215 Updated Oct 3, 2024

Vision agent

Python 1,332 139 Updated Nov 22, 2024

LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path Planning

Python 29 Updated Sep 19, 2024

A self-organizing file system with llama 3

Jupyter Notebook 4,966 314 Updated Oct 24, 2024

🚀CodiumAI PR-Agent: An AI-Powered 🤖 Tool for Automated Pull Request Analysis, Feedback, Suggestions and More! 💻🔍

Python 6,128 597 Updated Nov 21, 2024

🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Anthropic, Ollama, etc s…

Python 1,098 49 Updated Nov 19, 2024
Next