Skip to content
View yinqiwen's full-sized avatar
:octocat:
:octocat:

Block or report yinqiwen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A general-purpose lightweight C++ graph library

C++ 163 40 Updated Nov 19, 2024

ALP: Adaptive Lossless Floating-Point Compression

C++ 60 7 Updated Nov 20, 2024

C library for accessing the PostgreSQL parser outside of the server environment

C 1,211 182 Updated Oct 31, 2024

GLake: optimizing GPU memory management and IO transmission.

Python 381 33 Updated Aug 3, 2024

C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types and contiguous memory storage

C++ 2,528 314 Updated Nov 22, 2024

libshmcache is a local cache in the share memory for multi processes. high performance due to read is lockless. libshmcache is 100+ times faster than a remote interface such as redis.

C 450 136 Updated Jan 23, 2024

GLM-4-Voice | 端到端中英语音对话模型

Python 2,314 187 Updated Nov 11, 2024

A High-Performance JIT-Based C++ Expression/Script Execution Engine with SIMD Vectorization Support

C++ 69 4 Updated Nov 18, 2024

Official inference framework for 1-bit LLMs

C++ 11,395 769 Updated Nov 11, 2024

Ultra High-performance Lightweight Embedded and Server OLTP RDBMS✨

C 184 12 Updated Nov 15, 2024

A native PyTorch Library for large model training

Python 2,641 205 Updated Nov 23, 2024

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 29,243 12,071 Updated Nov 23, 2024

Vector class library, latest version

C++ 1,309 148 Updated Feb 1, 2024

Dynamic Memory Management for Serving LLMs without PagedAttention

C 241 16 Updated Nov 14, 2024

Low-bit LLM inference on CPU with lookup table

C++ 588 44 Updated Nov 19, 2024

Very fast, high quality, platform-independent hashing algorithm.

C++ 206 13 Updated Nov 5, 2024

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 65,244 33,363 Updated Nov 23, 2024

LLM101n: Let's build a Storyteller

30,242 1,652 Updated Aug 1, 2024

DuckDB is an analytical in-process SQL database management system

C++ 24,498 1,937 Updated Nov 22, 2024

A composable and fully extensible C++ execution engine library for data management systems.

C++ 3,532 1,161 Updated Nov 23, 2024

Library for specialized dense and sparse matrix operations, and deep learning primitives.

C 850 183 Updated Nov 22, 2024

C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))

C++ 2,218 258 Updated Nov 21, 2024

Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 768 144 Updated Nov 22, 2024

Tile primitives for speedy kernels

Cuda 1,665 70 Updated Nov 23, 2024

Scalable radix top-k selection on GPUs

Cuda 8 1 Updated May 6, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 5,994 509 Updated Nov 22, 2024

Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

Python 18,428 1,292 Updated Nov 23, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 6,179 525 Updated Nov 23, 2024

cuDF - GPU DataFrame Library

C++ 8,461 908 Updated Nov 23, 2024

[Start here!] Flow-IPC - Modern C++ toolkit for high-speed inter-process communication (IPC)

C++ 297 11 Updated Nov 14, 2024
Next