Skip to content
View alphadl's full-sized avatar
🎯
Developing LLM & its applications
🎯
Developing LLM & its applications

Block or report alphadl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AnchorAttention: Improved attention for LLMs long-context training

Python 156 4 Updated Nov 21, 2024

📰 Must-read papers and blogs on Speculative Decoding ⚡️

480 22 Updated Nov 19, 2024

经纬度转省市区县乡镇离线包,采用空间查询算法,速度快(单线程5w次/s),省市区县100%准确率。

Scala 178 74 Updated Oct 13, 2020

Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

Python 18,520 1,297 Updated Nov 23, 2024

Transformers components but in Triton

Python 27 Updated Nov 18, 2024
Python 5 Updated Nov 5, 2024

Official PyTorch implementation of IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact

Python 33 1 Updated May 24, 2024

dpo算法实现

Python 17 1 Updated Jun 12, 2024

A Collection on Large Language Models for Optimization

152 17 Updated Oct 31, 2024

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 4,891 371 Updated Nov 5, 2024

[ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding

Python 5 2 Updated Oct 6, 2024
Jupyter Notebook 527 66 Updated Mar 14, 2024

[NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.

Python 63 1 Updated Oct 18, 2024

The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.

Python 39 Updated Oct 18, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,241 218 Updated Nov 25, 2024

A bibliography and survey of the papers surrounding o1

TeX 801 36 Updated Nov 16, 2024

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

143 3 Updated Nov 25, 2024

O1 Replication Journey: A Strategic Progress Report – Part I

1,388 39 Updated Nov 24, 2024

Re-organized codes for developing LLM-based Chatbots, including DataProc, SFT, DPO, Demo, etc.

Python 8 1 Updated Aug 30, 2024

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 854 72 Updated Nov 21, 2024
Jupyter Notebook 63 1 Updated Nov 25, 2024

Unified KV Cache Compression Methods for Auto-Regressive Models

Jupyter Notebook 821 103 Updated Nov 22, 2024

[MQM-APE] Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators.

Python 3 2 Updated Sep 24, 2024

欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python 352 30 Updated Nov 8, 2024

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 2,348 464 Updated Oct 18, 2024

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 126 5 Updated Nov 20, 2024

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 148 9 Updated Nov 19, 2024

Code for Quiet-STaR

Python 654 89 Updated Aug 21, 2024

Code Gen with "Uncertainty Aware Selective Contrastive Decoding"

Python 4 Updated Sep 23, 2024

Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719

Python 20 Updated Jun 5, 2024
Next