alphadl

🎯

Developing LLM & its applications

Liang Ding alphadl

🎯

Developing LLM & its applications

NLP/ML researcher (developing LLM and exploring its way to human-centric AGI).

197 followers · 198 following

JD Explore Academy, JD.com Inc.
Shanghai(CN) & Sydney(AU)
liamding.cc
@liangdingNLP
https://scholar.google.com/citations?user=lFCLvOAAAAAJ

Achievements

x2 x2

Achievements

x2 x2

Stars

haonan3 / AnchorContext

AnchorAttention: Improved attention for LLMs long-context training

Python 156 4 Updated Nov 21, 2024

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

480 22 Updated Nov 19, 2024

deng0515001 / lnglat2Geo

经纬度转省市区县乡镇离线包，采用空间查询算法，速度快(单线程5w次/s)，省市区县100%准确率。

Scala 178 74 Updated Oct 13, 2020

unslothai / unsloth

Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

Python 18,520 1,297 Updated Nov 23, 2024

dame-cell / Triformer

Transformers components but in Triton

Python 27 Updated Nov 18, 2024

zanchangtong / CSR4mBART

Python 5 Updated Nov 5, 2024

ruikangliu / IntactKV

Official PyTorch implementation of IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact

Python 33 1 Updated May 24, 2024

OctopusMind / DPO

dpo算法实现

Python 17 1 Updated Jun 12, 2024

FeiLiu36 / LLM4Opt

A Collection on Large Language Models for Optimization

152 17 Updated Oct 31, 2024

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 4,891 371 Updated Nov 5, 2024

p1k0pan / ICD

[ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding

Python 5 2 Updated Oct 6, 2024

jwkirchenbauer / lm-watermarking

Jupyter Notebook 527 66 Updated Mar 14, 2024

sail-sg / CPO

[NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.

Python 63 1 Updated Oct 18, 2024

sail-sg / SimLayerKV

The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.

Python 39 Updated Oct 18, 2024

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,241 218 Updated Nov 25, 2024

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 801 36 Updated Nov 16, 2024

October2001 / Awesome-KV-Cache-Compression

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

143 3 Updated Nov 25, 2024

GAIR-NLP / O1-Journey

O1 Replication Journey: A Strategic Progress Report – Part I

1,388 39 Updated Nov 24, 2024

wxjiao / LLM-Hands-On

Re-organized codes for developing LLM-based Chatbots, including DataProc, SFT, DPO, Demo, etc.

Python 8 1 Updated Aug 30, 2024

rhymes-ai / Aria

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 854 72 Updated Nov 21, 2024

NumberChiffre / mcts-llm

Jupyter Notebook 63 1 Updated Nov 25, 2024

Zefan-Cai / KVCache-Factory

Unified KV Cache Compression Methods for Auto-Regressive Models

Jupyter Notebook 821 103 Updated Nov 22, 2024

Coldmist-Lu / MQM_APE

[MQM-APE] Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators.

Python 3 2 Updated Sep 24, 2024

mst272 / LLM-Dojo

欢迎来到 LLM-Dojo，这里是一个开源大模型学习场所，使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python 352 30 Updated Nov 8, 2024

HandsOnLLM / Hands-On-Large-Language-Models

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 2,348 464 Updated Oct 18, 2024

openpsi-project / ReaLHF

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 126 5 Updated Nov 20, 2024

facebookresearch / RAM

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 148 9 Updated Nov 19, 2024

ezelikman / quiet-star

Code for Quiet-STaR

Python 654 89 Updated Aug 21, 2024

alphadl / CodeGen-USCD

Code Gen with "Uncertainty Aware Selective Contrastive Decoding"

Python 4 Updated Sep 23, 2024

juvi21 / CoPE-cuda

Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719

Python 20 Updated Jun 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Liang Ding alphadl

Achievements

Achievements

Block or report alphadl

Stars

haonan3 / AnchorContext

hemingkx / SpeculativeDecodingPapers

deng0515001 / lnglat2Geo

unslothai / unsloth

dame-cell / Triformer

zanchangtong / CSR4mBART

ruikangliu / IntactKV

OctopusMind / DPO

FeiLiu36 / LLM4Opt

microsoft / OmniParser

p1k0pan / ICD

jwkirchenbauer / lm-watermarking

sail-sg / CPO

sail-sg / SimLayerKV

facebookresearch / lingua

srush / awesome-o1

October2001 / Awesome-KV-Cache-Compression

GAIR-NLP / O1-Journey

wxjiao / LLM-Hands-On

rhymes-ai / Aria

NumberChiffre / mcts-llm

Zefan-Cai / KVCache-Factory

Coldmist-Lu / MQM_APE

mst272 / LLM-Dojo

HandsOnLLM / Hands-On-Large-Language-Models

openpsi-project / ReaLHF

facebookresearch / RAM

ezelikman / quiet-star

alphadl / CodeGen-USCD

juvi21 / CoPE-cuda