-
Machine Learning & Backend Engineer
- Seoul • .°• Bay Area (SF)
-
20:20
(UTC +08:00) - sigridjin.medium.com
- @sigridjin_eth
- in/jinhyungp1
Highlights
-
-
cohere-finetune Public
Forked from cohere-ai/cohere-finetuneA tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models
-
RAG-Retrieval Public
Forked from NLPJCL/RAG-RetrievalUnify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.
Python MIT License UpdatedNov 11, 2024 -
zerox Public
Forked from getomni-ai/zeroxZero shot pdf OCR with gpt-4o-mini
Python MIT License UpdatedOct 23, 2024 -
kickstart.go Public template
Forked from raeperd/kickstart.goMinimalistic http servert template in go
Go MIT License UpdatedOct 22, 2024 -
squirrel-vault Public
Forked from Hyune-s-lab/squirrel-vault[side-project] 과금 정보 수집 & 처리 시스템 - 다람쥐 금고
Kotlin UpdatedOct 19, 2024 -
py-clean-arch Public
Forked from cdddg/py-clean-archA Python implementation of Clean Architecture, inspired by Uncle Bob's book
Python UpdatedOct 11, 2024 -
FlashRank Public
Forked from PrithivirajDamodaran/FlashRankLite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & C…
-
late-chunking Public
Forked from jina-ai/late-chunkingCode for explaining and evaluating late chunking (chunked pooling)
Python Apache License 2.0 UpdatedOct 8, 2024 -
entropix Public
Forked from xjdr-alt/entropixEntropy Based Sampling and Parallel CoT Decoding
Jupyter Notebook Apache License 2.0 UpdatedOct 5, 2024 -
Gokapi Public
Forked from Forceu/GokapiLightweight selfhosted Firefox Send alternative without public upload. AWS S3 supported.
Go GNU Affero General Public License v3.0 UpdatedSep 30, 2024 -
llama-stack Public
Forked from meta-llama/llama-stackModel components of the Llama Stack APIs
Python MIT License UpdatedSep 25, 2024 -
tutorials-kr Public
Forked from PyTorchKorea/tutorials-kr🇰🇷파이토치에서 제공하는 튜토리얼의 한국어 번역을 위한 저장소입니다. (Translate PyTorch tutorials in Korean🇰🇷)
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedSep 20, 2024 -
infinity Public
Forked from michaelfeil/infinityInfinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip
-
ComfyUI-Docker Public
Forked from YanWenKun/ComfyUI-Docker🐳Dockerfile for 🎨ComfyUI. | 容器镜像与启动脚本
Dockerfile Other UpdatedSep 11, 2024 -
gpt_server Public
Forked from shell-nlp/gpt_servergpt_server是一个用于生产级部署LLMs或Embedding的开源框架。
Python Apache License 2.0 UpdatedSep 8, 2024 -
latent-sae Public
Forked from enjalot/latent-saeTraining code for Sparse Autoencoders on Embedding models
Jupyter Notebook UpdatedSep 7, 2024 -
-
rank_llm Public
Forked from castorini/rank_llmRankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.
Python Apache License 2.0 UpdatedSep 4, 2024 -
ebpf_exporter Public
Forked from cloudflare/ebpf_exporterPrometheus exporter for custom eBPF metrics
Go MIT License UpdatedSep 3, 2024 -
fastapi-best-practices Public
Forked from zhanymkanov/fastapi-best-practicesFastAPI Best Practices and Conventions we used at our startup
UpdatedSep 3, 2024 -
mteb_ko_leaderboard Public
Forked from su-park/mteb_ko_leaderboard한글 텍스트 임베딩 모델 리더보드
UpdatedSep 2, 2024 -
1.5-Pints Public
Forked from Pints-AI/1.5-PintsA compact LLM pretrained in 9 days by using high quality data
Python MIT License UpdatedAug 30, 2024 -
Liger-Kernel Public
Forked from linkedin/Liger-KernelEfficient Triton Kernels for LLM Training
Python BSD 2-Clause "Simplified" License UpdatedAug 27, 2024 -
tevatron Public
Forked from texttron/tevatronTevatron - A flexible toolkit for neural retrieval research and development.
Python Apache License 2.0 UpdatedAug 24, 2024 -
Minitron Public
Forked from NVlabs/MinitronA family of compressed models obtained via pruning and knowledge distillation
UpdatedAug 23, 2024 -
swiftide Public
Forked from bosun-ai/swiftideFast, streaming indexing and query library for AI (RAG) applications, written in Rust
Rust MIT License UpdatedAug 19, 2024 -
sglang Public
Forked from sgl-project/sglangSGLang is yet another fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedAug 17, 2024 -
marlin Public
Forked from IST-DASLab/marlinFP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
Python Apache License 2.0 UpdatedAug 15, 2024 -
text-embeddings-inference Public
Forked from huggingface/text-embeddings-inferenceA blazing fast inference solution for text embeddings models
Rust Apache License 2.0 UpdatedAug 15, 2024