Skip to content
View dosco's full-sized avatar
The usual
The usual

Block or report dosco

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The unofficial DSPy framework. Build LLM powered Agents and "Agentic workflows" based on the Stanford DSP paper.

TypeScript 1,157 80 Updated Nov 22, 2024

TypeScript to C++ compiler.

TypeScript 104 2 Updated Sep 10, 2024

@mention people in a textarea

JavaScript 6 4 Updated Mar 10, 2023

Train a tiny Llama3 model with parquet datasets, using JavaScript.

JavaScript 7 Updated Jul 20, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 8,229 1,153 Updated Nov 8, 2024

Efficient Triton Kernels for LLM Training

Python 3,490 208 Updated Nov 23, 2024

Simple and fast low-bit matmul kernels in CUDA / Triton

Python 147 12 Updated Nov 23, 2024

A MLX port of FLUX based on the Huggingface Diffusers implementation.

Python 1,010 60 Updated Nov 23, 2024

Official repository of Evolutionary Optimization of Model Merging Recipes

Python 1,231 90 Updated Mar 30, 2024

Low code framework to build and launch a crew of AI agents with shared state. Built with https://axllm.dev.

TypeScript 6 4 Updated Nov 23, 2024

Fast and memory-efficient exact attention

Python 14,350 1,344 Updated Nov 23, 2024

A plugin for Jupyter Notebook to run CUDA C/C++ code

Jupyter Notebook 201 88 Updated Sep 13, 2024

LLM101n: Let's build a Storyteller

30,253 1,652 Updated Aug 1, 2024

LLM training in simple, raw C/CUDA

Cuda 87 6 Updated May 1, 2024

Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.

Python 16 3 Updated Sep 29, 2024

Efficient GPU kernels for block-sparse matrix multiplication and convolution

Cuda 1,027 202 Updated Jun 8, 2023

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,622 206 Updated Nov 22, 2024

Implementation of Diffusion Transformer (DiT) in JAX

Python 252 4 Updated Jun 11, 2024

CUDA Learning guide

Cuda 257 26 Updated Jun 20, 2024

Sparse autoencoders

Python 346 49 Updated Nov 22, 2024
Jupyter Notebook 83 5 Updated Feb 29, 2024
Python 198 10 Updated Jul 15, 2024

GPU programming related news and material links

1,244 74 Updated Sep 23, 2024

Make triton easier

Python 41 Updated Jun 12, 2024

Gemma 2B with 10M context length using Infini-attention.

Python 950 60 Updated May 12, 2024

The simplest way to set up your tsconfig.json

889 17 Updated Aug 24, 2024
Python 189 7 Updated May 1, 2024

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,414 256 Updated Aug 13, 2024

Simple and readable code for training and sampling from diffusion models

Python 208 18 Updated Nov 19, 2024

Top-down 2D RPG

Zig 232 10 Updated Jun 10, 2024
Next