- Tokyo
-
10:20
(UTC +09:00) - @azooKey_dev
- @miwa_ensan
Highlights
- Pro
Starred repositories
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
CUDA accelerated rasterization of gaussian splatting
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
Famous Vision Language Models and Their Architectures
First-two-char input method using transformer-based language model and n-gram model.
Documentation of OpenType shaping behavior
Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"
This repository contains programs for reconstructing 3D space using OpenSfM and Gaussian Splatting techniques. It allows users to generate point clouds from images captured by a 360-degree camera u…
A curated list of awesome LLM for Autonomous Driving resources (continually updated)
This repo contains the code for 1D tokenizer and generator
Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223
regrettable-username / llm.metal
Forked from karpathy/llm.cLLM training in simple, raw C/Metal Shading Language
WIP
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Swift Package to implement a transformers-like API in Swift