Skip to content
View Confusezius's full-sized avatar
🥦
🥦

Highlights

  • Pro

Block or report Confusezius

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]

35 1 Updated Aug 26, 2024

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,348 70 Updated Nov 23, 2024

[NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization

Python 102 8 Updated Oct 12, 2024
Python 10 Updated May 31, 2024
Jupyter Notebook 7 2 Updated Mar 23, 2024

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,346 158 Updated Aug 23, 2024

Pretrained deep learning models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.

Python 238 24 Updated Aug 12, 2023

Official repository of Evolutionary Optimization of Model Merging Recipes

Python 1,231 90 Updated Mar 30, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 2,078 154 Updated Nov 23, 2024
Python 702 46 Updated Mar 6, 2024

(ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life

Python 316 13 Updated Jul 10, 2024

VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning

Python 89 10 Updated Sep 17, 2024

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 478 236 Updated Jul 4, 2024

Official repository for "Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model" [ICLR 2024 spotlight]

7 Updated Feb 20, 2024

Unofficial implementation of "SODA: Bottleneck Diffusion Models for Representation Learning"

Jupyter Notebook 77 4 Updated Mar 21, 2024

[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"

Python 50 5 Updated Jul 4, 2024

✨✨Latest Advances on Multimodal Large Language Models

12,754 812 Updated Nov 22, 2024

Consistency Distilled Diff VAE

Python 2,137 76 Updated Nov 7, 2023

DataComp: In search of the next generation of multimodal datasets

Python 660 55 Updated Jan 2, 2024

A curated list of plugins that you can add to your FiftyOne install!

Python 101 15 Updated Nov 23, 2024

Refine high-quality datasets and visual AI models

Python 8,903 565 Updated Nov 23, 2024

Python package to download and use the SSB datasets

Python 11 3 Updated Aug 3, 2023

{KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch

Python 207 20 Updated Oct 7, 2024

This is the repository for the Photorealistic Unreal Graphics (PUG) datasets for representation learning.

Jupyter Notebook 230 12 Updated Apr 4, 2024

Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".

Python 281 14 Updated Oct 22, 2024

Implementation of Discrete Key / Value Bottleneck, in Pytorch

Python 87 3 Updated Jul 9, 2023

[ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Ying Ding, and Zhangyang Wang

Python 11 2 Updated Nov 28, 2023

An extension of the PyTorch library containing various tools for performing deep learning in hyperbolic space.

Python 137 9 Updated Jun 10, 2024
Next