Skip to content
View tsujuifu's full-sized avatar
⚔️
RS @ Apple
⚔️
RS @ Apple

Highlights

  • Pro

Block or report tsujuifu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 2,850 222 Updated Nov 16, 2024

Lightweight Python framework that provides a high-level API for creating and rendering scenes with Blender.

Python 763 20 Updated Nov 18, 2024

[NeurIPS 2024] NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing

Python 137 11 Updated Oct 20, 2024

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 4,846 505 Updated Nov 20, 2024

A curated list of papers, code and resources pertaining to image composition/compositing or object insertion, which aims to generate realistic composite image.

1,182 127 Updated Nov 24, 2024

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 523 58 Updated Nov 1, 2024

Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models

Python 50 3 Updated May 25, 2024

A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.

Python 84 8 Updated Sep 21, 2024

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Python 2,668 188 Updated Nov 22, 2024

we propose FlexEdit, an end-to-end image editing method that leverages both free-shape masks and language instructions for Flexible Editing.

24 Updated Aug 22, 2024

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

Python 177 10 Updated Sep 16, 2024

SigLIP-based Aesthetic Score Predictor

Python 142 1 Updated Oct 22, 2024

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Jupyter Notebook 400 12 Updated May 24, 2024

This is the official reproduction of FancyVideo.

Python 820 71 Updated Oct 30, 2024
Python 68 12 Updated Oct 17, 2024
Python 176 9 Updated Jul 23, 2024
Python 139 9 Updated Sep 12, 2024

[CVPR 2024] Official code for "Text-Driven Image Editing via Learnable Regions"

Python 266 21 Updated Sep 28, 2024

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Jupyter Notebook 8,115 861 Updated Jul 26, 2024

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,176 111 Updated Nov 3, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 12,546 1,171 Updated Oct 14, 2024

[MM 2024 Oral] Refiner for AIGC

Jupyter Notebook 23 1 Updated Jul 29, 2024

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Python 1,448 121 Updated Jul 17, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,696 889 Updated Oct 22, 2024
Python 2,936 252 Updated Oct 16, 2024

Code and dataset for AAAI 2022 paper "CAISE: Conversational Agent for Image Search and Editing" Hyounghun Kim, Doo Soon Kim, Seunghyun Yoon, Franck Dernoncourt, Trung Bui, and Mohit Bansal

Python 9 Updated May 6, 2022

Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation

Python 12 2 Updated Oct 28, 2024

Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

Python 503 22 Updated Aug 16, 2024
Jupyter Notebook 21 Updated Nov 13, 2024
Next