The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 12,607 1,177 Updated Oct 14, 2024

mobiushy / move-act

Python 7 Updated Jul 26, 2024

QizaoWang / FIRe-CCReID

Exploring Fine-Grained Representation and Recomposition for Cloth-Changing Person Re-Identification [TIFS 2024]

Python 11 1 Updated Jul 1, 2024

Computer-Vision-in-the-Wild / CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,209 58 Updated Mar 14, 2024

araffin / rlss23-dqn-tutorial

Deep Q-Network (DQN) and Fitted Q-Iteration (FQI) tutorial for RL Summer School 2023

Jupyter Notebook 52 6 Updated Nov 5, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

12,779 813 Updated Nov 24, 2024

PixArt-alpha / PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,830 182 Updated Oct 31, 2024

VamosC / CapHuman

[CVPR2024] CapHuman: Capture Your Moments in Parallel Universes

Python 92 7 Updated Nov 20, 2024

InternLM / InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Python 2,530 155 Updated Oct 10, 2024

YangLing0818 / RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

Jupyter Notebook 1,697 99 Updated Oct 10, 2024

ali-vilab / VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 2,976 265 Updated Oct 22, 2024

showlab / MotionDirector

[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Python 851 54 Updated Aug 21, 2024

showlab / Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,487 204 Updated Nov 21, 2024

ali-vilab / AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 4,008 368 Updated Apr 8, 2024

XavierCHEN34 / LivePhoto

Official implementations for paper: LivePhoto: Real Image Animation with Text-guided Motion Control

183 3 Updated Dec 18, 2023

TonyLianLong / LLM-groundedVideoDiffusion

[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper

Python 128 7 Updated May 7, 2024

TonyLianLong / LLM-groundedDiffusion

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD, TMLR 2024)

Python 435 28 Updated Sep 9, 2024

tsunghan-wu / SLD

🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

Python 154 7 Updated Apr 9, 2024

youngyangyang04 / leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

Shell 52,208 11,571 Updated Nov 25, 2024

FreedomIntelligence / LLMZoo

⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡

Python 2,934 201 Updated Nov 26, 2023

HumanAIGC / AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,507 976 Updated Jul 26, 2024

wjpoom / SPEC

[CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"

Jupyter Notebook 30 Updated Nov 12, 2024

Stability-AI / generative-models

Generative Models by Stability AI

Python 24,711 2,744 Updated Sep 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wujian Peng wujianP

Achievements

Achievements

Block or report wujianP

Stars

dannyXSC / Fudan_SafeTest

gaomingqi / Track-Anything

yuweihao / MM-Vet

black-forest-labs / flux

CHAITron / sketchdeco-code

OpenGVLab / Diffree

baaivision / DIVA

facebookresearch / sam2