Skip to content
View wujianP's full-sized avatar

Block or report wujianP

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

复旦大学安全教育测试

HTML 14 4 Updated Aug 29, 2023

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,514 481 Updated May 31, 2024

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)

Python 266 11 Updated Nov 5, 2024

Official inference repo for FLUX.1 models

Python 17,482 1,237 Updated Nov 21, 2024

Official implementation of "SketchDeco: Decorating B&W Sketches with Colour"

Python 54 2 Updated Jun 26, 2024

Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model

Python 234 13 Updated Aug 6, 2024

Diffusion Feedback Helps CLIP See Better

Python 218 12 Updated Aug 24, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 12,607 1,177 Updated Oct 14, 2024
Python 7 Updated Jul 26, 2024

Exploring Fine-Grained Representation and Recomposition for Cloth-Changing Person Re-Identification [TIFS 2024]

Python 11 1 Updated Jul 1, 2024

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,209 58 Updated Mar 14, 2024

Deep Q-Network (DQN) and Fitted Q-Iteration (FQI) tutorial for RL Summer School 2023

Jupyter Notebook 52 6 Updated Nov 5, 2024

✨✨Latest Advances on Multimodal Large Language Models

12,779 813 Updated Nov 24, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,830 182 Updated Oct 31, 2024

[CVPR2024] CapHuman: Capture Your Moments in Parallel Universes

Python 92 7 Updated Nov 20, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Python 2,530 155 Updated Oct 10, 2024

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

Jupyter Notebook 1,697 99 Updated Oct 10, 2024

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 2,976 265 Updated Oct 22, 2024

[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Python 851 54 Updated Aug 21, 2024

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,487 204 Updated Nov 21, 2024

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 4,008 368 Updated Apr 8, 2024

Official implementations for paper: LivePhoto: Real Image Animation with Text-guided Motion Control

183 3 Updated Dec 18, 2023

[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper

Python 128 7 Updated May 7, 2024

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD, TMLR 2024)

Python 435 28 Updated Sep 9, 2024

🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

Python 154 7 Updated Apr 9, 2024

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 52,208 11,571 Updated Nov 25, 2024

⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡

Python 2,934 201 Updated Nov 26, 2023

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,507 976 Updated Jul 26, 2024

[CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"

Jupyter Notebook 30 Updated Nov 12, 2024

Generative Models by Stability AI

Python 24,711 2,744 Updated Sep 4, 2024
Next