xin-ran-w

Follow

🔬

brain storm

Xinran Wang xin-ran-w

🔬

brain storm

Follow

I am currently pursuing a PhD 🎓 in the field of computer vision (CV).

17 followers · 41 following

Achievements

Achievements

Organizations

xin-ran-w/README.md

Hi there 👋

🎓 I'm currently pursuing a PhD in the field of multimodal learning.
🔭 I’m currently working on expressing visual content using language (e.g., image captioning, object description).
🌱 I’m looking to collaborate on using high-quality captions to train a diffusion / auto-regressive generation model to generate high-quality visual content (video/image/3D model).
📫 How to reach me: [email protected].

Pinned Loading

PRIS-CV/CineTechBench PRIS-CV/CineTechBench Public

A Benchmark for Cinematographic Technique Understanding and Generation

Python 23
CapAgent CapAgent Public

From Simple to Professional - A Combinatorial Controllable Image Captioning Agent

Python 7 1
PRIS-CV/ControllableObjectDescription PRIS-CV/ControllableObjectDescription Public

A training-free pipeline to control dimension details in object description.

Python 5 1
Caption2SceneGraph Caption2SceneGraph Public

A parser tool using large language model and vision experts to parse the input caption into a scene graph

Python