-
Zhejiang University Research Fellow | PhD Fudan University & Westlake University
- Hangzhou, Zhejiang, China
Stars
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
The jetson-examples repository by Seeed Studio offers a seamless, one-line command deployment to run vision AI and Generative AI models on the NVIDIA Jetson platform.
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…
中文nlp解决方案(大模型、数据、模型、训练、推理)
Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey"
Stonefish - an advanced C++ simulation library designed for (but not limited to) marine robotics.
ROS package implementing an interface for the Stonefish library.
Dataset proposed in "Self-supervised Monocular Underwater Depth Recovery, Image Restoration, and a Real-sea Video Dataset", ICCV 2023
aaronhd / CoACD
Forked from SarahWeiii/CoACD[SIGGRAPH2022] Approximate Convex Decomposition for 3D Meshes with Collision-Aware Concavity and Tree Search
aaronhd / roboagent
Forked from robopen/roboagentRepository to train and evaluate our universal agent (Code coming soon! Stay tuned!)
✨✨Latest Advances on Multimodal Large Language Models
Teriyaki: A Framework to Generate Neurosymbolic PDDL-compliant Planners
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos, ECCV 2022
Code for the paper Learning Visible Connectivity Dynamics for Cloth Smoothing
[ECCV 2022] Compositional Generation using Diffusion Models
Community for applying LLMs to robotics and a robot simulator with ChatGPT integration
Large language models for PDDL domains
Neural Grasp Distance Fields for Robot Manipulation