qwen-vl

Here are 4 public repositories matching this topic...

gokayfem / awesome-vlm-architectures

Famous Vision Language Models and Their Architectures

awesome awesome-list kosmos clip image-encoder vlm blip multimodal text-encoder vision-language-model llava internlm cogvlm qwen-vl

Updated Feb 24, 2025
Markdown

zjysteven / lmms-finetune

Star

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

finetuning multimodal vision-language foundation-models instruction-tuning large-language-model llava visual-instruction-tuning multimodal-large-language-models large-multimodal-models qwen-vl llava-next

Updated Feb 25, 2025
Python

reidbarber / webmarker

Star

Mark web pages for use with vision-language models

som prompt gemini operator cua claude playwright prompt-engineering llms vision-language-model gpt4v qwen-vl gpt4o set-of-mark computer-use computer-using-agent

Updated Jan 7, 2025
TypeScript

autodistill / autodistill-qwen-vl

Star

Qwen-VL base model for use with Autodistill.

zero-shot-object-detection autodistill qwen-vl

Updated Feb 8, 2024
Python

Improve this page

Add a description, image, and links to the qwen-vl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the qwen-vl topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qwen-vl

Here are 4 public repositories matching this topic...

gokayfem / awesome-vlm-architectures

zjysteven / lmms-finetune

reidbarber / webmarker

autodistill / autodistill-qwen-vl

Improve this page

Add this topic to your repo