Skip to content

Issues: huggingface/transformers

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

LlavaForConditionalGeneration._merge_input_ids_with_image_features throws error bug Multimodal WIP Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress
#35169 opened Dec 9, 2024 by NicolasDrapier
4 tasks done
Add GOT-OCR 2.0 to Transformers Multimodal New model run-slow
#34721 opened Nov 13, 2024 by yonigozlan Loading…
3 tasks done
Compile Grounding DINO Compilation Issues related to torchdynamo and torchinductor Feature request Request for a new feature Multimodal Vision
#34556 opened Nov 1, 2024 by pspdada
Vision (Auto)Processor multiple images finetuning example. Examples Which is related to examples in general Feature request Request for a new feature Multimodal
#34489 opened Oct 29, 2024 by lovodkin93
Add support for Aria model Feature request Request for a new feature Multimodal New model
#34078 opened Oct 10, 2024 by fakerybakery
Video Processor as a separate class Feature request Request for a new feature Multimodal Vision
#33504 opened Sep 16, 2024 by zucchini-nlp
6 tasks
Track progress for VLMs refactoring Generation Multimodal Vision WIP Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress
#33374 opened Sep 8, 2024 by zucchini-nlp
13 of 16 tasks
ProTip! Find all open issues with in progress development work with linked:pr.