-
Notifications
You must be signed in to change notification settings - Fork 27.3k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
LlavaForConditionalGeneration._merge_input_ids_with_image_features throws error
bug
Multimodal
WIP
Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress
#35169
opened Dec 9, 2024 by
NicolasDrapier
4 tasks done
[Idefics 3] Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1
bug
Multimodal
Vision
#35031
opened Nov 30, 2024 by
shyshin
2 of 4 tasks
Flash attention 2 broke when batch inference
bug
Multimodal
Vision
#34824
opened Nov 20, 2024 by
pspdada
2 of 4 tasks
Add GOT-OCR 2.0 to Transformers
Multimodal
New model
run-slow
#34721
opened Nov 13, 2024 by
yonigozlan
Loading…
3 tasks done
uniformize kwargs for SAM
Multimodal
Processing
Vision
#34578
opened Nov 2, 2024 by
tibor-reiss
Loading…
Compile Grounding DINO
Compilation
Issues related to torchdynamo and torchinductor
Feature request
Request for a new feature
Multimodal
Vision
#34556
opened Nov 1, 2024 by
pspdada
uniformize kwargs for OneFormer
Multimodal
Processing
Vision
#34547
opened Oct 31, 2024 by
tibor-reiss
Loading…
Vision (Auto)Processor multiple images finetuning example.
Examples
Which is related to examples in general
Feature request
Request for a new feature
Multimodal
#34489
opened Oct 29, 2024 by
lovodkin93
Add support for Aria model
Feature request
Request for a new feature
Multimodal
New model
#34078
opened Oct 10, 2024 by
fakerybakery
Add Loss Functions for QFormer Training in BLIP-2 Model (ITC, ITM, and ITG)
Feature request
Request for a new feature
Multimodal
#34019
opened Oct 8, 2024 by
thisisiron
Enabled Flash Attention for PaliGemma models
Flash Attention
Multimodal
run-slow
#34009
opened Oct 7, 2024 by
aroun-coumar
Loading…
1 of 5 tasks
Add ColPali to 🤗 transformers
Multimodal
New model
run-slow
Vision
#33736
opened Sep 26, 2024 by
tonywu71
Loading…
16 tasks done
Add support for Molmo
Feature request
Request for a new feature
Multimodal
New model
Vision
#33710
opened Sep 26, 2024 by
fakerybakery
Qwen2-VL: Multi-GPU training
bug
Distributed Training / Models
Feature request
Request for a new feature
Multimodal
trainer
Vision
#33666
opened Sep 23, 2024 by
ManuelFay
2 of 4 tasks
Video Processor as a separate class
Feature request
Request for a new feature
Multimodal
Vision
#33504
opened Sep 16, 2024 by
zucchini-nlp
6 tasks
The same situation as #31377 occurred when using Qwen/Qwen2-VL-7B-Instruct
bug
Cache
Multimodal
#33399
opened Sep 10, 2024 by
toondata
3 of 4 tasks
Track progress for VLMs refactoring
Generation
Multimodal
Vision
WIP
Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress
#33374
opened Sep 8, 2024 by
zucchini-nlp
13 of 16 tasks
Support Unified Multimodal Model
Feature request
Request for a new feature
Multimodal
New model
#33368
opened Sep 7, 2024 by
KevinZeng08
Kosmos-2.5 implementation in transformers
Multimodal
New model
#30877
opened May 17, 2024 by
Natyren
2 tasks done
Mixture of All Intelligence (MoAI)
Multimodal
New model
#29823
opened Mar 23, 2024 by
Dev-Khant
2 tasks done
ProTip!
Find all open issues with in progress development work with linked:pr.