VLMs: major clean up 🧼 #34502

zucchini-nlp · 2024-10-30T09:04:43Z

What does this PR do?

We have updated all the configs for VLMs on the hub so this PR removes legacy path for models, as it has been there for already 3 releases from v4.44. Also it fixes some stuff that broke on the way, like generating from only text input in LLaVA models

For Video-LLaVA the hub configs cannot be updated as the hub owner has been silent for several mmonths already. And since there is only one model with such architecture, we can hardcode the default values for patch_num and also remove the legacy path

fixes #34824, fixes #35169

HuggingFaceDocBuilderDev · 2024-10-30T09:53:08Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

I don't think we need this, we deprecated the legacy path, we can just remove it now no?
I don't remember what we said for for 4.46 but better to go with non legacy now if we can!

zucchini-nlp · 2024-10-30T10:24:59Z

We can remove it after updating the files on the hub and that mean we also need to change warning to error so users have chance to see what is the reason for failure.

I think the earliest we can remove is next release, because the blocking PR will prob be merged next week. After that I will take time to update all hub configs. Maybe then we'll wait for the blocking PR and remove all deprecation warnings?

ArthurZucker · 2024-11-25T17:36:30Z

Sounds good, let's wait a bit!

zucchini-nlp · 2024-12-10T11:45:36Z

@ArthurZucker i think this can be review now :)

zucchini-nlp added 2 commits October 30, 2024 09:57

fix tests

8a08b6b

[run-slow] llava_next_video

a022e60

zucchini-nlp added the run-slow label Oct 30, 2024

zucchini-nlp added 2 commits October 30, 2024 10:26

fix copies

b0e1c7c

[run-slow] llava_next_video

2bfd722

zucchini-nlp requested a review from ArthurZucker October 30, 2024 10:05

ArthurZucker reviewed Oct 30, 2024

View reviewed changes

zucchini-nlp added 5 commits November 22, 2024 14:40

remove legacy in all models

931b03a

also blip models

5fe6fa9

[run-slow] blip_2,instructblip,llava_next,video_llava

c7a33f5

green CI hopefully

cc88f18

Merge remote-tracking branch 'upstream/main' into llavas

8bab666

zucchini-nlp changed the title ~~Fix llava tests~~ VLMs: major clean up 🧼 Nov 24, 2024

zucchini-nlp added 8 commits November 24, 2024 12:32

update once more

38d118f

fix video llava

a371498

add image token index in blip tester

8b93a4a

docstring style

c010729

Merge remote-tracking branch 'upstream/main' into llavas

54db8eb

fix blip pipelines

4efcea3

fix some more

50423d7

Merge branch 'main' into llavas

4543c33

zucchini-nlp mentioned this pull request Nov 25, 2024

VideoLLaVA: add default values #34916

Merged

zucchini-nlp mentioned this pull request Nov 26, 2024

LlavaProcessor replaces <image> with 576 <image> tokens. Is this normal? #34934

Closed

4 tasks

zucchini-nlp added 2 commits December 2, 2024 10:48

Merge remote-tracking branch 'upstream/main' into llavas

baa8872

allow these attr in VLMs to be not used

bc768e2

This was referenced Dec 6, 2024

VLMs: fix number of image tokens #34332

Merged

LlavaForConditionalGeneration._merge_input_ids_with_image_features throws error #35169

Open

Merge branch 'main' into llavas

3110c3a

zucchini-nlp requested a review from ArthurZucker December 10, 2024 11:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VLMs: major clean up 🧼 #34502

VLMs: major clean up 🧼 #34502

zucchini-nlp commented Oct 30, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 30, 2024

ArthurZucker left a comment

zucchini-nlp commented Oct 30, 2024 •

edited

Loading

ArthurZucker commented Nov 25, 2024

zucchini-nlp commented Dec 10, 2024

VLMs: major clean up 🧼 #34502

Are you sure you want to change the base?

VLMs: major clean up 🧼 #34502

Conversation

zucchini-nlp commented Oct 30, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Oct 30, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

zucchini-nlp commented Oct 30, 2024 • edited Loading

ArthurZucker commented Nov 25, 2024

zucchini-nlp commented Dec 10, 2024

zucchini-nlp commented Oct 30, 2024 •

edited

Loading

zucchini-nlp commented Oct 30, 2024 •

edited

Loading