[WIP] Add flex attention for Qwen2VL #35112

jla524 · 2024-12-06T07:42:04Z

What does this PR do?

Addresses #34809 (issue)

Who can review?

jla524 · 2024-12-07T00:34:28Z

src/transformers/models/qwen2_vl/modeling_qwen2_vl.py

+            score += causal_mask[b][0][q_idx][kv_idx]
+        return score
+
+    attn_output, attn_weights = flex_attention(


I'm having a hard time getting tests to pass with flex attention.
Can I merge the refactor for now, and add flex attention as a follow up?

…ransformers into feat/refactor_qwen2vl_attention

ArthurZucker

Hey sorry for the late review! We ended up refactoring the API a bit more and I was off for a week! 🤗
Hope we did not deter you from contributing, and thanks for opening the PR! 🤗

jla524 and others added 15 commits November 20, 2024 01:45

fix: qwen2 model ids

1e5b4de

fix: line

e9731d8

fix: more format

71c158f

update: reformat

1ba083e

Merge branch 'main' of github.com:huggingface/transformers

30ccf50

wip: refactor attention funcs

d5ffa8c

Merge branch 'huggingface:main' into feat/refactor_qwen2vl_attention

e43550f

wip: fix stuff

dd70851

merge

082b407

update: more fixes

d04b907

fix

9fc8ebc

tests

c7b9c32

Merge branch 'main' of github.com:huggingface/transformers

743312e

Merge branch 'main' into feat/refactor_qwen2vl_attention

ef16d2a

update: type hints

864ecd2

jla524 changed the title ~~Add flex attention for Qwen2VL~~ [WIP] Add flex attention for Qwen2VL Dec 6, 2024

jla524 added 14 commits December 5, 2024 23:50

update: spacing

962e269

fix: flex forward

5e35027

todo

ef4da7e

update: names

74e05e5

update: more names

1a2b9db

fix: indents

8443cee

update: return type hints

f67637d

fix: type

b043022

more fixes

5b8a24b

almost there

4eb5ba0

closer

4de8845

remove check

3ad3da6

optional

5642395

remove debug

4dc453b

jla524 commented Dec 7, 2024

View reviewed changes

jla524 and others added 7 commits December 8, 2024 20:16

update: order

3e96731

Merge branch 'huggingface:main' into feat/refactor_qwen2vl_attention

54742a1

Merge branch 'main' into feat/refactor_qwen2vl_attention

6b445bf

update: yeahhh

1b34e19

Merge branch 'feat/refactor_qwen2vl_attention' of github.com:jla524/t…

8301fa7

…ransformers into feat/refactor_qwen2vl_attention

fix

9cc5341

redo

69f57ee

jla524 closed this Dec 18, 2024

ArthurZucker reviewed Dec 20, 2024

View reviewed changes

jla524 deleted the feat/refactor_qwen2vl_attention branch December 21, 2024 21:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Add flex attention for Qwen2VL #35112

[WIP] Add flex attention for Qwen2VL #35112

Uh oh!

jla524 commented Dec 6, 2024

Uh oh!

jla524 Dec 7, 2024 •

edited

Loading

Uh oh!

ArthurZucker left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[WIP] Add flex attention for Qwen2VL #35112

[WIP] Add flex attention for Qwen2VL #35112

Uh oh!

Conversation

jla524 commented Dec 6, 2024

What does this PR do?

Who can review?

Uh oh!

jla524 Dec 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jla524 Dec 7, 2024 •

edited

Loading