Fix : model used to test ggml conversion of Falcon-7b is incorrect #35083

MekkCyber · 2024-12-04T11:18:32Z

What does this PR do?

The model used to test the ggml conversion of falcon-7b in fp16 format is wrong :

You can see that it contains some Q4 weights which is unexpected in a `fp16` model, and its size is only 4GB but it should be around 7x2 = 14GB. I did my own model conversion to gguf to fix the issue :

Who can review ?

@SunMarc

HuggingFaceDocBuilderDev · 2024-12-04T11:44:47Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

SunMarc

Nice thanks for uploading the right model ! cc @Isotr0py

fixing test model

0ce3f31

MekkCyber requested a review from SunMarc December 10, 2024 15:35

SunMarc approved these changes Dec 10, 2024

View reviewed changes

MekkCyber merged commit 85eb339 into main Dec 16, 2024
12 checks passed

MekkCyber deleted the fix_falcon_ggml branch December 16, 2024 12:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix : model used to test ggml conversion of Falcon-7b is incorrect #35083

Fix : model used to test ggml conversion of Falcon-7b is incorrect #35083

Uh oh!

MekkCyber commented Dec 4, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Dec 4, 2024

Uh oh!

SunMarc left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix : model used to test ggml conversion of Falcon-7b is incorrect #35083

Fix : model used to test ggml conversion of Falcon-7b is incorrect #35083

Uh oh!

Conversation

MekkCyber commented Dec 4, 2024

What does this PR do?

Who can review ?

Uh oh!

HuggingFaceDocBuilderDev commented Dec 4, 2024

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants