Confusing error message #34658

jpiabrantes · 2024-11-08T14:55:07Z

transformers/src/transformers/generation/utils.py

Line 2022 in a06a0d1

    
           "A decoder-only architecture is being used, but right-padding was detected! For correct "

The error message says:

A decoder-only architecture is being used, but right-padding was detected! For correct generation results, please set padding_side='left' when initializing the tokenizer."

But the code does not check if the input tokenizer.padding_side == "left". Instead the code checks if the last token id is a padding token which is often the case when people set tokenizer.pad_token_id = tokenizer.eos_token_id.

The text was updated successfully, but these errors were encountered:

LysandreJik · 2024-11-15T13:38:05Z

Indeed! cc @gante, @zucchini-nlp

Rocketknight1 · 2024-12-10T14:41:15Z

gentle ping @gante @zucchini-nlp

zucchini-nlp · 2024-12-11T09:51:14Z

@jpiabrantes sorry for later reply. Do you mean there are cases when you want to generate from an input that ends with an eos token?

The warning is mostly an advice for beginners who try to generate, since generating with right padding might result in gibberish or lower quality text. So we point out to those who don't have much experience with generation, what are the best practices. If you already have the padding set on the correct side, you can ignore the warning

jpiabrantes · 2024-12-11T16:51:19Z

when generating in batches some generations will reach eos faster than others, but the generation carries on. however most importantly I think the error message should reflect what is actually being checked - which is not the padding side.

…

On Wed, 11 Dec 2024 at 01:51, Raushan Turganbay ***@***.***> wrote: @jpiabrantes <https://github.com/jpiabrantes> sorry for later reply. Do you mean there are cases when you want to generate from an input that ends with an eos token? The warning is mostly an advice for beginners who try to generate, since generating with right padding might result in gibberish or lower quality text. So we point out to those who don't have much experience with generation, what are the best practices. If you already have the padding set on the correct side, you can ignore the warning — Reply to this email directly, view it on GitHub <#34658 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AASCMU4JDUSJATDAJMCIBFT2FADKTAVCNFSM6AAAAABRNVGP4WVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKMZVGM2DOOJYHA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

zucchini-nlp · 2024-12-11T16:57:19Z

@jpiabrantes yep, we could check the tokenizer's attribute directly but since tokenizer is not a compulsory kwargs when calling generate(), we opted to check the inputs. What I cane think now is to change the warning level in warning.warn to be suppressible

some generations will reach eos faster than

True, but we don't try to generate from an EOS token right? When we just pass inputs to the generate() method, tokenizer doesn't add any EOS so it can be continued by an LLM

jpiabrantes · 2024-12-11T17:07:42Z

if 'tokenizer' in kwargs and kwargs['tokenizer'].padding_side == 'right': logger.log('Right padding side is being used with a decoder only architecture')

…

On Wed, Dec 11, 2024 at 8:57 AM Raushan Turganbay ***@***.***> wrote: @jpiabrantes <https://github.com/jpiabrantes> yep, we could check the tokenizer's attribute directly but since tokenizer is not a compulsory kwargs when calling generate(), we opted to check the inputs. What I cane think now is to change the warning level in warning.warn to be suppressible some generations will reach eos faster than True, but we don't try to generate from an EOS token right? When we just pass inputs to the generate() method, tokenizer doesn't add any EOS so it can be continued by an LLM — Reply to this email directly, view it on GitHub <#34658 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AASCMUZZV2DQMCUKQVP6AGT2FBVINAVCNFSM6AAAAABRNVGP4WVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKMZWGU2TCNRQGI> . You are receiving this because you were mentioned.Message ID: ***@***.***>

zucchini-nlp · 2024-12-11T17:21:01Z

Yes, that;s an option in case users decide to pass the tokenizer. The initial idea was to allow the tokenizer as arg in special cases like when we have stop_str or assisted_decoding. Would you like to open a PR for that? Feel free to tag gante and myself :)

LysandreJik added Generation bug labels Nov 15, 2024

huggingface deleted a comment from github-actions bot Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Confusing error message #34658

Confusing error message #34658

jpiabrantes commented Nov 8, 2024

LysandreJik commented Nov 15, 2024

Rocketknight1 commented Dec 10, 2024

zucchini-nlp commented Dec 11, 2024

jpiabrantes commented Dec 11, 2024 via email

zucchini-nlp commented Dec 11, 2024

jpiabrantes commented Dec 11, 2024 via email

zucchini-nlp commented Dec 11, 2024

Confusing error message #34658

Confusing error message #34658

Comments

jpiabrantes commented Nov 8, 2024

LysandreJik commented Nov 15, 2024

Rocketknight1 commented Dec 10, 2024

zucchini-nlp commented Dec 11, 2024

jpiabrantes commented Dec 11, 2024 via email

zucchini-nlp commented Dec 11, 2024

jpiabrantes commented Dec 11, 2024 via email

zucchini-nlp commented Dec 11, 2024