tokenizer 'padding' param is not correct. #669

xgwang · 2025-04-10T15:58:09Z

background:
i was evaluating models (such as Qwen2.5-7B-Instruct) against AIME 2024 dataset, the output seems not good. after digging, the tokenizer pad the input to be max_length so the output is always 1 token.
after param changed to 'longest', the generation works good.

otherwise, the response length is always 1 which is unexpected

HuggingFaceDocBuilderDev · 2025-04-17T10:39:40Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

xgw and others added 3 commits April 7, 2025 10:35

change tokenizer to pad to 'longest' sequence, instead of 'max_length'

1104426

otherwise, the response length is always 1 which is unexpected

Merge remote-tracking branch 'origin/main' into xg-fix-greeduntil

362c249

Merge branch 'main' into xg-fix-greeduntil

ca80c46

NathanHB mentioned this pull request Apr 17, 2025

[BUG] Transformers model padding should be to "longest" #663

Closed

NathanHB merged commit 88e3a3b into huggingface:main Apr 22, 2025
4 checks passed

xgwang deleted the xg-fix-greeduntil branch April 23, 2025 03:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tokenizer 'padding' param is not correct. #669

tokenizer 'padding' param is not correct. #669

xgwang commented Apr 10, 2025

HuggingFaceDocBuilderDev commented Apr 17, 2025

tokenizer 'padding' param is not correct. #669

tokenizer 'padding' param is not correct. #669

Conversation

xgwang commented Apr 10, 2025

HuggingFaceDocBuilderDev commented Apr 17, 2025