[BUG] Transformers model padding should be to "longest" #663

pingzhili · 2025-04-07T21:17:54Z

Describe the bug

transformers model padding is not efficient,

Line 863 in 75f4429

padding="max_length", # we pad to the longest sequence

As the comments say, "we pad to the longest sequence", it should be padding="longest"

lighteval accelerate \ "pretrained=gpt2" \ "leaderboard|truthfulqa:mc|0|0"

The initial sequence length does not have to be that long (max length)

0.8.0

The text was updated successfully, but these errors were encountered:

NathanHB · 2025-04-17T11:31:45Z

Hey ! Being fixed in #669

pingzhili added the bug Something isn't working label Apr 7, 2025

pingzhili changed the title ~~[BUG]~~ [BUG] Transformers model padding should be to "longest" Apr 7, 2025

NathanHB closed this as completed Apr 23, 2025